BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014597
         (422 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 224/364 (61%), Positives = 286/364 (78%), Gaps = 2/364 (0%)

Query: 35  SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST-EDTSSSGYL 93
           +I  DR+LSEY PS SS+S+++SC H LC+  S+CK+ KDPCPYI +Y   E+T+S+G+L
Sbjct: 149 NISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFENTTSAGFL 208

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           V+D LHLAS   H  +  +Q+SV++GCGRKQ GS+ DGAAPDGVMGLG GD+SVPSLLAK
Sbjct: 209 VEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAK 268

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           AGLIQN FS+CFDENDSG + FGD+G A+QQST FLPI   Y AYFVGVESYC+GNSCL 
Sbjct: 269 AGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCLK 328

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
           +SGF+ALVDSG+SFT+LP+E+Y E+V +FDK V++KRIS Q   W YCYNASS+E+  +P
Sbjct: 329 RSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELHDIP 388

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
            ++L F +NQ+FVV N  +S P ++GFT+FCL++  TDG YGIIGQNFM+G+R+VFD EN
Sbjct: 389 AIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSYGIIGQNFMIGYRMVFDIEN 448

Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKS 393
           LKL WS+S C++  D + VHL PPP  +SPNPLPT EQQS     + AP    +T+ S+S
Sbjct: 449 LKLGWSNSSCQDTSDSADVHLAPPPDNKSPNPLPTNEQQSIPRTPSVAPAVAGRTS-SES 507

Query: 394 IAAS 397
            AAS
Sbjct: 508 SAAS 511


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 218/360 (60%), Positives = 277/360 (76%), Gaps = 2/360 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+LSEY PS S++S+++SC+H LC+  S CK+LKDPCPYIADY+  +TSSSG+LV+DIL
Sbjct: 147 DRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTSSSGFLVEDIL 206

Query: 99  HLASFSK--HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           HLAS S   ++ Q  VQ+SVI+GCGRKQTG YLDGAAPDGVMGLG G +SVPSLLAKAGL
Sbjct: 207 HLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGL 266

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           I+ SFS+CFD N SG++ FGDQG  +Q+ST  LP    YDAY + VESYC+GNSCL QSG
Sbjct: 267 IRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYCVGNSCLKQSG 326

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
           F+ALVDSGASFT+LP ++Y ++V++FDK V+++RIS QG  W YCYN SS+++  VP MR
Sbjct: 327 FKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGPWNYCYNTSSKQLDNVPAMR 386

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
           L F  NQS ++ N  +  P+N+ F VFCLT+  TD +YGIIGQN+M G+R+VFD ENLKL
Sbjct: 387 LSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYGIIGQNYMTGYRVVFDMENLKL 446

Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAA 396
            WS S C+++ D++ V L P P  QSPNPLPT EQQS  N Q  AP    +T+   S+A+
Sbjct: 447 GWSSSNCKDISDETEVTLAPSPNDQSPNPLPTNEQQSVPNKQGVAPAVAGRTSSKHSVAS 506


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 212/359 (59%), Positives = 275/359 (76%), Gaps = 2/359 (0%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R+L+EY PS SS+SK +SC+  LC+  S CKS KDPCPY+A Y +E+TSSSG L++D LH
Sbjct: 149 RDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLH 208

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           LA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N
Sbjct: 209 LAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRN 268

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
           +FSICFD+N SG++ FGDQG  TQ+STSF+P+  K+  Y + VE Y +G+S L  +GFQA
Sbjct: 269 TFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 328

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           LVDSG SFTFLP EIY ++VV+FDK V++ R S +G+ WKYCYN+SS+E+L +P + L+F
Sbjct: 329 LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVTLVF 388

Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           + NQSF+V N +     ENE F VFCL +     ++GIIGQNFM G+R+VFDRENLKL W
Sbjct: 389 AMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENLKLGW 448

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           S S C+++ D   +HL PPP  +SPNPLPT +QQ T +  A AP    +T P+KS A S
Sbjct: 449 STSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPSRHAVAPAVAGRT-PAKSAAVS 506


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 212/359 (59%), Positives = 275/359 (76%), Gaps = 2/359 (0%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R+L+EY PS SS+SK +SC+  LC+  S CKS KDPCPY+A Y +E+TSSSG L++D LH
Sbjct: 139 RDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLH 198

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           LA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N
Sbjct: 199 LAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRN 258

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
           +FSICFD+N SG++ FGDQG  TQ+STSF+P+  K+  Y + VE Y +G+S L  +GFQA
Sbjct: 259 TFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 318

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           LVDSG SFTFLP EIY ++VV+FDK V++ R S +G+ WKYCYN+SS+E+L +P + L+F
Sbjct: 319 LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVTLVF 378

Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           + NQSF+V N +     ENE F VFCL +     ++GIIGQNFM G+R+VFDRENLKL W
Sbjct: 379 AMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENLKLGW 438

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           S S C+++ D   +HL PPP  +SPNPLPT +QQ T +  A AP    +T P+KS A S
Sbjct: 439 STSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPSRHAVAPAVAGRT-PAKSAAVS 496


>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 880

 Score =  427 bits (1097), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 206/384 (53%), Positives = 275/384 (71%), Gaps = 6/384 (1%)

Query: 12  NAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKS 71
           +A + +L +P   +    L  G   V DR+L++Y PS S++S+++ C H LC   S CK 
Sbjct: 123 DAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKG 182

Query: 72  LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 131
            KDPCPY   YS+ +TSSSGY+ +D LHL S  KHA Q+SVQ+S+I+GCGRKQTG YL G
Sbjct: 183 SKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRG 242

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
           A PDGV+GLG G++SVPSLLAKAGLIQNSFSICF+EN+SG + FGDQG  TQ ST FLPI
Sbjct: 243 AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPI 302

Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
             K++AY VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV++FDK V++  I
Sbjct: 303 DGKFNAYIVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSI 362

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
            LQ NSW+YCYNASS+E++ +P + L FS+NQ+++++N IF  P ++ +T+FCL V  +D
Sbjct: 363 VLQ-NSWEYCYNASSQELISIPPLNLAFSRNQTYLIQNPIFIDPASQEYTIFCLPVSPSD 421

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQ 371
            DY  IGQNF+MG+R+VFDRENL+ +WS   C++    S      P +  SPNPLP  +Q
Sbjct: 422 DDYAAIGQNFLMGYRMVFDRENLRFSWSRWNCQDRASFS-----SPYSVGSPNPLPVDQQ 476

Query: 372 QSTSNGQAAAPPSTAKTAPSKSIA 395
           QS  N     P     T+P  S A
Sbjct: 477 QSFPNAHGIPPAIAGHTSPKPSAA 500


>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
          Length = 530

 Score =  423 bits (1088), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 205/361 (56%), Positives = 263/361 (72%), Gaps = 1/361 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L++Y PS SS+SK++SCSH LC+S  +C S K  CPY  +Y +E+TSSSG L++DIL
Sbjct: 145 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 204

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S    A  SSV++ VIIGCG +QTG YLDG APDG+MGLGLG++SVPS L+KAGL++
Sbjct: 205 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 264

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF+++DSG +FFGDQG ATQQ+T FLP   KY+ Y VGVE+ CIG+SC+ Q+ F+
Sbjct: 265 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 324

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSGASFTFLP E Y  VV +FDK V++ R S +G  W+YCY +SS+E+LK P + L 
Sbjct: 325 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPSVILK 384

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F+ N SFVV N +F     +G   FCL +   DGD GI+GQNFM G+R+VFDRENLKL W
Sbjct: 385 FALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRENLKLGW 444

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           S S C+++ D   + L P P  + PNPLP  EQQ+T +G     P+ A  APS   AAS 
Sbjct: 445 SRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTIT-PAVAGRAPSNPSAAST 503

Query: 399 Q 399
           Q
Sbjct: 504 Q 504


>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 511

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 205/361 (56%), Positives = 263/361 (72%), Gaps = 1/361 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L++Y PS SS+SK++SCSH LC+S  +C S K  CPY  +Y +E+TSSSG L++DIL
Sbjct: 126 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S    A  SSV++ VIIGCG +QTG YLDG APDG+MGLGLG++SVPS L+KAGL++
Sbjct: 186 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 245

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF+++DSG +FFGDQG ATQQ+T FLP   KY+ Y VGVE+ CIG+SC+ Q+ F+
Sbjct: 246 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 305

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSGASFTFLP E Y  VV +FDK V++ R S +G  W+YCY +SS+E+LK P + L 
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPSVILK 365

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F+ N SFVV N +F     +G   FCL +   DGD GI+GQNFM G+R+VFDRENLKL W
Sbjct: 366 FALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRENLKLGW 425

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           S S C+++ D   + L P P  + PNPLP  EQQ+T +G     P+ A  APS   AAS 
Sbjct: 426 SRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTIT-PAVAGRAPSNPSAAST 484

Query: 399 Q 399
           Q
Sbjct: 485 Q 485


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 207/365 (56%), Positives = 260/365 (71%), Gaps = 2/365 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L+EY PS SS+SK++SCSH LC+   +C S K PCPY  DY TE+TSSSG LV+DIL
Sbjct: 158 DRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDIL 217

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HLAS   +A   SV++ V+IGCG KQ+G YLDG APDG+MGLGL ++SVPS LAKAGLI+
Sbjct: 218 HLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPSFLAKAGLIR 277

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CFDE+DSG +FFGDQGP TQQST FL +   Y  Y VGVE +C+G+SCL Q+ F+
Sbjct: 278 NSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGSSCLKQTSFR 337

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVD+G SFTFLP  +Y  +  +FD+ V++   S  G  WKYCY +SS  + KVP ++LI
Sbjct: 338 ALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVPSVKLI 397

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F  N SFV+ N +F     +G T FCL +  T+GD G IGQNFM G+R+VFDREN+KL W
Sbjct: 398 FPLNNSFVIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGYRVVFDRENMKLGW 457

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           SHS CE+  +   + L   P G   NPLPT EQQS+  G A + P+ A  APSK  AA+ 
Sbjct: 458 SHSSCEDRSNDKRMPLT-SPNGTLVNPLPTNEQQSSPGGHAVS-PAVAGRAPSKPSAAAV 515

Query: 399 QQLDS 403
           Q L S
Sbjct: 516 QLLPS 520


>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 525

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 208/404 (51%), Positives = 278/404 (68%), Gaps = 14/404 (3%)

Query: 17  LLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPC 76
           +L +P   +    L  G   V DR+L++Y PS S++S+++ C H LC   S CK  KDPC
Sbjct: 128 MLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPC 187

Query: 77  PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 136
           PY   Y++ +TSSSGY+ +D LHL S  KHA Q+SVQ+S+I+GCGRKQTG YL GA PDG
Sbjct: 188 PYEVQYASANTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDG 247

Query: 137 VMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD 196
           V+GLG G++SVPSLLAKAGLIQNSFSIC DEN+SG + FGDQG  TQ ST FLPI     
Sbjct: 248 VLGLGPGNISVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPI----I 303

Query: 197 AYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
           AY VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV +FDK V++ RI LQ +
Sbjct: 304 AYMVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQ-S 362

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDY 314
           SW+YCYNASS+E++ +P ++L FS+NQ+F+++N IF  P  + + +T+FCL V  +  DY
Sbjct: 363 SWEYCYNASSQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDY 422

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST 374
             IGQNF+MG+R+VFDRENL+  WS   C++           P  G SPNPLP  +QQ+ 
Sbjct: 423 AAIGQNFLMGYRLVFDRENLRFGWSRWNCQD-----RASFTSPSNGGSPNPLPANQQQTV 477

Query: 375 SNGQAAAPPSTAKTAPSKSIAASAQQLDSVLRVACSLLVLMCLL 418
            N +   P     T+P  S A     L +  R + + L+L+C L
Sbjct: 478 PNARGVPPAIAGHTSPKPSAATPG--LVTTSRHSLASLLLICHL 519


>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
 gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
          Length = 492

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 202/353 (57%), Positives = 248/353 (70%), Gaps = 3/353 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+LSEY PS SS+SK +SCSH LC    +CK+ K  CPY  +Y TE TSSSG LV+DI+
Sbjct: 143 DRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDII 202

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HLAS       +SV++ VIIGCG KQ+G YLDG APDG++GLGL ++SVPS LAKAGLIQ
Sbjct: 203 HLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSFLAKAGLIQ 262

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF+E+DSG +FFGDQGPATQQS  FL +   Y  Y VGVE  C+G SCL QS F 
Sbjct: 263 NSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVEVCCVGTSCLKQSSFS 322

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFTFLP +++  +  +FD  V++ R S +G SWKYCY  SS+++ K+P +RLI
Sbjct: 323 ALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLPKIPSLRLI 382

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F +N SF+V+N +F     +G   FCL +   DGD G IGQNFMMG+R+VFDRENLKL W
Sbjct: 383 FPQNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGTIGQNFMMGYRVVFDRENLKLGW 442

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 391
           S S CE         L   P+G   NPLPT EQQST  G A + P+ A  APS
Sbjct: 443 SRSNCE--FSGISYTLPLTPSGTPQNPLPTNEQQSTPGGHAVS-PAVAVNAPS 492


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 191/357 (53%), Positives = 260/357 (72%), Gaps = 13/357 (3%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           V DR+LSEY+P+ SS+SK++ C H LC   ++CKS  DPC Y  DY +++TS+SG++++D
Sbjct: 146 VLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIED 205

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L L SFSKH   S +Q+SV+ GCGRKQ+GSYLDGAAPDGVMGLG G++SVP+LLA+ GL
Sbjct: 206 KLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGL 265

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           ++N+FS+CFD N SG + FGD GPATQQ+T FLP+  ++ AYF+GVES+C+G+SCL +SG
Sbjct: 266 VRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGSSCLQRSG 325

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKL--VSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           FQALVDSG+SFT+LP E+Y ++V +FDK   V++ RI L+   W YCYN S+     +P 
Sbjct: 326 FQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYNISTLVSFNIPS 385

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           M+L+F  NQ F + + ++  P N+G+ VFCLT+  TD DYG+IGQN M+G+R+VFDRENL
Sbjct: 386 MQLVFPLNQIF-IHDPVYVLPANQGYKVFCLTLEETDEDYGVIGQNLMVGYRMVFDRENL 444

Query: 335 KLAWSHSKCEEVIDKSHVHLVPPP---AGQSPNPLPTTEQQSTSNGQAAAPPSTAKT 388
           KL WS SKC ++   +  H  PP      +SP  LP T +Q       A  P+ A+T
Sbjct: 445 KLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALPPTNRQ-------AIAPTAART 494


>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 521

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 201/365 (55%), Positives = 257/365 (70%), Gaps = 5/365 (1%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L+EY PS S SSK++SCSH LC   S+CKS +  CPY+  Y +E+TSSSG LV+DIL
Sbjct: 142 DRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDIL 201

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S    +  SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI 
Sbjct: 202 HLQSGGTLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIH 260

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
            SFS+CF+E+DSG +FFGDQGP +QQSTSFLP+   Y  Y +GVES CIGNSCL  + F+
Sbjct: 261 YSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMTSFK 320

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           A VDSG SFTFLP  +Y  +  +FD+ V+  R S +G+ W+YCY  SS+++ KVP   L+
Sbjct: 321 AQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPSSQDLPKVPSFTLM 380

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F +N SFVV + +F F  NEG   FCL ++ T+GD G IGQNFM G+R+VFDR N KLAW
Sbjct: 381 FQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTEGDMGTIGQNFMTGYRLVFDRGNKKLAW 440

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           S S C+++     + L P     S NPLPT EQQ T NG A A P+ A  AP K  AAS+
Sbjct: 441 SRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAVAGRAPHKPSAASS 496

Query: 399 QQLDS 403
           + + S
Sbjct: 497 RMISS 501


>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 532

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 192/360 (53%), Positives = 262/360 (72%), Gaps = 3/360 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D++L+EY PSSSS+SK++SCSH LC S  SC+S K  CPY+ DY TE+TSSSG L+ D+L
Sbjct: 148 DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 207

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL+S  +++   ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG++SV S LAK  L+Q
Sbjct: 208 HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 267

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+  KY+ Y VGVE+ CI NSCL Q+ F+
Sbjct: 268 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 327

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
           AL+DSG SFT+LP E Y  +V++FDK L ++  +S +G  WKYCY  S++ M KVP + L
Sbjct: 328 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 387

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           +F  N SFVV + +F    ++G   FC  ++  DGD GI+GQN+M G+R+VFDR+NLKL 
Sbjct: 388 LFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRDNLKLG 447

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           WSH+ C+++ ++  + L P      PNPLP  EQQS S G A A P+ A  APSK  AA+
Sbjct: 448 WSHANCQDLSNEKKMPLTPAKE-TPPNPLPADEQQSASGGHAVA-PAVAGRAPSKPSAAT 505


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 199/365 (54%), Positives = 254/365 (69%), Gaps = 5/365 (1%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L+EY PS S SSK++SCSH LC   S+CKS +  CPY+  Y +E+TSSSG LV+DIL
Sbjct: 141 DRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDIL 200

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S    +  SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI 
Sbjct: 201 HLQSGGSLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIH 259

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           +SFS+CF+E+DSG +FFGDQGP  QQSTSFLP+   Y  Y +GVES C+GNSCL  + F+
Sbjct: 260 DSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMTSFK 319

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
             VDSG SFTFLP  +Y  +  +FD+ V+  R S +G+ W+YCY  SS+E+ KVP + L 
Sbjct: 320 VQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPSSQELPKVPSLTLT 379

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F +N SFVV + +F F  NEG   FCL +  T+GD G IGQNFM G+R+VFDR N KLAW
Sbjct: 380 FQQNNSFVVYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRGNKKLAW 439

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           S S C+++     + L P     S NPLPT EQQ T NG A A P+ A  AP K  AA +
Sbjct: 440 SRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAVAGRAPHKPSAAPS 495

Query: 399 QQLDS 403
           + + S
Sbjct: 496 RMISS 500


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 197/385 (51%), Positives = 267/385 (69%), Gaps = 18/385 (4%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
           N + C P+++  +       S +  ++L+E+DPS+S++SK   CSH LC+S  +C+S K+
Sbjct: 126 NCVQCAPLSSAYY-------SSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKE 178

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
            CPY   Y++E+TSSSG LV+D+LHLA +S +A  SSV++ V++GCG KQ+G +L G AP
Sbjct: 179 QCPYTVTYASENTSSSGLLVEDVLHLA-YSANA-SSSVKARVVVGCGEKQSGEFLKGIAP 236

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
           DGVMGLG G++SVPS LAKAGL++NSFS+CFDE DSG ++FGD GP+TQQST FLP   +
Sbjct: 237 DGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNE 296

Query: 195 YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
           + AYFVGVE  C+GNSCL QS F  L+DSG SFTFLP EIY EV ++ D  +++    ++
Sbjct: 297 FVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIE 356

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGD 313
           G  W+YCY  S E   KVP ++L FS N +FV+   +F    +EG   FCL +  S +G 
Sbjct: 357 GGPWEYCYETSFEP--KVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGT 414

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK-SHVHLVPPPAGQSPNPLPTTEQQ 372
            G+IGQN+M G+RIVFDREN+KL WS SKC+E  DK +      P +  SPNPLPT EQQ
Sbjct: 415 GGVIGQNYMAGYRIVFDRENMKLGWSASKCQE--DKIAPPQEASPGSTSSPNPLPTEEQQ 472

Query: 373 STSNGQAAAPPSTAKTAPSKSIAAS 397
           S ++   A  P+ A   PSK+ +AS
Sbjct: 473 SRTH---AVSPAIAGKTPSKTSSAS 494


>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
 gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
 gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
 gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
          Length = 528

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 189/374 (50%), Positives = 255/374 (68%), Gaps = 19/374 (5%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
           N + C P+T+  +       S +  ++L+EY+PSSSS+SK   CSH LC S S C+S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKE 181

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
            CPY  +Y + +TSSSG LV+DILHL   + +      SSV++ V+IGCG+KQ+G YLDG
Sbjct: 182 QCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDG 241

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
            APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQST FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQL 301

Query: 192 -GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
              KY  Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ D+ +++  
Sbjct: 302 DNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATS 361

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
            + +G SW+YCY +S+E   KVP ++L FS N +FV+   +F F +++G   FCL + S 
Sbjct: 362 KNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI-SP 418

Query: 311 DGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPT 368
            G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E  DK       P +  SPNPLPT
Sbjct: 419 SGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGSTSSPNPLPT 476

Query: 369 TEQQSTSNGQAAAP 382
            EQQS   G A +P
Sbjct: 477 DEQQSR-GGHAVSP 489


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  363 bits (933), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 178/364 (48%), Positives = 246/364 (67%), Gaps = 8/364 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y PS S++S+++ CSH LC   S C + K PCPY  DY +E+T+SSG L++D+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S   HAP   V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 206 HLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF ++DSG +FFGDQG  TQQST F+P+  K   Y V V+ YCIG+ C   +GFQ
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVD+G SFT LP + Y  + ++FDK +++ R S    S++YCY+    EM  VP + L 
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F++N+SF   N I  F + +G F VFCL V+ +    GIIGQNFM+G+ +VFDREN+KL 
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A  APS   + +
Sbjct: 443 WYRSECHDLDNSTMVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRAPSSGGSTT 498

Query: 398 AQQL 401
            Q L
Sbjct: 499 LQNL 502


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  363 bits (932), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 178/364 (48%), Positives = 246/364 (67%), Gaps = 8/364 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y PS S++S+++ CSH LC   S C + K PCPY  DY +E+T+SSG L++D+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S   HAP   V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 206 HLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF ++DSG +FFGDQG  TQQST F+P+  K   Y V V+ YCIG+ C   +GFQ
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVD+G SFT LP + Y  + ++FDK +++ R S    S++YCY+    EM  VP + L 
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F++N+SF   N I  F + +G F VFCL V+ +    GIIGQNFM+G+ +VFDREN+KL 
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A  APS   + +
Sbjct: 443 WYRSECHDLDNSTTVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRAPSSGGSTT 498

Query: 398 AQQL 401
            Q L
Sbjct: 499 LQNL 502


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  360 bits (924), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 178/364 (48%), Positives = 244/364 (67%), Gaps = 8/364 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+  S C + K PC Y  DY +E+T+SSG L++D L
Sbjct: 144 DRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSENTTSSGLLIEDSL 203

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S   HAP   V +SVIIGCGRKQ+G YLDG APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 204 HLNSREGHAP---VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVR 260

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF E+ SG +FFGDQG ++QQST F+P+  K   Y V V+  CIG+ CL  S FQ
Sbjct: 261 NSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQ 320

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFT LP ++Y     +FDK +++ R+  + ++WKYCY+AS  EM  VP + L 
Sbjct: 321 ALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIILA 380

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ N+SF   N I  F + +G    FCL V+ +    GIIGQNF++G+ +VFDRE++KL 
Sbjct: 381 FAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGYHVVFDRESMKLG 440

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C +V + + V L P   G S +PLP+ EQQ++        P+T  TAP  S   +
Sbjct: 441 WYRSECRDVDNSTTVPLGPSQHGSSEDPLPSNEQQTS----PPVTPATTGTAPPSSATTN 496

Query: 398 AQQL 401
            Q L
Sbjct: 497 RQML 500


>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 529

 Score =  358 bits (919), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 188/382 (49%), Positives = 249/382 (65%), Gaps = 3/382 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 97
           DR+L+EY PS S SSK++SCSH LC   S+CK+ K   CPY  +Y +++TSSSG LV+DI
Sbjct: 145 DRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDI 204

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
            HL S       SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI
Sbjct: 205 FHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLI 264

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
           ++SFS+CF+E+DSG +FFGDQG   QQST FL +   +  Y VGVE+ CIGNSC   + F
Sbjct: 265 RDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSF 324

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
            A  DSG SFTFLP   Y  +  +FDK V++ R + QG+ W+YCY  SS+++ K+P + L
Sbjct: 325 NAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVPSSQQLPKIPTLTL 384

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           +F +N SFVV N +F     +G   FCL +  T+G  G IGQNFM G+R+VFDREN KLA
Sbjct: 385 MFQQNNSFVVYNPVFVSYNEQGVDGFCLAIQPTEGGMGTIGQNFMTGYRLVFDRENKKLA 444

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTS-NGQAAAPPSTAKTAPSKSIAA 396
           WSHS C+++     + L  PP G S + LP  EQQ T  +  A A    A   PS + + 
Sbjct: 445 WSHSNCQDLSLGKRMPL-SPPNGTSSSQLPADEQQRTKGHAVAPAVAVRAPQKPSVASSQ 503

Query: 397 SAQQLDSVLRVACSLLVLMCLL 418
           ++  +       C  L+L  LL
Sbjct: 504 TSYMISYWRHWHCHWLLLFHLL 525


>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
          Length = 506

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 187/373 (50%), Positives = 252/373 (67%), Gaps = 20/373 (5%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
           N + C P+T+  +       S +  ++L+EY+PSSSS+SK   CSH LC S S C+S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKE 181

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
            CPY  +Y + +TSSSG LV+DILHL   + +      SSV++ V+IGCG+KQ+G YLDG
Sbjct: 182 QCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDG 241

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
            APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQST FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQL 301

Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
            E    Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ D+ +++   
Sbjct: 302 -ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK 360

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
           S +G SW+YCY +S E   KVP ++L FS N +FV+   +F F +++G   FCL + S  
Sbjct: 361 SFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI-SPS 417

Query: 312 GDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT 369
           G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E  +K       P +  SP PLPT 
Sbjct: 418 GQEGIGSIGQNYMRGYRMVFDRENMKLRWSASKCQE--EKIEPPQASPGSTSSPYPLPTE 475

Query: 370 EQQSTSNGQAAAP 382
           EQQ  S G A +P
Sbjct: 476 EQQ--SRGHAVSP 486


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  355 bits (910), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 184/388 (47%), Positives = 249/388 (64%), Gaps = 9/388 (2%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C  + G     DR+L  Y P+ S++S+++ CSH LC   S C S K PCPY  DY  E+T
Sbjct: 176 CAPLAGYRETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENT 235

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           +SSG L++DILHL S   HAP   V++SV+IGCGRKQ+GSYLDG APDG++GLG+ D+SV
Sbjct: 236 TSSGLLIEDILHLDSRESHAP---VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISV 292

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS LA+AGL++NSFS+CF E DSG +FFGDQG + QQST F+P+  KY  Y V V+  C+
Sbjct: 293 PSFLARAGLVRNSFSMCFKE-DSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCV 351

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
           G+ C   + F+ALVDSG SFT LP  +Y  V V+FDK V + RI+ +  S++YCY+AS  
Sbjct: 352 GHKCFEATSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPL 411

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHR 326
           +M  VP + L F+ N+SF   N      + EG    FCL +  +    GIIGQNF+ G+ 
Sbjct: 412 KMPDVPTVTLTFAANKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLTGYH 471

Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA 386
           IVFD+EN+KL W  S+C +  + + V L P        PLP++EQQ++       PP+ A
Sbjct: 472 IVFDKENMKLGWYRSECHDPDNSTTVPLGPSQHNSPGVPLPSSEQQTSPT---VTPPAVA 528

Query: 387 KTAPSKSIAASAQQLDSVLRVACSLLVL 414
             AP+ S +     L  +L   CSLL+L
Sbjct: 529 GKAPTSS-SGPPSNLHRLLANCCSLLLL 555


>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
          Length = 378

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 176/379 (46%), Positives = 248/379 (65%), Gaps = 11/379 (2%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           +QDR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D
Sbjct: 1   MQDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIED 60

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            LHL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL
Sbjct: 61  TLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGL 117

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           +QNSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + 
Sbjct: 118 VQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTS 177

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
           F+ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + 
Sbjct: 178 FKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTIT 237

Query: 277 LIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
           L F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++K
Sbjct: 238 LTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMK 297

Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
           L W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA TAP   ++
Sbjct: 298 LGWYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LS 350

Query: 396 ASAQQLDSVLRVACSLLVL 414
            +   L  +L  +  LL+L
Sbjct: 351 CATTNLQMLLASSYPLLLL 369


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  350 bits (897), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 175/377 (46%), Positives = 247/377 (65%), Gaps = 11/377 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 110 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 169

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 170 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 226

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + F+
Sbjct: 227 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 286

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + L 
Sbjct: 287 ALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 346

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++KL 
Sbjct: 347 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 406

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C +V D + V L P       +PLP+ EQQ++     A  P+TA TAP   ++ +
Sbjct: 407 WYRSECHDVEDSTTVPLGPSQRDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 459

Query: 398 AQQLDSVLRVACSLLVL 414
              L  +L  +  LL+L
Sbjct: 460 TTNLQMLLASSYPLLLL 476


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  348 bits (892), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 175/377 (46%), Positives = 246/377 (65%), Gaps = 11/377 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++KL 
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA TAP   ++ +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 489

Query: 398 AQQLDSVLRVACSLLVL 414
              L  +L  +  LL+L
Sbjct: 490 TTNLQMLLASSYPLLLL 506


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 174/377 (46%), Positives = 245/377 (64%), Gaps = 11/377 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL     H P   V +SVIIGCG+KQ+G YLDG APDG++ LG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQ 256

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++KL 
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
           W  S+C  V D + V L P       +PLP+ EQQ++     A  P+TA TAP   ++ +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 489

Query: 398 AQQLDSVLRVACSLLVL 414
              L  +L  +  LL+L
Sbjct: 490 TTNLQMLLASSYPLLLL 506


>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 529

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 183/372 (49%), Positives = 249/372 (66%), Gaps = 18/372 (4%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
           N + C P+T+  +       S +  ++L+EY+PSSSSSSK   CSH LC S S C S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCDSPKE 181

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
            C Y   Y + +TSSSG LV+DILHL   + +      SSV++ V++GCG+KQ+G YLDG
Sbjct: 182 QCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGDYLDG 241

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
            APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQS  FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAPFLQL 301

Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
            E    Y VGVE+ CIGNSCL Q+ F   +DSG SFT+LP EIY +V ++ D+ +++   
Sbjct: 302 -ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK 360

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
           S +G SW+YCY +S E   KVP ++L FS N +FV+   +F F +++G   FCL +  ++
Sbjct: 361 SFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSE 418

Query: 312 GD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTE 370
            +  G IGQN+M G+R+VFDREN+KL WS SKC+E  DK+      P +  SP PLPT E
Sbjct: 419 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKTEPPQASPGSTSSPYPLPTEE 476

Query: 371 QQSTSNGQAAAP 382
           QQ  S G A +P
Sbjct: 477 QQ--SRGHAVSP 486


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 173/356 (48%), Positives = 236/356 (66%), Gaps = 9/356 (2%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC   S C + K PCPY   Y  E+T+SSG LV+DIL
Sbjct: 252 DRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDIL 311

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL S   HAP   V++SVIIGCGRKQ+GSYLDG APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 312 HLDSRESHAP---VKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVR 368

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF + DSG +FFGDQG +TQQST F+P+  K   Y V V+  C+G+ C   + FQ
Sbjct: 369 NSFSMCFTK-DSGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQ 427

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           A+VDSG SFT LP +IY  V ++FDK V++ R+  +  S+ YCY+AS   M  VP + L 
Sbjct: 428 AIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTLT 487

Query: 279 FSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ N+SF   N  F   + EG    FCL V+ +    GII QNF++G+ +VFDREN+KL 
Sbjct: 488 FAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHVVFDRENMKLG 547

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKS 393
           W  S+C ++ + + V L P       +PLP+ EQQ++     A  P+ A  A + S
Sbjct: 548 WYRSECHDLDNSTTVPLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRARASS 599


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 182/391 (46%), Positives = 258/391 (65%), Gaps = 19/391 (4%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L++Y PS SSSS+++ C H LC   S+CK  KD CPYI +Y++++TSSSG+L++D L
Sbjct: 147 DRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKL 206

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HLAS   +A ++S+Q+SVI+GCGRKQ+G +L+GAAP+G++GLG G +SVP+LLAKAGLI+
Sbjct: 207 HLAS--NNATKNSIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIR 264

Query: 159 NSFSICFDENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
           NS SIC +E  SG + FGDQG ATQ +ST FL    +   YFVGVE +C+G+ C  ++ F
Sbjct: 265 NSISICLNEKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSFCYKETEF 324

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMR 276
           +A +D+G SFT+LP  +Y  VV +F+K V + RI+ Q  S +  CYNASS E    P M+
Sbjct: 325 KAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRESNNFPPMK 384

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMGHRIVF 329
             FSKNQSF+++N   S  + +  T  CL V+ +D +       Y I  QNF+MG+ +VF
Sbjct: 385 FTFSKNQSFIIQNPFISMDQED--TTICLAVVQSDDELITIGRKYTIACQNFLMGYDMVF 442

Query: 330 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KT 388
           DRENL+  W  S C++ + +S  +   P  G SP+ +P+ +QQ   N   + PP+ A KT
Sbjct: 443 DRENLRFGWFRSNCQDSMGES-ANFTSPSIGGSPDSIPSNQQQRVPNNTRSVPPAIAGKT 501

Query: 389 APSKSIAASAQQLDSVLRVACSLLVLMCLLL 419
           +P  S A        +L      L L+CLLL
Sbjct: 502 SPKPSAAKPGLNSWHLLNS----LSLICLLL 528


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 152/307 (49%), Positives = 212/307 (69%), Gaps = 4/307 (1%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL     H P   V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF E+ SG +FFGDQG  +QQST F+P+  K   Y V V+  CIG+ CL  + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           ALVDSG SFT LP ++Y    ++FDK +++ R+  +  +WKYCY+AS  EM  VP + L 
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376

Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F+ ++S    N I  F + +G    FCL V+ +    GII QNF++G+ +VFDRE++KL 
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436

Query: 338 WSHSKCE 344
           W  S+C+
Sbjct: 437 WYRSECK 443


>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
          Length = 313

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 148/276 (53%), Positives = 196/276 (71%), Gaps = 9/276 (3%)

Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
           SSV++ V+IGCG+KQ+G YLDG APDG+MGLG  ++SVPS L+KAGL++NSFS+CFDE D
Sbjct: 5   SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64

Query: 170 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 228
           SG ++FGD GP+ QQST FL +   KY  Y VGVE+ CIGNSCL Q+ F   +DSG SFT
Sbjct: 65  SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124

Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 288
           +LP EIY +V ++ D+ +++   + +G SW+YCY +S+E   KVP ++L FS N +FV+ 
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIH 182

Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             +F F +++G   FCL + S  G  GI  IGQN+M G+R+VFDREN+KL WS SKC+E 
Sbjct: 183 KPLFVFQQSQGLVQFCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE- 240

Query: 347 IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAP 382
            DK       P +  SPNPLPT EQQS   G A +P
Sbjct: 241 -DKIEPPQASPGSTSSPNPLPTDEQQSR-GGHAVSP 274


>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like, partial [Cucumis sativus]
          Length = 408

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 143/255 (56%), Positives = 195/255 (76%), Gaps = 1/255 (0%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D++L+EY PSSSS+SK++SCSH LC S  SC+S K  CPY+ DY TE+TSSSG L+ D+L
Sbjct: 148 DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 207

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL+S  +++   ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG++SV S LAK  L+Q
Sbjct: 208 HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 267

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           NSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+  KY+ Y VGVE+ CI NSCL Q+ F+
Sbjct: 268 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 327

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
           AL+DSG SFT+LP E Y  +V++FDK L ++  +S +G  WKYCY  S++ M KVP + L
Sbjct: 328 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 387

Query: 278 IFSKNQSFVVRNHIF 292
           +F  N SFVV + +F
Sbjct: 388 LFPLNNSFVVHDPVF 402


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 197/351 (56%), Gaps = 21/351 (5%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASF 103
           YD   SS+SKNV+C+  LC+ ++ C S     CPY  +Y +E+TS++G+LV+D+LHL + 
Sbjct: 163 YDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITD 222

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +    Q +    +  GCG+ QTG++LDGAAP+G+ GLG+ DVSVPS+LAK GL  NSFS+
Sbjct: 223 NDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSM 281

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
           CF  +  G + FGD   +  Q  +   I   +  Y + V    +G +      F A+ D+
Sbjct: 282 CFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLE-FNAIFDT 340

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS 280
           G SFT+L    Y ++   FD  +  +R S   +    ++YCY+  + + ++VP++ L   
Sbjct: 341 GTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQTIEVPNINLTMK 400

Query: 281 KNQSFVVRNHIF-SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 339
              ++ V + I  S   N G  V CL V+ ++ +  IIGQNFM G+RIVFDREN+ L W 
Sbjct: 401 GGDNYFVMDPIITSGGGNNG--VLCLAVLKSN-NVNIIGQNFMTGYRIVFDRENMTLGWK 457

Query: 340 HSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
            S C  +E+    +++SH   V P    +P       Q + SNG    P S
Sbjct: 458 ESNCYDDELSSLPVNRSHAPAVSPAMAVNPEI-----QSNPSNGPQRLPSS 503


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 134/365 (36%), Positives = 194/365 (53%), Gaps = 25/365 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           N + Y P++SS+SK V CS  LC     C S  D CPY   Y +++TSS+GYLV+DILHL
Sbjct: 175 NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL 234

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +         V + + +GCG+ Q+G++L  AAP+G+ GLG+ +VSVPS+LA AGLI NS
Sbjct: 235 TT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 292

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF     G + FGD+G   Q  T F  +G ++  Y V +    +G   ++      +
Sbjct: 293 FSLCFGPARMGRIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVI 350

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
            DSG SFT+L    Y+    KF  +V  K+ ++  +  ++ CY  S ++     P M L 
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 410

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
                 FV+ NH       E   +FCL +  +D    IIGQNFM G+ IVFDRE + L W
Sbjct: 411 MKGGGHFVI-NHPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGW 468

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAAS 397
             S C    D++  +L   P G +P P             AAAP +TA K   + +I  +
Sbjct: 469 KESNCTGYEDENTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNT 512

Query: 398 AQQLD 402
            Q ++
Sbjct: 513 TQTIE 517


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 134/365 (36%), Positives = 194/365 (53%), Gaps = 25/365 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           N + Y P++SS+SK V CS  LC     C S  D CPY   Y +++TSS+GYLV+DILHL
Sbjct: 152 NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL 211

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +         V + + +GCG+ Q+G++L  AAP+G+ GLG+ +VSVPS+LA AGLI NS
Sbjct: 212 TT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 269

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF     G + FGD+G   Q  T F  +G ++  Y V +    +G   ++      +
Sbjct: 270 FSLCFGPARMGRIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVI 327

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
            DSG SFT+L    Y+    KF  +V  K+ ++  +  ++ CY  S ++     P M L 
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 387

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
                 FV+ NH       E   +FCL +  +D    IIGQNFM G+ IVFDRE + L W
Sbjct: 388 MKGGGHFVI-NHPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGW 445

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAAS 397
             S C    D++  +L   P G +P P             AAAP +TA K   + +I  +
Sbjct: 446 KESNCTGYEDENTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNT 489

Query: 398 AQQLD 402
            Q ++
Sbjct: 490 TQTIE 494


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 129/352 (36%), Positives = 191/352 (54%), Gaps = 12/352 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P++SS+S+ + C++ LC  +S C S +  CPY   Y +  TSS+G LV+D+LHL +  
Sbjct: 165 YRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTT-- 222

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             A   ++ + +I GCGR QTGS+LDGAAP+G+ GLG+ ++SVPS LA+ G   NSFS+C
Sbjct: 223 DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMC 282

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGD G + Q  T F  + + +  Y V +    +G        F A+ DSG
Sbjct: 283 FGRDGIGRISFGDTGSSGQGETPF-NLRQLHPTYNVSITKINVGGRDADLE-FSAIFDSG 340

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEM-LKVPDMRLIFSKN 282
            SFT+L    Y  +   F+     KR  S+    ++YCY  SS +  L++P + L+    
Sbjct: 341 TSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPTVNLVMQGG 400

Query: 283 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
             F V + I       G +++CL ++ + GD  IIGQNFM G+RIVF+RE   L W  S 
Sbjct: 401 SQFNVTDPIVIVILQGGASIYCLAIVKS-GDVNIIGQNFMTGYRIVFNRERNVLGWKASD 459

Query: 343 CEEVIDKSHVHLVPPPAGQSP----NPLPTTEQQSTSNGQAAAPPSTAKTAP 390
           C + +D +   + P   G  P    NP  T    +T+   +  PP     AP
Sbjct: 460 CYDDMDTTTFPVDPISPGIPPATAVNPQATAGSGNTTE-VSGTPPPVGNNAP 510


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 137/393 (34%), Positives = 200/393 (50%), Gaps = 34/393 (8%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C     ++   D +L+ Y+P+ SS+SK V+C++ LC  RS C      CPY+  Y + +T
Sbjct: 129 CAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAET 188

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S+SG LV+D+LHL     H     V+++VI GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 189 STSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISV 246

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L++ G   +SFS+CF  +  G + FGD+G   Q  T F  +   +  Y + V    +
Sbjct: 247 PSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRV 305

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
           G + L    F AL DSG SFT+L    Y  +   F   V  +R        ++YCY+ S 
Sbjct: 306 GTT-LIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSP 364

Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           +     +P + L       F V + I      +   V+CL V+ T  +  IIGQNFM G+
Sbjct: 365 DANTSLIPSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKT-AELNIIGQNFMTGY 422

Query: 326 RIVFDRENLKLAWSHSKCEEVID-------KSHVHLVPPPA-----GQSPNPLPTTEQQS 373
           R+VFDRE L L W    C ++ D       + H H   PPA     G  P   PT  ++S
Sbjct: 423 RVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPHSHADVPPAVAAGLGNYPATDPT--RKS 480

Query: 374 TSNGQAAAPPSTAKTAPSKSIAASAQQLDSVLR 406
             N Q             K +  + Q L S+LR
Sbjct: 481 KYNSQ------------RKWLTNTTQWLRSMLR 501


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 189/344 (54%), Gaps = 9/344 (2%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y PS SS++K V CS PLC+  S+C +  D CPY  +Y + +TS+SG L +D ++  
Sbjct: 159 LNPYTPSLSSTAKPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMY-- 216

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
            F + +  + V+  V +GCG+ QTGS L GAAP+G+MGLG  D+SVP+ LA  G + +SF
Sbjct: 217 -FMRESGGNPVKLPVYLGCGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSF 275

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           S+C     SG++ FGD+GPA Q++T  +P      D Y V ++S  +GN+ L  +   AL
Sbjct: 276 SLCISPGGSGTLTFGDEGPAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HAL 334

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
            D+G SFT+L   +Y + V  +D  +S  K    + + W  CY  S+    +VP + L  
Sbjct: 335 FDTGTSFTYLSKTVYPQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTN-FQVPVVSLAL 393

Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           S   S  V + + S   +N      C+TVM +     IIGQNFM  + I ++R  + + W
Sbjct: 394 SGGNSLDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGW 453

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAAA 381
           + S C   +  S+      PA   P  PLP   + ++ N    A
Sbjct: 454 TPSDCSTDLTLSNSTPGSVPAALPPTAPLPAVPRPASPNSTVTA 497


>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 417

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 188/340 (55%), Gaps = 15/340 (4%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    G+    D  LS Y P  SS+SK V C++ LC  R  C      CPY+  Y + +T
Sbjct: 37  CAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAET 96

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S++G L++D+LHL + +KH+    +Q+ +  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 97  STTGILIEDLLHLKTENKHS--EPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 154

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L++ GL+ NSFS+CF ++  G + FGD+G   Q+ T F  + + +  Y + V S  +
Sbjct: 155 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRV 213

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
           G + L  +   AL DSG SF++    IY+++   F       R        ++YCYN S 
Sbjct: 214 GTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSP 272

Query: 267 EEMLKV-PDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +    + P + L       F V + I     +NE   ++CL V+ +  +  IIGQNFM G
Sbjct: 273 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKS-AELNIIGQNFMTG 329

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
           +RIVFDRE L L W    C ++ +KS   + P     PPA
Sbjct: 330 YRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPA 369


>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
 gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 178/319 (55%), Gaps = 10/319 (3%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    GAS   D  LS Y+P  SS+SK V+C++ +C  R+ C      CPYI  Y +  T
Sbjct: 130 CAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVSYVSAQT 189

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S+SG LV D+LHL +  +   +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 190 STSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 247

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L++ GLI +SFS+CF  +  G + FGD+G   Q+ T F  +   +  Y V V    +
Sbjct: 248 PSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPF-NVNPAHPTYNVTVTQARV 306

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
           G + L    F AL DSG SFT++    Y+ V  KF  L   KR        ++YCY+ S 
Sbjct: 307 G-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDMSP 365

Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +     VP M L     + F V + I     +NE   V+CL V+ +  +  IIGQNFM G
Sbjct: 366 DANASLVPSMSLTMKGGRHFTVYDPIIVISTQNE--IVYCLAVVKST-ELNIIGQNFMTG 422

Query: 325 HRIVFDRENLKLAWSHSKC 343
           +R+VFDRE L L W    C
Sbjct: 423 YRVVFDREKLVLGWKKFDC 441


>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 525

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 187/340 (55%), Gaps = 15/340 (4%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    G+    D  LS Y P  SS+SK V C++ LC  R  C      CPY+  Y + +T
Sbjct: 145 CAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSAET 204

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S++G L++D+LHL +  KH+    +Q+ +  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 205 STTGILIEDLLHLKTEHKHS--EPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L++ GL+ NSFS+CF ++  G + FGD+G   Q+ T F  + + +  Y + V S  +
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRV 321

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
           G + L  +   AL DSG SF++    IY+++   F       R        ++YCYN S 
Sbjct: 322 GTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSP 380

Query: 267 EEMLKV-PDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +    + P + L       F V + I     +NE   ++CL V+ +  +  IIGQNFM G
Sbjct: 381 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKS-AELNIIGQNFMTG 437

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
           +RIVFDRE L L W    C ++ +KS   + P     PPA
Sbjct: 438 YRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPA 477


>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
 gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
 gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 524

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 116/316 (36%), Positives = 182/316 (57%), Gaps = 8/316 (2%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
           GA+   +  LS Y+P  S+++K V+C++ LC  R+ C      CPY+  Y +  TS+SG 
Sbjct: 145 GATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 204

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L++D++HL +  K+  +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+LA
Sbjct: 205 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 262

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           + GL+ +SFS+CF  +  G + FGD+G + Q+ T F  +   +  Y + V    +G + L
Sbjct: 263 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 320

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK 271
               F AL D+G SFT+L   +Y  V   F      KR S      ++YCY+ S++    
Sbjct: 321 IDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANAS 380

Query: 272 -VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            +P + L    N  F + + I      EG  V+CL ++ +  +  IIGQN+M G+R+VFD
Sbjct: 381 LIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVKS-SELNIIGQNYMTGYRVVFD 438

Query: 331 RENLKLAWSHSKCEEV 346
           RE L LAW    C ++
Sbjct: 439 REKLVLAWKKFDCYDI 454


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 133/366 (36%), Positives = 192/366 (52%), Gaps = 19/366 (5%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    G +   D  LS YDP  SS+SK V+C++ LC  R+ C      CPY+  Y +  T
Sbjct: 134 CAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQT 193

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S+SG LV+D+LHL S  + + Q S+++ V  GCG+ Q+GS+L+ AAP+G+ GLG+  +SV
Sbjct: 194 STSGILVEDVLHLTS--EDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISV 251

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L++ GL  +SFS+CF  +  G + FGD+G   Q+ T F      + +Y + V    +
Sbjct: 252 PSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNS-NPSHPSYNISVTQVRV 310

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS- 265
           G + L    F AL DSG SFT+L   IYA V   F      KR        ++YCY+ S 
Sbjct: 311 GTT-LVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSP 369

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
                 +P M L       F V + I     +NE   V+CL ++ +  +  IIGQNFM G
Sbjct: 370 GANSSLIPSMSLTMKGRGHFTVFDPIIVITTQNE--LVYCLAIVKST-ELNIIGQNFMTG 426

Query: 325 HRIVFDRENLKLAWSHSKCEE-----VIDKSHVHLVPPPA----GQSPNPLPTTEQQSTS 375
           +R+VFDRE L L W  + C +        + H   VPP      G   +P  T + +  S
Sbjct: 427 YRVVFDREKLVLGWKETDCYDQEYNSFPTEPHASDVPPAVAAGLGNYSSPHSTNQDRKKS 486

Query: 376 NGQAAA 381
               A+
Sbjct: 487 QSSVAS 492


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 189/342 (55%), Gaps = 10/342 (2%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+S+ V CS  +C  ++ C +  + CPY  +Y +++TSS G LV+D+++LA+ S
Sbjct: 157 YSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATES 216

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H+     Q+ +  GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  G+  NSFS+C
Sbjct: 217 GHS--KITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMC 274

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F E+  G + FGD G A Q  T  L I +    Y + +     G    + + F A+VDSG
Sbjct: 275 FGEDGHGRINFGDTGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFS-TKFSAVVDSG 332

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y E+   FDK V  KR     +  ++YCY  SS+  +  P++ L      
Sbjct: 333 TSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPPNISLTAKGGS 392

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F V++ I +  +     V +CL +M ++G   +IG+NFM G ++VFDRE L L W    
Sbjct: 393 VFPVKDPIITITDISSSPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERLVLGWKSFN 451

Query: 343 CEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
           C  V   + + + P  +   P P+       +SN +AA  PS
Sbjct: 452 CYSVDHSTKLPVSPNSSAIPPKPV---SGPGSSNPEAAKRPS 490


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 120/305 (39%), Positives = 175/305 (57%), Gaps = 8/305 (2%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +L+ Y P++SS+S  V C+  LC     C S    CPY   Y +  TSS+G LV+D+LHL
Sbjct: 151 DLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHL 210

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S  K++    +++ + +GCG  QTG + DGAAP+G+ GLGL D+SVPS+LAK G+  NS
Sbjct: 211 VSMEKNS--KPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 268

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF ++ +G + FGD+G   Q+ T  L I + +  Y V V    +G +      F A+
Sbjct: 269 FSMCFGDDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNVTVTQISVGGNT-GDLEFDAV 326

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
            D+G SFT+L    Y  +   F+ L   KR        ++YCY  S +++  + PD+ L 
Sbjct: 327 FDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFEYPDVNLT 386

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
                S+ V + +   P  E   V+CL +M ++ D  IIGQNFM G+R+VFDRE L L W
Sbjct: 387 MKGGSSYPVYHPLIVVPI-EDTVVYCLAIMKSE-DISIIGQNFMTGYRVVFDREKLILGW 444

Query: 339 SHSKC 343
             S C
Sbjct: 445 KESDC 449


>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 522

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 116/316 (36%), Positives = 182/316 (57%), Gaps = 8/316 (2%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
           GA+   +  LS Y+P  S+++K V+C++ LC  R+ C      CPY+  Y +  TS+SG 
Sbjct: 143 GATYASEFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 202

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L++D++HL +  K+  +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+LA
Sbjct: 203 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 260

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           + GL+ +SFS+CF  +  G + FGD+G + Q+ T F  +   +  Y + V    +G + L
Sbjct: 261 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 318

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK 271
               F AL D+G SFT+L   +Y  V   F      KR S      ++YCY+ S++    
Sbjct: 319 IDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANAS 378

Query: 272 -VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            +P + L    N  F + + I      EG  V+CL ++ +  +  IIGQN+M G+R+VFD
Sbjct: 379 LIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVKS-SELNIIGQNYMTGYRVVFD 436

Query: 331 RENLKLAWSHSKCEEV 346
           RE L LAW    C ++
Sbjct: 437 REKLVLAWKKFDCYDI 452


>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 537

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 133/368 (36%), Positives = 197/368 (53%), Gaps = 26/368 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
           +L  Y P  SS+SK V+C H LC+  ++C +  +    CPY   Y + +TSSSG LV+D+
Sbjct: 154 DLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDV 213

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           LHL+  +     ++V + V++GCG+ QTG++LDGAA DG++GLG+  VSVPS+L  AGL+
Sbjct: 214 LHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVPSVLHAAGLV 273

Query: 158 -QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
             +SFS+CF  +  G + FGD G   Q  T F  +   +  Y + V +  +    +    
Sbjct: 274 ASDSFSMCFSPDGFGRINFGDSGRRGQAETPFT-VRNTHPTYNISVTAMSVSGKEVAAE- 331

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPD 274
           F A+VDSG SFT+L    Y E+   F+  V  +R +L  +  ++YCY     +  L VP+
Sbjct: 332 FAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQTELFVPE 391

Query: 275 MRLIFSKNQSF-VVRNHIFSFPE-NEGFTV---FCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           + L       F V R  +  + E ++G  V   +CL V+  D    IIGQNFM G ++VF
Sbjct: 392 VSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNFMTGLKVVF 451

Query: 330 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT----EQQSTSNGQ--AAAPP 383
           DRE   L W    C + ++   +       G +P P PTT     Q   +NG     A P
Sbjct: 452 DRERSVLGWHEFDCYKDVETEEL-------GAAPGPSPTTRLKPRQSEVANGTPYPGAVP 504

Query: 384 STAKTAPS 391
            T + A S
Sbjct: 505 VTPRQAGS 512


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/339 (37%), Positives = 187/339 (55%), Gaps = 14/339 (4%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    G +   D  LS Y+P  SS+S+ V+C + LC  R+ C      CPY+  Y + +T
Sbjct: 136 CAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSNCPYMVSYVSAET 195

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S+SG LV+D+LHL +  +   Q  V++ V  GCG+ QTGS+LD AAP+G+ GLGL  +SV
Sbjct: 196 STSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISV 253

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+L+K G   +SFS+CF  +  G + FGD+G   Q+ T F  +   +  Y + V    +
Sbjct: 254 PSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF-NLNALHPTYNITVTQVRV 312

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS- 265
           G + L    F AL DSG SFT+L   IY  V+  F  +   S+R       +++CY+ S 
Sbjct: 313 GTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPFEFCYDMSP 371

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
            E    +P M L       F V + I     ++   ++C+ V+ +  +  IIGQNFM G+
Sbjct: 372 GENTSLIPSMSLTMKGGSQFPVYDPIIII-SSQSELIYCMAVVRS-AELNIIGQNFMTGY 429

Query: 326 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
           RI+FDRE L L W   +C++ I+ S V + P     PPA
Sbjct: 430 RIIFDREKLVLGWKEFECDD-IENSSVPIRPRATSVPPA 467


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 130/375 (34%), Positives = 187/375 (49%), Gaps = 42/375 (11%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +L+ Y P++SS+S+ V C+  LC    R  C S +  CPY   Y +  TS++GY+V D+L
Sbjct: 107 DLNIYSPNTSSTSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLL 166

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL   S  +   +V + +  GCG+ QTGS+L G AP+G+ GLG+ ++SVPS LA  G   
Sbjct: 167 HL--ISDDSQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTS 224

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
            SFS+CF  N  G + FGD+G   Q  TSF     +   Y + +    IG    +   + 
Sbjct: 225 GSFSMCFSPNGIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQA-SDLVYS 283

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS------------ 266
           A+ DSG SFT+L    Y  +   F+KLV   R S     + YCY+  S            
Sbjct: 284 AIFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFSCA 343

Query: 267 ---EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
              +    +P + L+ S    F V + I      +G  V+CL ++ + GD  IIGQNFM 
Sbjct: 344 YANQTEPTIPAVTLVMSGGDYFNVTDPIVLVQLADGSAVYCLGMIKS-GDVNIIGQNFMT 402

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 383
           GHRIVFDRE + L W  S C + +D + + + P                       A PP
Sbjct: 403 GHRIVFDRERMILGWKPSNCYDNMDTNTLAVSP---------------------NTAVPP 441

Query: 384 STAKTAPSKSIAASA 398
           +TA    +K I AS+
Sbjct: 442 ATAVNPEAKQIPASS 456


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 127/357 (35%), Positives = 189/357 (52%), Gaps = 20/357 (5%)

Query: 35  SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           +   D +L+ Y+P+ SS+SK V+C++ LC  RS C      CPY+  Y + +TS+SG LV
Sbjct: 140 AFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAETSTSGILV 199

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           +D+LHL     H     V+++VI GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+L++ 
Sbjct: 200 EDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSRE 257

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           G   +SFS+CF  +  G + FGD+G   Q  T F  +   +  Y + V    +G + +  
Sbjct: 258 GFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRVGTTVIDV 316

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK-V 272
             F AL DSG SFT+L    Y  +   F   V  +R        ++YCY+ S +     +
Sbjct: 317 E-FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLI 375

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
           P + L       F V + I      +   V+CL V+ +  +  IIGQNFM G+R+VFDRE
Sbjct: 376 PSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKS-AELNIIGQNFMTGYRVVFDRE 433

Query: 333 NLKLAWSHSKCEEVID---------KSHVHLVPP--PAGQSPNPLPTTEQQSTSNGQ 378
            L L W    C ++ D         +SH   VPP   AG    P   + ++S  N Q
Sbjct: 434 KLVLGWKKFDCYDIEDHNDAIPTRPRSHAD-VPPAVAAGLGNYPATDSTRKSKYNSQ 489


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 118/323 (36%), Positives = 189/323 (58%), Gaps = 8/323 (2%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S
Sbjct: 148 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 207

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             A    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+C
Sbjct: 208 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG
Sbjct: 266 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 323

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L      
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 382

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  
Sbjct: 383 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 441

Query: 343 CEEVIDKSHVHLVPPPAGQSPNP 365
           C    + S + + P P+   P P
Sbjct: 442 CYNFDESSRLPVNPSPSAVPPKP 464


>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
          Length = 829

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 130/394 (32%), Positives = 210/394 (53%), Gaps = 29/394 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           YD   SS+S+ V C+  LC+ +  C S    CPY  +Y +  TS++G+LV+D+LHL +  
Sbjct: 151 YDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDD 210

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                +  +  +  GCG+ QTG++LDGAAP+G+ GLG+G+ SVPS+LAK GL  NSFS+C
Sbjct: 211 DETKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMC 268

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGD     Q  T F  +   +  Y + V    +G +      F A+ DSG
Sbjct: 269 FGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGGNAADLE-FHAIFDSG 326

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSK 281
            SFT L    Y ++   F+  +  +R S   +    ++YCY+ SS + +++P + L    
Sbjct: 327 TSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVELP-INLTMKG 385

Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
             +++V + I +    EG  + CL V+ ++ +  IIGQNFM G+RIVFDREN+ L W  S
Sbjct: 386 GDNYLVTDPIVTI-SGEGVNLLCLGVLKSN-NVNIIGQNFMTGYRIVFDRENMILGWRES 443

Query: 342 KC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
            C  +E+    I++S+   + P    +P      E  + SN    +P  + K  P+ +  
Sbjct: 444 NCYVDELSTLAINRSNSPAISPAIAVNPE-----ETSNQSNDPELSPNLSFKIKPTSAFM 498

Query: 396 AS--------AQQLDSVLRVACSLLVLMCLLLSS 421
            +        + Q+   + VA   L++M  ++S+
Sbjct: 499 MALLVPKNHRSTQISMAVMVAFLNLIIMFSVVST 532


>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 544

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 131/358 (36%), Positives = 192/358 (53%), Gaps = 39/358 (10%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           V D N+ E D   SS+ KNV C+  +CK ++ C S    C Y  +Y + DTSSSG+LV+D
Sbjct: 157 VIDLNIYELD--KSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSSGFLVED 213

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           +LHL   + +     + + + IGCG+ QTG +L+GAAP+G+ GLG+ +VSVPS+LA+ GL
Sbjct: 214 VLHL--ITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILAQKGL 271

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           I +SFS+CF  + SG + FGD G + Q  T F  + E +  Y V +    +G        
Sbjct: 272 ISDSFSMCFGSDGSGRITFGDTGSSDQGKTPF-NLRESHPTYNVTITQIIVGGYAADHE- 329

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNS---WKYCYNASSEEMLKV 272
           F A+ DSG SFT+L    Y  +  KF+ LV + R S L  +S   ++YCY+ S ++ ++V
Sbjct: 330 FHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTIEV 389

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ------------- 319
           P + L       + V + I          + CL +  +D +  IIG+             
Sbjct: 390 PFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKSD-NLNIIGREYTTEEEFLHLKH 448

Query: 320 ---------NFMMGHRIVFDRENLKLAWSHSKC-EEVI----DKSHVHLVPPPAGQSP 363
                    NFM G+RIVFDREN+ L W  S C EEV+    +KSH   + P    +P
Sbjct: 449 MIIKFFIQKNFMTGYRIVFDRENMNLGWKESNCTEEVLSIPTNKSHSPAISPAIAVNP 506


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 119/306 (38%), Positives = 173/306 (56%), Gaps = 9/306 (2%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +L+ Y P++SS+S  V C+  LC     C S +  CPY   Y +  TSS+G LV+D+LHL
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 209

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S  K +   ++ + V  GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+  NS
Sbjct: 210 VSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF  + +G + FGD+G   Q+ T  L I + +  Y + V    +G +      F A+
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAV 325

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEMLKVPDMRL 277
            DSG SFT+L    Y  +   F+ L   KR     +   ++YCY  S +++  + P + L
Sbjct: 326 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNL 385

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
                 S+ V + +   P  +   V+CL +M  + D  IIGQNFM G+R+VFDRE L L 
Sbjct: 386 TMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILG 443

Query: 338 WSHSKC 343
           W  S C
Sbjct: 444 WKESDC 449


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/363 (34%), Positives = 203/363 (55%), Gaps = 16/363 (4%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S
Sbjct: 111 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 170

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             A    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+C
Sbjct: 171 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 228

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG
Sbjct: 229 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 286

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L      
Sbjct: 287 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 345

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  
Sbjct: 346 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 404

Query: 343 CEEVIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSI 394
           C    + S + + P P+     P        P   + +  NG Q    PS +     +S+
Sbjct: 405 CYNFDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSV 464

Query: 395 AAS 397
           +A+
Sbjct: 465 SAT 467


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 13/385 (3%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D     Y P  SS+S+ V CS  LC  ++ C +  + CPY   Y +E+TSS G LV+D+L
Sbjct: 142 DLKFDMYSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVL 201

Query: 99  HLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +L + S    QS + Q+ +  GCG+ Q+GS+L  AAP+G++GLG+   SVPSLLA  G+ 
Sbjct: 202 YLTTESG---QSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIA 258

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            NSFS+CF E+  G + FGD G + Q  T  L I ++   Y + +    +G      + F
Sbjct: 259 ANSFSMCFGEDGHGRINFGDTGSSDQLETP-LNIYKQNPYYNISITGAMVGGKSF-DTKF 316

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 276
            A+VDSG SFT L   +Y E+   F+  V   R  L  +  ++YCY+ S++  +  P++ 
Sbjct: 317 SAVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNIS 376

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
           L       F V   I +  +     + +CL +M ++G   +IG+NFM G +IVFDRE L 
Sbjct: 377 LTAKGGSIFPVNGPIITITDTSSRPIAYCLAIMKSEG-VNLIGENFMSGLKIVFDRERLV 435

Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
           L W    C    + S + +   P+   P P       +    + A+P  T    P  S +
Sbjct: 436 LGWKTFNCYNFDNSSKLPVNRNPSADPPKPALGPSSSNPEAAKGASPNITQIDVPHSS-S 494

Query: 396 ASAQQLD---SVLRVACSLLVLMCL 417
           +S  +L    + L    +LL L  L
Sbjct: 495 SSETRLHLSGTFLSATIALLFLAAL 519


>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 516

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 110/309 (35%), Positives = 171/309 (55%), Gaps = 10/309 (3%)

Query: 42  LSEYDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
            + YD   SS+S  VSC++   C+ R  C S    C Y  DY + DTSS G++V+D+LHL
Sbjct: 153 FNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHL 212

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +       +  +  +  GCG+ QTG +L+GAAP+G+ GLG+ ++SVPS+LA+ GLI NS
Sbjct: 213 ITDDDQTKDADTR--IAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAREGLISNS 270

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF  + +G + FGD G   Q+ T F  + + +  Y + +    + +S +    F A+
Sbjct: 271 FSMCFGSDSAGRITFGDTGSPDQRKTPF-NVRKLHPTYNITITKIIVEDS-VADLEFHAI 328

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMR 276
            DSG SFT++    Y  +   ++  V +KR S Q       + YCY+ S  + ++VP + 
Sbjct: 329 FDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVPFLN 388

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
           L       + V + I      E   + CL +  +D    IIGQNFM G++IVFDR+N+ L
Sbjct: 389 LTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKSDS-VNIIGQNFMTGYKIVFDRDNMNL 447

Query: 337 AWSHSKCEE 345
            W  + C +
Sbjct: 448 GWKETNCSD 456


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 124/363 (34%), Positives = 203/363 (55%), Gaps = 16/363 (4%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S
Sbjct: 125 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 184

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             +    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+C
Sbjct: 185 AQS--KIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 242

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG
Sbjct: 243 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 300

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L      
Sbjct: 301 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 359

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  
Sbjct: 360 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 418

Query: 343 CEEVIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSI 394
           C    + S + + P P+     P        P   + +  NG Q    PS +     +S+
Sbjct: 419 CYNFDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSV 478

Query: 395 AAS 397
           +A+
Sbjct: 479 SAT 481


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 135/374 (36%), Positives = 193/374 (51%), Gaps = 20/374 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILH 99
           +L  Y P  SS+SK V+C +PLC  R+ C +  +  CPY   Y + +TSSSG LV D+LH
Sbjct: 156 SLRPYSPRRSSTSKQVACDNPLCGQRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLH 215

Query: 100 LASF--SKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAG 155
           L        A   ++Q+ V+ GCG+ QTG++LDG   A DG+MGLG+G VSVPS LA +G
Sbjct: 216 LTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASG 275

Query: 156 LI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           L+  +SFS+CF ++  G V FGD G   Q  T F  +      Y V   S  +G+  +  
Sbjct: 276 LVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYNVSFTSIGVGSESVAA 334

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEM 269
             F A++DSG SFT+L    Y ++  KF+  VS +R++    S     ++YCY  S  + 
Sbjct: 335 E-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQT 393

Query: 270 -LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYG--IIGQNFMMGH 325
            + +PD+ L       F V        +  G  V +CL +M  D   G  IIGQNFM G 
Sbjct: 394 EVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAVGYCLAIMRNDMAIGIDIIGQNFMTGL 453

Query: 326 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 385
           ++VFDRE   L W    C      +      P    +P   PT      ++G  +  P  
Sbjct: 454 KVVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKITPRQNDGSGSGYPGA 513

Query: 386 A---KTAPSKSIAA 396
           A   ++A S++ AA
Sbjct: 514 APLPRSAGSRNAAA 527


>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
 gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
          Length = 541

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 131/364 (35%), Positives = 188/364 (51%), Gaps = 33/364 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
           L  Y P  SS+SK V+C + LC   + C +  +  CPY   Y + +TS+SG LV D+LHL
Sbjct: 158 LRPYSPRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHL 217

Query: 101 ASFSKHAPQSS------VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
              ++  P ++      +Q+ V+ GCG+ QTG++LDGAA DG+MGLG  +VSVPS+LA +
Sbjct: 218 ---TRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSVPSVLASS 274

Query: 155 GLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSC 211
           GL+  +SFS+CF ++  G + FGD G + Q  T F      Y+  F  V VE+  +    
Sbjct: 275 GLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTGRRTLYNVSFTAVNVETKSVA--- 331

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCY--NA 264
              + F A++DSG SFT+L    Y E+   F+ LV  +R +    S     ++YCY    
Sbjct: 332 ---AEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCYALGP 388

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD--GDYGIIGQNFM 322
           +  E L +PD+ L       F V   +           +CL +M  D   ++ IIGQNFM
Sbjct: 389 NQTEAL-IPDVSLTTKGGARFPVTQPVIGVASGRTVVGYCLAIMKNDLGVNFNIIGQNFM 447

Query: 323 MGHRIVFDRENLKLAWSHSKC---EEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQA 379
            G ++VFDRE   L W    C     V D       P PA   P  +   +   +SNG  
Sbjct: 448 TGLKVVFDREKSVLGWEKFDCYKNARVADAPDGSPSPAPAAD-PTKITPRQNDGSSNGFP 506

Query: 380 AAPP 383
           AA P
Sbjct: 507 AAAP 510


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score =  204 bits (518), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 117/323 (36%), Positives = 188/323 (58%), Gaps = 8/323 (2%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S
Sbjct: 148 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 207

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             A    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+C
Sbjct: 208 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG
Sbjct: 266 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 323

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L      
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 382

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F V + I +  +N    V +CL +M ++G   +IG+NFM G ++VFDRE + L W +  
Sbjct: 383 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 441

Query: 343 CEEVIDKSHVHLVPPPAGQSPNP 365
           C    + S + + P P+     P
Sbjct: 442 CYNFDESSRLPVNPSPSAVPSKP 464


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/307 (38%), Positives = 176/307 (57%), Gaps = 11/307 (3%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +L+ Y P++SS+S  V C+  LC     C S +  CPY   Y +  TSS+G LV+D+LHL
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHL 209

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S  K +   ++ + V +GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+  NS
Sbjct: 210 VSNDKSS--KAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQA 219
           FS+CF  + +G + FGD+G   Q+ T  L I + +  Y + V    + GN+   +  F A
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNTGDLE--FDA 324

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEMLKVPDMR 276
           + DSG SFT+L    Y  +   F+ L   KR     +   ++YCY  S +++  + P + 
Sbjct: 325 VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVN 384

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
           L      S+ V + +   P  +   V+CL ++  + D  IIGQNFM G+R+VFDRE L L
Sbjct: 385 LTMKGGSSYPVYHPLVVIPMKDT-DVYCLAILKIE-DISIIGQNFMTGYRVVFDREKLIL 442

Query: 337 AWSHSKC 343
            W  S C
Sbjct: 443 GWKESDC 449


>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
 gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
          Length = 523

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 120/330 (36%), Positives = 190/330 (57%), Gaps = 7/330 (2%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +D     Y P  SS+S+ V CS  LC  +S+C+S    CPY  +Y +++TSS+G LV+D+
Sbjct: 146 RDLKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDV 205

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L+L   +++     V + +  GCGR QTGS+L  AAP+G++GLG+  +SVPSLLA  G+ 
Sbjct: 206 LYL--ITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVA 263

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            NSFS+CF ++  G + FGD G + QQ T  L I ++   Y + +    +G+     + F
Sbjct: 264 ANSFSMCFGDDGRGRINFGDTGSSDQQETP-LNIYKQNPYYNISITGAMVGSKSF-NTNF 321

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 276
            A+VDSG SFT L   +Y+E+   F+  V  K   L  +  +++CY+ S +  +  P++ 
Sbjct: 322 NAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISPKGSVNPPNIS 381

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
           L+      F V + I +  ++    + +CL VM ++G   +IG+NFM G ++VFDRE   
Sbjct: 382 LMAKGGSIFPVNDPIITITDDASNPMAYCLAVMKSEG-VNLIGENFMSGLKVVFDRERKV 440

Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
           L W    C  V + S++ + P P+G  P P
Sbjct: 441 LGWKKFNCYSVDNSSNLPVNPNPSGVPPKP 470


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 134/373 (35%), Positives = 192/373 (51%), Gaps = 20/373 (5%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
           L  Y P  SS+S+ V+C +PLC  R+ C +  +  CPY   Y + +TSSSG LV D+LHL
Sbjct: 159 LRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHL 218

Query: 101 ASF--SKHAPQSSVQSSVIIGCGRKQTGSYLD--GAAPDGVMGLGLGDVSVPSLLAKAGL 156
                   A   ++Q+ V+ GCG+ QTG++LD  G A DG+MGLG+G VSVPS LA +GL
Sbjct: 219 TRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGL 278

Query: 157 I-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
           +  +SFS+CF ++  G V FGD G   Q  T F  +      Y V   S  IG+  +   
Sbjct: 279 VASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYNVSFTSIGIGSESVAAE 337

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEM- 269
            F A++DSG SFT+L    Y ++  KF+  VS +R++    S     ++YCY  S  +  
Sbjct: 338 -FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQTE 396

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYG--IIGQNFMMGHR 326
           + +PD+ L       F V        +  G  + +CL +M  D   G  IIGQNFM G +
Sbjct: 397 VAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDIIGQNFMTGLK 456

Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA 386
           +VFDRE   L W    C      +      P    +P   PT      ++G  +  P  A
Sbjct: 457 VVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKITPRQNDGSGSGYPGAA 516

Query: 387 ---KTAPSKSIAA 396
              ++A S++ AA
Sbjct: 517 PLPRSAGSRNAAA 529


>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 553

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/380 (33%), Positives = 194/380 (51%), Gaps = 43/380 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D +LS Y+P+ SS+SK V+C++ LC  R+ C      CPY+  Y + +TS+SG LV+D+L
Sbjct: 149 DFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVL 208

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           HL     +     V+++VI GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+L++ G   
Sbjct: 209 HLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTA 266

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
           +SFS+CF  +  G + FGD+G   Q  T F  +   +  Y + +    +G + L    F 
Sbjct: 267 DSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPTYNITINQVRVGTT-LIDVEFT 324

Query: 219 ALVDSGASFTFLPTEIYAEV--------------------------VVKFDKLVSSKRIS 252
           AL DSG SFT+L    Y+ +                          +++F   V  +R  
Sbjct: 325 ALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRRRP 384

Query: 253 LQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
                 + YCY+ S +     +P M L       FVV + I      +   V+CL V+ +
Sbjct: 385 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLAVVKS 443

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS-------HVHLVPPPAGQSP 363
             +  IIGQNFM G+R+VFDRE L L W  S C ++ D +       H   VPP      
Sbjct: 444 -AELNIIGQNFMTGYRVVFDREKLILGWKKSDCYDIEDHNNAIPIGQHSDKVPPAVAAGL 502

Query: 364 NPLPTTE--QQSTSNGQAAA 381
              PTT+  ++S  N Q ++
Sbjct: 503 GDYPTTDSSRKSKYNSQHSS 522


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 118/315 (37%), Positives = 173/315 (54%), Gaps = 18/315 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +L+ Y P++SS+S  V C+  LC     C S +  CPY   Y +  TSS+G LV+D+LHL
Sbjct: 101 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 160

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S  K +   ++ + V  GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+  NS
Sbjct: 161 VSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 218

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           FS+CF  + +G + FGD+G   Q+ T  L I + +  Y + V    +G +      F A+
Sbjct: 219 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAV 276

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCY----------NASSEE 268
            DSG SFT+L    Y  +   F+ L   KR     +   ++YCY          +  +++
Sbjct: 277 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKD 336

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
             + P + L      S+ V + +   P  +   V+CL +M  + D  IIGQNFM G+R+V
Sbjct: 337 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVV 394

Query: 329 FDRENLKLAWSHSKC 343
           FDRE L L W  S C
Sbjct: 395 FDREKLILGWKESDC 409


>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
 gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
          Length = 499

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 118/303 (38%), Positives = 174/303 (57%), Gaps = 12/303 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ +
Sbjct: 155 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 213

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+C
Sbjct: 214 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 271

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L I +++  Y + +    IGN   T   F  + D+G
Sbjct: 272 FGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-TDLDFITIFDTG 329

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE    +PD+ L     
Sbjct: 330 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVSG 389

Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 390 SLFPVIDPGQVISIQEHE--YVYCLAIVKSR-KLNIIGQNFMTGLRVVFDRERKILGWKK 446

Query: 341 SKC 343
             C
Sbjct: 447 FNC 449


>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 508

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 124/376 (32%), Positives = 199/376 (52%), Gaps = 21/376 (5%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C+   G S  +    + YD   SS+S+ V C+  LC+ +  C S    CPY  +Y +  T
Sbjct: 134 CVHGIGLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGT 193

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S++G+LV+D+LHL +       +  +  +  GCG+ QTG++LDGAAP+G+ GLG+ + SV
Sbjct: 194 STTGFLVEDVLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNESV 251

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+LAK GL  NSFS+CF  +  G + FGD     Q  T F  +   +  Y + V    +
Sbjct: 252 PSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIV 310

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNA 264
           G   +    F A+ DSG SFT+L    Y ++   F+  +  +R S   ++   ++YCY  
Sbjct: 311 GEK-VDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYEL 369

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           S  + +++  + L      +++V + I +    EG  + CL V+ ++ +  IIGQNFM G
Sbjct: 370 SPNQTVEL-SINLTMKGGDNYLVTDPIVTV-SGEGINLLCLGVLKSN-NVNIIGQNFMTG 426

Query: 325 HRIVFDRENLKLAWSHSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQ 378
           +RIVFDREN+ L W  S C  +E+    I++S+   + P    +P       + S SN  
Sbjct: 427 YRIVFDRENMILGWRESNCYDDELSTLPINRSNTPAISPAIAVNPE-----ARSSQSNNP 481

Query: 379 AAAPPSTAKTAPSKSI 394
             +P  + K  P+ + 
Sbjct: 482 VLSPNLSFKIKPTSAF 497


>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
          Length = 530

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+LV+D+L+L++  
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+ GL  NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT   F  + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEMTVGNS-LTDLEFSTIFDTG 337

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++ P + L     
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397

Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454

Query: 341 SKCEEV 346
             C + 
Sbjct: 455 FNCYDT 460


>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
          Length = 530

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+LV+D+L+L++  
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+ GL  NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT   F  + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTG 337

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++ P + L     
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397

Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454

Query: 341 SKCEEV 346
             C + 
Sbjct: 455 FNCYDT 460


>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
 gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
 gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
          Length = 530

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y PS SS+S+ V C+   C+ R  C +    CPY   Y + DTSSSG+LV+D+L+L++  
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           + A    +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +S+PS+LA+ GL  NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L +  ++  Y + +    +GNS LT   F  + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTG 337

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE+ ++ P + L     
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397

Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V     + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454

Query: 341 SKCEEV 346
             C + 
Sbjct: 455 FNCYDT 460


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 197/379 (51%), Gaps = 28/379 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y PS SS+S+ V C+   C  R  C S    CPY   Y + DTSSSG+LV+D+L+L++  
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTED 204

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ ++ GCG  QTGS+LD AAP+G+ GLG+  +SVPS+LA+ GL  NSFS+C
Sbjct: 205 TH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMC 262

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L I +K+  Y + +    +GN+ L       + D+G
Sbjct: 263 FGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVSTIFDTG 320

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE  ++ P + L     
Sbjct: 321 TSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRTVGG 380

Query: 283 QSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F  +    + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 381 SLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGWKK 437

Query: 341 SKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQ 400
             C +                S NPL    + ST   +  +P  T   A +  +   +  
Sbjct: 438 FNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPAGASQLGHVSSS 481

Query: 401 LDSVLRVACSLLVLMCLLL 419
              V     SLL++M +LL
Sbjct: 482 PPLVWWHNNSLLLMMFVLL 500


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 197/379 (51%), Gaps = 28/379 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y PS SS+S+ V C+   C  R  C S    CPY   Y + DTSSSG+LV+D+L+L++  
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTED 204

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ ++ GCG  QTGS+LD AAP+G+ GLG+  +SVPS+LA+ GL  NSFS+C
Sbjct: 205 TH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMC 262

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQG + Q+ T  L I +K+  Y + +    +GN+ L       + D+G
Sbjct: 263 FGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVSTIFDTG 320

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE  ++ P + L     
Sbjct: 321 TSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRTVGG 380

Query: 283 QSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F  +    + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 381 SLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGWKK 437

Query: 341 SKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQ 400
             C +                S NPL    + ST   +  +P  T   A +  +   +  
Sbjct: 438 FNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPAGASQLRHVSSS 481

Query: 401 LDSVLRVACSLLVLMCLLL 419
              V     SLL++M +LL
Sbjct: 482 PPLVWWHNNSLLLMMFVLL 500


>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
 gi|219887047|gb|ACL53898.1| unknown [Zea mays]
 gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 416

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 116/306 (37%), Positives = 173/306 (56%), Gaps = 12/306 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ +
Sbjct: 54  YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 112

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+C
Sbjct: 113 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 170

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQ  + Q+ T  L I  ++  Y + +    +GN   T   F  + D+G
Sbjct: 171 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTG 228

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE    +PD+ L     
Sbjct: 229 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 288

Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 289 SMFPVIDPGQVISIQEHE--YVYCLAIVKS-MKLNIIGQNFMTGLRVVFDRERKILGWKK 345

Query: 341 SKCEEV 346
             C + 
Sbjct: 346 FNCYDT 351


>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
 gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 500

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/303 (38%), Positives = 172/303 (56%), Gaps = 12/303 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ +
Sbjct: 156 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 214

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+C
Sbjct: 215 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 272

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQ  + Q+ T  L I  ++  Y + +    +GN   T   F  + D+G
Sbjct: 273 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 330

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE    +PD+ L     
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 390

Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 391 SMFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNIIGQNFMTGLRVVFDRERKILGWKK 447

Query: 341 SKC 343
             C
Sbjct: 448 FNC 450


>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
          Length = 520

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 116/306 (37%), Positives = 173/306 (56%), Gaps = 12/306 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ +
Sbjct: 158 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 216

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+C
Sbjct: 217 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 274

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQ  + Q+ T  L I  ++  Y + +    +GN   T   F  + D+G
Sbjct: 275 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 332

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
            SFT+L    Y  +   F   V + R +      ++YCY+ +SSE    +PD+ L     
Sbjct: 333 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 392

Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W  
Sbjct: 393 SMFPVIDPGQVISIQEHE--YVYCLAIVKS-MKLNIIGQNFMTGLRVVFDRERKILGWKK 449

Query: 341 SKCEEV 346
             C + 
Sbjct: 450 FNCYDT 455


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 109/304 (35%), Positives = 164/304 (53%), Gaps = 9/304 (2%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y  ++SS+S  V CS  LC+  + C S K  CPY   Y +E++SS+GYLV DILH+A
Sbjct: 151 LNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMA 210

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +    +    V   V +GCG+ QTG + +  AP+G++GLG+G VSVPS LA  GL  +SF
Sbjct: 211 T--DDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSF 268

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
           S+CF     G + FGD GP  Q+ T F P    Y+   + +    I  +  T     A++
Sbjct: 269 SMCFGYYGYGRIDFGDIGPVGQRETPFNPASLSYNVTILQI----IVTNRPTNVHLTAII 324

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFS 280
           DSGASFT+L    Y+ +    D  +  +RI    +  ++YCY  S   + + P++     
Sbjct: 325 DSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTME 384

Query: 281 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
             + F V     S   ++G    CL ++ +  D  +IG NF  G+R+VF+RE + L W  
Sbjct: 385 GGRKFDVITSYVSVDTDDG-PALCLAIVKST-DINVIGHNFFGGYRVVFNREKMTLGWKE 442

Query: 341 SKCE 344
             C+
Sbjct: 443 VDCD 446


>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 510

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/387 (34%), Positives = 201/387 (51%), Gaps = 35/387 (9%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           S Y PS SS+S+ V C+   C  R  C S    CPY   Y + DTSSSG+LV+D+L+L++
Sbjct: 147 SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYKMVYVSADTSSSGFLVEDVLYLST 205

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
              H PQ  +++ ++ GCG+ QTGS+LD AAP+G+ GLG+  +SVPS+LA  GL  +SFS
Sbjct: 206 EDNH-PQI-LKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFS 263

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
           +CF  +  G + FGDQG + Q+ T  L I +K+  Y + +    +G   +    F  + D
Sbjct: 264 MCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGITVGTEPMDLE-FSTIFD 321

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFS 280
           +G +FT+L    Y  +   F   V + R +      ++YCY+ +SSE  ++ P +     
Sbjct: 322 TGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVSFRTV 381

Query: 281 KNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
               F V +   + S  ++E   V+CL ++ +     IIGQNFM G R+VFDRE   L W
Sbjct: 382 GGSLFPVIDLGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGW 438

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
               C +                S NPL    + S+        PST     +K+ A + 
Sbjct: 439 KKFNCYDT--------------DSTNPLSINSRNSS-----GFSPSTYSPQETKNPAGAT 479

Query: 399 Q--QLDSVLRVAC--SLLVLMCLLLSS 421
           Q   L+S   V    + LVLM LL+ S
Sbjct: 480 QLRHLNSSPPVMWHNNSLVLMFLLVHS 506


>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Brachypodium distachyon]
          Length = 509

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 193/363 (53%), Gaps = 22/363 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L  Y P  SS+SK V+CSH LC   ++C +    CPY   Y + +TSSSG LV+D+L++ 
Sbjct: 126 LKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMT 185

Query: 102 SFSKHAPQ-------SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
             S  +          +V + V+ GCG++QTG++LDGAA +G++GLG+  VSVPSLLA A
Sbjct: 186 RQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDRVSVPSLLAAA 245

Query: 155 GLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           GL+  +SFS+CF  + +G + FG+   A  Q+ +   + +    Y + V +  +      
Sbjct: 246 GLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVTAVNVKGKGAM 305

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEM-LK 271
            + F A+VDSG SFT+L    Y+ +   F+  V  KR +L  +  ++YCY  S  +  + 
Sbjct: 306 AAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCYALSRGQTEVL 365

Query: 272 VPDMRLIFSKNQSF-VVRNHIFSFPENEGFTV----FCLTVMSTDGDYGIIGQNFMMGHR 326
           +P++ L       F V R  +    E     V    +CL V  +D    IIGQNFM G +
Sbjct: 366 MPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDIIGQNFMTGLK 425

Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTE---QQSTSNGQAAAPP 383
           +VFDR+   L W+   C + +    V     PA  +P P+P T+   +QS +    A  P
Sbjct: 426 VVFDRQRSVLGWTKFDCYKNM---KVEDDGSPAA-APGPMPVTQLRPRQSDTPFPGAVQP 481

Query: 384 STA 386
            +A
Sbjct: 482 RSA 484


>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 498

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 115/302 (38%), Positives = 170/302 (56%), Gaps = 12/302 (3%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P  SS+SK V C+   C  +  C +    CPY   Y +  TSSSG+LV+D+L+L++ +
Sbjct: 156 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 214

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            H PQ  +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL  NSFS+C
Sbjct: 215 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 272

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F  +  G + FGDQ  + Q+ T  L I  ++  Y + +    +GN   T   F  + D+G
Sbjct: 273 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 330

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT+L    Y  +   F   V + R +      ++YCY+  SE    +PD+ L      
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDL-SEARFPIPDIILRTVTGS 389

Query: 284 SFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
            F V +   + S  E+E   V+CL ++ +     IIGQNFM G R+VFDRE   L W   
Sbjct: 390 MFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNIIGQNFMTGLRVVFDRERKILGWKKF 446

Query: 342 KC 343
            C
Sbjct: 447 NC 448


>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
          Length = 551

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 203/379 (53%), Gaps = 41/379 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L +Y PS SS+SK V+C+  LC   ++C +    CPY   Y+  +TSSSG LV+D+L+L 
Sbjct: 155 LRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLT 214

Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                +  A  ++V++ V+ GCG+ QTGS+LDGAA DG+MGLG+  VSVPS+LA  G+++
Sbjct: 215 REKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVK 274

Query: 159 -NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            NSFS+CF ++  G + FGD G A Q  T F+ +   +  Y + + S  +G+  L   GF
Sbjct: 275 SNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYNISITSMSVGDKNLPL-GF 332

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEE-ML 270
            A+ DSG SFT+L    Y      F+  +S +R +  G++      ++YCY+ S ++  +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTV 392

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--FCLTVMSTDGDYGIIGQNFMMGH 325
           ++P + L  +    F V + ++       N    +  +CL V+ +D    IIGQNFM G 
Sbjct: 393 ELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGL 452

Query: 326 RIVFDRENLKLAWSHSKC---EEVIDK--------------SHVHLVP----PPAGQSPN 364
           ++VF+RE   L W    C   E++ D               +HV   P     PAG++P 
Sbjct: 453 KVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAGRTPI 512

Query: 365 P--LPTTEQQSTSNGQAAA 381
           P   P     S + G  A 
Sbjct: 513 PGAAPVPRSSSAAAGGRAG 531


>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
           Japonica Group]
 gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
          Length = 551

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 203/379 (53%), Gaps = 41/379 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L +Y PS SS+SK V+C+  LC   ++C +    CPY   Y+  +TSSSG LV+D+L+L 
Sbjct: 155 LRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLT 214

Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                +  A  ++V++ V+ GCG+ QTGS+LDGAA DG+MGLG+  VSVPS+LA  G+++
Sbjct: 215 REKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVK 274

Query: 159 -NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            NSFS+CF ++  G + FGD G A Q  T F+ +   +  Y + + S  +G+  L   GF
Sbjct: 275 SNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYNISITSMSVGDKNLPL-GF 332

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEE-ML 270
            A+ DSG SFT+L    Y      F+  +S +R +  G++      ++YCY+ S ++  +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTV 392

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--FCLTVMSTDGDYGIIGQNFMMGH 325
           ++P + L  +    F V + ++       N    +  +CL V+ +D    IIGQNFM G 
Sbjct: 393 ELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGL 452

Query: 326 RIVFDRENLKLAWSHSKC---EEVIDK--------------SHVHLVP----PPAGQSPN 364
           ++VF+RE   L W    C   E++ D               +HV   P     PAG++P 
Sbjct: 453 KVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAGRTPI 512

Query: 365 P--LPTTEQQSTSNGQAAA 381
           P   P     S + G  A 
Sbjct: 513 PGAAPVPRSSSAAAGGRAG 531


>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
          Length = 207

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 130/207 (62%), Gaps = 3/207 (1%)

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           + F+A VDSG SFTFLP   Y  +  +FDK V++ R S +G+ W+YCY +SSE++ KVP 
Sbjct: 2   TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           + L+F +N SFVV N +F+F +N+G   FCL +  T+GD G IGQNFM G+R+VFDREN 
Sbjct: 62  LTLMFQQNNSFVVYNPVFTFYDNQGVVGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRENK 121

Query: 335 KLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSI 394
            LAWS S C+++     + L PP    S  PLPT EQQ T NG A AP    + +P  S 
Sbjct: 122 NLAWSPSNCQDLSLGKRMPLSPPNKTSS-APLPTDEQQRT-NGHAVAPAIAGRASPKPS- 178

Query: 395 AASAQQLDSVLRVACSLLVLMCLLLSS 421
           AA ++ +   +    S   L+  LLS+
Sbjct: 179 AAPSRIISCQVHYWHSYWFLLFQLLSA 205


>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
          Length = 585

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 120/332 (36%), Positives = 167/332 (50%), Gaps = 68/332 (20%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
           G +   D  LS Y+P  SS+S+ V+C++ LC  R+ C      CPY+  Y + +TS+SG 
Sbjct: 141 GTTYASDFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGI 200

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           LV+D+LHL +  +   Q  V++ V  GCG+ QTGS+LD AAP+G+ GLGL  +SVPS+L+
Sbjct: 201 LVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILS 258

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           K G   +SFS+CF  +  G + FGD+G   Q+ T F  +   +  Y + V    +G + L
Sbjct: 259 KEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF-NLNALHPTYNITVTQVRVGTT-L 316

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
               F AL DSG SFT+L   IY  V      L SS+ I        YC           
Sbjct: 317 IDLDFTALFDSGTSFTYLVDPIYTNV------LKSSELI--------YCMA--------- 353

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
                        VVR+                       +  IIGQNFM G+RI+FDRE
Sbjct: 354 -------------VVRS----------------------AELNIIGQNFMTGYRIIFDRE 378

Query: 333 NLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
            L L W   +C++ I+ S V + P     PPA
Sbjct: 379 KLVLGWKEFECDD-IENSSVPIRPRATSVPPA 409


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 109/315 (34%), Positives = 171/315 (54%), Gaps = 10/315 (3%)

Query: 36  IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + Q   L+ Y P++S++S ++ CS   C     C S    CPY   YS   T + G L+ 
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNS-TGTKGTLLQ 203

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   SVPSLLAKA 
Sbjct: 204 DVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261

Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +  NSFS+CF     + G + FGD+G   Q+ T F+ +     AY V +    +    + 
Sbjct: 262 ITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPS-TAYGVNISGVSVAGDPVD 320

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLK 271
              F A  D+G+SFT L    Y  +   FD+LV  +R  +     +++CY+ S +   ++
Sbjct: 321 IRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQ 379

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFD 330
            P + + F      ++ N  F+    EG  ++CL V+ + G    +IGQNF+ G+RIVFD
Sbjct: 380 FPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFD 439

Query: 331 RENLKLAWSHSKCEE 345
           RE + L W  S C E
Sbjct: 440 RERMILGWKQSLCFE 454


>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
           [Cucumis sativus]
          Length = 430

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 173/335 (51%), Gaps = 29/335 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y P+ S++S  V C+  LC   + C S ++ CPY   Y + +TSS GYLV+D+LHLA
Sbjct: 3   LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +    +    V++ +  GCG  QTG +   AAP+G++GLG+  +SVPS LA  GL  NSF
Sbjct: 60  T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
           S+CF  +  G + FGD GPA Q+ T F  + E Y +Y V      +G        F A+ 
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 175

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYN-ASSEEMLKVPDMRLI 278
           DSG SFT+L    Y+ +  + D  +  KR SL G +  ++YCY      +  +   +   
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFT 235

Query: 279 FSKNQSFVVRNHIFSFPEN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 328
                 F   +     P +         E   V CL +  STD D  +IGQNFM G+RI 
Sbjct: 236 MKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRIT 293

Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
           F+R+ + L WS S C +       + V  P+G +P
Sbjct: 294 FNRDQMVLGWSSSDCYD-------NGVGTPSGDTP 321


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 173/335 (51%), Gaps = 29/335 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y P+ S++S  V C+  LC   + C S ++ CPY   Y + +TSS GYLV+D+LHLA
Sbjct: 151 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 207

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +    +    V++ +  GCG  QTG +   AAP+G++GLG+  +SVPS LA  GL  NSF
Sbjct: 208 T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 265

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
           S+CF  +  G + FGD GPA Q+ T F  + E Y +Y V      +G        F A+ 
Sbjct: 266 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 323

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNA-SSEEMLKVPDMRLI 278
           DSG SFT+L    Y+ +  + D  +  KR SL G +  ++YCY      +  +   +   
Sbjct: 324 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFT 383

Query: 279 FSKNQSFVVRNHIFSFPEN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 328
                 F   +     P +         E   V CL +  STD D  +IGQNFM G+RI 
Sbjct: 384 MKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRIT 441

Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
           F+R+ + L WS S C +       + V  P+G +P
Sbjct: 442 FNRDQMVLGWSSSDCYD-------NGVGTPSGDTP 469


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 170/315 (53%), Gaps = 13/315 (4%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           Q R L+ Y P++SS+S ++ CS   C   S C S    CPY   Y ++DT ++G L +D+
Sbjct: 147 QSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDV 206

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           LHL +  +      V++++ +GCG+ QTG     AA +G++GLGL D SVPS+LAKA + 
Sbjct: 207 LHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT 264

Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
            NSFS+CF    +  G + FGD+G   Q  T  LP  E    Y V V    +G   +   
Sbjct: 265 ANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTEVSVGGDAV--- 320

Query: 216 GFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLK 271
           G Q  AL D+G SFT L    Y  +   FD  V+ KR  +     +++CY+ S ++  + 
Sbjct: 321 GVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTIL 380

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFD 330
            P + + F       +RN +F     +   ++CL ++ S D    IIGQNFM G+RIVFD
Sbjct: 381 FPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFD 440

Query: 331 RENLKLAWSHSKCEE 345
           RE + L W  S C E
Sbjct: 441 RERMILGWKRSDCFE 455


>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 171/317 (53%), Gaps = 19/317 (5%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           Q   L+ Y+PS S+SS  V+C+  LC  R+ C S    CPY   Y +  + S+G LV+D+
Sbjct: 161 QRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDV 220

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +H+++    A  + +      GC   Q G + +  A +G+MGL + D++VP++L KAG+ 
Sbjct: 221 IHMSTEEGEARDARIT----FGCSETQLGLFQE-VAVNGIMGLAMADIAVPNMLVKAGVA 275

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQS 215
            +SFS+CF  N  G++ FGD+G + Q  T   P+G      F  V +  + +G   + ++
Sbjct: 276 SDSFSMCFGPNGKGTISFGDKGSSDQHET---PLGGTISPLFYDVSITKFKVGKVTV-ET 331

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY---NASSEEMLK 271
            F A+ DSG + T+L    Y  +   F   V  +R+    +S +++CY   + S EE  K
Sbjct: 332 KFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYIITSTSDEE--K 389

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTD-GDYGIIGQNFMMGHRIVF 329
           +P +        ++ V + I  F  ++G F V+CL V+  D  D+ IIGQNFM  +RIV 
Sbjct: 390 LPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADFNIIGQNFMTNYRIVH 449

Query: 330 DRENLKLAWSHSKCEEV 346
           DRE + L W  S C + 
Sbjct: 450 DRERMILGWKKSNCNDT 466


>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
 gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
          Length = 575

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 113/316 (35%), Positives = 161/316 (50%), Gaps = 18/316 (5%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           Y PS SS+SK V C HPLC+   +C +       CPY   Y + +T SSG LV+D+LHL 
Sbjct: 162 YSPSLSSTSKTVPCGHPLCERPDACATAGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLV 221

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNS 160
                    +VQ+ ++ GCG+ QTG++L GAA  G+MGLGL  VSVPS LA +GL+  +S
Sbjct: 222 DGGGGGGGKAVQAPIVFGCGQVQTGAFLRGAAAGGLMGLGLDKVSVPSALASSGLVASDS 281

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQA 219
           FS+CF  +  G + FGD G   Q  T  +  G    +Y+ + V +  + +  +    F A
Sbjct: 282 FSMCFSRDGVGRINFGDAGSPDQAETPLIAAGSLQPSYYNISVGAITVDSKAMAVE-FTA 340

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEE--MLKVPDM 275
           +VDSG SFT+L    Y  +   F+  VS  S+        +++CY  S  +  M ++P M
Sbjct: 341 VVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYGSGYEKFEFCYRLSPGQTSMKRLPAM 400

Query: 276 RLIFSKNQSFVVRNHIFSF--PENEG---FTVFCLTVMST---DGDYGIIGQNFMMGHRI 327
            L       F +   I       N G      +CL ++ T     +   IGQNFM G ++
Sbjct: 401 SLTTKGGAVFPITWPIIPVLASTNGGPYHPIGYCLGIIKTSILSTEDATIGQNFMTGLKV 460

Query: 328 VFDRENLKLAWSHSKC 343
           VFDR    L W    C
Sbjct: 461 VFDRRKSVLGWEKFDC 476


>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 111/319 (34%), Positives = 172/319 (53%), Gaps = 14/319 (4%)

Query: 36  IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + Q   L+ Y P++S++S ++ CS   C     C S K  CPY   YS   T ++G L+ 
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNS-TGTTGTLLQ 203

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   SVPSLLAKA 
Sbjct: 204 DVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261

Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +  +SFS+CF     + G + FGD+G   Q+ T F+ +     AY + V    +G   + 
Sbjct: 262 ITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPS-TAYGLNVTGVSVGGDPVG 320

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEM-LK 271
              F A  D+G+SFT L    Y  +   FD LV  KR  +     +++CY+ S     ++
Sbjct: 321 TRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSIE 379

Query: 272 VPDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHR 326
            P + + F      ++ N  F+        EG  ++CL V+ + G    +IGQNF+ G+R
Sbjct: 380 FPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKINVIGQNFVAGYR 439

Query: 327 IVFDRENLKLAWSHSKCEE 345
           IVFDRE + L W  S C E
Sbjct: 440 IVFDRERMILGWKPSLCFE 458


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/285 (36%), Positives = 166/285 (58%), Gaps = 7/285 (2%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ S++S+ V CS  LC  +++C+S  + CPY   Y +++TSSSG LV+D+L+L S S
Sbjct: 84  YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 143

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             A    V + ++ GCG+ QTGS+L  AAP+G++GLG+   SVPSLLA  GL  NSFS+C
Sbjct: 144 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 201

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
           F ++  G + FGD G + Q+ T  L + ++   Y + +    +G+  ++   F A+VDSG
Sbjct: 202 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 259

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
            SFT L   +Y ++   FD  + S R  L  +  +++CY+ S+  ++  P++ L      
Sbjct: 260 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 318

Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRI 327
            F V + I +  +N    V +CL +M ++G   I G NF    R+
Sbjct: 319 IFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGGYNFDESSRL 363


>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 530

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 113/313 (36%), Positives = 167/313 (53%), Gaps = 9/313 (2%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           Q R L+ Y P++SS+S ++ C+   C   S C S    CPY   Y ++DT ++G L +D+
Sbjct: 148 QSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTTGTLFEDV 207

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           LHL +  +      V++++ +GCGR QTG     AA +G++GLG+ D SVPS+LAKA + 
Sbjct: 208 LHLVT--EDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILAKAKIT 265

Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
            NSFS+CF    +  G + FGD+G   Q  T  LP  E    Y V V    +G   +   
Sbjct: 266 ANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPT-EPSPTYAVNVTEVSVGGDVVGVQ 324

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVP 273
              AL D+G SFT L    Y  +   FD  V+ KR  +     +++CY+ S +   +  P
Sbjct: 325 -LLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTILFP 383

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRE 332
            + + F       +RN +F     +   ++CL ++ S D    IIGQNFM G+R+VFDRE
Sbjct: 384 RVAMTFEGGSLMFLRNPLFIVWNEDNTAMYCLGILKSVDFKINIIGQNFMSGYRVVFDRE 443

Query: 333 NLKLAWSHSKCEE 345
            + L W  S C E
Sbjct: 444 RMILGWKRSDCFE 456


>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
          Length = 519

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 114/313 (36%), Positives = 168/313 (53%), Gaps = 19/313 (6%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           Q R L+ Y P++SS+S ++ CS   C   S C S    CPY   Y ++DT ++G L +D+
Sbjct: 147 QSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDV 206

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           LHL +  +      V++++ +GCG+ QTG     AA +G++GLGL D SVPS+LAKA + 
Sbjct: 207 LHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT 264

Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
            NSFS+CF    +  G + FGD+G   Q  T  LP         VG ++  +G   L   
Sbjct: 265 ANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGGDA--VGVQLL--- 319

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVP 273
              AL D+G SFT L    Y  +   FD  V+ KR  +     +++CY+ S ++  +  P
Sbjct: 320 ---ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFP 376

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRE 332
            + + F       +RN +F     +   ++CL ++ S D    IIGQNFM G+RIVFDRE
Sbjct: 377 RVAMTFEGGSQMFLRNPLFI----DNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRE 432

Query: 333 NLKLAWSHSKCEE 345
            + L W  S C E
Sbjct: 433 RMILGWKRSDCFE 445


>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
          Length = 306

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 154/278 (55%), Gaps = 10/278 (3%)

Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 179
           CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF  + +G + FGD+G
Sbjct: 13  CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 239
            + Q+ T F P   +   Y + +    +G +      F A+ DSG SFT+L    Y  + 
Sbjct: 73  SSGQEETPFNPSKSQL-LYNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 130

Query: 240 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
             F+     KR S   +  ++YCY+ S ++  ++ P + L      +F V + I      
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 190

Query: 298 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 357
            G+ V+CL V+ + GD  IIGQNFM G+RI+FDRE + L W+ S C +  + + + + P 
Sbjct: 191 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 248

Query: 358 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 391
            +   P  +    + +  NG  +    AP   A  +P+
Sbjct: 249 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 286


>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
           vinifera]
          Length = 294

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 97/278 (34%), Positives = 155/278 (55%), Gaps = 10/278 (3%)

Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 179
           CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF  + +G + FGD+G
Sbjct: 1   CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 239
            + Q+ T F P   +   Y + +    +G +    + F A+ DSG SFT+L    Y  + 
Sbjct: 61  SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 118

Query: 240 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
             F+     KR S   +  ++YCY+ S ++  ++ P + L      +F V + I      
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 178

Query: 298 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 357
            G+ V+CL V+ + GD  IIGQNFM G+RI+FDRE + L W+ S C +  + + + + P 
Sbjct: 179 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 236

Query: 358 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 391
            +   P  +    + +  NG  +    AP   A  +P+
Sbjct: 237 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 274


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 111/349 (31%), Positives = 177/349 (50%), Gaps = 15/349 (4%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y+PS S SS  V+C+  LC  R+ C S    CPY   Y +  + S+G LV+D++H++
Sbjct: 137 LNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMS 196

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +    A  + +      GC   Q G + +  A +G+MGL + D++VP++L KAG+  +SF
Sbjct: 197 TEEGEARDARIT----FGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSF 251

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQA 219
           S+CF  N  G++ FGD+G + Q  T   P+       F  V +  + +G   +  + F A
Sbjct: 252 SMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTA 307

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRL 277
             DSG + T+L    Y  +   F   V  +R+S   +S +++CY   S+ +  K+P +  
Sbjct: 308 TFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSF 367

Query: 278 IFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLK 335
                 ++ V + I  F  ++G F V+CL V+   + D+ IIGQNFM  +RIV DRE   
Sbjct: 368 EMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRERRI 427

Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
           L W  S C +    +    +  P   +P   P T   S+     AA  S
Sbjct: 428 LGWKKSNCNDTNGFTGPTALAKPPSMAPTSSPRTINLSSRLNPLAAASS 476


>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 530

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 112/317 (35%), Positives = 166/317 (52%), Gaps = 20/317 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y P++S++S ++ CS   C     C S +  CPY    S+ +T ++G L+ D+LHL 
Sbjct: 152 LNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS-NTVTTGTLLQDVLHLV 210

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +  +      V ++V +GCG+ QTG++    A +GV+GL + + SVPSLLAKA +  NSF
Sbjct: 211 T--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSF 268

Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
           S+CF    S  G + FGD+G   Q+ T  + + E   AY V V    +G   +    F A
Sbjct: 269 SMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNVTGVSVGGVPVDVPLF-A 326

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
           L D+G+SFT L    Y      FD L+  KR  +  +  +++CY+   E +      R +
Sbjct: 327 LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHM 386

Query: 279 FSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            SK        F  R         S+  NEG  ++CL ++ +  +  IIGQN M GHRIV
Sbjct: 387 QSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI-NLNIIGQNLMSGHRIV 444

Query: 329 FDRENLKLAWSHSKCEE 345
           FDRE + L W  S C E
Sbjct: 445 FDRERMILGWKQSNCFE 461


>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
          Length = 518

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/317 (35%), Positives = 166/317 (52%), Gaps = 20/317 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L+ Y P++S++S ++ CS   C     C S +  CPY    S+ +T ++G L+ D+LHL 
Sbjct: 140 LNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS-NTVTTGTLLQDVLHLV 198

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +  +      V ++V +GCG+ QTG++    A +GV+GL + + SVPSLLAKA +  NSF
Sbjct: 199 T--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSF 256

Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
           S+CF    S  G + FGD+G   Q+ T  + + E   AY V V    +G   +    F A
Sbjct: 257 SMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNVTGVSVGGVPVDVPLF-A 314

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
           L D+G+SFT L    Y      FD L+  KR  +  +  +++CY+   E +      R +
Sbjct: 315 LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHM 374

Query: 279 FSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            SK        F  R         S+  NEG  ++CL ++ +  +  IIGQN M GHRIV
Sbjct: 375 QSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI-NLNIIGQNLMSGHRIV 432

Query: 329 FDRENLKLAWSHSKCEE 345
           FDRE + L W  S C E
Sbjct: 433 FDRERMILGWKQSNCFE 449


>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
          Length = 335

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/244 (36%), Positives = 141/244 (57%), Gaps = 9/244 (3%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C    GA+   +  LS Y+P  S+++K V+C++ LC  R+ C      CPY+  Y +  T
Sbjct: 20  CAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 79

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
           S+SG L++D++HL +  K+  +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SV
Sbjct: 80  STSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 137

Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
           PS+LA+ GL+ +SFS+CF  +  G + FGD+G + Q+ T F  +   +  Y + V    +
Sbjct: 138 PSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV 196

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
           G + L    F AL D+G SFT+L   +Y  V     +    KR S      ++YCY+   
Sbjct: 197 GTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQDKRHSPDSRIPFEYCYDMRE 251

Query: 267 EEML 270
           + +L
Sbjct: 252 KLVL 255


>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
 gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
          Length = 455

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 140/239 (58%), Gaps = 9/239 (3%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
           GA+   +  LS Y+P  S+++K V+C++ LC  R+ C      CPY+  Y +  TS+SG 
Sbjct: 145 GATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 204

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L++D++HL +  K+  +  V++ V  GCG+ Q+GS+LD AAP+G+ GLG+  +SVPS+LA
Sbjct: 205 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 262

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           + GL+ +SFS+CF  +  G + FGD+G + Q+ T F  +   +  Y + V    +G + L
Sbjct: 263 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 320

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEML 270
               F AL D+G SFT+L   +Y  V     +    KR S      ++YCY+   + +L
Sbjct: 321 IDDEFTALFDTGTSFTYLVDPMYTTV----SESAQDKRHSPDSRIPFEYCYDMREKLVL 375


>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
          Length = 263

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 133/241 (55%), Gaps = 6/241 (2%)

Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 171
           V++ ++ GCG+ QTG++LD AAP+G+ GLG+  VSVPS+LA  G   NSFS+CF  +  G
Sbjct: 11  VKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMCFGSDGMG 70

Query: 172 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 231
            ++FGD G + Q  T F  +   +  Y + +    +GNS +  +   A+VDSG SFT L 
Sbjct: 71  RIYFGDTGSSDQGETPF-DVNHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSGTSFTCLA 128

Query: 232 TEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 289
             +Y ++   F   V   R  S  G  ++YCY  S ++  + +P + L       F + +
Sbjct: 129 DPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGGSQFPIND 188

Query: 290 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
            I     +E  + +CL ++ +     IIGQNFM G RIVFDRE L L W  S C E  D 
Sbjct: 189 PIIVI-SSEQSSFYCLGIVKSS-QLNIIGQNFMTGLRIVFDRERLVLGWKESDCYEAEDS 246

Query: 350 S 350
           S
Sbjct: 247 S 247


>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 414

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 145/296 (48%), Gaps = 20/296 (6%)

Query: 65  SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
           S+  C S    CPY   Y    TS+ G L +D+LHL +  +      V++++ +GCG+ Q
Sbjct: 123 SQGGCSSPASVCPYQIPYLFNTTSTRGTLFEDVLHLVT--EDEGLEPVKANITLGCGQNQ 180

Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPAT 182
           TG Y    A +G++GLG+ D SVPS+LAK  +  NSFS+CF    +  G + FGD+G   
Sbjct: 181 TGLYRKSLAVNGLLGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTD 240

Query: 183 QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
           Q  T  +PI E    Y V V    +G   L +    AL D+G SFT L    Y  +   F
Sbjct: 241 QLQTPLVPI-EPNPTYAVNVTEVTVGGDIL-EIQMLALFDTGTSFTHLLEPAYGLLTKAF 298

Query: 243 DKLVSSKRISLQGN-SWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 300
           D  V+ KR  +     +++CY+ S   +  K P + + F       +R+ +F+       
Sbjct: 299 DDHVTDKRRPIDPEIPFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVWNEARH 358

Query: 301 TVFCLTVMSTDGD------------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
             +  ++  +D +              ++ +N M G+RIVFDRE + L W  S C+
Sbjct: 359 GAWMSSLTFSDREKKKKEYVLNAFHIWVVSENLMSGYRIVFDRERMILGWKRSDCK 414


>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
          Length = 335

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 123/203 (60%), Gaps = 4/203 (1%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +D     Y P  SS+S+ V CS  LC  +S+C+S    CPY   Y +++TSS+G LV+D+
Sbjct: 130 RDLKFDTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDV 189

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL- 156
           L+L +     P+  V + +  GCGR QTGS+L  AAP+G++GLG+  +SVPSLLA  G+ 
Sbjct: 190 LYLVTEYGRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVA 248

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
             NSFS+CF ++  G + FGD G + QQ T  L + ++   Y + +    +G+  +  + 
Sbjct: 249 AANSFSMCFAQDGHGRINFGDTGSSDQQETP-LNMYKQNPYYNISITGATVGSKSI-HTK 306

Query: 217 FQALVDSGASFTFLPTEIYAEVV 239
           F A+VDSG SFT L   +Y ++ 
Sbjct: 307 FNAIVDSGTSFTALSDPMYTQIT 329


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 93/317 (29%), Positives = 145/317 (45%), Gaps = 67/317 (21%)

Query: 36  IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + Q   L+ Y P++S++S ++ CS   C     C S    CPY   YS   T + G L+ 
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NSTGTKGTLLQ 203

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+LHLA+  ++   + V+++V +GCG+KQTG +    + +GV+GLG+   SVPSLLAKA 
Sbjct: 204 DVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261

Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +  NSFS+CF     + G + FGD+G   Q+ T F+ +  +                   
Sbjct: 262 ITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPR------------------- 302

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEMLKV 272
               +  VD    F F                               CY+ S +   ++ 
Sbjct: 303 ----RRPVDPELPFEF-------------------------------CYDLSPNATTIQF 327

Query: 273 PDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           P + + F      ++ N  F+        EG  ++CL V+ +    G+   NF+ G+RIV
Sbjct: 328 PLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKS---VGLKINNFVAGYRIV 384

Query: 329 FDRENLKLAWSHSKCEE 345
           FDRE + L W  S C E
Sbjct: 385 FDRERMILGWKQSLCFE 401


>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
 gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
           Group]
 gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
          Length = 307

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 134/263 (50%), Gaps = 36/263 (13%)

Query: 137 VMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 195
           +MGLG+  VSVPS+LA  G+++ NSFS+CF ++  G + FGD G A Q  T F+ +   +
Sbjct: 9   LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTH 67

Query: 196 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
             Y + + S  +G+  L   GF A+ DSG SFT+L    Y      F+  +S +R +  G
Sbjct: 68  SYYNISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSG 126

Query: 256 NS------WKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--F 303
           ++      ++YCY+ S ++  +++P + L  +    F V + ++       N    +  +
Sbjct: 127 STRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGY 186

Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC---EEVID------------ 348
           CL V+ +D    IIGQNFM G ++VF+RE   L W    C   E++ D            
Sbjct: 187 CLAVIKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSP 246

Query: 349 --KSHVHLVP----PPAGQSPNP 365
              +HV   P     PAG++P P
Sbjct: 247 GPTTHVFPQPQESDSPAGRTPIP 269


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 90/318 (28%), Positives = 152/318 (47%), Gaps = 22/318 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           L+ YD  +SS++K+VSCS   C     RS C S    C Y+  Y  + +S++GYLV D++
Sbjct: 128 LTPYDVDASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVIMYG-DGSSTNGYLVKDVV 185

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
           HL   + +    S   ++I GCG KQ+G   +  AA DG+MG G  + S  S LA  G +
Sbjct: 186 HLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKV 245

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQS 215
           + SF+ C D N+ G +F    G          P+  K   Y V + +  +GNS   L+ +
Sbjct: 246 KRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSN 303

Query: 216 GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
            F +      ++DSG +  +LP  +Y  ++ +   L S   ++L      +     ++++
Sbjct: 304 AFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHTVQESFTCFHYTDKL 361

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGH 325
            + P +   F K+ S  V  R ++F   E+     +    + T G     I+G   +   
Sbjct: 362 DRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421

Query: 326 RIVFDRENLKLAWSHSKC 343
            +V+D EN  + W++  C
Sbjct: 422 LVVYDIENQVIGWTNHNC 439


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 90/318 (28%), Positives = 150/318 (47%), Gaps = 22/318 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           L+ YD  +SS++K+VSCS   C     RS C S    C Y+  Y  + +S++GYLV D++
Sbjct: 128 LTPYDADASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVILYG-DGSSTNGYLVRDVV 185

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
           HL   + +    S   ++I GCG KQ+G   +  AA DG+MG G  + S  S LA  G +
Sbjct: 186 HLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKV 245

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-- 215
           + SF+ C D N+ G +F    G          P+  K   Y V + +  +GNS L  S  
Sbjct: 246 KRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSD 303

Query: 216 GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
            F +      ++DSG +  +LP  +Y  ++ +   L S + ++L      +      + +
Sbjct: 304 AFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQI--LASHQELNLHTVQDSFTCFHYIDRL 361

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGH 325
            + P +   F K+ S  V  + ++F   E+     +    + T G     I+G   +   
Sbjct: 362 DRFPTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421

Query: 326 RIVFDRENLKLAWSHSKC 343
            +V+D EN  + W++  C
Sbjct: 422 LVVYDIENQVIGWTNHNC 439


>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
          Length = 426

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 136/262 (51%), Gaps = 17/262 (6%)

Query: 65  SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
           +++ C S    CPY   Y +  + S+G LV+D++H+++    A  +       I  G  Q
Sbjct: 124 TKARCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDAR------ITFGESQ 177

Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 184
            G + +  A +G+MGL + D++VP++L KAG+  +SFS+CF  N  G++ FGD+G + Q 
Sbjct: 178 LGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQL 236

Query: 185 STSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
            T   P+       F  V +  + +G   +  + F A  DSG + T+L    Y  +   F
Sbjct: 237 ET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNF 292

Query: 243 DKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG- 299
              V  +R+S   +S +++CY   S+ +  K+P +        ++ V + I  F  ++G 
Sbjct: 293 HLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGS 352

Query: 300 FTVFCLTVMS-TDGDYGIIGQN 320
           F V+CL V+   + D+ IIG+N
Sbjct: 353 FQVYCLAVLKQVNADFSIIGRN 374


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 152/328 (46%), Gaps = 29/328 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL----KDPCPYIADYSTEDTSSSGYLVDDI 97
           L+ YDP  S +S+ VSC H  C S    + L    ++PCPY   Y  + ++++GY V D 
Sbjct: 113 LTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYG-DGSATTGYYVQDY 171

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAG 155
           L     + +   ++  SS+I GCG  Q+G++   +  A DG++G G  + SV S LA +G
Sbjct: 172 LTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASG 231

Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            ++  FS C D N  G +F  G+      ++T  +P    Y+     +E   +    L  
Sbjct: 232 KVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIE---VDGDILQL 288

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
                 +++G   ++DSG +  +LP  +Y +++ K   L    R+ +     +Y C+  +
Sbjct: 289 PSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV--LAKQPRLKVYLVEEQYSCFQYT 346

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL------TVMSTDGDYGIIGQ 319
                  P ++L F  + S  V  H + F   +G + +C+      +      D  ++G 
Sbjct: 347 GNVDSGFPIVKLHFEDSLSLTVYPHDYLF-NYKGDSYWCIGWQKSASETKNGKDMTLLGD 405

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             +    +V+D EN+ + W+   C   I
Sbjct: 406 FVLSNKLVVYDLENMTIGWTDYNCSSSI 433


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/333 (27%), Positives = 151/333 (45%), Gaps = 36/333 (10%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKD-PCPYIADYSTEDTSSSGYLV 94
            L+ YDP+SS +SK V C    C S      S CK  KD  CPY   Y    T+S  Y+ 
Sbjct: 118 ELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCK--KDMSCPYSITYGDGSTTSGSYIK 175

Query: 95  DDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSL 150
           DD+    +    +  P ++   SVI GCG KQ+G  S     + DG++G G  + SV S 
Sbjct: 176 DDLTFDRVVGDLRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 232

Query: 151 LAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           LA AG ++  FS C D  + G +F  G+      ++T  +P    Y+     +E    G+
Sbjct: 233 LAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIE--VAGD 290

Query: 210 SCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
                     + SG   ++DSG +  +LP  IY +++ K     S   + L  + +  C+
Sbjct: 291 PIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT-CF 349

Query: 263 NASSEEMLK--VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DY 314
           + S E+ L    P ++  F +  +     H + FP  E   ++C+     T  + DG D 
Sbjct: 350 HYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFKE--DMWCIGWQKSTAQTKDGKDL 407

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
            ++G   +     ++D +N+ + W+   C   I
Sbjct: 408 ILLGDLVLTNKLFIYDLDNMSIGWTDYNCSSSI 440


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 154/332 (46%), Gaps = 36/332 (10%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YDP  S +S+ +SC    C +        CKS + PCPY   Y  + ++++GY V 
Sbjct: 113 DLTLYDPKGSETSELISCDQEFCSATYDGPIPGCKS-EIPCPYSITYG-DGSATTGYYVQ 170

Query: 96  DIL---HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSL 150
           D L   H+    + APQ+S   S+I GCG  Q+G+    +  A DG++G G  + SV S 
Sbjct: 171 DYLTYNHVNDNLRTAPQNS---SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQ 227

Query: 151 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
           LA +G ++  FS C D    G +F    G   +   S  P+  +   Y V ++S  +   
Sbjct: 228 LAASGKVKKIFSHCLDNIRGGGIF--AIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTD 285

Query: 211 CL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-C 261
            L        + +G   ++DSG +  +LP  +Y E++ K   +    R+ L     ++ C
Sbjct: 286 ILQLPSDIFDSGNGKGTIIDSGTTLAYLPAIVYDELIPKV--MARQPRLKLYLVEQQFSC 343

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYG 315
           +  +       P ++L F  + S  V  H + F   +G  ++C+        + +G D  
Sbjct: 344 FQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDG--IWCIGWQKSVAQTKNGKDMT 401

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           ++G   +    +++D EN+ + W+   C   I
Sbjct: 402 LLGDLVLSNKLVIYDLENMAIGWTDYNCSSSI 433


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 166/368 (45%), Gaps = 57/368 (15%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SSS   V C+        +C S K  C Y   Y+ E +SSSG L +DI+     
Sbjct: 129 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 182

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +     I GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+
Sbjct: 183 SELKPQHA-----IFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSL 236

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V  G   P     ++  P+   Y  Y + ++   +    L        
Sbjct: 237 CYGGMDIGGGAMVLGGMLAPPDMIFSNSDPLRSPY--YNIELKEIHVAGKALRVESRIFN 294

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ-----GNSWK-YCYNASSEE 268
           S    ++DSG ++ +LP + +    V F + V+SK  SL+       S+K  C+  +   
Sbjct: 295 SKHGTVLDSGTTYAYLPEQAF----VAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRN 350

Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GII 317
           + K+    PD+ ++F   Q  S    N++F   + +G   +CL V     D      GII
Sbjct: 351 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGII 408

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNG 377
            +N +    + +DR N K+ +  + C E+ ++ H+       G +P+P P+++  S  + 
Sbjct: 409 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHI-------GDTPSPAPSSDTSSEHDM 457

Query: 378 QAAAPPST 385
             A  PS 
Sbjct: 458 SPAPAPSN 465


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 164/368 (44%), Gaps = 57/368 (15%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SSS   V C+        +C S K  C Y   Y+ E +SSSG L +DI+     
Sbjct: 130 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 183

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+
Sbjct: 184 SELKPQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSL 237

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V  G   P+    +   P+   Y  Y + ++   +    L        
Sbjct: 238 CYGGMDIGGGAMVLGGVPAPSDMVFSHSDPLRSPY--YNIELKEIHVAGKALRVDSRVFN 295

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
           S    ++DSG ++ +LP + +    V F   V+SK  SL+       N    C+  +   
Sbjct: 296 SKHGTVLDSGTTYAYLPEQAF----VAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRN 351

Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GII 317
           + K+    PD+ ++F   Q  S    N++F   + +G   +CL V     D      GII
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGII 409

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNG 377
            +N +    + +DR N K+ +  + C E+ ++ H+         +P+P P+++  S ++ 
Sbjct: 410 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHI-------SDAPSPAPSSDTNSETDM 458

Query: 378 QAAAPPST 385
             A  PS+
Sbjct: 459 SPAPAPSS 466


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 148/328 (45%), Gaps = 28/328 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YDP  S +S  VSC    C +        CKS + PCPY   Y  + ++++GY V 
Sbjct: 113 DLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKS-EIPCPYSITYG-DGSATTGYYVQ 170

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
           D L     + +   S   SS+I GCG  Q+G+    +  A DG++G G  + SV S LA 
Sbjct: 171 DYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAA 230

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           +G ++  FS C D    G +F    G   +   S  P+  +   Y V ++S  +    L 
Sbjct: 231 SGKVKKIFSHCLDNVRGGGIF--AIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQ 288

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                  + +G   ++DSG +  +LP  +Y E++ K        ++ L    ++ C+  +
Sbjct: 289 LPSDIFDSVNGKGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQFR-CFLYT 347

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
                  P ++L F  + S  V  H + F   +G  ++C+        + +G D  ++G 
Sbjct: 348 GNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDG--IWCIGWQRSVAQTKNGKDMTLLGD 405

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             +    +++D EN+ + W+   C   I
Sbjct: 406 LVLSNKLVIYDLENMVIGWTDYNCSSSI 433


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 137/316 (43%), Gaps = 18/316 (5%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 +       +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
             FS C D  + G +F    G   +      PI +  + Y  V ++S  +  + L     
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T       +DSG++  +LP  IY+E+++          I++       C++      
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 358

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
            K P +   F  + +  V   +++  +  N+    F    +    D  I+G   +    +
Sbjct: 359 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 418

Query: 328 VFDRENLKLAWSHSKC 343
           V+D E   + W+   C
Sbjct: 419 VYDMEKQAIGWTEHNC 434


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 144/312 (46%), Gaps = 23/312 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP  SS+ + + CS  LC     SC+     C Y  +Y + +T   G    D + L + 
Sbjct: 95  FDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYSYEYGSGETE--GEFARDTISLGTT 152

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S  + +     S  +GCG   +G   DG   DG++GLG G VS+ S L+ A  I + FS 
Sbjct: 153 SDGSQKFP---SFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSY 203

Query: 164 CF----DENDSGSVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQS 215
           C      +++S  + FG          QST   P  + Y  Y++  V    +    +   
Sbjct: 204 CLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP 263

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
           G   ++DSG + T++P+ +Y  V+ + + +V+  R+         CY+ SS    K P +
Sbjct: 264 G-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322

Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENL 334
            +  +        ++ F   ++ G TV CL + S  G    IIG     G+ I++DR + 
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTV-CLAMGSASGLPVSIIGNVMQQGYHILYDRGSS 381

Query: 335 KLAWSHSKCEEV 346
           +L++  +KCE +
Sbjct: 382 ELSFVQAKCESL 393


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 140/319 (43%), Gaps = 19/319 (5%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 +       +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +
Sbjct: 159 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 218

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
             FS C D  + G +F    G   +      PI +  + Y  V ++S  +  + L     
Sbjct: 219 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 276

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T       +DSG++  +LP  IY+E+++          I++       C++      
Sbjct: 277 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 334

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
            K P +   F  + +  V   +++  +  N+    F    +    D  I+G   +    +
Sbjct: 335 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 394

Query: 328 VFDRENLKLAWS-HSKCEE 345
           V+D E   + W+ H+  EE
Sbjct: 395 VYDMEKQAIGWTEHNSVEE 413


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 96/354 (27%), Positives = 156/354 (44%), Gaps = 36/354 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ Y+   S S K VSC    C        S CK+    CPY+  Y  + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
           ++   S +      +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA 
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +G ++  F+ C D  + G +F    G   Q   +  P+      Y V + +  +G   LT
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298

Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 Q G +  A++DSG +  +LP  IY  +V K      + ++ +    +K C+  S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYS 357

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQ 319
                  P++   F  +    V  H + FP +EG  ++C+      + S D  +  ++G 
Sbjct: 358 GRVDEGFPNVTFHFENSVFLRVYPHDYLFP-HEG--MWCIGWQNSAMQSRDRRNMTLLGD 414

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 368
             +    +++D EN  + W+   C   I         VHLV      S  PL T
Sbjct: 415 LVLSNKLVLYDLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 144/312 (46%), Gaps = 23/312 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP  SS+ + + CS  LC     SC+     C Y  +Y + +T   G    D + L + 
Sbjct: 95  FDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYSYEYGSGETE--GEFARDTISLGTT 152

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S  + +     S  +GCG   +G   DG   DG++GLG G VS+ S L+ A  I + FS 
Sbjct: 153 SGGSQKFP---SFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSY 203

Query: 164 CF----DENDSGSVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQS 215
           C      +++S  + FG          QST   P  + Y  Y++  V    +    +   
Sbjct: 204 CLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP 263

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
           G   ++DSG + T++P+ +Y  V+ + + +V+  R+         CY+ SS    K P +
Sbjct: 264 G-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322

Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENL 334
            +  +        ++ F   ++ G TV CL + S  G    IIG     G+ I++DR + 
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTV-CLAMGSAGGLPVSIIGNVMQQGYHILYDRGSS 381

Query: 335 KLAWSHSKCEEV 346
           +L++  +KCE +
Sbjct: 382 ELSFVQAKCESL 393


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 94/337 (27%), Positives = 159/337 (47%), Gaps = 39/337 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ + V C+        +C   K+ C Y  +Y+ E +SS G L +D++   + 
Sbjct: 135 KFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNE 188

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +
Sbjct: 189 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 242

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+   D G    +  G   P+    T   P    Y  Y + +    +       NS +  
Sbjct: 243 CYGGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPY--YNIDLTGIRVAGKKLSLNSRVFD 300

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK---YCYNASSE--E 268
               A++DSG ++ +LP   +A       + VS  K+I     ++K   +   AS++  E
Sbjct: 301 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSE 360

Query: 269 MLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
           + K+ P + +IF   QS+++    + F  ++    +CL V     D+     GI+ +N +
Sbjct: 361 LSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL 420

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
               +V+DREN K+ +  + C E+ D+ H+   PPPA
Sbjct: 421 ----VVYDRENSKVGFWRTNCSELSDRLHIDGAPPPA 453


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 158/337 (46%), Gaps = 39/337 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ + V C+        +C   ++ C Y  +Y+ E +SS G L +D++   + 
Sbjct: 134 KFQPEMSSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNE 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +
Sbjct: 188 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 241

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    +  G   P+    T   P    Y  Y + +    +    L+       
Sbjct: 242 CYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFD 299

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSE----E 268
               A++DSG ++ +LP   +A       + VS+ K+I     ++K  C+  ++     E
Sbjct: 300 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSE 359

Query: 269 MLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
           + K+ P + ++F   QS+++    + F  ++    +CL V     D+     GI+ +N +
Sbjct: 360 LSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL 419

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
               +V+DREN K+ +  + C E+ D+ H+   PPPA
Sbjct: 420 ----VVYDRENSKVGFWRTNCSELSDRLHIDGAPPPA 452


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 90/328 (27%), Positives = 147/328 (44%), Gaps = 24/328 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS+++  VSCS  +C      S S+C    + C Y+  Y  + + +SGY V D
Sbjct: 127 LNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYG-DGSGTSGYYVMD 185

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           ++HL      +  S+  +SV+ GC   QTG       A DG+ G G  D+SV S L+  G
Sbjct: 186 MIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRG 245

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C   +DSG       G   + +  + P+      Y + ++S  +    L   
Sbjct: 246 IAPKVFSHCLKGDDSGGGIL-VLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPIS 304

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
                T S    ++DSG +  +L  E Y   VV    +V  S++ + L+GN    CY  S
Sbjct: 305 PAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNR---CYVTS 361

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFM 322
           S      P + L F+   S V+    +   +N   G TV+C+      G    I+G   +
Sbjct: 362 SSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVL 421

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
                ++D  N ++ W++  C   ++ S
Sbjct: 422 KDKIFIYDLANQRIGWTNYDCSMSVNVS 449


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 88/335 (26%), Positives = 153/335 (45%), Gaps = 40/335 (11%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
           +L+ YDP+ S +SK V C    C S      S C K +   CPY   Y    T+S  Y+ 
Sbjct: 117 DLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMS--CPYSITYGDGSTTSGSYIK 174

Query: 95  DDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSL 150
           DD+    +    +  P ++   SVI GCG KQ+G  S     + DG++G G  + SV S 
Sbjct: 175 DDLTFDRVVGDLRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 231

Query: 151 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
           LA AG ++  FS C D    G +F    G   Q      P+ +    Y V ++   +   
Sbjct: 232 LAAAGKVKRIFSHCLDSISGGGIFA--IGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGD 289

Query: 211 CL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
            +        + SG   ++DSG +  +LP  IY +++ K     S  ++ L  + +  C+
Sbjct: 290 PIQLPSDILDSSSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT-CF 348

Query: 263 NASSEEMLK--VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCL-----TVMSTDGD 313
           + S EE +    P ++  F +  +     R+++F F E+    ++C+        + DG 
Sbjct: 349 HYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKED----MWCVGWQKSMAQTKDGK 404

Query: 314 YGIIGQNFMMGHR-IVFDRENLKLAWSHSKCEEVI 347
             I+  + ++ ++ +V+D +N+ + W+   C   I
Sbjct: 405 ELILLGDLVLANKLVVYDLDNMAIGWADYNCSSSI 439


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 136/312 (43%), Gaps = 18/312 (5%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 +       +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
             FS C D  + G +F    G   +      PI +  + Y  V ++S  +  + L     
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T       +DSG++  +LP  IY+E+++          I++       C++      
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 358

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
            K P +   F  + +  V   +++  +  N+    F    +    D  I+G   +    +
Sbjct: 359 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 418

Query: 328 VFDRENLKLAWS 339
           V+D E   + W+
Sbjct: 419 VYDMEKQAIGWT 430


>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
 gi|255630909|gb|ACU15817.1| unknown [Glycine max]
          Length = 244

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 113/234 (48%), Gaps = 9/234 (3%)

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
           +CF  + +G + FGD G   Q+ T F  + + +  Y + +    + +S +    F A+ D
Sbjct: 1   MCFGPDGAGRITFGDTGSPDQRKTPFN-VRKLHPTYNITITQIVVEDS-VADLEFHAIFD 58

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMRLI 278
           SG SFT++    Y  +   ++  V + R S Q       ++YCY+ S  + ++VP + L 
Sbjct: 59  SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118

Query: 279 FSKNQSFVVRNHIFS-FPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
                 + V + I   F E EG  + CL +  +D    IIGQNFM+G++IVFDR+N+ L 
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEG-DLLCLGIQKSDS-VNIIGQNFMIGYKIVFDRDNMNLG 176

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 391
           W  + C + +  +   +  P    + +P       +TSN     P  + +  P+
Sbjct: 177 WKETNCSDDVLSNTSPINTPSPSPAVSPAIAVNPVATSNPSINPPNRSFRIKPT 230


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 140/321 (43%), Gaps = 19/321 (5%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 +       +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +
Sbjct: 159 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 218

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
             FS C D  + G +F    G   +      PI +  + Y  V ++S  +  + L     
Sbjct: 219 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 276

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T       +DSG++  +LP  IY+E+++          I++       C++      
Sbjct: 277 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 334

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
            K P +   F  + +  V   +++  +  N+    F    +    D  I+G   +    +
Sbjct: 335 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 394

Query: 328 VFDRENLKLAWS-HSKCEEVI 347
           V+D E   + W+ H+    ++
Sbjct: 395 VYDMEKQAIGWTEHNSMARIV 415


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 36/354 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ Y+   S S K VSC    C        S CK+    CPY+  Y  + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
           ++   S +      +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA 
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +G ++  F+ C D  + G +F    G   Q   +  P+      Y V + +  +G   L 
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLN 298

Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 Q G +  A++DSG +  +LP  IY  +V K      + ++ +    +K C+  S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYS 357

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQ 319
                  P++   F  +    V  H + FP  EG  ++C+      + S D  +  ++G 
Sbjct: 358 GRVDEGFPNVTFHFENSVFLRVYPHDYLFPY-EG--MWCIGWQNSAMQSRDRRNMTLLGD 414

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 368
             +    +++D EN  + W+   C   I         VHLV      S  PL T
Sbjct: 415 LVLSNKLVLYDLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/320 (26%), Positives = 143/320 (44%), Gaps = 18/320 (5%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           LS YD  +SS+SKNV C    C    +S     K PC Y   Y  + ++S G  V D + 
Sbjct: 121 LSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG-DGSTSDGDFVKDNIT 179

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           L   + +   + +   V+ GCG+ Q+G      +A DG+MG G  + SV S LA  G ++
Sbjct: 180 LDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVK 239

Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLT 213
             FS C D  + G +F  G+      ++T  +P    Y+    G+    E   +  S  +
Sbjct: 240 RIFSHCLDNMNGGGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLAS 299

Query: 214 QSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
            +G    ++DSG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S     
Sbjct: 300 TNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKA 356

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRI 327
            P + L F  +    V  H + F   E    F      + + DG D  ++G   +    +
Sbjct: 357 FPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLV 416

Query: 328 VFDRENLKLAWSHSKCEEVI 347
           V+D EN  + W+   C   I
Sbjct: 417 VYDLENEVIGWADHNCSSSI 436


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 162/337 (48%), Gaps = 42/337 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P SSS+ + V C+        +C S +  C Y   Y+ E ++SSG L +D++   + 
Sbjct: 125 KFQPESSSTYQPVKCT-----IDCNCDSDRMQCVYERQYA-EMSTSSGVLGEDLISFGNQ 178

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ APQ +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+
Sbjct: 179 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSL 232

Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+   D   G++  G   P +  + ++  P+   Y  Y + ++   +       N+ +  
Sbjct: 233 CYGGMDVGGGAMVLGGISPPSDMAFAYSDPVRSPY--YNIDLKEIHVAGKRLPLNANVFD 290

Query: 215 SGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
                ++DSG ++ +LP      + + +VK  +L S K+IS    ++   C++ +  ++ 
Sbjct: 291 GKHGTVLDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKKISGPDPNYNDICFSGAGIDVS 348

Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNF 321
           ++    P + ++F   Q + +    + F  ++    +CL V     D      GII +N 
Sbjct: 349 QLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNT 408

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
           +    +V+DRE  K+ +  + C E+ ++  + + PPP
Sbjct: 409 L----VVYDREQTKIGFWKTNCAELWERLQISVAPPP 441


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/345 (27%), Positives = 153/345 (44%), Gaps = 40/345 (11%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKS-LKDPCPYIADYSTEDTSSSGYLV 94
           +L+ Y+   SSS K V C   LCK       + C S   D CPY+  Y  + +S++GY V
Sbjct: 116 DLTLYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYG-DGSSTAGYFV 174

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLA 152
            D++     S     +S   SVI GCG +Q+G  SY +  A DG++G G  + S+ S L+
Sbjct: 175 KDVVLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLS 234

Query: 153 KAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
            +G ++  F+ C +  + G +F  G     T  +T  LP    Y      ++   +G++ 
Sbjct: 235 SSGKVKKMFAHCLNGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTF 291

Query: 212 L---TQSGFQ-----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CY 262
           L   T +  Q      ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+
Sbjct: 292 LNLSTDASEQRDSKGTIIDSGTTLAYLPDGIYQPLVYKI--LSQQPNLKVQTLHDEYTCF 349

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCL-----TVMSTDGDYGI 316
             S       P++   F    S  V  H + F  EN    ++C+        S D     
Sbjct: 350 QYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLSEN----LWCIGWQNSGAQSRDSKNMT 405

Query: 317 IGQNFMMGHRIVF-DRENLKLAWSHSKCEEVI-----DKSHVHLV 355
           +  + ++ +++VF D EN  + W+   C   I         VHLV
Sbjct: 406 LLGDLVLSNKLVFYDLENQVIGWTEYNCSSSIKVRDEKTGTVHLV 450


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 149/326 (45%), Gaps = 36/326 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP +SS++  +SC+ P C   S  C      C Y   Y+ E +SSSG L++D+L L   
Sbjct: 122 FDPEASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYA-EQSSSSGILLEDVLALHDG 180

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
              AP       +I GC  ++TG      A DG+ GLG  D SV + L KAG+I + FS+
Sbjct: 181 LPGAP-------IIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSL 232

Query: 164 CFD--ENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS--- 215
           CF   E D G++  GD    G  + Q T  L        Y V + S  +    L  S   
Sbjct: 233 CFGMVEGD-GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSL 291

Query: 216 ---GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSW-KYCY-NASSE 267
              G+  ++DSG +FT++P+ ++       +K   S   KR+      +   C+  A S 
Sbjct: 292 FDQGYGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSH 351

Query: 268 EMLKV-----PDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
           + L+      P M + F +  S V+   N++F    N G   +CL V        ++G  
Sbjct: 352 DDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSG--KYCLGVFDNGRAGTLLGGI 409

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
                 + +DR N ++ +  + C+E+
Sbjct: 410 TFRNVLVRYDRANQRVGFGPALCKEL 435


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 150/342 (43%), Gaps = 50/342 (14%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SS+   V CS     +  +C S K  C Y   Y+ E +SSSG L +DI+   + 
Sbjct: 126 RFQPDLSSTYSPVKCS-----ADCTCDSDKSQCTYERQYA-EMSSSSGVLGEDIVSFGTE 179

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG G +S+   L   G+I +SFS+
Sbjct: 180 SELKPQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSM 233

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V      P     +   P+   Y  Y + ++   +    L        
Sbjct: 234 CYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPY--YNIELKEIHVAGKALRLDPRIFD 291

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
           S    ++DSG ++ +LP + +    V F   V+SK   L+       N    C+  +   
Sbjct: 292 SKHGTVLDSGTTYAYLPEQAF----VAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRN 347

Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGII 317
           + ++    PD+ ++F   Q  S    N++F   + EG   +CL V     D      GI+
Sbjct: 348 VSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIV 405

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
            +N +    + +DR N K+ +  + C E+ ++ HV   P PA
Sbjct: 406 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHVSGAPSPA 443


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 91/326 (27%), Positives = 134/326 (41%), Gaps = 26/326 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
           +L  YDP  SSS   VSC    C +    K    +   PC Y   Y  + +S++GY V D
Sbjct: 126 DLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYG-DGSSTTGYFVSD 184

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     S         +SVI GCG +Q G       A DG++G G  + S+ S LA AG
Sbjct: 185 SLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAG 244

Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            ++  FS C D    G +F  GD      +ST  +P    Y+   V +ES  +G + L  
Sbjct: 245 EVKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYN---VNLESINVGGTTLQL 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNAS 265
                 T      ++DSG + T+LP  +Y +V+   F K   +   S+Q      C    
Sbjct: 302 PSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD---FLCIQYF 358

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNF 321
                  P +   F  +    V  H + F   +    F      + S DG D  ++G   
Sbjct: 359 QSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLV 418

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVI 347
           +    +V+D EN  + W+   C   I
Sbjct: 419 LSNKVVVYDLENQVVGWTDYNCSSSI 444


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C    +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V      GC  
Sbjct: 156 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 209

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
            +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G    V  G   
Sbjct: 210 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 268

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
           P     +   P+   Y  Y + ++   +    L        S    ++DSG ++ +LP +
Sbjct: 269 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 326

Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
            +    V F   V++K  SL+       N    C+  +   + ++    PD+ ++F   Q
Sbjct: 327 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 382

Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFDRENLKL 336
             S    N++F   + EG   +CL V     D      GI+ +N +    + +DR N K+
Sbjct: 383 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 436

Query: 337 AWSHSKCEEVIDKSHVHLVP 356
            +  + C E+ ++ H+  VP
Sbjct: 437 GFWKTNCSELWERLHISEVP 456


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C    +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V      GC  
Sbjct: 157 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 210

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
            +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G    V  G   
Sbjct: 211 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 269

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
           P     +   P+   Y  Y + ++   +    L        S    ++DSG ++ +LP +
Sbjct: 270 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 327

Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
            +    V F   V++K  SL+       N    C+  +   + ++    PD+ ++F   Q
Sbjct: 328 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 383

Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
             S    N++F   + EG   +CL V     D      GI+ +N +    + +DR N K+
Sbjct: 384 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 437

Query: 337 AWSHSKCEEVIDKSHVHLVP 356
            +  + C E+ ++ H+  VP
Sbjct: 438 GFWKTNCSELWERLHISEVP 457


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C    +C + +  C Y   Y+ E +SSSG L +DI+     S+  PQ +V      GC  
Sbjct: 146 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 199

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
            +TG      A DG+MGLG G +S+   L + G+I +SFS+C+   D G    V  G   
Sbjct: 200 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 258

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
           P     +   P+   Y  Y + ++   +    L        S    ++DSG ++ +LP +
Sbjct: 259 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 316

Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
            +    V F   V++K  SL+       N    C+  +   + ++    PD+ ++F   Q
Sbjct: 317 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 372

Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFDRENLKL 336
             S    N++F   + EG   +CL V     D      GI+ +N +    + +DR N K+
Sbjct: 373 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 426

Query: 337 AWSHSKCEEVIDKSHVHLVP 356
            +  + C E+ ++ H+  VP
Sbjct: 427 GFWKTNCSELWERLHISEVP 446


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 140/329 (42%), Gaps = 30/329 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
           +L+ YDP++S+SSK V+C    C + +      SC +   PC Y   Y  + +S++G+ V
Sbjct: 132 DLTLYDPTASASSKTVTCGQEFCATATNGGVPPSCAA-NSPCQYSITYG-DGSSTTGFFV 189

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
            D L     S     +   +SV  GCG K  G+      A DG++G G  + S+ S L  
Sbjct: 190 ADFLQYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTS 249

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           AG +   FS C D  + G +F    G   Q      P+      Y V +++  +G S L 
Sbjct: 250 AGKVTKIFSHCLDTVNGGGIF--AIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQ 307

Query: 214 ---------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                          ++DSG +  +LP  +Y  V+       +   ++L+      C+  
Sbjct: 308 LPTNIFDIGGGSRGTIIDSGTTLAYLPEVVYKAVLSAV--FSNHPDVTLKNVQDFLCFQY 365

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
           S       P++   F  +   VV  H + F   E   V+C+      V S DG D  ++G
Sbjct: 366 SGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQNTE--DVYCVGFQSGGVQSKDGKDMVLLG 423

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
              +    +V+D EN  + W++  C   I
Sbjct: 424 DLALSNKLVVYDLENQVIGWTNYNCSSSI 452


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 102/211 (48%), Gaps = 14/211 (6%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 +       +SV  GCG +Q+GS  + A A DG++G G  + +  S LA AG  +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
             FS C D  + G +F    G   +      PI +  + Y  V ++S  +  + L     
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVV 240
              T       +DSG++  +LP  IY+E+++
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 331


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 137/326 (42%), Gaps = 28/326 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  SS+   VSC    C +        C +   PC Y   Y  + +S++GY V D
Sbjct: 133 LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 190

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +L     S         S+V  GCG +Q G       A DG++G G  + S+ S L+ AG
Sbjct: 191 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 250

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D  + G +F    G   Q      P+      Y V ++S  +G + L   
Sbjct: 251 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 308

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T      ++DSG + T+LP  +Y E+++        K I+        C+     
Sbjct: 309 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGR 366

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
                P +   F  +    V  H + F EN G  ++C+      + S DG   ++  + +
Sbjct: 367 VDDDFPKITFHFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLV 424

Query: 323 MGHR-IVFDRENLKLAWSHSKCEEVI 347
           + ++ +V+D EN  + W+   C   I
Sbjct: 425 LSNKLVVYDLENQVIGWTEYNCSSSI 450


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 151/335 (45%), Gaps = 43/335 (12%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           QD N   + P  SS+ + + CS        +C S    C Y   Y+ E +SSSG L +DI
Sbjct: 130 QDPN---FQPDWSSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDI 180

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +     S+  PQ +V      GC   +TG      A DG+MGLG GD+S+   L + G+I
Sbjct: 181 VSFGKQSELKPQRTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVI 234

Query: 158 QNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------ 208
            NSFS+C+   D G    V  G   PA    T   P    Y  Y + ++   I       
Sbjct: 235 GNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPI 292

Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNAS 265
           N  +    +  ++DSG ++ +LP   +        K ++S ++ +QG    Y   C++  
Sbjct: 293 NPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGV 351

Query: 266 SEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GI 316
             ++ ++    P + L+FS      +    + F  ++    +CL +   + D      GI
Sbjct: 352 GSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGI 411

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
           I +N +    +++DRE+LK+ +  + C E+ +  H
Sbjct: 412 IVRNTL----VMYDREHLKIGFWKTNCSEIWEILH 442


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 151/335 (45%), Gaps = 43/335 (12%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           QD N   + P  SS+ + + CS        +C S    C Y   Y+ E +SSSG L +DI
Sbjct: 130 QDPN---FQPDWSSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDI 180

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +     S+  PQ +V      GC   +TG      A DG+MGLG GD+S+   L + G+I
Sbjct: 181 VSFGKQSELKPQRTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVI 234

Query: 158 QNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------ 208
            NSFS+C+   D G    V  G   PA    T   P    Y  Y + ++   I       
Sbjct: 235 GNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPI 292

Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNAS 265
           N  +    +  ++DSG ++ +LP   +        K ++S ++ +QG    Y   C++  
Sbjct: 293 NPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGV 351

Query: 266 SEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GI 316
             ++ ++    P + L+FS      +    + F  ++    +CL +   + D      GI
Sbjct: 352 GSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGI 411

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
           I +N +    +++DRE+LK+ +  + C E+ +  H
Sbjct: 412 IVRNTL----VMYDREHLKIGFWKTNCSEIWEILH 442


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 139/322 (43%), Gaps = 22/322 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           LS YD  +SS+SKNV C    C    +S     K PC Y   Y    TS   ++ D+I  
Sbjct: 118 LSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT- 176

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           L   + +   + +   V+ GCG+ Q+G      +A DG+MG G  + S+ S LA  G  +
Sbjct: 177 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTK 236

Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS------- 210
             FS C D  + G +F  G+      ++T  +P    Y+    G++    G+        
Sbjct: 237 RIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSL 294

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEM 269
             T      ++DSG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S   
Sbjct: 295 ASTNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTD 351

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGH 325
              P + L F  +    V  H + F   E    F      + + DG D  ++G   +   
Sbjct: 352 KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 411

Query: 326 RIVFDRENLKLAWSHSKCEEVI 347
            +V+D EN  + W+   C   I
Sbjct: 412 LVVYDLENEVIGWADHNCSSSI 433


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 139/322 (43%), Gaps = 22/322 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           LS YD  +SS+SKNV C    C    +S     K PC Y   Y    TS   ++ D+I  
Sbjct: 122 LSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT- 180

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           L   + +   + +   V+ GCG+ Q+G      +A DG+MG G  + S+ S LA  G  +
Sbjct: 181 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTK 240

Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS------- 210
             FS C D  + G +F  G+      ++T  +P    Y+    G++    G+        
Sbjct: 241 RIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSL 298

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEM 269
             T      ++DSG +  +LP  +Y  ++   +K+ + +++ L      + C++ +S   
Sbjct: 299 ASTNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTD 355

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGH 325
              P + L F  +    V  H + F   E    F      + + DG D  ++G   +   
Sbjct: 356 KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 415

Query: 326 RIVFDRENLKLAWSHSKCEEVI 347
            +V+D EN  + W+   C   I
Sbjct: 416 LVVYDLENEVIGWADHNCSSSI 437


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 137/326 (42%), Gaps = 28/326 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  SS+   VSC    C +        C +   PC Y   Y  + +S++GY V D
Sbjct: 48  LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 105

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +L     S         S+V  GCG +Q G       A DG++G G  + S+ S L+ AG
Sbjct: 106 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 165

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D  + G +F    G   Q      P+      Y V ++S  +G + L   
Sbjct: 166 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 223

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T      ++DSG + T+LP  +Y E+++        K I+        C+     
Sbjct: 224 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGR 281

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
                P +   F  +    V  H + F EN G  ++C+      + S DG   ++  + +
Sbjct: 282 VDDDFPKITFHFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLV 339

Query: 323 MGHR-IVFDRENLKLAWSHSKCEEVI 347
           + ++ +V+D EN  + W+   C   I
Sbjct: 340 LSNKLVVYDLENQVIGWTEYNCSSSI 365


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 140/327 (42%), Gaps = 28/327 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
           +L+ YDP +SS+   V C    C      +    S   PC Y   Y  + +S+ G  V+D
Sbjct: 131 DLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYG-DGSSTVGSFVND 189

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
            L     +         +SVI GCG +Q G     + A DG++G G  + S+ S LA AG
Sbjct: 190 ALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAG 249

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
            ++  F+ C D    G +F    G   Q      P+      Y V +++  +G + L   
Sbjct: 250 KVKKIFAHCLDTIKGGGIF--AIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELP 307

Query: 214 ----QSGFQ--ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
               + G +   ++DSG + T+LP  ++ +V++  F+K    + I+        C+  S 
Sbjct: 308 ADIFKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNK---HQDITFHDVQDFLCFEYSG 364

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
                 P +   F  + +  V  H + FP   G  V+C+      + S DG D  ++G  
Sbjct: 365 SVDDGFPTLTFHFEDDLALHVYPHEYFFP--NGNDVYCVGFQNGALQSKDGKDIVLMGDL 422

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
            +    +V+D EN  + W+   C   I
Sbjct: 423 VLSNKLVVYDLENRVIGWTDYNCSSSI 449


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 85/331 (25%), Positives = 146/331 (44%), Gaps = 37/331 (11%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           L+ YDP SS+S+  + C    C +      +   K L  PC Y   Y  + +S++G+ V 
Sbjct: 126 LTLYDPQSSTSATRIYCDDDFCAATYNGVLQGCTKDL--PCQYSVVYG-DGSSTAGFFVK 182

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D L     + +   SS   SVI GCG KQ+G       A DG++G G  + S+ S LA A
Sbjct: 183 DNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAA 242

Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           G ++  F+ C D    G +F  G+       +T  +P    Y+     +E   +G + L 
Sbjct: 243 GKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIE---VGGNVLE 299

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCY 262
                  T      ++DSG +  +LP  +Y  ++ K    + S++  L+ ++ +    C+
Sbjct: 300 LPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESMMTK----IVSEQPGLKLHTVEEQFTCF 355

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGI 316
             +       P ++  F+ + S  V  H + F  +E   V+C       + S DG D  +
Sbjct: 356 QYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE--EVWCFGWQNSGMQSKDGRDMTL 413

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           +G   +    +++D EN  + W+   C   I
Sbjct: 414 LGDLVLSNKLVLYDLENQAIGWTDYNCSSSI 444


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 155/358 (43%), Gaps = 36/358 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +S +DP  S+S  ++SC+   C   S S C      CPY   Y  + +S++GYL++D+L 
Sbjct: 92  ISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYG-DGSSTAGYLINDVLS 150

Query: 100 LASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                S ++  +S  + +  GCG  QTG++L     DG++G G  +VS+PS L+K  +  
Sbjct: 151 FNQVPSGNSTATSGTARLTFGCGSNQTGTWLT----DGLVGFGQAEVSLPSQLSKQNVSV 206

Query: 159 NSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           N F+ C   D   SG++  G         T  +P    Y+   + +     G +  T + 
Sbjct: 207 NIFAHCLQGDNKGSGTLVIGHIREPGLVYTPIVPKQSHYNVELLNIG--VSGTNVTTPTA 264

Query: 217 FQ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
           F        ++DSG + T+L    Y +   K    + S  + +      + +  + E   
Sbjct: 265 FDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRSGVLPV-----AFQFFCTIEGYF 319

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPE--NEGFTVFCLTVMSTDGDYG-----IIGQNFMM 323
             P++ L F+   + ++    + + E    G + +C + + +   YG     I G N + 
Sbjct: 320 --PNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLK 377

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP----PAGQSPNPLPTTEQQSTSNG 377
              +V+D  N ++ W +  C + I  S      P    P+   P     T   + SNG
Sbjct: 378 DQLVVYDNVNNRIGWKNFDCTKEISVSSTATSMPVTVFPSKAGPPGAFVTTNNAHSNG 435


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 136/327 (41%), Gaps = 28/327 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
           +L+ YDP +SSS   VSC    C +    K    +   PC Y   Y  + +S++G+ V D
Sbjct: 127 DLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFVTD 185

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     +         ++V  GCG +Q G       A DG++G G  + S+ S LA AG
Sbjct: 186 ALQFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAG 245

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D    G +F    G   Q      P+      Y V ++S  +G + L   
Sbjct: 246 KVKKIFAHCLDTIKGGGIF--AIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLP 303

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
                T      ++DSG + T+LP  ++ EV+   F+K    + I         C+    
Sbjct: 304 AHVFETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNK---HQDIVFHNVQDFMCFQYPG 360

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
                 P +   F  + +  V  H + FP   G  ++C+      + S DG D  ++G  
Sbjct: 361 SVDDGFPTITFHFEDDLALHVYPHEYFFP--NGNDMYCVGFQNGALQSKDGKDIVLMGDL 418

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
            +    +++D EN  + W+   C   I
Sbjct: 419 VLSNKLVIYDLENQVIGWTDYNCSSSI 445


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 138/322 (42%), Gaps = 37/322 (11%)

Query: 51  SSSKNVSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + SK V C H LC S        +  C+S  + C Y+  Y+ +  SS+G LV+D     S
Sbjct: 110 TKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYA-DQGSSTGVLVND-----S 163

Query: 103 FSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNS 160
           F+      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L + G+ +N 
Sbjct: 164 FALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNV 223

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQ 218
              C      G +FFGD     Q++T + P+      + Y  G  S   G+  L     +
Sbjct: 224 VGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK 282

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
            + DSG+SFT+   + Y  +V      +S         S   C+    E    V D+R  
Sbjct: 283 VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKE 341

Query: 279 FSKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMG 324
           F   +S V+     +  +   P      V      CL +++       D  IIG   M  
Sbjct: 342 F---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQD 398

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
           H +++D E  K+ W  + C+  
Sbjct: 399 HMVIYDNEKGKIGWIRAPCDRA 420


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/331 (26%), Positives = 140/331 (42%), Gaps = 34/331 (10%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
           +L+ YDP  SSS   VSC +  C +          C + K PC Y A+Y  + +S++G  
Sbjct: 130 DLALYDPKGSSSGSAVSCDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYG-DGSSTAGSF 187

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLA 152
           V D L     S +A     +++VI GCG +Q G       A DG++G G  + S  S LA
Sbjct: 188 VSDSLQYNQLSGNAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLA 247

Query: 153 KAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
            AG ++  FS C D    G +F  G+      +ST  LP    Y+   V ++S  +  + 
Sbjct: 248 SAGEVKKIFSHCLDTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYN---VNLQSIDVAGNA 304

Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCY 262
           L        T      ++DSG + T+LP  +Y +++   F K       ++QG     C+
Sbjct: 305 LQLPPHIFETSEKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQG---FLCF 361

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD------GDYGI 316
             S       P +   F  +    V  H + F    G  ++CL   +         D  +
Sbjct: 362 EYSESVDDGFPKITFHFEDDLGLNVYPHDYFF--QNGDNLYCLGFQNGGFQPKDAKDMVL 419

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           +G   +    +V+D E   + W+   C   I
Sbjct: 420 LGDLVLSNKVVVYDLEKQVIGWTDYNCSSSI 450


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/345 (25%), Positives = 149/345 (43%), Gaps = 45/345 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N S Y P  SS+ +N+SC  P C+  SS      CK+    CPY  DY+    ++  +
Sbjct: 207 EQNGSHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDF 266

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
             +      ++     +      V+ GCG    G +  GA+  G++GLG G +S PS + 
Sbjct: 267 ASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKG-FFYGAS--GLLGLGRGPISFPSQIQ 323

Query: 153 KAGLIQNSFSICFDE-----NDSGSVFFGDQGPATQQS----TSFLPIGEKYDA--YFVG 201
              +  +SFS C  +     + S  + FG+            T+ L   E  D   Y++ 
Sbjct: 324 --SIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQ 381

Query: 202 VESYCIGNSCLTQS---------------GFQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
           ++S  +G   L  S               G   ++DSG++ TF P   Y  +   F+K +
Sbjct: 382 IKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKI 441

Query: 247 SSKRISLQGNSWKYCYNASSEEM-LKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVF 303
             ++I+        CYN S   M +++PD  + F+     +F   N+ + +  +E   V 
Sbjct: 442 KLQQIAADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE---VI 498

Query: 304 CLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           CL +M T       IIG        I++D +  +L +S  +C EV
Sbjct: 499 CLAIMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 543


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 153/329 (46%), Gaps = 38/329 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++DP SSS+ K + C+         C S    C Y   Y+ E ++SSG L +D++   + 
Sbjct: 124 KFDPESSSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQ 177

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG GD+S+   L + G I +SFS+
Sbjct: 178 SELIPQRAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSL 231

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG--- 216
           C+   D G    V  G   P+    T   P+   Y  Y V + E +  G      SG   
Sbjct: 232 CYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFD 289

Query: 217 --FQALVDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNA----SSEE 268
             + A++DSG ++ +LP E ++       D++ S K+I     ++K  C++     ++E 
Sbjct: 290 GRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAEL 349

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
             K P + ++F   Q   +    + F  ++    +CL +     D      GI+ +N + 
Sbjct: 350 SNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL- 408

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
              +++DR N K+ +  + C E+ ++  +
Sbjct: 409 ---VMYDRANSKIGFWKTNCSELWERLRI 434


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 153/329 (46%), Gaps = 38/329 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++DP SSS+ K + C+         C S    C Y   Y+ E ++SSG L +D++   + 
Sbjct: 124 KFDPESSSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQ 177

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG GD+S+   L + G I +SFS+
Sbjct: 178 SELIPQRAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSL 231

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG--- 216
           C+   D G    V  G   P+    T   P+   Y  Y V + E +  G      SG   
Sbjct: 232 CYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFD 289

Query: 217 --FQALVDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNA----SSEE 268
             + A++DSG ++ +LP E ++       D++ S K+I     ++K  C++     ++E 
Sbjct: 290 GRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAEL 349

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
             K P + ++F   Q   +    + F  ++    +CL +     D      GI+ +N + 
Sbjct: 350 SNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL- 408

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
              +++DR N K+ +  + C E+ ++  +
Sbjct: 409 ---VMYDRANSKIGFWKTNCSELWERLRI 434


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 145/327 (44%), Gaps = 22/327 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D
Sbjct: 122 LNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 180

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           ++HL +  + +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G
Sbjct: 181 MMHLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 240

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGN 209
           +    FS C   D +  G +  G+        TS +P    Y+       V  ++  I +
Sbjct: 241 IAPRIFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDS 300

Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASS 266
           S    S  +  +VDSG +  +L  E Y   V      +  S + +  +GN    CY  +S
Sbjct: 301 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQ---CYLITS 357

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMM 323
                 P + L F+   S ++R   +   +N   G  V+C+      G    I+G   + 
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKS 350
              +V+D    ++ W++  C   ++ S
Sbjct: 418 DKIVVYDLAGQRIGWANYDCSLSVNVS 444


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 137/321 (42%), Gaps = 36/321 (11%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L++D     SF
Sbjct: 112 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 165

Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L + G+ +N  
Sbjct: 166 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 225

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
             C      G +FFGD     Q++T + P+      + Y  G  S   G+  L     + 
Sbjct: 226 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           + DSG+SFT+   + Y  +V      +S         S   C+    E    V D+R  F
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 343

Query: 280 SKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMGH 325
              +S V+     +  +   P      V      CL +++       D  IIG   M  H
Sbjct: 344 ---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDH 400

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            +++D E  K+ W  + C+  
Sbjct: 401 MVIYDNEKGKIGWIRAPCDRA 421


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 137/320 (42%), Gaps = 36/320 (11%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L++D     SF
Sbjct: 103 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 156

Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L + G+ +N  
Sbjct: 157 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 216

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
             C      G +FFGD     Q++T + P+      + Y  G  S   G+  L     + 
Sbjct: 217 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 275

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           + DSG+SFT+   + Y  +V      +S         S   C+    E    V D+R  F
Sbjct: 276 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 334

Query: 280 SKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMGH 325
              +S V+     +  +   P      V      CL +++       D  IIG   M  H
Sbjct: 335 ---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDH 391

Query: 326 RIVFDRENLKLAWSHSKCEE 345
            +++D E  K+ W  + C+ 
Sbjct: 392 MVIYDNEKGKIGWIRAPCDR 411


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 147/340 (43%), Gaps = 36/340 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  S S + V+C    C +       SC S   PC Y   Y  + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     S     +   +SV  GCG K  G       A DG++G G  + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251

Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            ++  F+ C D  + G +F  G+      ++T  +P    Y+    G++   +G + L  
Sbjct: 252 KVRKMFAHCLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGL 308

Query: 213 ------TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 + +    ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S
Sbjct: 309 PTNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYS 365

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
                  P++   F  + S +V  H + F    G  ++C+      V + DG D  ++G 
Sbjct: 366 GSVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGD 423

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI----DKSHVHLV 355
             +    +++D EN  + W+   C   I    DK   + V
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 128/272 (47%), Gaps = 17/272 (6%)

Query: 85  EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLG 143
           + +S++GYLV D++HL   + +    S   ++I GCG KQ+G   +  AA DG+MG G  
Sbjct: 4   DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63

Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
           + S  S LA  G ++ SF+ C D N+ G +F    G          P+  K   Y V + 
Sbjct: 64  NSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLN 121

Query: 204 SYCIGNSC--LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
           +  +GNS   L+ + F +      ++DSG +  +LP  +Y  ++ +   L S   ++L  
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHT 179

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG- 312
               +     ++++ + P +   F K+ S  V  R ++F   E+     +    + T G 
Sbjct: 180 VQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGG 239

Query: 313 -DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               I+G   +    +V+D EN  + W++  C
Sbjct: 240 ASLTILGDMALSNKLVVYDIENQVIGWTNHNC 271


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 79/333 (23%), Positives = 150/333 (45%), Gaps = 32/333 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ ++V C+        +C   K  C Y   Y+ E ++SSG L +DI+   + 
Sbjct: 54  KFQPDLSSTYQSVKCN-----IDCNCDDEKQQCVYERQYA-EMSTSSGVLGEDIISFGNL 107

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S  APQ +V      GC   +TG      A DG+MG+G GD+S+   L   G+I +SFS+
Sbjct: 108 SALAPQRAV-----FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSL 161

Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+          V  G   P+    +   P+   Y  Y + ++   +       N  +  
Sbjct: 162 CYGGMGIGGGAMVLGGISPPSNMVFSQSDPVRSPY--YNIDLKEIHVAGKPLPLNPTVFD 219

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
                ++DSG ++ +LP   +        K + S +  ++G    Y   C++ +  ++ +
Sbjct: 220 GKHGTILDSGTTYAYLPEAAFVSFKDAIMKELHSLK-PIRGPDPNYNDICFSGAGSDISQ 278

Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHR 326
           +    P + ++F   Q  ++    + F  ++    +CL +     D   ++G   +    
Sbjct: 279 LSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTL 338

Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
           +++DREN K+ +  + C E+ ++ +V   PPPA
Sbjct: 339 VLYDRENSKIGFWKTNCSELWERLNVDGAPPPA 371


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 83/330 (25%), Positives = 144/330 (43%), Gaps = 28/330 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D
Sbjct: 119 LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 177

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           ++HL +  + +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G
Sbjct: 178 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 237

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           +    FS C   D +  G +  G+        TS +P    Y+   + ++S  +    L 
Sbjct: 238 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYN---LNLQSIAVNGQTLQ 294

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYN 263
                  T +    +VDSG +  +L  E Y   V      +  S   +  +GN    CY 
Sbjct: 295 IDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQ---CYL 351

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQN 320
            +S      P + L F+   S ++R   +   +N   G  V+C+      G    I+G  
Sbjct: 352 ITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDL 411

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
            +    +V+D    ++ W++  C   ++ S
Sbjct: 412 VLKDKIVVYDLAGQRIGWANYDCSLSVNVS 441


>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
 gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
          Length = 260

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 9/180 (5%)

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ--ALVDSGASFT 228
           G + FGD+G   Q  T  LP  E    Y V V    +G   +   G Q  AL D+G SFT
Sbjct: 11  GRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTEVSVGGDAV---GVQLLALFDTGTSFT 66

Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFV 286
            L    Y  +   FD  V+ KR  +     +++CY+ S ++  +  P + + F       
Sbjct: 67  HLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMF 126

Query: 287 VRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           +RN +F     +   ++CL ++ S D    IIGQNFM G+RIVFDRE + L W  S C E
Sbjct: 127 LRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDCFE 186


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 85/327 (25%), Positives = 144/327 (44%), Gaps = 35/327 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDPS SS+   +SC    C      +  SC S    C Y   Y  + +S+ GY + D
Sbjct: 82  LTTYDPSRSSTDGALSCRDSNCGAALGSNEVSCTS-AGYCAYSTTYG-DGSSTQGYFIQD 139

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAG 155
           ++       +  Q +  +SV  GCG  Q+G+ L    A DG++G G   VS+PS LA  G
Sbjct: 140 VMTFQEIHNNT-QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMG 198

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCL 212
            + N F+ C   D    G++  G     ++ + S+ PI  + + Y VG+++  + G +  
Sbjct: 199 KVGNRFAHCLQGDNQGGGTIVIGS---VSEPNISYTPIVSR-NHYAVGMQNIAVNGRNVT 254

Query: 213 TQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           T + F          ++DSG +  +L    Y + V      VS+   S+  +  +    A
Sbjct: 255 TPASFDTTSTSAGGVIMDSGTTLAYLVDPAYTQFV----NAVSTFESSMFSSHSQCLQLA 310

Query: 265 SSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGII 317
                   P ++L F      +   RN+++S P   G   +C+     T  +    Y I+
Sbjct: 311 WCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSIL 370

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCE 344
           G   +  H +V+D +N  + W    C+
Sbjct: 371 GDIVLKDHLVVYDNDNRVVGWKSFDCK 397


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 146/328 (44%), Gaps = 29/328 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YDP+ S +S  V C    C        S CK     CPY   Y  + +++SG  V+
Sbjct: 45  DLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVN 102

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
           D L     S +       SSVI GCG KQ+GS    +  A DG++G G  + SV S LA 
Sbjct: 103 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 162

Query: 154 AGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           +G ++  FS C D +  G +F  G        +T  +P    Y+     ++    G   L
Sbjct: 163 SGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPIL 220

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                  + SG   ++DSG +  +LP  IY +++ K        ++ +  + +  C++ S
Sbjct: 221 LPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT-CFHYS 279

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
            +     P ++  F +  S  V  H + F   E   ++C+     +  + +G D  +IG 
Sbjct: 280 DKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDLILIGD 336

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             +    +V+D EN+ + W++  C   I
Sbjct: 337 LVLSNKLVVYDLENMVIGWTNFNCSSSI 364


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 161/366 (43%), Gaps = 47/366 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  S + + V C+ P C    +C    + C Y   Y+ E +SSSG L +D++   + 
Sbjct: 130 KFQPDLSETYQPVKCT-PDC----NCDGDTNQCMYDRQYA-EMSSSSGVLGEDVVSFGNL 183

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ APQ +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+
Sbjct: 184 SELAPQRAV-----FGCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSL 237

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+   D G    +  G   P     T   P    Y  Y + ++   +       N  +  
Sbjct: 238 CYGGMDVGGGAMILGGISPPEDMVFTHSDPDRSPY--YNINLKEMHVAGKKLQLNPKVFD 295

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
                ++DSG ++ +LP   +    + F + +  +R SL+       N    C+  +  +
Sbjct: 296 GKHGTVLDSGTTYAYLPETAF----LAFKRAIMKERNSLKQINGPDPNYKDICFTGAGID 351

Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMM 323
           + ++    P + ++F       +    + F  ++    +CL V S   D   ++G  F+ 
Sbjct: 352 VSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVR 411

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 383
              +++DREN K+ +  + C E+ +  H          +P+PLP+  +   +N   A  P
Sbjct: 412 NTLVMYDRENSKIGFWKTNCSELWETLHT-------SDAPSPLPSNSE--VTNLTKAFAP 462

Query: 384 STAKTA 389
           S A +A
Sbjct: 463 SVAPSA 468


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 146/328 (44%), Gaps = 29/328 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YDP+ S +S  V C    C        S CK     CPY   Y  + +++SG  V+
Sbjct: 115 DLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVN 172

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
           D L     S +       SSVI GCG KQ+GS    +  A DG++G G  + SV S LA 
Sbjct: 173 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 232

Query: 154 AGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           +G ++  FS C D +  G +F  G        +T  +P    Y+     ++    G   L
Sbjct: 233 SGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPIL 290

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                  + SG   ++DSG +  +LP  IY +++ K        ++ +  + +  C++ S
Sbjct: 291 LPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT-CFHYS 349

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
            +     P ++  F +  S  V  H + F   E   ++C+     +  + +G D  +IG 
Sbjct: 350 DKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDLILIGD 406

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             +    +V+D EN+ + W++  C   I
Sbjct: 407 LVLSNKLVVYDLENMVIGWTNFNCSSSI 434


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 143/339 (42%), Gaps = 34/339 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  S S + V+C    C +       SC S   PC Y   Y  + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     S     +   +SV  GCG K  G       A DG++G G  + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D  + G +F    G   Q      P+      Y V ++   +G + L   
Sbjct: 252 KVRKMFAHCLDTVNGGGIFA--IGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLP 309

Query: 213 -----TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                + +    ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S 
Sbjct: 310 TNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSG 366

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
                 P++   F  + S +V  H + F    G  ++C+      V + DG D  ++G  
Sbjct: 367 SVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGDL 424

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI----DKSHVHLV 355
            +    +++D EN  + W+   C   I    DK   + V
Sbjct: 425 VLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 87/341 (25%), Positives = 144/341 (42%), Gaps = 34/341 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD   SSS K V C    CK       + C +    CPY+  Y  + +S++GY V 
Sbjct: 126 DLTLYDIKESSSGKLVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVK 183

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           DI+     S      S   S++ GCG +Q+G  S  +  A DG++G G  + S+ S LA 
Sbjct: 184 DIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLAS 243

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           +G ++  F+ C +  + G +F    G   Q   +  P+      Y V + +  +G++ L 
Sbjct: 244 SGKVKKMFAHCLNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLS 301

Query: 213 --TQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
             T +  Q      ++DSG +  +LP  IY  +V K        ++    + +  C+  S
Sbjct: 302 LSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYT-CFQYS 360

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQN 320
                  P +   F    S  V  H + FP       +C+        S D     +  +
Sbjct: 361 ESVDDGFPAVTFFFENGLSLKVYPHDYLFPS---VNFWCIGWQNSGTQSRDSKNMTLLGD 417

Query: 321 FMMGHRIVF-DRENLKLAWSHSKCEEVID-----KSHVHLV 355
            ++ +++VF D EN  + W+   C   I         VHLV
Sbjct: 418 LVLSNKLVFYDLENQAIGWAEYNCSSSIKVRDERTGTVHLV 458


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 137/325 (42%), Gaps = 34/325 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP  S+++K ++C  PLC   + SC    D C Y   Y+ E +SS G++++D       
Sbjct: 55  FDPDKSTTAKKLACGDPLCNCGTPSCTCNNDRCYYSRTYA-ERSSSEGWMIEDTFGF--- 110

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
               P S     ++ GC   +TG      A DG+MG+G    +  S L +  +I++ FS+
Sbjct: 111 ----PDSDSPVRLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSL 165

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--------NSCLTQS 215
           CF     G +  GD       +T + P+      ++  V+   I         ++ +   
Sbjct: 166 CFGYPKDGILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDR 225

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKY---CYNASSEEMLK 271
           G+  ++DSG +FT+LPT+ +  +       V  K + S  G   +Y   C+  + ++   
Sbjct: 226 GYGTVLDSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKD 285

Query: 272 V----PDMRLIFSKNQSFV---VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +    P    +F          +R    S P       +CL +        ++G   +  
Sbjct: 286 LDKYFPPAEFVFGGGAKLTLPPLRYLFLSKPAE-----YCLGIFDNGNSGALVGGVSVRD 340

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDK 349
             + +DR N K+ ++   C +V  K
Sbjct: 341 VVVTYDRRNSKVGFTTMACADVARK 365


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 149/340 (43%), Gaps = 32/340 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
           +L+ Y+ + S + K V C    C   +  +    +    CPY+  Y  + +S++GY V D
Sbjct: 121 DLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYG-DGSSTAGYFVKD 179

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           ++  A  S     ++   SVI GCG +Q+G     +  A DG++G G  + S+ S LA  
Sbjct: 180 VVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVT 239

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           G ++  F+ C D  + G +F    G   Q   +  P+      Y V + +  +G+  L+ 
Sbjct: 240 GKVKKIFAHCLDGTNGGGIFV--IGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSL 297

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                ++G +  A++DSG +  +LP  +Y  +V K        ++    + +  C+  S 
Sbjct: 298 PTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYT-CFQYSD 356

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
                 P++   F  +    V  H + FP  EG  ++C+      V S D  +  ++G  
Sbjct: 357 SLDDGFPNVTFHFENSVILKVYPHEYLFPF-EG--LWCIGWQNSGVQSRDRRNMTLLGDL 413

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLV 355
            +    +++D EN  + W+   C   I         VHLV
Sbjct: 414 VLSNKLVLYDLENQAIGWTEYNCSSSIQVQDERTGTVHLV 453


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 136/327 (41%), Gaps = 28/327 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDD 96
           +L+ YDP +SS+   V C    C +    K  K     PC Y   Y  + +S+ G  V D
Sbjct: 129 DLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYG-DGSSTIGSFVTD 187

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     ++        +SVI GCG +Q G       A DG++G G  + S+ S L  AG
Sbjct: 188 ALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAG 247

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
            ++  F+ C D    G +F    G   Q      P+      Y V +++  +G + L   
Sbjct: 248 KVKKIFAHCLDTIKGGGIF--SIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLP 305

Query: 214 ----QSGFQ--ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
               + G +   ++DSG + T+LP  ++ EV++  F+K    + I+        C+    
Sbjct: 306 AHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNK---HQDITFHDVQGFLCFQYPG 362

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTDG-DYGIIGQN 320
                 P +   F  + +  V  H + F    G  V+C+        S DG D  ++G  
Sbjct: 363 SVDDGFPTITFHFEDDLALHVYPHEYFFA--NGNDVYCVGFQNGASQSKDGKDIVLMGDL 420

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
            +    +++D EN  + W+   C   I
Sbjct: 421 VLSNKLVIYDLENRVIGWTDYNCSSSI 447


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 79/336 (23%), Positives = 158/336 (47%), Gaps = 40/336 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P SSS+ + V C+        +C   +  C Y   Y+ E ++SSG L +D++   + 
Sbjct: 153 KFQPESSSTYQPVKCT-----IDCNCDGDRMQCVYERQYA-EMSTSSGVLGEDVISFGNQ 206

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ APQ +V      GC   +TG      A DG+MGLG GD+S+   L    +I +SFS+
Sbjct: 207 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSL 260

Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQS 215
           C+   D   G++  G   P +  + ++    ++   Y + ++   +       N+ +   
Sbjct: 261 CYGGMDVGGGAMVLGGISPPSDMTFAY-SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDG 319

Query: 216 GFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLK 271
               ++DSG ++ +LP      + + +VK  +L S K+IS    ++   C++ +  ++ +
Sbjct: 320 KHGTVLDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKQISGPDPNYNDICFSGAGNDVSQ 377

Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
           +    P + ++F     + +    + F  ++    +CL +     D      GII +N +
Sbjct: 378 LSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTL 437

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
               +++DRE  K+ +  + C E+ ++    + PPP
Sbjct: 438 ----VMYDREQTKIGFWKTNCAELWERLQTSIAPPP 469


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 77/271 (28%), Positives = 122/271 (45%), Gaps = 26/271 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ Y+   S S K VSC    C        S CK+    CPY+  Y  + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
           ++   S +      +   SVI GCG +Q+G  LD +   A DG++G G  + S+ S LA 
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           +G ++  F+ C D  + G +F    G   Q   +  P+      Y V + +  +G   LT
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298

Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 Q G +  A++DSG +  +LP  IY  +V K   L    ++ +    +K C+  S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKEPAL----KVHIVDKDYK-CFQYS 353

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 296
                  P++   F  +    V  H + FP 
Sbjct: 354 GRVDEGFPNVTFHFENSVFLRVYPHDYLFPH 384


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 144/341 (42%), Gaps = 34/341 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD   SSS K V C    CK       + C +    CPY+  Y  + +S++GY V 
Sbjct: 128 DLTLYDIKESSSGKFVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVK 185

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           DI+     S      S   S++ GCG +Q+G  S  +  A  G++G G  + S+ S LA 
Sbjct: 186 DIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLAS 245

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           +G ++  F+ C +  + G +F    G   Q   +  P+      Y V + +  +G++ L 
Sbjct: 246 SGKVKKMFAHCLNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLS 303

Query: 213 --TQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
             T +  Q      ++DSG +  +LP  IY  +V K        ++    + +  C+  S
Sbjct: 304 LSTDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYT-CFQYS 362

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQN 320
                  P +   F    S  V  H + FP  +    +C+        S D     +  +
Sbjct: 363 ESVDDGFPAVTFYFENGLSLKVYPHDYLFPSGD---FWCIGWQNSGTQSRDSKNMTLLGD 419

Query: 321 FMMGHRIVF-DRENLKLAWSHSKCEEVID-----KSHVHLV 355
            ++ +++VF D EN  + W+   C   I         VHLV
Sbjct: 420 LVLSNKLVFYDLENQVIGWTEYNCSSSIKVRDERTGTVHLV 460


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/383 (23%), Positives = 170/383 (44%), Gaps = 51/383 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ + V C+        +C + +  C Y   Y+ E ++SSG L +D++   + 
Sbjct: 122 KFQPDLSSTYQPVKCT-----LDCNCDNDRMQCVYERQYA-EMSTSSGVLGEDVVSFGNQ 175

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ APQ +V      GC   +TG      A DG+MGLG GD+S+   L    ++ +SFS+
Sbjct: 176 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSL 229

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+   D G    V  G   P+        P+   Y  Y + ++   +       N  +  
Sbjct: 230 CYGGMDVGGGAMVLGGISPPSDMVFAQSDPVRSPY--YNIDLKEIHVAGKRLPLNPSVFD 287

Query: 215 SGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
               +++DSG ++ +LP E    + E +VK  +L S  +IS    ++   C++ +  ++ 
Sbjct: 288 GKHGSVLDSGTTYAYLPEEAFLAFKEAIVK--ELQSFSQISGPDPNYNDLCFSGAGIDVS 345

Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNF 321
           ++    P + +IF     + +    + F  ++    +CL +     D      GI+ +N 
Sbjct: 346 QLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNT 405

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAA 381
           +    +++DRE  K+ +  + C E+ ++  +   PPP        P TE    +N   + 
Sbjct: 406 L----VLYDREQTKIGFWKTNCAELWERLQISSAPPPMP------PNTE---ATNSTKSV 452

Query: 382 PPSTAKTAPSKSIAASAQQLDSV 404
            PS A +    +I     Q+  +
Sbjct: 453 DPSVAPSVSQHNIPRGEFQIAQI 475


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 144/332 (43%), Gaps = 27/332 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS+++  VSCS   C      S S C S  + C Y   Y  + + +SGY V D
Sbjct: 128 LTFFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYG-DGSGTSGYYVAD 186

Query: 97  ILHLASFSKHAPQ-----SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSL 150
           ++HL +    + +      +  SSV   C   QTG       A DG+ G G  ++SV S 
Sbjct: 187 LMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQ 246

Query: 151 LAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYF----VGVES 204
           LA  G+    FS C   +DSG   +  G+        T  +P    Y+ Y     V  ++
Sbjct: 247 LASQGITPRVFSHCLKGDDSGGGVLVLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQT 306

Query: 205 YCIGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYC 261
             I  S    S  Q  +VDSG +  +L    Y   V     +VS + R  L +GN    C
Sbjct: 307 LAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQ---C 363

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQ 319
           Y  +S      P + L F+   S ++    +   +N   G  V+C+    T G    I  
Sbjct: 364 YLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILG 423

Query: 320 NFMMGHRI-VFDRENLKLAWSHSKCEEVIDKS 350
           + ++  +I V+D  N ++ W++  C   ++ S
Sbjct: 424 DLVLKDKIFVYDIANQRVGWTNYDCSMSVNVS 455


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 164/384 (42%), Gaps = 54/384 (14%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SSS K + C +P C    +C      C Y   Y+ E +SSSG L +D++   + 
Sbjct: 121 KFQPELSSSYKALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 174

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG G +SV   L   G+I++ FS+
Sbjct: 175 SQLTPQRAV-----FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 228

Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   E   G++  G   P      S   P    Y  Y + ++   +    L        
Sbjct: 229 CYGGMEVGGGAMVLGKISPPAGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 286

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEML 270
                ++DSG ++ + P E +  +     K + S KRI   G    Y   C++ +  ++ 
Sbjct: 287 GKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRI--HGPDPNYDDVCFSGAGRDVA 344

Query: 271 KV----PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           ++    P++ + F   Q  ++   N++F   +  G   +CL +        ++G   +  
Sbjct: 345 EIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRG--AYCLGIFPDRDSTTLLGGIVVRN 402

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
             + +DREN KL +  + C ++  +    L  P   +SP P     Q  +SN      PS
Sbjct: 403 TLVTYDRENDKLGFLKTNCSDLWRR----LAAP---ESPAPTSPISQNKSSN----ISPS 451

Query: 385 TAKTAPSKSIAASAQQLDSVLRVA 408
            AK+       +    L  VLRV 
Sbjct: 452 PAKS------ESPTTDLPGVLRVG 469


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 147/340 (43%), Gaps = 36/340 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  S S + V+C    C +       SC S   PC Y   Y  + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     S     +   +SV  GCG K  G       A DG++G G  + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251

Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            ++  F+ C D  + G +F  G+      ++T  +P    Y+    G++   +G + L  
Sbjct: 252 KVRKMFAHCLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGL 308

Query: 213 ------TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 + +    ++DSG +  ++P  +Y A   + FDK    + IS+Q      C+  S
Sbjct: 309 PTNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYS 365

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-----STDGDYGIIGQN 320
                  P++   F  + S +V  H + F    G  ++C+        + DG    +  +
Sbjct: 366 GSVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGGKTKDGKDLGLLGD 423

Query: 321 FMMGHRIV-FDRENLKLAWSHSKCEEVI----DKSHVHLV 355
            ++ +++V +D EN  + W+   C   I    DK   + V
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/319 (22%), Positives = 134/319 (42%), Gaps = 19/319 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
            L+ +D +SSS+++ V CSHP+C S+     + C    + C Y   Y  + + +SGY V 
Sbjct: 124 QLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYG-DGSGTSGYYVS 182

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D  +  +    +  ++  ++++ GC   Q+G       A DG+ G G G++SV S L+  
Sbjct: 183 DTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSH 242

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           G+    FS C    DSG       G   +    + P+      Y + ++S  +    L  
Sbjct: 243 GITPRVFSHCLKGEDSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPI 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                 T S    ++D+G +  +L  E Y   V      V S+  +   N    CY  S+
Sbjct: 302 DPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSN 360

Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
                 P +   F+   + +++   ++       G  ++C+      G   I+G   +  
Sbjct: 361 SVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKD 420

Query: 325 HRIVFDRENLKLAWSHSKC 343
              V+D  + ++ W++  C
Sbjct: 421 KIFVYDLAHQRIGWANYDC 439


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/348 (25%), Positives = 152/348 (43%), Gaps = 52/348 (14%)

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C    +C S K+ C Y   Y+ E +SSSG L +DI+   + S+  PQ +V      GC  
Sbjct: 143 CNVDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-----FGCEN 196

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
            +TG      A DG+MGLG G +S+   L   G+I +SFS+C+   D G    V      
Sbjct: 197 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
           P     T    +   Y  Y + ++   +    L             ++DSG ++ +LP +
Sbjct: 256 PPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313

Query: 234 IYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ 283
            +    V F   VSS     K+I    +++K  C+  +   + ++    P + ++F   Q
Sbjct: 314 AF----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369

Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
             S    N++F   + EG   +CL V     D      GI+ +N +    + +DR N K+
Sbjct: 370 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 423

Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
            +  + C E+ ++         +G +P+P P+ +    ++   A  PS
Sbjct: 424 GFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 153/337 (45%), Gaps = 42/337 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  S + + V C+      + +C + +  C Y   Y+ E ++SSG L +D++   + 
Sbjct: 134 KFRPEDSETYQPVKCTW-----QCNCDNDRKQCTYERRYA-EMSTSSGALGEDVVSFGNQ 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           ++ +PQ +     I GC   +TG   +  A DG+MGLG GD+S+   L +  +I +SFS+
Sbjct: 188 TELSPQRA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSL 241

Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+          V  G   PA    T   P+   Y  Y + ++   +       N  +  
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFD 299

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEML 270
                ++DSG ++ +LP   +        K   S KRIS  G   +Y   C++ +  ++ 
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRIS--GPDPRYNDICFSGAEIDVS 357

Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNF 321
           ++    P + ++F       +    + F  ++    +CL V S   D      GI+ +N 
Sbjct: 358 QISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNT 417

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
           +    +++DRE+ K+ +  + C E+ ++ HV   PPP
Sbjct: 418 L----VMYDREHTKIGFWKTNCSELWERLHVSDAPPP 450


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 89/348 (25%), Positives = 151/348 (43%), Gaps = 52/348 (14%)

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C    +C S K+ C Y   Y+ E +SSSG L +DI+   + S+  PQ +V      GC  
Sbjct: 143 CNVDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-----FGCEN 196

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
            +TG      A DG+MGLG G +S+   L   G+I +SFS+C+   D G    V      
Sbjct: 197 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255

Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
           P     T    +   Y  Y + ++   +    L             ++DSG ++ +LP +
Sbjct: 256 PPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313

Query: 234 IYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ 283
            +    V F   VSS     K+I     ++K  C+  +   + ++    P + ++F   Q
Sbjct: 314 AF----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369

Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
             S    N++F   + EG   +CL V     D      GI+ +N +    + +DR N K+
Sbjct: 370 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 423

Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
            +  + C E+ ++         +G +P+P P+ +    ++   A  PS
Sbjct: 424 GFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 148/338 (43%), Gaps = 52/338 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP+SSSSS  + C    C   R  C  S K  C Y   Y+ E +SS+G LV D L L  
Sbjct: 106 FDPASSSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYA-EQSSSAGLLVSDQLQLR- 163

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                   +V+  V+ GC  K+TG   +  A DG++GLG  +VS+ + LA +G+I + F+
Sbjct: 164 ------DGAVE--VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFA 214

Query: 163 ICFD--ENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
           +CF   E D G++  GD   A      Q T+ L        Y V +E+  +G   L    
Sbjct: 215 LCFGSVEGD-GALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKP 273

Query: 214 ---QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK----------- 259
              + G+  ++DSG +FT+LP+E +      F + VS+  +    NS K           
Sbjct: 274 ERYEEGYGTVLDSGTTFTYLPSEAFQ----LFKEAVSAYALEHGLNSVKGPDPKEKSFAQ 329

Query: 260 ---YCY-------NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
               C+       +A   ++ KV P   L F+           + F        +CL V 
Sbjct: 330 FHDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVF 389

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                  ++G        + +DR N ++ +  + C+E+
Sbjct: 390 DNGASGTLLGGISFRNILVQYDRRNRRVGFGAASCQEI 427


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 143/308 (46%), Gaps = 34/308 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           ++PS SSS KN+ C+   CK  +    SC +  D C Y   Y   D  S G L +D L L
Sbjct: 131 FNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGG-DAKSQGDLSNDSLTL 189

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S S     S +  +++IGCG        D +   GV+G+G G +S+   +  +  + + 
Sbjct: 190 DSTSG---SSVLFPNIVIGCGHINV--LQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSK 243

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL 212
           FS C      D N S  + FG+    + +   ST  + +  + + YF+ +E++ +GN+ +
Sbjct: 244 FSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRI 303

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                   S    L+DSG   T LP    +++V    + V   RI    +    CYN + 
Sbjct: 304 EYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTG 363

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMG 324
           ++ L VPD+   F+     +  N  F FP  +G  + C   +S++G   +G I QN ++ 
Sbjct: 364 KQ-LNVPDITAHFNGADVKLNSNGTF-FPFEDG--IMCFGFISSNGLEIFGNIAQNNLL- 418

Query: 325 HRIVFDRE 332
             I +D E
Sbjct: 419 --IDYDLE 424


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 138/341 (40%), Gaps = 40/341 (11%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
           +L+ YDP +SSS   VSC    C +    K    +   PC Y   Y  + +S++G+ + D
Sbjct: 130 DLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFITD 188

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     +         +++  GCG +Q G   +   A DG++G G  + S+ S LA AG
Sbjct: 189 ALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAG 248

Query: 156 LIQNSFSICFDENDSGS--------------VFFGDQGPATQQSTSFLPIGEKYDAYFVG 201
             +  F+ C D    G               VFF   G         + I      Y V 
Sbjct: 249 KAKKIFAHCLDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVN 308

Query: 202 VESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-IS 252
           ++S  +G + L        T      ++DSG + T+LP  ++ +V+   D + S  R I+
Sbjct: 309 LKSIDVGGTTLQLPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVM---DVVFSKHRDIA 365

Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TV 307
                   C+  S       P +   F  + +  V  H + FP   G  ++C+      +
Sbjct: 366 FHNLQDFLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP--NGNDIYCVGFQNGAL 423

Query: 308 MSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
            S DG D  ++G   +    +V+D EN  + W+   C   I
Sbjct: 424 QSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTDYNCSSSI 464


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 29/326 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDPS SSS   V+C    C +       SC     PC Y   Y  + +S++G+ V D
Sbjct: 125 LTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAA-PCQYSISYG-DGSSTTGFFVTD 182

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
            L     S ++  +   +S+  GCG K  G     + A DG++G G  + S+ S LA AG
Sbjct: 183 FLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAG 242

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D  + G +F    G   Q   S  P+      Y V +E+  +G   L   
Sbjct: 243 KVRKVFAHCLDTINGGGIF--AIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLP 300

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                       ++DSG +  +LP  +Y  ++ K         + L+ +    C+  S  
Sbjct: 301 TNIFDIGESKGTIIDSGTTLAYLPGVVYNAIMSKV--FAQYGDMPLKNDQDFQCFRYSGS 358

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
                P +   F       +  H + F   E   ++C+      + + DG   ++  +  
Sbjct: 359 VDDGFPIITFHFEGGLPLNIHPHDYLFQNGE---LYCMGFQTGGLQTKDGKDMVLLGDLA 415

Query: 323 MGHRIV-FDRENLKLAWSHSKCEEVI 347
             +R+V +D EN  + W+   C   I
Sbjct: 416 FSNRLVLYDLENQVIGWTDYNCSSSI 441


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 140/327 (42%), Gaps = 28/327 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD  +S++S  V C    C         CK  L+  C Y   Y  + +S++GY V 
Sbjct: 117 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 173

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D +     S +   +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +
Sbjct: 174 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           G ++  FS C D  D G +F    G   +   +  P+ +    Y V ++   +G   L  
Sbjct: 234 GKVKKVFSHCLDNVDGGGIFA--IGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 291

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                +SG +   ++DSG +  + P E+Y  ++ K        R+     ++  C++ + 
Sbjct: 292 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 350

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
                 P + L F K+ S  V  H + F   E    +C+        + DG D  ++G  
Sbjct: 351 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDL 408

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
            +    +V+D E   + W    C   I
Sbjct: 409 VLSNKLVVYDLEKQGIGWVEYNCSSSI 435


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 136/320 (42%), Gaps = 19/320 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
            LS +D ++SS+SK V C    C   S   S +    C Y   Y+ E TS  G  + D+L
Sbjct: 117 RLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDML 175

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L   +       +   V+ GCG  Q+G   +G +A DGVMG G  + SV S LA  G  
Sbjct: 176 TLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDA 235

Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
           +  FS C D    G +F  G       ++T  +P    Y+   +G++    S  +  S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV 295

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
              G   +VDSG +  + P  +Y  ++   + +++ + + L      + C++ S+     
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEA 350

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGII--GQNFMMGHRI 327
            P +   F  +    V  H + F   E    F      ++TD    +I  G   +    +
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLV 410

Query: 328 VFDRENLKLAWSHSKCEEVI 347
           V+D +N  + W+   C   I
Sbjct: 411 VYDLDNEVIGWADHNCSSSI 430


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 155/352 (44%), Gaps = 44/352 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  S+S + + C +P C    +C      C Y   Y+ E +SSSG L +D++   + 
Sbjct: 117 KFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 170

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ +PQ +V      GC  ++TG      A DG+MGLG G +SV   L   G+I++ FS+
Sbjct: 171 SQLSPQRAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 224

Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   E   G++  G   P      S   P    Y  Y + ++   +    L        
Sbjct: 225 CYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 282

Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEE 268
                ++DSG ++ + P E +    + V+K  ++ S KRI   G    Y   C++ +  +
Sbjct: 283 GKHGTVLDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRD 338

Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           + ++    P++ + F   Q  ++    + F   +    +CL +        ++G   +  
Sbjct: 339 VAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRN 398

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
             + +DREN KL +  + C ++  +    L  P   +SP P     Q  +SN
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN 443


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 155/352 (44%), Gaps = 44/352 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  S+S + + C +P C    +C      C Y   Y+ E +SSSG L +D++   + 
Sbjct: 117 KFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 170

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ +PQ +V      GC  ++TG      A DG+MGLG G +SV   L   G+I++ FS+
Sbjct: 171 SQLSPQRAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 224

Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   E   G++  G   P      S   P    Y  Y + ++   +    L        
Sbjct: 225 CYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 282

Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEE 268
                ++DSG ++ + P E +    + V+K  ++ S KRI   G    Y   C++ +  +
Sbjct: 283 GKHGTVLDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRD 338

Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           + ++    P++ + F   Q  ++    + F   +    +CL +        ++G   +  
Sbjct: 339 VAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRN 398

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
             + +DREN KL +  + C ++  +    L  P   +SP P     Q  +SN
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN 443


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 83/328 (25%), Positives = 142/328 (43%), Gaps = 24/328 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  +SCS   C      S S C +  + C Y   Y  + + +SGY V D
Sbjct: 96  LNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYG-DGSGTSGYYVSD 154

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +LH  +    +  ++  + ++ GC   QTG       A DG+ G G  D+SV S LA  G
Sbjct: 155 LLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQG 214

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +   +FS C   +DSG       G   + +  + P+      Y + ++S  +    L   
Sbjct: 215 ISPRAFSHCLKGDDSGGGIL-VLGEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAID 273

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNAS 265
                T S    ++DSG +  +L    Y   +     +VS S R  L +GN   +CY  S
Sbjct: 274 PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPYLSKGN---HCYLIS 330

Query: 266 SEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFM 322
           S      P + L F+   S ++  ++++       G  ++C+      G    I+G   +
Sbjct: 331 SSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVL 390

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
                V+D  N ++ W++  C   ++ S
Sbjct: 391 KDKIFVYDIANQRIGWANYDCSMSVNVS 418


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 140/327 (42%), Gaps = 28/327 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD  +S++S  V C    C         CK  L+  C Y   Y  + +S++GY V 
Sbjct: 198 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 254

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D +     S +   +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +
Sbjct: 255 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           G ++  FS C D  D G +F    G   +   +  P+ +    Y V ++   +G   L  
Sbjct: 315 GKVKKVFSHCLDNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 372

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                +SG +   ++DSG +  + P E+Y  ++ K        R+     ++  C++ + 
Sbjct: 373 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 431

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
                 P + L F K+ S  V  H + F   E    +C+        + DG D  ++G  
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDL 489

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
            +    +V+D E   + W    C   I
Sbjct: 490 VLSNKLVVYDLEKQGIGWVEYNCSSSI 516


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 138/328 (42%), Gaps = 24/328 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
            L+ +D SSSS++  V CS P+C S      + C S  D C Y   Y  + + +SGY V 
Sbjct: 109 QLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYG-DGSGTSGYYVS 167

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D L+  +    +   +  + ++ GC   Q+G       A DG+ G G G++SV S L+  
Sbjct: 168 DTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTR 227

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           G+    FS C   + SG       G   +    + P+      Y + + S  +    L  
Sbjct: 228 GITPRVFSHCLKGDGSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPI 286

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR--ISLQGNSWKYCYNA 264
                 T +    +VDSG +  +L  E Y   V   + +VS     I+ +GN    CY  
Sbjct: 287 DPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ---CYLV 343

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           S+      P     F+   S V++  +++  F  + G  ++C+      G   I+G   +
Sbjct: 344 STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVL 402

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
                V+D    ++ W++  C   ++ S
Sbjct: 403 KDKIFVYDLVRQRIGWANYDCSLSVNVS 430


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 144/334 (43%), Gaps = 31/334 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  Y+P SSS+S  ++C  P C +        CK     C Y   Y  + ++++GY V+
Sbjct: 116 DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVN 173

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D + L     +   S    S++ GCG KQ+G     + A DG++G G  + S+ S LA  
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233

Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           G ++  F+ C D    G +F  G+      ++T  +P    Y+    GV+   +G++ L 
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVK---VGDTALD 290

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNA 264
                  T     A++DSG +  +LP  IY  ++ K   L +   + L+    ++ C+  
Sbjct: 291 LPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVF 348

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
                   P +   F ++    +  H + F   +   V+C+        S DG +  ++G
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLG 406

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
              +    + ++ EN  + W+   C   I    V
Sbjct: 407 DLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 141/328 (42%), Gaps = 32/328 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           L++YDP+ S ++  V C    C + S      +C S   PC +   Y  + +S++G+ V 
Sbjct: 129 LTQYDPAGSGTT--VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYG-DGSSTTGFYVS 185

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D +     S +   +   +S+  GCG +  G     + A DG++G G  D S+ S LA A
Sbjct: 186 DSVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAA 245

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
             ++  F+ C D    G +F    G   Q      P+ +    Y V ++   +G + L  
Sbjct: 246 RKVRKIFAHCLDTVHGGGIF--AIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQL 303

Query: 214 -QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNAS 265
             S F +      ++DSG +  +LP E+Y  ++   FDK    + ++L       C+  S
Sbjct: 304 PSSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKY---QDLALHNYQDFVCFQFS 360

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
                  P +   F    +  V  H + F +NE   ++C+      V + DG D  ++G 
Sbjct: 361 GSIDDGFPVVTFSFEGEITLNVYPHDYLF-QNEN-DLYCMGFLDGGVQTKDGKDMVLLGD 418

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             +    +V+D E   + W+   C   I
Sbjct: 419 LVLSNKLVVYDLEKQVIGWADYNCSSSI 446


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 43/360 (11%)

Query: 57  SCSHPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 115
           S  HP+ C    +C      C Y   Y+ E +SSSG L +DI+   + S+  PQ +V   
Sbjct: 136 STYHPVKCNMDCNCDHDGVNCVYERRYA-EMSSSSGVLGEDIISFGNQSEVVPQRAV--- 191

Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSV 173
              GC   +TG      A DG+MGLG G +S+   L    +I +SFS+C+       G++
Sbjct: 192 --FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAM 248

Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVE---SYCIGNSC-LTQSGFQ----ALVDSGA 225
             G   P      S     + Y + +  +E    +  G    L+ S F      ++DSG 
Sbjct: 249 VLGGIPPPPDMVFSR---SDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGT 305

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 275
           ++ +LP E +    V F   +  K  +L+       N    C++ +  ++ ++    P++
Sbjct: 306 TYAYLPEEAF----VAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEV 361

Query: 276 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
            ++FS  Q  S    N++F   +  G   +CL +        ++G   +    + +DREN
Sbjct: 362 DMVFSNGQKLSLTPENYLFQHTKVHG--AYCLGIFRNGDSTTLLGGIIVRNTLVTYDREN 419

Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP----LPTTEQQSTSNGQAAAPPSTAKTA 389
            K+ +  + C E+  + H+   P  A   P P     P       +N     PP+ A + 
Sbjct: 420 EKIGFWKTNCSELWKRLHIPGAPAAAPIVPTPKSVSAPAPVVSYNNNTTVGMPPTVAPSG 479


>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 410

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 136/325 (41%), Gaps = 35/325 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +Y P  ++    V CS P+C      +   C + K+ C Y  +Y+ + +S    ++D   
Sbjct: 96  QYKPKGNT----VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAG 155
                 K    S++Q  +  GCG  Q  SY     P    GV+GLG G + + + L  AG
Sbjct: 152 F-----KLLNGSAMQPRLAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAG 204

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
           L +N    C      G +FFGD         ++ P+    + Y  G              
Sbjct: 205 LTRNVVGHCLSSKGGGYLFFGDTL-IPSLGVAWTPLLPPDNHYTTGPAELLFNGKPTGLK 263

Query: 216 GFQALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLK 271
           G + + D+G+S+T+  ++ Y  +V  +  D  VS  +++ +  +   C+  +   + +L+
Sbjct: 264 GLKLIFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLE 323

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNF 321
           V +     + N +   RN     P  E + +       CL +++       +  +IG   
Sbjct: 324 VKNFFKTITINFTNARRNTQLQIPP-ESYLIISKTGNACLGLLNGSEVGLQNSNVIGDIS 382

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
           M G  I++D E  +L W  S C ++
Sbjct: 383 MQGLLIIYDNEKQQLGWVSSNCNKL 407


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/324 (26%), Positives = 144/324 (44%), Gaps = 32/324 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           YDP+ SS+   + C+ PLC++  S     +    + DY      ++GYL  D L +    
Sbjct: 139 YDPARSSTFSKLPCASPLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGD 198

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                SS  + V  GC     G  +DGA+  G++GLG    S  SLL++ G+    FS C
Sbjct: 199 GDGDASSSFAGVAFGCSTANGGD-MDGAS--GIVGLGR---SALSLLSQIGV--GRFSYC 250

Query: 165 FDEN-DSGS--VFFGDQGPATQ---QSTSFL--PIGEKYDA--YFVGVESYCIGNSCLTQ 214
              + D+G+  + FG     T    QST+ L  P+  +  A  Y+V +    +G++ L  
Sbjct: 251 LRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPV 310

Query: 215 S----GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY--CY 262
           +    GF A      +VDSG +FT+L    Y  +   F    +     + G  + +  C+
Sbjct: 311 TSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCF 370

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
            A + +   VP +   F+    + V    +    +EG  V CL V+ T G   +IG    
Sbjct: 371 EAGAADT-PVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIGNVMQ 428

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
           M   +++D +    +++ + C  +
Sbjct: 429 MDLHVLYDLDGATFSFAPADCASL 452


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 145/335 (43%), Gaps = 53/335 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           YDP SSS+ + + C+ P C+       C +    C Y+  Y  + ++SSG L  D L   
Sbjct: 130 YDPRSSSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDRLVF- 187

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 P  +   +V +GCG    G  L+ AA  G++G+G G +S P+ LA A    + F
Sbjct: 188 ------PDDTHVHNVTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVF 236

Query: 162 SICFD------ENDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCL 212
           S C        +N S  + FG        ST+F P+     +   Y+V +  + +G   +
Sbjct: 237 SYCLGDRLSRAQNGSSYLVFGRT--PEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV 294

Query: 213 TQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQ 254
           T  GF                +VDSG + +    + YA V   FD   ++    ++++ +
Sbjct: 295 T--GFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATK 352

Query: 255 GNSWKYCY----NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM 308
            + +  CY    N +    ++VP + L F+      +   N++      +  T FCL + 
Sbjct: 353 FSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQ 412

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           + D    ++G     G  +VFD E  ++ ++ + C
Sbjct: 413 AADDGLNVLGNVQQQGFGLVFDVERGRIGFTPNGC 447


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 138/331 (41%), Gaps = 42/331 (12%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   +C +       R  C S K  C Y   Y+ +  SS G LV D   L   
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159

Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + A  S V+  +  GCG  +Q GS  + +A DGV+GLG G VS+ S L + G+ +N   
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218

Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
            C      G +FFGD   P ++ + + +      + Y  G  +   G   L     + + 
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
           DSG+SFT+   + Y  +V      +S     +  +S   C+    +    V D++  F  
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF-- 335

Query: 282 NQSFVVRNHIFSF-----------PEN----EGFTVFCLTVMSTD----GDYGIIGQNFM 322
                 R  + SF           PEN      +   CL +++       D  I+G   M
Sbjct: 336 ------RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITM 389

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
               +++D E  ++ W  + C+ + + + +H
Sbjct: 390 QDQMVIYDNERGQIGWIRAPCDRIPNDNTIH 420


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 153/365 (41%), Gaps = 51/365 (13%)

Query: 45  YDPSSSSSSKNVSCSHP----LCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           YD   S   + + C       LC+   + +C+S    C Y+  Y+ E +SS GY+V D +
Sbjct: 80  YDYDRSMEFERLDCGEASDATLCEETMKGTCQS-DGRCSYVVSYA-EGSSSRGYVVRDRV 137

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L        + ++ + +  GC   +T +  +  A DG+ G G G  +V + LA AGLI+
Sbjct: 138 RLG-------EGTLSAMLAFGCEEAETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIE 189

Query: 159 NSFSIC---FDENDS----GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
           N FS C   F  N      G   FG   PA  + T  +        + V   S+ +G+S 
Sbjct: 190 NVFSFCVEGFGANGGVLTLGRFDFGADAPALAR-TPLVADPANPAFHNVRTSSWKLGDSL 248

Query: 212 LTQ-SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL-QGNSWKY---CYNASS 266
           +   + +   +DSG +FTF+P  ++     + D   +   + +  G   +Y   CY  S+
Sbjct: 249 IEHLNSYTTTLDSGTTFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSA 308

Query: 267 EEMLKV----------PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDY 314
             M             P + + +    S  +   N++F+   N     FC+ + +   + 
Sbjct: 309 AAMNMTLSQSTVSEWFPPLTIAYEGGVSLTLGPENYLFAHETNS--AAFCVGIFANPNNQ 366

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST 374
            ++GQ  M    + FD  N ++  + + C  + +K + H        SP P P+     +
Sbjct: 367 ILLGQITMRDTLMEFDVANSRVGMAPANCRRLREK-YTH-------DSPEPTPSNSSTPS 418

Query: 375 SNGQA 379
             G A
Sbjct: 419 GGGDA 423


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 89/344 (25%), Positives = 146/344 (42%), Gaps = 44/344 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           Y+P  +   K V C  P+C          C S    C Y  +Y+ + +S+ G LV+D L 
Sbjct: 83  YNPKKA---KVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYA-DGSSTMGVLVEDTLT 138

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           +    +    + +Q+  IIGCG  Q G+     A+ DGV+GL    V++P+ LA+ G+I+
Sbjct: 139 V----RLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIK 194

Query: 159 NSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYCIG 208
           N    C  +  N  G +FFGD+   +   T    +G+        +  +   G +S  + 
Sbjct: 195 NVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLN 254

Query: 209 N-SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS- 266
           N   LT+S    + DSG SFT+L  + YA V+    K     R+     +  YC+   S 
Sbjct: 255 NDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVK-SDTTLPYCWRGPSP 313

Query: 267 -EEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTDGD-- 313
            + +  V      + L F     F   + +   P  +G+ +       CL ++   G   
Sbjct: 314 FQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSP--QGYLIVSTQGNVCLGILDASGASL 371

Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 355
               IIG   M G+ +V+D    ++ W    C     K+    V
Sbjct: 372 EVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNCHSRPTKTSSQFV 415


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 138/324 (42%), Gaps = 23/324 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD  +S++S  V C    C         CK  L+  C Y   Y  + +S++GY V 
Sbjct: 198 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 254

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D +     S +   +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +
Sbjct: 255 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           G ++  FS C D  D G +F    G   +   +  P+ +    Y V ++   +G   L  
Sbjct: 315 GKVKKVFSHCLDNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 372

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                +SG +   ++DSG +  + P E+Y  ++ K        R+     ++  C++ + 
Sbjct: 373 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 431

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT--VMSTDG-DYGIIGQNFMM 323
                 P + L F K+ S  V  H + F     + +        + DG D  ++G   + 
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLS 491

Query: 324 GHRIVFDRENLKLAWSHSKCEEVI 347
              +V+D E   + W    C   I
Sbjct: 492 NKLVVYDLEKQGIGWVEYNCSSSI 515


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 145/325 (44%), Gaps = 37/325 (11%)

Query: 60  HPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 118
           HP+ C    +C +  D C Y   Y+ E +SSSG L +D++   + S+  PQ +V      
Sbjct: 47  HPVKCNPDCTCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100

Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 176
           GC   +TG      A DG+MGLG GD+S+   L + G+I +SFS+C+   E   G++  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 177 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 229
              P +    S   P    Y  Y + +    +       N  +       ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217

Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIF 279
           LP   +    + F + ++S+   L+       N    C++ +  E+ ++    P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273

Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 338
              + + +    + F  ++    +CL V     D   ++G   +    + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333

Query: 339 SHSKCE---EVIDKSHVHLVPPPAG 360
             + C    E ++ S +   P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 141/330 (42%), Gaps = 28/330 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L  +D + SS++  VSC  P+C      + S C S  + C Y   Y  + + ++GY V D
Sbjct: 127 LDFFDTAGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYG-DGSGTTGYYVSD 185

Query: 97  ILHLAS-FSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
            ++  +     +  ++  S++I GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 186 TMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSR 245

Query: 155 GLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           G+    FS C    EN  G +  G+     + S  + P+      Y + ++S  +    L
Sbjct: 246 GVTPKVFSHCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLL 302

Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCY 262
                   T +    +VDSG +  +L  E Y   V      VS  SK I  +GN    CY
Sbjct: 303 PIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CY 359

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
             S+      P + L F    S V+   +++  +   +G  ++C+     +  + I+G  
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDL 419

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
            +     V+D  N ++ W+   C   ++ S
Sbjct: 420 VLKDKIFVYDLANQRIGWADYDCSLSVNVS 449


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 143/334 (42%), Gaps = 31/334 (9%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  Y+P SSS+S  ++C  P C +        CK     C Y   Y  + ++++GY V+
Sbjct: 116 DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVN 173

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D + L     +   S    S++ GCG KQ+G     + A DG++G G  + S+ S LA  
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233

Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           G ++  F+ C D    G +F  G+       +T  +P    Y+    GV+   +G++ L 
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVK---VGDTALD 290

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNA 264
                  T     A++DSG +  +LP  IY  ++ K   L +   + L+    ++ C+  
Sbjct: 291 LPLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVF 348

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
                   P +   F ++    +  H + F   +   V+C+        S DG +  ++G
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLG 406

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
              +    + ++ EN  + W+   C   I    V
Sbjct: 407 DLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 145/325 (44%), Gaps = 37/325 (11%)

Query: 60  HPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 118
           HP+ C    +C +  D C Y   Y+ E +SSSG L +D++   + S+  PQ +V      
Sbjct: 47  HPVKCNPDCTCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100

Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 176
           GC   +TG      A DG+MGLG GD+S+   L + G+I +SFS+C+   E   G++  G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159

Query: 177 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 229
              P +    S   P    Y  Y + +    +       N  +       ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217

Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIF 279
           LP   +    + F + ++S+   L+       N    C++ +  E+ ++    P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273

Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 338
              + + +    + F  ++    +CL V     D   ++G   +    + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333

Query: 339 SHSKCE---EVIDKSHVHLVPPPAG 360
             + C    E ++ S +   P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 80/331 (24%), Positives = 143/331 (43%), Gaps = 36/331 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLV 94
           L++YDP+ S ++  V C    C + S+       C S   PC +   Y  + +S++G+ V
Sbjct: 129 LTQYDPAGSGTT--VGCEQEFCVANSAASGVPPACPSAASPCQFRITYG-DGSSTTGFYV 185

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAK 153
            D +     S +   +    S+  GCG +  G     + A DG++G G  D S+ S LA 
Sbjct: 186 TDFVQYNQVSGNGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAA 245

Query: 154 AGLIQNSFSICFDENDSGSVF-FGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
           A  ++  F+ C D    G +F  G+   P   ++T  +P    Y+    G+    +G + 
Sbjct: 246 ARKVRKIFAHCLDTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGIS---VGGAT 302

Query: 212 LT--QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCY 262
           L    S F +      ++DSG +  +LP E+Y  ++   FDK      ++++      C+
Sbjct: 303 LQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDK---HPDLAVRNYEDFICF 359

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGI 316
             S     + P +   F  + +  V  H + F    G  ++C+      V + DG D  +
Sbjct: 360 QFSGSLDEEFPVITFSFEGDLTLNVYPHDYLF--QNGNDLYCMGFLDGGVQTKDGKDMVL 417

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           +G   +    +V+D E   + W+   C   I
Sbjct: 418 LGDLVLSNKLVVYDLEKQVIGWTDYNCSSSI 448


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 85/355 (23%), Positives = 147/355 (41%), Gaps = 33/355 (9%)

Query: 5   ICFGSHANAYNALLCLPV-TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC 63
           + FG+ A  Y  +       + + CL   G    Q   +  +DP+ S++   V C HP C
Sbjct: 139 VGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPI--FDPTKSATYSVVPCGHPQC 196

Query: 64  KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 123
            +    K     C Y  +Y  + +SS+G L  + L L S       +        GCG+ 
Sbjct: 197 AAADGSKCSNGTCLYKVEYG-DGSSSAGVLSHETLSLTS-------TRALPGFAFGCGQT 248

Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPA 181
             G + D    DG++GLG G +S+ S  A +     +FS C   D    G +  G   PA
Sbjct: 249 NLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIGPTTPA 303

Query: 182 TQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLP 231
           +     +  + +K D    YFV + S  IG   L       T  G    +DSG   T+LP
Sbjct: 304 SNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDG--TFLDSGTILTYLP 361

Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH- 290
            E Y  +  +F   ++  + +   + +  CY+ + +  + +P +   FS    F +    
Sbjct: 362 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421

Query: 291 IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           I  FP++    + CL  ++      + I+G        +++D    K+ ++ + C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 150/335 (44%), Gaps = 38/335 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P +S + + V C+      + +C   +  C Y   Y+ E ++SSG L +D++   + 
Sbjct: 134 KFRPEASETYQPVKCT-----WQCNCDDDRKQCTYERRYA-EMSTSSGVLGEDVVSFGNQ 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+ +PQ +     I GC   +TG   +  A DG+MGLG GD+S+   L +  +I ++FS+
Sbjct: 188 SELSPQRA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSL 241

Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
           C+          V  G   PA    T   P+   Y  Y + ++   +       N  +  
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTHSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFD 299

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSEEMLKV 272
                ++DSG ++ +LP   +        K   S KRIS     +   C++ +   + ++
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQL 359

Query: 273 ----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
               P + ++F       +    + F  ++    +CL V S   D      GI+ +N + 
Sbjct: 360 SKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTL- 418

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
              +++DRE+ K+ +  + C E+ ++ HV   PPP
Sbjct: 419 ---VMYDREHSKIGFWKTNCSELWERLHVSNAPPP 450


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 134/317 (42%), Gaps = 33/317 (10%)

Query: 51  SSSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           + +K V C++ +C +  S  S       +  C Y   Y T+  SS G LV D   L   +
Sbjct: 103 TKNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRN 161

Query: 105 KHAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNS 160
           K    S+V+ S+  GCG  +Q G   +GAAP   DG++GLG G VS+ S L + G+ +N 
Sbjct: 162 K----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNV 215

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQ 218
              C   +  G +FFGD    T + T ++P+      + Y  G  +       L+    +
Sbjct: 216 LGHCLSTSGGGFLFFGDDMVPTSRVT-WVPMVRSTSGNYYSPGSATLYFDRRSLSTKPME 274

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
            + DSG+++T+   + Y   +      +S     +   S   C+    +    V D++  
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKD 333

Query: 279 FSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
           F   Q    +N +   P      V      CL ++  DG      + IIG   M    ++
Sbjct: 334 FKSLQFIFGKNAVMEIPPENYLIVTKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVI 391

Query: 329 FDRENLKLAWSHSKCEE 345
           +D E  +L W    C  
Sbjct: 392 YDNEKAQLGWIRGSCSR 408


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 79/319 (24%), Positives = 135/319 (42%), Gaps = 18/319 (5%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  +SCS   C      S S C +  + C Y   Y  + + +SGY V D
Sbjct: 134 LNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYG-DGSGTSGYYVSD 192

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
           +LH  +    +   +  + ++ GC   QTG       A DG+ G G  D+SV S LA  G
Sbjct: 193 LLHFDTILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQG 252

Query: 156 LIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGN 209
           +    FS C   +DSG   +  G+        T  +P    Y+    + +V  ++  I  
Sbjct: 253 ITPRVFSHCLKGDDSGGGILVLGEIVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDP 312

Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
           S    S  Q  ++DSG +  +L    Y   +      V S  +S   +    CY  SS  
Sbjct: 313 SVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTV-SPSVSPYLSKGNQCYLTSSSI 371

Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGH 325
               P + L F+   S ++  ++++       G  ++C+      G +  I+G   +   
Sbjct: 372 NDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDK 431

Query: 326 RIVFDRENLKLAWSHSKCE 344
             V+D    ++ W++  C+
Sbjct: 432 IFVYDIAGQRIGWANYDCK 450


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 134/316 (42%), Gaps = 26/316 (8%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   LC S       +  C S K  C Y   Y+ +  SS G L+ D     SF
Sbjct: 104 TKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYA-DQGSSLGVLLTD-----SF 157

Query: 104 SKHAPQSS-VQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +     SS V+ S+  GCG  +Q GS  + A  DGV+GLG G +S+ S L + G+ +N  
Sbjct: 158 AVRLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVV 217

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQAL 220
             C      G +FFGD      ++T    +   +  Y+  G  S   G   L     + +
Sbjct: 218 GHCLSIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVV 277

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD---- 274
           +DSG+SFT+   + Y  +V      +S     +   S   C+      + +L V      
Sbjct: 278 LDSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCWKGKKPFKSVLDVKKEFKS 337

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFD 330
           + L FS  +  ++     ++     F   CL +++       D  I+G   M    +++D
Sbjct: 338 LVLSFSNGKKALMEIPPENYLIVTKFGNACLGILNGSEIGLKDLNIVGDITMQDQMVIYD 397

Query: 331 RENLKLAWSHSKCEEV 346
            E  ++ W  + C+ +
Sbjct: 398 NERGQIGWIRAPCDRI 413


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 36/317 (11%)

Query: 54  KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
           K++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +        +
Sbjct: 257 KDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GREK 309

Query: 114 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDS 170
              + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   D N  
Sbjct: 310 LDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGG 369

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVDSG 224
           G +F GD        TS  PI    D  F    +    G+  L+  G      Q + DSG
Sbjct: 370 GYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFDSG 428

Query: 225 ASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPDMR 276
           +S+T+LP EIY  ++         F +  S + + L     +   Y    +++ K   + 
Sbjct: 429 SSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--PLN 486

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
           L F K + FV+       P+N          CL  ++  D D+G   I+G N + G  +V
Sbjct: 487 LHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVV 545

Query: 329 FDRENLKLAWSHSKCEE 345
           +D +  ++ W++S C +
Sbjct: 546 YDNQQRQIGWTNSDCTK 562


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 36/317 (11%)

Query: 54  KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
           K++ C   L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +        +
Sbjct: 258 KDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GREK 310

Query: 114 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDS 170
              + GC   Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   D N  
Sbjct: 311 LDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGG 370

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVDSG 224
           G +F GD        TS  PI    D  F    +    G+  L+  G      Q + DSG
Sbjct: 371 GYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFDSG 429

Query: 225 ASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPDMR 276
           +S+T+LP EIY  ++         F +  S + + L     +   Y    +++ K   + 
Sbjct: 430 SSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--PLN 487

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
           L F K + FV+       P+N          CL  ++  D D+G   I+G N + G  +V
Sbjct: 488 LHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVV 546

Query: 329 FDRENLKLAWSHSKCEE 345
           +D +  ++ W++S C +
Sbjct: 547 YDNQQRQIGWTNSDCTK 563


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 145/323 (44%), Gaps = 27/323 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  +SC    C+S      +SC    + C Y   Y  + + +SGY V D
Sbjct: 121 LNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSD 179

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
           ++H AS  +    ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G
Sbjct: 180 LMHFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQG 239

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C   ++SG       G   + +  + P+      Y + ++S  +    +   
Sbjct: 240 IAPRVFSHCLKGDNSGGGVL-VLGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIA 298

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
                T +    +VDSG +  +L  E Y   V+    ++  S + +  +GN    CY  +
Sbjct: 299 PSVFATSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ---CYLIT 355

Query: 266 SEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNF 321
           +   + + P + L F+   S V+R   +   +N   EG +V+C+      G    I  + 
Sbjct: 356 TSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEG-SVWCIGFQKISGQSITILGDL 414

Query: 322 MMGHRI-VFDRENLKLAWSHSKC 343
           ++  +I V+D    ++ W++  C
Sbjct: 415 VLKDKIFVYDLAGQRIGWANYDC 437


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 141/330 (42%), Gaps = 28/330 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L  +D + SS++  VSC+ P+C      + S C S  + C Y   Y  + + ++GY V D
Sbjct: 127 LDFFDTAGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYG-DGSGTTGYYVSD 185

Query: 97  ILHLAS-FSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
            ++  +     +  ++  S+++ GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 186 TMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSR 245

Query: 155 GLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           G+    FS C    EN  G +  G+     + S  + P+      Y + ++S  +    L
Sbjct: 246 GVTPKVFSHCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLL 302

Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCY 262
                   T +    +VDSG +  +L  E Y   V      VS  SK I  +GN    CY
Sbjct: 303 PIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CY 359

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
             S+      P + L F    S V+   +++  +   +   ++C+     +  + I+G  
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDL 419

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
            +     V+D  N ++ W+   C   ++ S
Sbjct: 420 VLKDKIFVYDLANQRIGWADYNCSLAVNVS 449


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 142/320 (44%), Gaps = 40/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P +SSS  N SC+  LC +  R +C S+++ C Y   Y     +   +         +
Sbjct: 50  FIPLASSSYSNASCTDSLCDALPRPTC-SMRNTCTYSYSYGDGSNTRGDF---------A 99

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F       S  + +  GCG  Q G++   A  DG++GLG G +S+PS L  +    + FS
Sbjct: 100 FETVTLNGSTLARIGFGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFS 154

Query: 163 ICF-DENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ- 214
            C  D++ +G+   + FG+   A     SF P+ +  D    Y+VGVES  +GN  +   
Sbjct: 155 YCLVDQSTTGTFSPITFGNA--AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTP 212

Query: 215 -SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
            S F+         ++DSG + T+     +  ++ +  + +S             CY+ S
Sbjct: 213 PSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDIS 272

Query: 266 --SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
             S   L +P M +  +     +  ++++   +N G TV   T MST   + IIG     
Sbjct: 273 SVSASSLTLPSMTVHLTNVDFEIPVSNLWVLVDNFGETV--CTAMSTSDQFSIIGNVQQQ 330

Query: 324 GHRIVFDRENLKLAWSHSKC 343
            + IV D  N ++ +  + C
Sbjct: 331 NNLIVTDVANSRVGFLATDC 350


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 142/303 (46%), Gaps = 34/303 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           ++PS SS+ KN+ CS P+CK   ++ C S  K  C Y   Y  + + S G +  D L L 
Sbjct: 132 FNPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITY-LDRSGSQGDISKDTLTLN 190

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           S +  +P S  +  ++IGCG K + +  +G A  G++G G G+ S+ S L  +  I   F
Sbjct: 191 S-NDGSPISFPK--IVIGCGHKNSLT-TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKF 243

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN----- 209
           S C        N S  ++FGD    +       P+ + +    YF  +E++ +G+     
Sbjct: 244 SYCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKL 303

Query: 210 ---SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
              S +  +   A++DSG++ T LP ++Y+++      +V  KR+         CY  + 
Sbjct: 304 KDSSLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT- 362

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFM 322
              LK  ++ +I +  +   V+ + F+        V C    S+   + + G    QNF+
Sbjct: 363 ---LKKYEVPIITAHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFL 419

Query: 323 MGH 325
           +G+
Sbjct: 420 VGY 422


>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1336

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/292 (26%), Positives = 127/292 (43%), Gaps = 31/292 (10%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y   Y+ + +SS G LV D LHL + +     S  + +V+ GCG  Q G  L+  A  
Sbjct: 271 CDYEIQYA-DHSSSLGVLVRDELHLVTTNG----SKTKLNVVFGCGYDQEGLILNTLAKT 325

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIG 192
           DG+MGL    VS+P  LA  GLI+N    C   + +G   +F GD         +++P+ 
Sbjct: 326 DGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYMFLGDDF-VPYWGMNWVPMA 384

Query: 193 EKY--DAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
                D Y   +     GN  L   G     +   DSG+S+T+ P E Y ++V   +++ 
Sbjct: 385 YTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVS 444

Query: 247 SSKRISLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT 301
               +    ++     W+  +   S + +K     L       + + + +F  P  EG+ 
Sbjct: 445 GLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPP-EGYL 503

Query: 302 VF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +       CL ++      DG   I+G   + G+ +V+D    K+ W  + C
Sbjct: 504 IISNKGHVCLGILDGSKVNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADC 555


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 134/321 (41%), Gaps = 44/321 (13%)

Query: 56  VSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 108
           V CS P+C +  S       C     PC Y   Y+ +  S+ G LV D +H+ S     P
Sbjct: 116 VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYA-DHASTLGVLVRDYMHIGS-----P 169

Query: 109 QSSVQSSVI-IGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
            SS +  ++  GCG +Q  +G     + P G++GLG G  S+ S L   G I N    C 
Sbjct: 170 SSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCL 229

Query: 166 DENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
                G +F GD+          P  Q S       EK+  Y  G              G
Sbjct: 230 SAEGGGYLFLGDKFVPSSGIVWTPIIQSSL------EKH--YNTGPVDLFFNGKPTPAKG 281

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYC--YNASSEEM 269
            Q + DSG+S+T+  + +Y  V    +  +  K +S   +      WK    + + +E  
Sbjct: 282 LQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVN 341

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGH 325
                + L F+K+++   +    ++     +   CL +++ +    G+  ++G   +   
Sbjct: 342 NYFKPLTLSFTKSKNLQFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRNVVGDISLQDK 401

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            +V+D E  ++ W+ + C+++
Sbjct: 402 VVVYDNEKQQIGWASANCKQI 422


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 134/312 (42%), Gaps = 19/312 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
            LS +D ++SS+SK V C    C   S   S +    C Y   Y+ E TS  G  + D+L
Sbjct: 117 RLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDML 175

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L   +       +   V+ GCG  Q+G   +G +A DGVMG G  + SV S LA  G  
Sbjct: 176 TLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDA 235

Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
           +  FS C D    G +F  G       ++T  +P    Y+   +G++    S  +  S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV 295

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
              G   +VDSG +  + P  +Y  ++   + +++ + + L      + C++ S+     
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEA 350

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGII--GQNFMMGHRI 327
            P +   F  +    V  H + F   E    F      ++TD    +I  G   +    +
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLV 410

Query: 328 VFDRENLKLAWS 339
           V+D +N  + W+
Sbjct: 411 VYDLDNEVIGWA 422


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/329 (24%), Positives = 148/329 (44%), Gaps = 44/329 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P SSS+ K + C+ P C    +C      C Y   Y+ E +SSSG L +D+L   + 
Sbjct: 129 RFQPESSSTYKPMQCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGLLAEDVLSFGNE 182

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +     I GC   +TG      A DG+MGLG G +SV   L    ++ NSFS+
Sbjct: 183 SELTPQRA-----IFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSL 236

Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV---ESYCIG-----NSCLT 213
           C+   D   G++  G+  P      +     + Y + +  +   E +  G     N  + 
Sbjct: 237 CYGGMDVVGGAMVLGNIPPPPDMVFAH---SDPYRSAYYNIELKELHVAGKRLKLNPRVF 293

Query: 214 QSGFQALVDSGASFTFLPTEIYA---EVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEM 269
                 ++DSG ++ +LP E +    + ++K  K +  K+I     S+   C++ +  ++
Sbjct: 294 DGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFL--KQIHGPDPSYNDICFSGAGRDV 351

Query: 270 LKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQN 320
            ++    P++ ++F   Q   +    + F   +    +CL +     D      GI+ +N
Sbjct: 352 SQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRN 411

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
            +    + +DR+N K+ +  + C E+  +
Sbjct: 412 TL----VTYDRDNDKIGFWKTNCSELWKR 436


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 142/342 (41%), Gaps = 38/342 (11%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YD   S++ K VSC    C        S C +    CPY+  Y  + +S++GY V D
Sbjct: 131 LTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKD 188

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKA 154
            +     S     ++   S+  GCG +Q+G        A DG++G G  + S+ S LA  
Sbjct: 189 YVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
             ++  F+ C D  + G +F    G   Q   +  P+      Y V +    +G+  L  
Sbjct: 249 RKVKKMFAHCLDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNI 306

Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
           S   F+A      ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+  S
Sbjct: 307 SADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYS 364

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCL-----TVMSTDGDYGIIGQ 319
                  P +   F  +    V  H + F  EN    ++C+      + S D     +  
Sbjct: 365 ERVDDGFPPVIFHFENSLLLKVYPHEYLFQYEN----LWCIGWQNSGMQSRDRKNVTLFG 420

Query: 320 NFMMGHRIV-FDRENLKLAWSHSKCEEVI-----DKSHVHLV 355
           + ++ +++V +D EN  + W+   C   I         VHLV
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLV 462


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 134/324 (41%), Gaps = 42/324 (12%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   +C +       R  C S K  C Y   Y+ +  SS G LV D   L   
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159

Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + A  S V+  +  GCG  +Q GS  + +A DGV+GLG G VS+ S L + G+ +N   
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218

Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
            C      G +FFGD   P ++ + + +      + Y  G  +   G   L     + + 
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
           DSG+SFT+   + Y  +V      +S     +  +S   C+    +    V D++  F  
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF-- 335

Query: 282 NQSFVVRNHIFSF-----------PEN----EGFTVFCLTVMSTD----GDYGIIGQNFM 322
                 R  + SF           PEN      +   CL +++       D  I+G   M
Sbjct: 336 ------RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITM 389

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
               +++D E  ++ W  + C+ +
Sbjct: 390 QDQMVIYDNERGQIGWIRAPCDRI 413


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 139/314 (44%), Gaps = 33/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++P  S+S  +V C+   C +       ++  C Y   Y  + T S G L  + + + S 
Sbjct: 122 FNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS- 179

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 SSV+S  +IGCG   +G +       GV+GLG G +S+ S +++   I   FS 
Sbjct: 180 ------SSVKS--VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 228

Query: 164 CFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQS 215
           C      + +G + FG     +       P+  K     Y++ +E+  IGN       + 
Sbjct: 229 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 288

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLK 271
           G   ++DSG + +FLP E+Y  VV    K+V +KR+   GN W  C+    N ++   + 
Sbjct: 289 G-NVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 347

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVF 329
           +   +     N + +  N       N    V CLT+   S   ++GIIG   +    I +
Sbjct: 348 IITAQFSGGANVNLLPVNTFQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGY 403

Query: 330 DRENLKLAWSHSKC 343
           D E  +L++  + C
Sbjct: 404 DLEAKRLSFKPTVC 417


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 131/316 (41%), Gaps = 31/316 (9%)

Query: 51  SSSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           + +K V C++ +C +  S  S       +  C Y   Y T+  SS G LV D   L   +
Sbjct: 103 TKNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRN 161

Query: 105 KHAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNS 160
           K    S+V+ S+  GCG  +Q G   +GAAP   DG++GLG G VS+ S L + G+ +N 
Sbjct: 162 K----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNV 215

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQA 219
              C   +  G +FFGD    T + T    +      Y+  G  +       L+    + 
Sbjct: 216 LGHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV 275

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           + DSG+++T+   + Y   +      +S     +   S   C+    +    V D++  F
Sbjct: 276 VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDF 334

Query: 280 SKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVF 329
              Q    +N +   P      +      CL ++  DG      + IIG   M    +++
Sbjct: 335 KSLQFIFGKNAVMDIPPENYLIITKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVIY 392

Query: 330 DRENLKLAWSHSKCEE 345
           D E  +L W    C  
Sbjct: 393 DNEKAQLGWIRGSCSR 408


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 95/213 (44%), Gaps = 18/213 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  SS+   VSC    C +        C +   PC Y   Y  + +S++GY V D
Sbjct: 77  LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 134

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +L     S         S+V  GCG +Q G       A DG++G G  + S+ S L+ AG
Sbjct: 135 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 194

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            ++  F+ C D  + G +F    G   Q      P+      Y V ++S  +G + L   
Sbjct: 195 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 252

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVV 240
                T      ++DSG + T+LP  +Y E+++
Sbjct: 253 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIML 285


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 45/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           YD  +S+SS  V CS P C      S S C   ++ C Y   Y  + + + GYLV+D+LH
Sbjct: 83  YDVKASASSSKVPCSDPSCTLITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLH 140

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                      +  ++VI GCG KQ+G       A DG++G G  D+S  S LAK G   
Sbjct: 141 Y--------MVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTP 192

Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
           N F+ C D  E   G +  G+      Q T  +P    Y+   V ++S  + N+ LT   
Sbjct: 193 NVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMYHYN---VVLQSISVNNANLTIDP 249

Query: 214 ----QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                   Q  + DSG +  +LP E Y            ++ +SL    +  C    S  
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRF 300

Query: 269 MLKV-PDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNF 321
           + K+ P++ L F   + +     ++          ++C+   S     ++  Y I G   
Sbjct: 301 IYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLV 360

Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
           +    +V+D E  ++ W    C+
Sbjct: 361 LKNKLVVYDLERGRIGWRPFDCK 383


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/315 (25%), Positives = 132/315 (41%), Gaps = 25/315 (7%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   LC S       +  C S  + C Y+  Y+ +  SS+G LV+D   L   
Sbjct: 112 TKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYA-DQGSSTGVLVNDSFAL--- 167

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
            + A  S V+ S+  GCG  Q  S  + +  DGV+GLG G VS+ S   + G+ +N    
Sbjct: 168 -RLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGH 226

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVD 222
           C      G +FFGD     Q+ T    +      Y+  G  S   G+  L     + + D
Sbjct: 227 CLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFD 286

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SK 281
           SG+SFT+   + Y  +V      +S     +   S   C+    +    V D++  F S 
Sbjct: 287 SGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKG-KKPFKSVLDVKKEFKSL 345

Query: 282 NQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
             +F   N  F     + + +       CL +++       D  I+G   M    +++D 
Sbjct: 346 VLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYDN 405

Query: 332 ENLKLAWSHSKCEEV 346
           E  ++ W  + C+ +
Sbjct: 406 EKGQIGWIRAPCDRI 420


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 45/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           YD  +S+SS  V CS P C      S S C   ++ C Y   Y  + + + GYLV+D+LH
Sbjct: 83  YDVKASASSSKVPCSDPSCTLITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLH 140

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                      +  ++VI GCG KQ+G       A DG++G G  D+S  S LAK G   
Sbjct: 141 Y--------MVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTP 192

Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
           N F+ C D  E   G +  G+      Q T  +P    Y+   V ++S  + N+ LT   
Sbjct: 193 NVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMSHYN---VVLQSISVNNANLTIDP 249

Query: 214 ----QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                   Q  + DSG +  +LP E Y            ++ +SL    +  C    S  
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRF 300

Query: 269 MLKV-PDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNF 321
           + K+ P++ L F   + +     ++          ++C+   S     ++  Y I G   
Sbjct: 301 IYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLV 360

Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
           +    +V+D E  ++ W    C+
Sbjct: 361 LKNKLVVYDLERGRIGWRPFDCK 383


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 150/342 (43%), Gaps = 46/342 (13%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSC----------KSLKDPCPYIADYS-TEDTSSSG 91
           +E + S S +   + C  P C+ R+SC             +  C Y   Y    + S++G
Sbjct: 139 TEKECSRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAG 198

Query: 92  YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
            + +D L + +  SK  P S     V IGC    T  + D +   GV GLG    S+P  
Sbjct: 199 VMYEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 257

Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
           L  +      FS C   + E D  S       P          A   +T+  P  +    
Sbjct: 258 LNFS-----KFSYCLSSYQEPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTL 312

Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
           YFV +++  IG +      T+SG    VD+GASFT L   ++A++V + D+++  ++   
Sbjct: 313 YFVHLQNISIGGTRFPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVK 372

Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
           +    N+ + CY   + +++E  K+PDM L F+ + + V+    + +      +  CL +
Sbjct: 373 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 429

Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             ++  G   ++G   M    ++ D  N KL++  + C +VI
Sbjct: 430 YKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 471


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/339 (24%), Positives = 149/339 (43%), Gaps = 39/339 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   Y+P+ SSS +N+SC  P C+  SS      CK+    CPY  DY+    ++  +
Sbjct: 206 EQNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDF 265

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
            ++      ++     +      V+ GCG    G +        ++GLG G +S PS L 
Sbjct: 266 ALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ 322

Query: 153 KAGLIQNSFSICFDE---NDSGS---VFFGDQGPATQQSTSFLPI--GEKY---DAYFVG 201
              +  +SFS C  +   N S S   +F  D+      + +F  +  GE+      Y++ 
Sbjct: 323 --SIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQ 380

Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
           ++S  +G   L          ++     ++DSG++ TF P   Y  +   F+K +  ++I
Sbjct: 381 IKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI 440

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVMS 309
           +        CYN S    +++PD  + F+     +F   N+ + +  +E   V CL ++ 
Sbjct: 441 AADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDE---VICLAILK 497

Query: 310 T--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           T       IIG        I++D +  +L +S  +C EV
Sbjct: 498 TPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 536


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 136/302 (45%), Gaps = 34/302 (11%)

Query: 67  SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
           ++CK     C Y  +Y+ + +SS G L  D +H+ + +        +   + GC   Q G
Sbjct: 263 ATCKQ----CDYEIEYA-DRSSSMGVLAKDDMHMIATNG----GREKLDFVFGCAYDQQG 313

Query: 127 SYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQ 183
             L   A  DG++GL    +S+PS LA  G+I N F  C   + N  G +F GD     +
Sbjct: 314 QLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGGYMFLGDDY-VPR 372

Query: 184 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTFLPTEIYAE 237
              ++ PI G   + Y    +    G+  L   G      Q + DSG+S+T+LP EIY +
Sbjct: 373 WGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFDSGSSYTYLPDEIYKK 432

Query: 238 VV--VKFD--KLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVVRNHI 291
           +V  +K+D    V     +     WK  ++    E +K     + L F  N+ FV+    
Sbjct: 433 LVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLNLHFG-NRWFVIPRTF 491

Query: 292 FSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              P++          CL +++  + D+    I+G   + G  +V+D E  ++ W+ S+C
Sbjct: 492 TILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVVYDNERRQIGWADSEC 551

Query: 344 EE 345
            +
Sbjct: 552 TK 553


>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
 gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
 gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
 gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
 gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
          Length = 583

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 131/298 (43%), Gaps = 40/298 (13%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-AP 134
           C Y  +Y+ + + S G L  D  HL    K    S  +S ++ GCG  Q G  L+     
Sbjct: 281 CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKT 335

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSFLPI 191
           DG++GL    +S+PS LA  G+I N    C   D N  G +F G D  P+     +++P+
Sbjct: 336 DGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPM 393

Query: 192 --GEKYDAYFVGVESYCIGNSCLTQSGF-----QALVDSGASFTFLPTEIYAEVVVKFDK 244
               + DAY + V     G   L+  G      + L D+G+S+T+ P + Y+++V    +
Sbjct: 394 LHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE 453

Query: 245 LVSSKRISLQ--GNSWKYCYNASSE-EMLKVPDMRLIFSK------NQSFVVRNHIFSFP 295
            VS   ++      +   C+ A +      + D++  F        ++  ++   +   P
Sbjct: 454 -VSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP 512

Query: 296 E------NEGFTVFCLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           E      N+G    CL ++      DG   I+G   M GH IV+D    ++ W  S C
Sbjct: 513 EDYLIISNKGNV--CLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 143/340 (42%), Gaps = 34/340 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ Y+   S S K V C    C        S C +    CPY+  Y  + +S++GY V D
Sbjct: 130 LTLYNIKDSVSGKLVPCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYG-DGSSTAGYFVKD 187

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           ++     S     +S   SVI GCG +Q+G        A DG++G G  + S+ S LA  
Sbjct: 188 VVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAAT 247

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
             ++  F+ C D  + G +F    G   Q   +  P+      Y V + +  +G   L  
Sbjct: 248 RKVKKIFAHCLDGINGGGIF--AIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHL 305

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                ++G +  A++DSG +  +LP  +Y  +V K        ++ +  + +  C+  S 
Sbjct: 306 PTEEFEAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSG 364

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
                 P++   F  +    V  H + FP  EG  ++C+      + S D  +  ++G  
Sbjct: 365 SVDDGFPNVTFHFENSVFLKVHPHEYLFP-FEG--LWCIGWQNSGMQSRDRRNMTLLGDL 421

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLV 355
            +    +++D EN  + W+   C   I         VHLV
Sbjct: 422 VLSNKLVLYDLENQAIGWTEYNCSSSIKVQDERTGTVHLV 461


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 138/329 (41%), Gaps = 34/329 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           L++YDP+ S ++  V C    C + S      +C S   PC +   Y  + ++++G+ V 
Sbjct: 128 LTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVT 184

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
           D +     S +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA 
Sbjct: 185 DFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAA 243

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           A  ++  F+ C D    G +F    G   Q      P+      Y V ++   +G + L 
Sbjct: 244 ARRVRKIFAHCLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQ 301

Query: 214 --QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNA 264
              S F +      ++DSG +  +LP E+Y  ++   FDK    + + L       C+  
Sbjct: 302 LPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQF 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
           S       P +   F  + +  V    + F       ++C+      V + DG D  ++G
Sbjct: 359 SGSIDDGFPVITFSFEGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLG 416

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
              +    +V+D E   + W+   C   I
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSSSI 445


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 91/329 (27%), Positives = 140/329 (42%), Gaps = 43/329 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           YDP  +   + V C  PLC         +C      C Y  +Y+ + +S+ G L++D + 
Sbjct: 66  YDPKKA---RLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYA-DGSSTMGVLMEDTIT 121

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           L     +  +S  +++ IIGCG  Q G+     A+ DGVMGL    +S+PS LAK G+++
Sbjct: 122 L--LLTNGTRS--KTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVR 177

Query: 159 NSFSICF--DENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
           N    C     N  G +FFGD   PA     ++ PI  K     +G +S    +      
Sbjct: 178 NVIGHCLAGGSNGGGYLFFGDSLVPAL--GMTWTPIMGKSITGNIGGKSGDADDKTGDIG 235

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK---RISLQGNSWKYCYNASS--EEML 270
           G   + DSG SFT+L  E Y  V+   +  V      RI    N+  +C+   S  E + 
Sbjct: 236 G--VMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTD-NTLPFCWRGPSPFESVA 292

Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTDGD----YGI 316
            V      + L F K   +     +   P  EG+ +       CL ++   G       I
Sbjct: 293 DVQRYFKTVTLDFGKRNWYSASRVLELSP--EGYLIVSTQGNVCLGILDASGASLEVTNI 350

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           IG   M G+ +V+D    ++ W    C  
Sbjct: 351 IGDVSMRGYLVVYDNARNQIGWVRRNCHN 379


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 138/329 (41%), Gaps = 34/329 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           L++YDP+ S ++  V C    C + S      +C S   PC +   Y  + ++++G+ V 
Sbjct: 128 LTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVT 184

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
           D +     S +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA 
Sbjct: 185 DFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAA 243

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
           A  ++  F+ C D    G +F    G   Q      P+      Y V ++   +G + L 
Sbjct: 244 ARRVRKIFAHCLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQ 301

Query: 214 --QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNA 264
              S F +      ++DSG +  +LP E+Y  ++   FDK    + + L       C+  
Sbjct: 302 LPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQF 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
           S       P +   F  + +  V    + F       ++C+      V + DG D  ++G
Sbjct: 359 SGSIDDGFPVITFSFKGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLG 416

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
              +    +V+D E   + W+   C   I
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSSSI 445


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 134/315 (42%), Gaps = 24/315 (7%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   +C +       R  C S K  C Y   Y+ +  SS G LV D   L   
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159

Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + A  S V+  +  GCG  +Q GS  + +A DGV+GLG G VS+ S L + G+ +N   
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218

Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
            C      G +FFGD   P ++ + + +      + Y  G  +   G   L     + + 
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD----M 275
           DSG+SFT+   + Y  +V      +S     +  +S   C+      + +L V      +
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTV 338

Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
            L FS  +  ++     ++     +   CL +++       D  I+G   M    +++D 
Sbjct: 339 VLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398

Query: 332 ENLKLAWSHSKCEEV 346
           E  ++ W  + C+ +
Sbjct: 399 ERGQIGWIRAPCDRI 413


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 149/342 (43%), Gaps = 46/342 (13%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSG 91
           +E + S S +   + C  P C+ R+SC   +            C Y   Y    + S++G
Sbjct: 83  TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAG 142

Query: 92  YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
            L +D L + +  SK  P S     V IGC    T  + D +   GV GLG    S+P  
Sbjct: 143 VLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 201

Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
           L  +      FS C   + + D  S       P          A   +T+  P  +    
Sbjct: 202 LNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTR 256

Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
           YFV ++   IG + L    T+SG    VD+G SFT L   ++A++V + D+++  ++   
Sbjct: 257 YFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVK 316

Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
           +    N+ + CY   + +++E  K+PDM L F+ + + V+    + +      +  CL +
Sbjct: 317 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 373

Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             ++  G   ++G   M    ++ D  N KL++  + C +VI
Sbjct: 374 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 415


>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
 gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 405

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)

Query: 56  VSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           + CS+P+C +     +  C + ++ C Y   Y+ +  SS G LV D   L    K    S
Sbjct: 99  IPCSNPICTALHWPNKPHCPNPQEQCDYEVKYA-DQGSSMGALVTDQFPL----KLVNGS 153

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
            +Q  V  GCG  Q  SY     P    GV+GLG G + + + L  AGL +N    C   
Sbjct: 154 FMQPPVAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSS 211

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
              G +FFGD         ++ P+  + + Y  G              G + + D+G+S+
Sbjct: 212 KGGGFLFFGDN-LVPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLKGLKLIFDTGSSY 270

Query: 228 TFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPDMRLIFSKNQ 283
           T+  ++ Y  ++  +  D  VS  +++ +  +   C+  +   + +L+V +     + N 
Sbjct: 271 TYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINF 330

Query: 284 SFVVRN-HIFSFPE------NEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENL 334
           +   RN  ++  PE        G     L   S  G  +  +IG   M G  +++D E  
Sbjct: 331 TNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQ 390

Query: 335 KLAWSHSKCEEV 346
           +L W  S C ++
Sbjct: 391 QLGWVSSDCNKL 402


>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
 gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
          Length = 410

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 131/298 (43%), Gaps = 40/298 (13%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-AP 134
           C Y  +Y+ + + S G L  D  HL    K    S  +S ++ GCG  Q G  L+     
Sbjct: 108 CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKT 162

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSFLPI 191
           DG++GL    +S+PS LA  G+I N    C   D N  G +F G D  P+     +++P+
Sbjct: 163 DGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPM 220

Query: 192 --GEKYDAYFVGVESYCIGNSCLTQSGF-----QALVDSGASFTFLPTEIYAEVVVKFDK 244
               + DAY + V     G   L+  G      + L D+G+S+T+ P + Y+++V    +
Sbjct: 221 LHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE 280

Query: 245 LVSSKRISLQ--GNSWKYCYNASSE-EMLKVPDMRLIFS------KNQSFVVRNHIFSFP 295
            VS   ++      +   C+ A +      + D++  F        ++  ++   +   P
Sbjct: 281 -VSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP 339

Query: 296 E------NEGFTVFCLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           E      N+G    CL ++      DG   I+G   M GH IV+D    ++ W  S C
Sbjct: 340 EDYLIISNKGNV--CLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 395


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 139/317 (43%), Gaps = 40/317 (12%)

Query: 46  DPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           DP+ S+S KN+SCS   CK        SC S    C Y   Y  + + S G+   + L L
Sbjct: 177 DPTKSTSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQYG-DGSYSIGFFATETLTL 233

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +S       S+V  + + GCG++ +G +  GAA  G++GLG   +S+PS  A+    +  
Sbjct: 234 SS-------SNVFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQK--YKKL 281

Query: 161 FSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIG 208
           FS C   + S  G + FG Q     ++  F P+ E + +            VG     I 
Sbjct: 282 FSYCLPASSSSKGYLSFGGQ---VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSID 338

Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
            S  + SG   ++DSG   T LP+  Y+ +   F KL++    +   + +  CY+ S  E
Sbjct: 339 ASIFSTSG--TVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNE 396

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNFMMGHR 326
            +K+P + + F       +      +P N G    CL       D    I G      ++
Sbjct: 397 TIKIPKVGVSFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNGDDVKAAIFGNTQQKTYQ 455

Query: 327 IVFDRENLKLAWSHSKC 343
           +V+D    ++ ++ S C
Sbjct: 456 VVYDDAKGRVGFAPSGC 472


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 129/315 (40%), Gaps = 31/315 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS+S + K++SC+   C S          C++  + C Y A Y  + + S GYL  D+
Sbjct: 56  FDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDL 114

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L LA        S      + GCG+   G +   A   G++GLG   +S+   ++     
Sbjct: 115 LTLAP-------SQTLPGFVYGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--F 162

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQ 214
             +FS C      G      +      +  F P+         YF+ + +  +G   L  
Sbjct: 163 GYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV 222

Query: 215 SGFQ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEM 269
           +  Q     ++DSG   T LP  +Y      F K++SSK     G S    C+  + ++M
Sbjct: 223 AAAQYRVPTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDM 282

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
             VP++RLIF       +R        +EG T  CL     +G   IIG +     ++  
Sbjct: 283 QSVPEVRLIFQGGADLNLRPVNVLLQVDEGLT--CLAFAGNNG-VAIIGNHQQQTFKVAH 339

Query: 330 DRENLKLAWSHSKCE 344
           D    ++ ++   C 
Sbjct: 340 DISTARIGFATGGCN 354


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 125/279 (44%), Gaps = 17/279 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  ++CS   C      S ++C S  + C Y   Y  + + +SGY V D
Sbjct: 69  LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 127

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           ++HL +  + +  ++  + V+ GC  +QTG       A DG+ G G  ++SV S L+  G
Sbjct: 128 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 187

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGN 209
           +    FS C   D +  G +  G+        TS +P    Y+    +  V  ++  I +
Sbjct: 188 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDS 247

Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
           S    S  +  +VDSG +  +L  E Y   V      +  + +    +    CY  +S  
Sbjct: 248 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSV 306

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCL 305
               P + L F+   S ++R   +   +N   G  V+C+
Sbjct: 307 TEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCI 345


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 149/342 (43%), Gaps = 46/342 (13%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSG 91
           +E + S S +   + C  P C+ R+SC   +            C Y   Y    + S++G
Sbjct: 60  TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAG 119

Query: 92  YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
            L +D L + +  SK  P S     V IGC    T  + D +   GV GLG    S+P  
Sbjct: 120 VLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 178

Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
           L  +      FS C   + + D  S       P          A   +T+  P  +    
Sbjct: 179 LNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTR 233

Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
           YFV ++   IG + L    T+SG    VD+G SFT L   ++A++V + D+++  ++   
Sbjct: 234 YFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVK 293

Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
           +    N+ + CY   + +++E  K+PDM L F+ + + V+    + +      +  CL +
Sbjct: 294 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 350

Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
             ++  G   ++G   M    ++ D  N KL++  + C +VI
Sbjct: 351 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 392


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 132/316 (41%), Gaps = 39/316 (12%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 108
           + +K V C+  LC S +  K    P  C Y   Y T+  SS G L+ D   L+  +    
Sbjct: 119 TKNKIVPCAASLCTSLTPNKKCAVPQQCDYQIKY-TDKASSLGVLIADNFTLSLRN---- 173

Query: 109 QSSVQSSVIIGCGR-KQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            S+V++++  GCG  +Q G   +GA   A DG++GLG G VS+ S L + G+ +N    C
Sbjct: 174 SSTVRANLTFGCGYDQQVGK--NGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHC 231

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQALVD 222
           F  N  G +FFGD    T + T ++P+      + Y  G  +       L     + + D
Sbjct: 232 FSTNGGGFLFFGDDIVPTSRVT-WVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFD 290

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS------SEEMLKVPDMR 276
           SG+++ +   E Y   V      +S     +   S   C+         SE       + 
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKSLF 350

Query: 277 LIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTDG-----DYGIIGQNFMMGHRI 327
           L F KN    +       PEN      +   CL ++  DG      + IIG   M    I
Sbjct: 351 LSFGKNSVMEIP------PENYLIVTKYGNVCLGIL--DGTTAKLKFNIIGDITMQDQMI 402

Query: 328 VFDRENLKLAWSHSKC 343
           ++D E  +L W    C
Sbjct: 403 IYDNEKGQLGWIRGSC 418


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 83/328 (25%), Positives = 137/328 (41%), Gaps = 42/328 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   +SC   LC S  R SC      C Y   Y  + + + G L  + + L S
Sbjct: 82  FDPEGSSSYTTMSCGDTLCDSLPRKSCSP---DCDYSYGYG-DGSGTRGTLSSETVTLTS 137

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 +     ++  GCG    GS+ D +   G++GLG G++S  S L    L  + FS
Sbjct: 138 TQG---EKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFS 189

Query: 163 ICF-----DENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNS 210
            C        + +  +FFGD+  +         +F P+         Y+V ++   I   
Sbjct: 190 YCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGR 249

Query: 211 CL---------TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            L            G   ++ DSG + T LP   Y  V+      +S  +I         
Sbjct: 250 ALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDL 309

Query: 261 CYNASSEEM---LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
           CY+ S  +    +K+P M   F   +    V N+  +   N+  T+ CL ++S++ D GI
Sbjct: 310 CYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGI 367

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            G       R+++D  + K+ W+ S+C+
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQCD 395


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 142/314 (45%), Gaps = 33/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S+S  +V C+   CK+   S C + +  C Y   Y  + T + G L  + + + S
Sbjct: 134 FDPLKSTSFSHVPCNSQNCKAIDDSHCGA-QGVCDYSYTYG-DQTYTKGDLGFEKITIGS 191

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  SSV+S  +IGCG +  G +   +    V+GLG G +S+ S +++   I   FS
Sbjct: 192 -------SSVKS--VIGCGHESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFS 239

Query: 163 ICFD---ENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGF 217
            C      + +G + FG     +       P+  K     Y+V +E+  IGN     S  
Sbjct: 240 YCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAK 299

Query: 218 QA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLK 271
           Q   ++DSG + +FLP E+Y  VV    K+V +KR+   GN W  C+    N ++   + 
Sbjct: 300 QGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 359

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVF 329
           +   +     N + +  N       N    V CLT+   S   ++GIIG   +    I +
Sbjct: 360 IITAQFSGGANVNLLPVNTFQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGY 415

Query: 330 DRENLKLAWSHSKC 343
           D E  +L++  + C
Sbjct: 416 DLEAKRLSFKPTVC 429


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 136/305 (44%), Gaps = 36/305 (11%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y   Y+ + +SS G LV D   L    + +  S  + + I GC   Q G  L+  +  
Sbjct: 275 CNYEVQYADQ-SSSLGVLVKDEFTL----RFSNGSLTKLNAIFGCAYDQQGLLLNTLSKT 329

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFL--- 189
           DG++GL    VS+PS LA  G+I N    C   D    G +F GD     Q   +++   
Sbjct: 330 DGILGLSRAKVSLPSQLASRGIINNVVGHCLTGDPAGGGYLFLGDDF-VPQWGMAWVAML 388

Query: 190 --PIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
             P  + Y    V ++   I  S  T   S  Q + DSG+S+T+   E Y ++V   ++ 
Sbjct: 389 DSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQLVANLEE- 447

Query: 246 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK------NQSFVVRNHIFSFPEN-- 297
           VS+  + LQ +S   C+  + + +  V D++  F        ++ ++V   +   PEN  
Sbjct: 448 VSAFGLILQDSSDTICWK-TEQSIRSVKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYL 506

Query: 298 ----EGFTVFCLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
               EG    CL ++      DG   I+G N + G  +V+D  N ++ W+ S C      
Sbjct: 507 LINKEGNV--CLGILDGSQVHDGSTIILGDNALRGKLVVYDNVNQRIGWTSSDCHNPRKI 564

Query: 350 SHVHL 354
            H+ L
Sbjct: 565 KHLPL 569


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 133/322 (41%), Gaps = 37/322 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYIADYSTEDTSSSGYL 93
           YDPS SSS K V C+   C+   +              +K  C Y+  Y  + + + G L
Sbjct: 178 YDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYG-DGSYTRGDL 236

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             + + L          +   +++ GCGR   G +       G+MGLG   VS+ S   K
Sbjct: 237 ASESIVLG--------DTKLENLVFGCGRNNKGLF---GGASGLMGLGRSSVSLVSQTLK 285

Query: 154 AGLIQNSFSIC---FDENDSGSVFFGDQGPATQQSTS--FLPIGEK---YDAYFVGVESY 205
                  FS C    ++  SG++ FG+     + STS  + P+ +       Y + +   
Sbjct: 286 T--FNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGA 343

Query: 206 CIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
            IG   L    F    L+DSG   T LP  IY  V  +F K  S    +   +    C+N
Sbjct: 344 SIGGVELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFN 403

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNF 321
            +S E + +P +++IF  N    V    +F F + +   V   L  +S + + GIIG   
Sbjct: 404 LTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQ 463

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R+++D    +L  +   C
Sbjct: 464 QKNQRVIYDTTQERLGIAGENC 485


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
           ++++ V C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS
Sbjct: 99  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 152

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N   
Sbjct: 153 LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 212

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
            C   N  G +FFGD    + + T ++P+ ++   + Y  G  +       L     + +
Sbjct: 213 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 271

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
            DSG+++T+   + Y  VV      +S     +   +   C+    +    V D++  F 
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 330

Query: 280 SKNQSFV-VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
           S   SF   +N     P      V      CL ++  DG      + +IG   M    ++
Sbjct: 331 SMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 388

Query: 329 FDRENLKLAWSHSKCEE 345
           +D E  +L W+   C  
Sbjct: 389 YDNEKSQLGWARGACTR 405


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 146/323 (45%), Gaps = 27/323 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  +SCS   C+S      +SC S  + C Y   Y  + + +SGY V D
Sbjct: 121 LNYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYG-DGSGTSGYYVSD 179

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
           ++H A   +    ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G
Sbjct: 180 LMHFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQG 239

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C   ++SG       G   + +  + P+ +    Y + ++S  +    +   
Sbjct: 240 IAPRVFSHCLKGDNSGGGVL-VLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIA 298

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
                T +    +VDSG +  +L  E Y   V     LV  S + +  +GN    CY  +
Sbjct: 299 PAVFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ---CYLIT 355

Query: 266 SEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNF 321
           +   + + P + L F+   S V+R   +   +N   EG +V+C+      G    I  + 
Sbjct: 356 TSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEG-SVWCIGFQRIPGQSITILGDL 414

Query: 322 MMGHRI-VFDRENLKLAWSHSKC 343
           ++  +I V+D    ++ W++  C
Sbjct: 415 VLKDKIFVYDLAGQRIGWANYDC 437


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
           ++++ V C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS
Sbjct: 41  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 94

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N   
Sbjct: 95  LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 154

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
            C   N  G +FFGD    + + T ++P+ ++   + Y  G  +       L     + +
Sbjct: 155 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 213

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
            DSG+++T+   + Y  VV      +S     +   +   C+    +    V D++  F 
Sbjct: 214 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 272

Query: 280 SKNQSFV-VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
           S   SF   +N     P      V      CL ++  DG      + +IG   M    ++
Sbjct: 273 SMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 330

Query: 329 FDRENLKLAWSHSKCEE 345
           +D E  +L W+   C  
Sbjct: 331 YDNEKSQLGWARGACTR 347


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 135/321 (42%), Gaps = 29/321 (9%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           P    S+  V C  PLC S  S    +    D C Y  +Y+ +  SS G LV D+  L +
Sbjct: 98  PLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-N 155

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            +   P   ++  + +GCG  Q          DG++GLG G VS+ S L   G+++N   
Sbjct: 156 LTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVG 212

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALV 221
            CF+    G +FFGD G        + P+   Y  ++  G                  + 
Sbjct: 213 HCFNSKGGGYLFFGD-GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVF 271

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           DSG+S+T+   + Y  +    ++ ++ K  R ++  ++   C+    + +  + D+R  F
Sbjct: 272 DSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSLRDVRKYF 330

Query: 280 S----KNQSFVVRNHIFSFPENEGFTVF------CLTVMS-TD---GDYGIIGQNFMMGH 325
                   S      +F  P  EG+ +       CL +++ TD    +  IIG   M   
Sbjct: 331 KPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            +V++ E   + W+ + C+ V
Sbjct: 390 MVVYNNEKQAIGWATANCDRV 410


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
           ++++ V C++ LC +  S +   + CP      Y   Y T+  SS G L++D     SFS
Sbjct: 99  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 152

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                S+++  +  GCG  Q         AA DG++GLG G VS+ S L + G+ +N   
Sbjct: 153 LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 212

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
            C   N  G +FFGD    + + T ++P+ ++   + Y  G  +       L     + +
Sbjct: 213 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 271

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
            DSG+++T+   + Y  VV      +S     +   +   C+    +    V D++  F 
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 330

Query: 280 SKNQSF-VVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
           S   SF   +N     P      V      CL ++  DG      + +IG   M    ++
Sbjct: 331 SMFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 388

Query: 329 FDRENLKLAWSHSKCEE 345
           +D E  +L W+   C  
Sbjct: 389 YDNEKSQLGWARGACTR 405


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 87/322 (27%), Positives = 143/322 (44%), Gaps = 36/322 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++PS SSS KN+ CS  LC S R +  S ++ C Y   Y  + + S G L  D L L S 
Sbjct: 129 FNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYG-DSSHSQGDLSVDTLSLEST 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S  +P S  +  ++IGCG    G++  G A  G++GLG G VS+ + L  +  I   FS 
Sbjct: 188 SG-SPVSFPK--IVIGCGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSY 240

Query: 164 CF------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG 216
           C       + N S  + FGD    +       P+ +K    YF+ ++++ +GN  +   G
Sbjct: 241 CLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGG 300

Query: 217 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                      ++DSG + T +P+++Y  +      LV   R+      +  CY+  S E
Sbjct: 301 SSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFS--FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMG 324
                D  +I    +   V  H  S   P  +G   F        G  +G +  QN ++G
Sbjct: 361 Y----DFPIITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG 416

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
               +D +   +++  + C +V
Sbjct: 417 ----YDLQQKTVSFKPTDCTKV 434


>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
          Length = 217

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 54/81 (66%), Gaps = 3/81 (3%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DR+L  Y P+ S++S+++ CSH LC+S   C + K PCPY  DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199

Query: 99  HLASFSKHAPQSSVQSSVIIG 119
           HL     H P   V +SVIIG
Sbjct: 200 HLNYREDHVP---VNASVIIG 217


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 145/318 (45%), Gaps = 32/318 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           +N  ++DP++S+S KNVSCS   CK     +  +   + + C Y   Y +  T   G+L 
Sbjct: 178 QNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGYTI--GFLA 235

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            + L +AS       S V  + + GC  +  G++ +G    G++GLG   +++PS     
Sbjct: 236 TETLAIAS-------SDVFKNFLFGCSEESRGTF-NGTT--GLLGLGRSPIALPSQTTNK 285

Query: 155 GLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLP-IGEKYDAYFVGVESYCIGNSC 211
              +N FS C   + S  G + FG +     +ST   P + + Y    VG+    +    
Sbjct: 286 --YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPISPKLKQLYGLNTVGIS---VRGRE 340

Query: 212 LTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--E 267
           L  +G   + ++DSG +FTFLP+  Y+ +   F +++++  ++   +S++ CY+ S+   
Sbjct: 341 LPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGN 400

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGH 325
             L +P + + F       +       P N G    CL    T  D D+ I G      +
Sbjct: 401 GTLTIPGISIFFEGGVEVEIDVSGIMIPVN-GLKEVCLAFADTGSDSDFAIFGNYQQKTY 459

Query: 326 RIVFDRENLKLAWSHSKC 343
            +++D     + ++   C
Sbjct: 460 EVIYDVAKGMVGFAPKGC 477


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 130/315 (41%), Gaps = 29/315 (9%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPC--PYIADYS---TEDTSSSGYLVDDILHLASFSK 105
           + +K V C+  +C +  S +S    C  P   DY    T+  SS G LV D   L   + 
Sbjct: 98  TKNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRN- 156

Query: 106 HAPQSSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
               SSV+ S   GCG  Q    + +  A  DG++GLG G VS+ S L   G+ +N    
Sbjct: 157 ---SSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGH 213

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQALV 221
           C   N  G +FFGD    T ++T ++P+      + Y  G  +       L     + + 
Sbjct: 214 CLSTNGGGFLFFGDNVVPTSRAT-WVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVF 272

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 280
           DSG+++T+   + Y   V      +S     +   S   C+    +++ K V D++  F 
Sbjct: 273 DSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKG--QKVFKSVSDVKNDFK 330

Query: 281 KNQSFVVRNHIFSFPENEGFTVF-----CLTVMSTDGD-----YGIIGQNFMMGHRIVFD 330
                 V+N +   P      V      CL ++  DG      + IIG   M    I++D
Sbjct: 331 SLFLSFVKNSVLEIPPENYLIVTKNGNACLGIL--DGSAAKLTFNIIGDITMQDQLIIYD 388

Query: 331 RENLKLAWSHSKCEE 345
            E  +L W    C  
Sbjct: 389 NERGQLGWIRGSCSR 403


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 144/329 (43%), Gaps = 40/329 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +DP++S S +NV+C    C+          R   +   DPCPY   Y  +  ++      
Sbjct: 191 FDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGD---- 246

Query: 96  DILHLASFSKHAPQSSVQS--SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             L L +F+ +  QS  +    V  GCG +  G +   A    ++GLG G +S  S L +
Sbjct: 247 --LALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-R 300

Query: 154 AGLIQNSFSICFDENDSGS---VFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYC 206
                ++FS C  E+ S +   + FG             T+F P  +    Y++ ++S  
Sbjct: 301 GVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSIL 360

Query: 207 IGNSCL-----TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKY 260
           +G   +     T S    ++DSG + ++ P   Y  +   F D++  S  + L       
Sbjct: 361 VGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSP 420

Query: 261 CYNASSEEMLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMST-DGDYGII 317
           CYN S  E ++VP++ L+F+   +  F   N+     E EG  + CL V+ T      II
Sbjct: 421 CYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRL-EPEG--IMCLAVLGTPRSGMSII 477

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G        +++D E+ +L ++  +C +V
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCADV 506


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 135/328 (41%), Gaps = 42/328 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   +SC   LC S  R SC      C Y   Y  + + + G L  + + L S
Sbjct: 82  FDPEGSSSYTTMSCGDTLCDSLPRKSCSP---NCDYSYGYG-DGSGTRGTLSSETVTLTS 137

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 +     ++  GCG    GS+ D +   G++GLG G++S  S L    L  + FS
Sbjct: 138 TQG---EKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFS 189

Query: 163 ICF-----DENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNS 210
            C        + +  +FFGD+  +         +F P+         Y+V ++   I   
Sbjct: 190 YCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGR 249

Query: 211 CL---------TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            L            G   ++ DSG + T LP   Y  V+      VS   I         
Sbjct: 250 ALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDL 309

Query: 261 CYNASSEEM---LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
           CY+ S  +     K+P M   F   +    V N+  +   N+  T+ CL ++S++ D GI
Sbjct: 310 CYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGI 367

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            G       R+++D  + K+ W+ S+C+
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQCD 395


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 33/314 (10%)

Query: 56  VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           V CS+ LC++ S+     C +  D C Y  +Y+ +  SS G L+ D   L    + +  +
Sbjct: 104 VPCSNSLCQAVSTGENYHCDAPDDQCDYEIEYA-DLGSSIGVLLSDSFPL----RLSNGT 158

Query: 111 SVQSSVIIGCG--RKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
            +Q  +  GCG  +K  G +     PD  G++GLG G VS+ S L   G+ QN    CF 
Sbjct: 159 LLQPKMAFGCGYDQKHLGPH---PPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFS 215

Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
               G +FFGD   P+++ + + +        Y  G      G       G Q + DSG+
Sbjct: 216 RARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGS 275

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASS----EEMLKVPDMR 276
           S+T+   ++Y  ++    K ++ K +           WK      S    +   K   + 
Sbjct: 276 SYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTIS 335

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 332
            + +KN    +    +     +G    CL +++      G++ +IG  FM    +++D E
Sbjct: 336 FMNAKNVQLQLAPEDYLIITKDGNV--CLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNE 393

Query: 333 NLKLAWSHSKCEEV 346
             ++ W  + C+ +
Sbjct: 394 KQQIGWFPANCDRL 407


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 138/326 (42%), Gaps = 51/326 (15%)

Query: 56  VSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 109
           V  S P C      +    C+S    C Y  +Y+ + + S G L  D  HL    K    
Sbjct: 251 VRSSEPFCVEVQRNQLTEHCESCHQ-CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNG 304

Query: 110 SSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--D 166
           S  +S ++ GCG  Q G  L+     DG++GL    +S+PS LA  G+I N    C   D
Sbjct: 305 SLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASD 364

Query: 167 ENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGF-----Q 218
            N  G +F G D  P+     +++P+      + Y + V     GN+ L+  G      +
Sbjct: 365 LNGEGYIFMGSDLVPS--HGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGK 422

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEM--- 269
            L D+G+S+T+ P + Y+++V    + VS   ++   +       W+   N+    +   
Sbjct: 423 VLFDTGSSYTYFPNQAYSQLVTSLQE-VSDLELTRDDSDEALPICWRAKTNSPISSLSDV 481

Query: 270 --------LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS----TDGDYGII 317
                   L++    LI SK    +++   +    N+G    CL ++      DG   II
Sbjct: 482 KKFFRPITLQIGSKWLIISK--KLLIQPEDYLIISNKGNV--CLGILDGSNVHDGSTIII 537

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G   M G  IV+D    ++ W  S C
Sbjct: 538 GDISMRGRLIVYDNVKQRIGWMKSDC 563


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 137/323 (42%), Gaps = 25/323 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
           +LS +D ++SS+SK V C    C   S   S +    C Y   Y+ E TS  G  + D L
Sbjct: 117 HLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSE-GNFIRDKL 175

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L   +       +   V+ GCG  Q+G      +A DGVMG G  + SV S LA  G  
Sbjct: 176 TLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDA 235

Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
           +  FS C D    G +F  G       ++T  +P    Y+   +G++    +  +  S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIM 295

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
              G   +VDSG +  + P  +Y  ++   + +++ + + L      + C++ S    + 
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEDTFQCFSFSENVDVA 350

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMS---TDGDYG---IIGQNFMMG 324
            P +   F  +    V  H + F  E E   ++C    +   T G+     ++G   +  
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEKE---LYCFGWQAGGLTTGERTEVILLGDLVLSN 407

Query: 325 HRIVFDRENLKLAWSHSKCEEVI 347
             +V+D EN  + W+   C   I
Sbjct: 408 KLVVYDLENEVIGWADHNCSSSI 430


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 116/242 (47%), Gaps = 30/242 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++PS SSS KN+ CS  LC+S R +  + ++ C Y  ++S + + S G L  + L L S 
Sbjct: 129 FNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFS-DQSYSQGELSVETLTLDST 187

Query: 104 SKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           + H+   P++      +IGCG    G +       G++GLG+G VS+ + L  +  I   
Sbjct: 188 TGHSVSFPKT------VIGCGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGK 237

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCL- 212
           FS C      D N +  + FGD    +       P  +K     Y++ +E++ +GN  + 
Sbjct: 238 FSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIE 297

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                 ++ G   ++DSG + T LP+ +Y  +     +LV   R+         CY+ +S
Sbjct: 298 FEVLDDSEEG-NIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITS 356

Query: 267 EE 268
           ++
Sbjct: 357 DQ 358


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 148/332 (44%), Gaps = 46/332 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SS+ + V C+     +  +C      C Y   Y+ E ++SSG L +D++     
Sbjct: 130 RFQPELSSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKE 183

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   ++G      A DG+MGLG G +SV   L   G++ NSFS+
Sbjct: 184 SELVPQRAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSL 237

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V  G   P     +   P    Y  Y + ++   +    L        
Sbjct: 238 CYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFD 295

Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
             + A++DSG ++ + P + Y    + ++K  K+   K+IS    ++K  C++ +  ++ 
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVT 353

Query: 271 KV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQ 319
           ++    P++ ++F+  Q  S    N++F   +  G   +CL +     D      GII +
Sbjct: 354 ELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSG--AYCLGIFKNGNDQTTLLGGIIVR 411

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
           N +    + ++REN  + +  + C E+    H
Sbjct: 412 NTL----VTYNRENSTIGFWKTNCSELWKNLH 439


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 139/326 (42%), Gaps = 41/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S S   V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A 
Sbjct: 170 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 228

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++      VQ  V IGCG    G ++   A  G++GLG G +S PS +A++     SFS
Sbjct: 229 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFS 276

Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
            C  +           S +V FG    A     SF P+G        Y+V +  + +G +
Sbjct: 277 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 336

Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
               ++QS  +          ++DSG S T L   +Y  V   F       R+S  G S 
Sbjct: 337 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 396

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           +  CYN S   ++KVP + +  +   S  +    +  P +   T FC  +  TDG   II
Sbjct: 397 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 455

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G     G R+VFD +  ++ +    C
Sbjct: 456 GNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 143/340 (42%), Gaps = 36/340 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YD   S + K VSC    C +      S C +    C Y   Y+ + +SS GY V D
Sbjct: 142 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRD 199

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           I+     S     +S   SVI GC   Q+G      A DG++G G  + S+ S LA +G 
Sbjct: 200 IVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGK 259

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
           ++  F+ C D  + G +F    G   Q   +  P+      Y V +++  +G   L    
Sbjct: 260 VRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPT 317

Query: 214 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                  + G   ++DSG +  +LP  +Y +++ K     S  ++    + +  C+  S 
Sbjct: 318 DVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSE 374

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
                 P +   F  +    V  H + F  +    ++C+      + S D  +  ++G  
Sbjct: 375 SLDDGFPAVTFHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDL 431

Query: 321 FMMGHRIVFDRENLKLAWSHSKCE---EVIDKSH--VHLV 355
            +    +++D EN  + W+   C    +V+D+    VHLV
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCSSSIKVVDEQSGTVHLV 471


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/333 (24%), Positives = 140/333 (42%), Gaps = 38/333 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SS+ + V C+ P C    +C      C Y   Y+ E +SSSG + +D++   + 
Sbjct: 118 RFQPDLSSTYRPVKCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGVIAEDVVSFGNE 171

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG+MGLG G +SV   L   G+I +SFS+
Sbjct: 172 SELKPQRAV-----FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSL 225

Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D   G++  G   P      S   P    Y  Y + ++   +    L        
Sbjct: 226 CYGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPY--YNIELKELHVAGKPLKLKPKVFD 283

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRI-SLQGNSWKYCYNASSEEMLKV 272
                ++DSG ++ + P   +  +     K +   K+I     N    C++ +  E+  +
Sbjct: 284 EKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHL 343

Query: 273 ----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
               P++ ++F   Q   +    + F   +    +CL +     D      GI+ +N + 
Sbjct: 344 SKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTL- 402

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP 356
              + +DREN K+ +  + C E+     V  VP
Sbjct: 403 ---VTYDRENDKIGFWKTNCSELWKSLQVPGVP 432


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/304 (25%), Positives = 129/304 (42%), Gaps = 38/304 (12%)

Query: 70  KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY- 128
           K     C Y   Y+ +   S G+LV D +     +K    + + ++ + GCG  Q  S  
Sbjct: 151 KEASQRCDYDVAYA-DHGYSEGFLVRDSVRALLTNK----TVLTANSVFGCGYNQRESLP 205

Query: 129 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQST 186
           +  A  DG++GLG G  S+PS  AK GLI+N    C      D G +FFGD   +T   T
Sbjct: 206 VSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGYMFFGDDLVSTSAMT 265

Query: 187 SFLPIGE-KYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFLPTEIYAEVVV 240
               +G      Y+VG      GN  L + G        + DSG+++T+   + Y   + 
Sbjct: 266 WVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLS 325

Query: 241 KFDKLVSSKRISLQGNS------W--KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 292
              + +S K++    +       W  K  + + +E       + L F   ++      + 
Sbjct: 326 VVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKT----KQME 381

Query: 293 SFPENEGFTV------FCLTVMSTDG----DYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            FP  EG+ V       CL +++       D  ++G     G  +V+D E  ++ W+ S 
Sbjct: 382 IFP--EGYLVVNKKGNVCLGILNGTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSD 439

Query: 343 CEEV 346
           C+E+
Sbjct: 440 CQEI 443


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 148/332 (44%), Gaps = 46/332 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SS+ + V C+     +  +C      C Y   Y+ E ++SSG L +D++     
Sbjct: 130 RFQPELSSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKE 183

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   ++G      A DG+MGLG G +SV   L   G++ NSFS+
Sbjct: 184 SELVPQRAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSL 237

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V  G   P     +   P    Y  Y + ++   +    L        
Sbjct: 238 CYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFD 295

Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
             + A++DSG ++ + P + Y    + ++K  K+   K+IS    ++K  C++ +  ++ 
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVT 353

Query: 271 KV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQ 319
           ++    P++ ++F+  Q  S    N++F   +  G   +CL +     D      GII +
Sbjct: 354 ELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSG--AYCLGIFKNGNDQTTLLGGIIVR 411

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
           N +    + ++REN  + +  + C E+    H
Sbjct: 412 NTL----VTYNRENSTIGFWKTNCSELWKNLH 439


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           ++L  ++PS S++ + VSCS P+C      +SC S K  C Y   Y  +++ S G    D
Sbjct: 122 QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVD 179

Query: 97  ILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            L + S S      P+++      IGCG    GS+   A   G++GLGLG  S+   +  
Sbjct: 180 TLTMGSTSGRVVAFPRTA------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGS 231

Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF------- 199
           A  +   FS C      D+  S  + FG     +       PI   +K+ +++       
Sbjct: 232 A--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 200 -VGVES--YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
            VG  +  Y   NS L       ++DSG + T LP ++Y          ++ +R      
Sbjct: 290 SVGRNNTFYSTANSILGGKA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348

Query: 257 SWKYCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD- 313
             +YC+  ++++  KVP + + F   N      N +    +N    V CL    + D D 
Sbjct: 349 FLEYCFETTTDD-YKVPFIAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDI 403

Query: 314 --YGIIGQ-NFMMGHRIVFDRENLKLAWSHSKC 343
             YG I Q NF++G    +D  N+ L++    C
Sbjct: 404 SIYGNIAQINFLVG----YDVTNMSLSFKPMNC 432


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 41/320 (12%)

Query: 55  NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
            V C  PLC + R         S  DP  C Y   Y T    S G L  DI+ +    K 
Sbjct: 93  KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
                    +  GCG KQ        +P DG++GLG+G   + + L    +I +N    C
Sbjct: 150 -------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHC 202

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
                 G ++ GD  P T+  T + P+ E    Y  G+    I    +     F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
           G+++T +P +IY E+V K    +S   +  ++G +   C+            +   K   
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321

Query: 275 MRLIFSKNQSFV-VRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMGHR 326
           +++  ++  S + +    + F + +G T  CL ++    D       + +IG   M    
Sbjct: 322 LKITHARGTSNLDIPPQNYLFVKEDGET--CLAILDASLDPVLKELNFILIGAVTMQDLF 379

Query: 327 IVFDRENLKLAWSHSKCEEV 346
           +++D E  +L W  ++C+ V
Sbjct: 380 VIYDNEKKQLGWVRAQCDRV 399


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 139/326 (42%), Gaps = 41/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S S   V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A 
Sbjct: 164 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 222

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++      VQ  V IGCG    G ++   A  G++GLG G +S PS +A++     SFS
Sbjct: 223 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFS 270

Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
            C  +           S +V FG    A     SF P+G        Y+V +  + +G +
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330

Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
               ++QS  +          ++DSG S T L   +Y  V   F       R+S  G S 
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           +  CYN S   ++KVP + +  +   S  +    +  P +   T FC  +  TDG   II
Sbjct: 391 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 449

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G     G R+VFD +  ++ +    C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           ++L  ++PS S++ + VSCS P+C      +SC S K  C Y   Y  +++ S G    D
Sbjct: 122 QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVD 179

Query: 97  ILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            L + S S      P+++      IGCG    GS+   A   G++GLGLG  S+   +  
Sbjct: 180 TLTMGSTSGRVVAFPRTA------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGS 231

Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF------- 199
           A  +   FS C      D+  S  + FG     +       PI   +K+ +++       
Sbjct: 232 A--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 200 -VGVES--YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
            VG  +  Y   NS L       ++DSG + T LP ++Y          ++ +R      
Sbjct: 290 SVGRNNTFYSTANSILGGKA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348

Query: 257 SWKYCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD- 313
             +YC+  ++++  KVP + + F   N      N +    +N    V CL    + D D 
Sbjct: 349 FLEYCFETTTDD-YKVPFIAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDI 403

Query: 314 --YGIIGQ-NFMMGHRIVFDRENLKLAWSHSKC 343
             YG I Q NF++G    +D  N+ L++    C
Sbjct: 404 SIYGNIAQINFLVG----YDVTNMSLSFKPMNC 432


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 80/329 (24%), Positives = 132/329 (40%), Gaps = 26/329 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  VSCS   C      S S C    + C Y   Y  + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L        +   +  + V+ GC   QTG  +    A DG+ G G   +SV S LA  G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243

Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           L    FS C   EN  G +     G   + +  F P+      Y V + S  +    L  
Sbjct: 244 LAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
                 T +G   ++D+G +  +L    Y   V      VS   + +  +GN    CY  
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFM 322
           ++      P + L F+   S  +    +   +N   G  V+C+           I  + +
Sbjct: 359 ATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418

Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVIDKS 350
           +  +I V+D    ++ W++  C   ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSMSVNVS 447


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 136/328 (41%), Gaps = 57/328 (17%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP+SS S+  VSC    C S  +     CK  + PC Y   Y  + +S++GY V D
Sbjct: 71  LTLYDPASSVSATRVSCDDDFCTSTYNGLLPDCKK-ELPCQYNVVYG-DGSSTAGYFVSD 128

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            +     + +        +V  GCG +Q+G     G A DG++G                
Sbjct: 129 AVQFERVTGNLQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG---------------- 172

Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
               +F+ C D  + G +F  G+       +T  +P    Y+ Y   +E   +G + L  
Sbjct: 173 ----AFAHCLDNVNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLEL 225

Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-----ISLQGNSWKY- 260
                 SG +   ++DSG +  +LP  +Y       D +++  R     +SL     ++ 
Sbjct: 226 PTDVFDSGDRRGTIIDSGTTLAYLPEVVY-------DSMMNEIRSQQPGLSLHTVEEQFI 278

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGI 316
           C+  S       PD++  F  + +  V  H + F  +E    F      + S DG D  +
Sbjct: 279 CFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTL 338

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           +G   +    +++D EN  + W+   C+
Sbjct: 339 LGDLVLSNKLVLYDIENQAIGWTEYNCK 366


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
           L  ++P +SS+S  + CS   C      S + C++  + PC Y   Y  + + +SGY V 
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D ++  S   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L   
Sbjct: 194 DTMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
           G+    FS C   +D+G       G   +    + P+      Y + +ES         I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312

Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
            +S  T S  Q  +VDSG +  +L    Y   V      VS    SL  +GN    C+  
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           SS      P + L F    +  V+  N++      +   ++C+      G    I  + +
Sbjct: 370 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429

Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
           +  +I V+D  N+++ W+   C   ++
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCSTSVN 456


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 107/240 (44%), Gaps = 19/240 (7%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + SK V C H LC S       +  C S  + C Y+  Y+ +  SS+G L++D     SF
Sbjct: 112 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 165

Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +      SV + SV  GCG  Q     D ++P DGV+GLG G VS+ S L + G+ +N  
Sbjct: 166 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 225

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
             C      G +FFGD     Q++T + P+      + Y  G  S   G+  L     + 
Sbjct: 226 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           + DSG+SFT+   + Y  +V      +S         S   C+    E    V D+R  F
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 343


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 78/296 (26%), Positives = 139/296 (46%), Gaps = 23/296 (7%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           L   ++ C++ K  C Y  +Y+ + +SS G L  D +HL + +        +   + GC 
Sbjct: 252 LQGDQNYCETCKQ-CDYEIEYA-DRSSSMGVLAKDDMHLIATNG----GREKLDFVFGCA 305

Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
             Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   + N  G +F GD 
Sbjct: 306 YDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGYMFLGDD 365

Query: 179 GPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL-TQSGFQALVDSGASFTFLPTEIYA 236
               +   ++ PI G   + Y    +    G+  L   +  Q + DSG+S+T+LP E+Y 
Sbjct: 366 Y-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSSYTYLPEEMYK 424

Query: 237 EVV--VKFD--KLVSSKRISLQGNSWKYCYNASS-EEMLKVPDMRLIFSKNQSFVVRNHI 291
            ++  +K D    V     +     WK  ++  S  + L +   R  F   ++F +    
Sbjct: 425 NLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFVVPKTFTIVPDD 484

Query: 292 FSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +    ++G    CL +++ T+ ++G   I+G   + G  +V+D E  ++ W++S+C
Sbjct: 485 YLIISDKGNV--CLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQIGWANSEC 538


>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
           [Glycine max]
          Length = 1388

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 131/301 (43%), Gaps = 34/301 (11%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y   Y+ + +SS G LV D LHL + +     S  + +V+ GCG  Q G  L+     
Sbjct: 269 CDYEIQYA-DHSSSLGVLVRDELHLVTTNG----SKTKLNVVFGCGYDQAGLLLNTLGKT 323

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIG 192
           DG+MGL    VS+P  LA  GLI+N    C   + +G   +F GD         +++P+ 
Sbjct: 324 DGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYMFLGDDF-VPYWGMNWVPMA 382

Query: 193 EKY--DAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
                D Y   +     GN  L   G     + + DSG+S+T+ P E Y ++V   +++ 
Sbjct: 383 YTLTTDLYQTEILGINYGNRQLRFDGQSKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVS 442

Query: 247 SSKRISLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT 301
               +    ++     W+  +   S + +K     L       + + + +F     EG+ 
Sbjct: 443 GLGLVQDDSDTTLPICWQANFPIKSVKDVKDYFKTLTLRFGSKWWILSTLFQISP-EGYL 501

Query: 302 VF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
           +       CL ++      DG   I+G   + G+ +V+D    K+ W  + C   +D+ +
Sbjct: 502 IISNKGHVCLGILDGSNVNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADC---VDRCY 558

Query: 352 V 352
           +
Sbjct: 559 I 559


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 90/327 (27%), Positives = 141/327 (43%), Gaps = 42/327 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDPS+SS+   V CS   C    +SR +C +   PC YI  YS +   S G L  + L +
Sbjct: 108 YDPSASSTFSPVPCSSATCLPTWRSR-NCSNPSSPCRYIYSYS-DGAYSVGILGTETLTI 165

Query: 101 ASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            S     P  +V   SV  GCG    G  L+     G +GLG G +   SLLA+ G+   
Sbjct: 166 GS---SVPGQTVSVGSVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--G 214

Query: 160 SFSIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
            FS C    F+       F G       GP T QST  L        YFV ++   +G+ 
Sbjct: 215 KFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDV 274

Query: 211 CL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            L                 +VDSG +FT L    + EVV +  +L+    ++        
Sbjct: 275 RLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLD-SP 333

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           C+ +   E   +PD+ L F+      + R++  S+  NE  + FCL ++ +   +  +G 
Sbjct: 334 CFPSPDGEPF-MPDLVLHFAGGADMRLHRDNYMSY--NEDDSSFCLNIVGSPSTWSRLGN 390

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEV 346
                 +++FD    +L++  + C ++
Sbjct: 391 FQQQNIQMLFDMTVGQLSFLPTDCSKL 417


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 139/326 (42%), Gaps = 41/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S S   V C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A 
Sbjct: 164 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 222

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++      VQ  V IGCG    G ++   A  G++GLG G +S P+ +A++     SFS
Sbjct: 223 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFS 270

Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
            C  +           S +V FG    A     SF P+G        Y+V +  + +G +
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330

Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
               ++QS  +          ++DSG S T L   +Y  V   F       R+S  G S 
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           +  CYN S   ++KVP + +  +   S  +    +  P +   T FC  +  TDG   II
Sbjct: 391 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 449

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G     G R+VFD +  ++ +    C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
           L  ++P +SS+S  + CS   C      S + C++  + PC Y   Y  + + +SGY V 
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D ++  +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L   
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
           G+    FS C   +D+G       G   +    + P+      Y + +ES         I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312

Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
            +S  T S  Q  +VDSG +  +L    Y   V      VS    SL  +GN    C+  
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           SS      P + L F    +  V+  N++      +   ++C+      G    I  + +
Sbjct: 370 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429

Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
           +  +I V+D  N+++ W+   C   ++
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCSTSVN 456


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
           L  ++P +SS+S  + CS   C      S + C++  + PC Y   Y  + + +SGY V 
Sbjct: 161 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 219

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D ++  +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L   
Sbjct: 220 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 279

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
           G+    FS C   +D+G       G   +    + P+      Y + +ES         I
Sbjct: 280 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 338

Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
            +S  T S  Q  +VDSG +  +L    Y   V      VS    SL  +GN    C+  
Sbjct: 339 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 395

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           SS      P + L F    +  V+  N++      +   ++C+      G    I  + +
Sbjct: 396 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 455

Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
           +  +I V+D  N+++ W+   C   ++
Sbjct: 456 LKDKIFVYDLANMRMGWTDYDCSTSVN 482


>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
 gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 131/322 (40%), Gaps = 45/322 (13%)

Query: 55  NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
            V C  PLC + R         S  DP  C Y   Y T    S G L  DI+ +    K 
Sbjct: 93  KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
                    +  GCG KQ        +P DG++GLG+G     + L    +I +N    C
Sbjct: 150 -------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHC 202

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
                 G ++ GD  P T+   ++ P+ E    Y  G+    I    +     F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTR-GVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
           G+++T +P +IY E+V K    +S   +  ++G +   C+            +   K   
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321

Query: 275 MRLIFSK---NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMG 324
           +++  ++   N     +N++F   + E     CL ++    D       + +IG   M  
Sbjct: 322 LKITHARGTNNLDIPPQNYLFVKEDGE----TCLAILDASLDPVLKELNFILIGAVTMQD 377

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
             +++D E  +L W  ++C+ V
Sbjct: 378 LFVIYDNEKKQLGWVRAQCDRV 399


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 129/315 (40%), Gaps = 38/315 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
           +DP++SSS   VSC   +C++ S             DYS    + + + G L  + L L 
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                   ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSC--------- 211
           S C     +G       G      T  +P G +  + Y+VG+    +G            
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQ 333

Query: 212 LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
           LT+ G   +V D+G + T LP E YA +   FD  + +   S   +    CY+ S    +
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 393

Query: 271 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           +VP +   F +     +  RN +       G  VFCL    +     I+G     G +I 
Sbjct: 394 RVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQIT 449

Query: 329 FDRENLKLAWSHSKC 343
            D  N  + +  + C
Sbjct: 450 VDSANGYVGFGPNTC 464


>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
 gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
          Length = 424

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 130/313 (41%), Gaps = 30/313 (9%)

Query: 56  VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           +SC  PLC +  +     C+S  D C Y   Y+ E  SS G LV D   L   +     S
Sbjct: 117 LSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTDYFPLRLMNG----S 171

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
            ++  +  GCG  Q         P  GV+GLG G  S+ S L   G++ N    C     
Sbjct: 172 FLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKG 231

Query: 170 SGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 227
            G +FFG Q P      S+ P+ +K  D Y+  G      G         + + DSG+S+
Sbjct: 232 GGFLFFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSY 290

Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNAS------SEEMLKVPDMRLIF 279
           T+   ++Y   +    K +S K  R + +  +   C+  +      +E         L F
Sbjct: 291 TYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSF 350

Query: 280 SKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
           +K +S    +    +    N+G    CL +++      G++ +IG N      +++D + 
Sbjct: 351 TKAKSVQLQIPPEDYLIVTNDGNV--CLGILNGSEVGLGNFNVIGDNLFQDKLVIYDSDK 408

Query: 334 LKLAWSHSKCEEV 346
            ++ W  + C+ +
Sbjct: 409 HQIGWIPANCDRL 421


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 138/320 (43%), Gaps = 36/320 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS+S S + V CS P C+S  S       C S    C Y+ +Y  + + + G L  + 
Sbjct: 175 FNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYG-DGSYTRGELGTEH 233

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L +       S+  ++ I GCGR   G +       G++GLG   +S+ S    + + 
Sbjct: 234 LDLGN-------STAVNNFIFGCGRNNQGLF---GGASGLVGLGRSSLSLIS--QTSAMF 281

Query: 158 QNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 209
              FS C    +   SGS+  G      + +     T  +P   +   YF+ +    +G+
Sbjct: 282 GGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIP-NPQLPFYFLNLTGITVGS 340

Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
             +    F     ++DSG   T LP  IY  +  +F K  S    +        C+N S 
Sbjct: 341 VAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSG 400

Query: 267 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMM 323
            + +++P++++ F  N    V    +F F + +   V CL +  +S + + GIIG     
Sbjct: 401 YQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQV-CLAIASLSYENEVGIIGNYQQK 459

Query: 324 GHRIVFDRENLKLAWSHSKC 343
             R+++D +   L ++   C
Sbjct: 460 NQRVIYDTKGSMLGFAAEAC 479


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 133/304 (43%), Gaps = 52/304 (17%)

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
           PC Y  DY+ + +S++G+L  D    A+ S      +    V  GCG +  G    G   
Sbjct: 141 PCGYAYDYA-DGSSTTGFLARDT---ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG- 195

Query: 135 DGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDSGS-------VFFGDQGPATQQST 186
            GV+GLG G +S P   A++G L   +FS C  + + G        +F G   P  + + 
Sbjct: 196 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGR--PERRAAF 249

Query: 187 SFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTE 233
           ++ P+     A   Y+VGV +  +GN  L   G +           ++DSG++ T+L   
Sbjct: 250 AYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLG 309

Query: 234 IYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYNASSEEMLK-----VPDMRLIFSKNQ 283
            Y  +V  F   V   RI       QG   + CYN SS   L       P + + F++  
Sbjct: 310 AYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSLAPANGGFPRLTIDFAQGL 367

Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWS 339
           S  +   N++    ++    V CL +  T   +   ++G     G+ + FDR + ++ ++
Sbjct: 368 SLELPTGNYLVDVADD----VKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFA 423

Query: 340 HSKC 343
            ++C
Sbjct: 424 RTEC 427


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 137/306 (44%), Gaps = 33/306 (10%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           L  +++ C + K  C Y   Y+ + +SS+G L  D + L +    A        ++ GC 
Sbjct: 190 LQGNQNYCDTCKQ-CDYEIAYA-DRSSSAGVLARDNMELIT----ADGERENMDLVFGCA 243

Query: 122 RKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQ 178
             Q G  L   A+ DG++GL  G +S+P+ LAK G+I N F  C   + SGS  +F GD 
Sbjct: 244 HDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFLGDD 303

Query: 179 GPATQQSTSFLPIG----EKYDAYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPT 232
               +   +++P+     + Y      V   C   +   Q+G   Q + DSG+S+T+ P 
Sbjct: 304 Y-VPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSSYTYFPH 362

Query: 233 EIYAEVVVKFDKLVSSKRISLQGNSWKYC----YNASSEEMLKVPDMRLIFSKNQSFVVR 288
           EIY  ++   + +           +  +C    +   S + +K     L+   +++++V 
Sbjct: 363 EIYTSLITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLHFSKTWLVI 422

Query: 289 NHIFSF-PEN----EGFTVFCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLA 337
              F   PEN     G    CL V+  DG + G     +IG   + G  + +D +  ++ 
Sbjct: 423 PRTFEISPENYLIISGKGNVCLGVL--DGTEIGHSSTIVIGDVSLRGKLVAYDNDANQIG 480

Query: 338 WSHSKC 343
           W+ S C
Sbjct: 481 WAQSDC 486


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 132/297 (44%), Gaps = 29/297 (9%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAP 134
           C Y   Y  + +S++GY V D + L   + +   +S   S++ GCG +Q+G      AA 
Sbjct: 156 CEYRVAYG-DGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAAL 214

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGE 193
           DG++G G  + S+ S LA +G ++  F+ C D  + G +F  G+      ++T  +P   
Sbjct: 215 DGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIFAIGEVVQPKVRTTPLVPQQA 274

Query: 194 KYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDK 244
            Y+ +   +E   + N  L        T      ++DSG +  + P  IY  ++ K F +
Sbjct: 275 HYNVFMKAIE---VDNEVLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFAR 331

Query: 245 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH--IFSFPENEGFTV 302
             + K  +++     + Y+ + ++    P +   F  + S  V  H  +F    N+    
Sbjct: 332 QSTLKLHTVEEQFTCFEYDGNVDDGF--PTVTFHFEDSLSLTVYPHEYLFDIDSNK---- 385

Query: 303 FCL-----TVMSTDGDYGIIGQNFMMGHRIV-FDRENLKLAWSHSKCEEVIDKSHVH 353
           +C+        S DG   I+  + ++ +R+V +D EN  + W+   C   I     H
Sbjct: 386 WCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLENQTIGWTEYNCSSSIKVRDEH 442


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 82/331 (24%), Positives = 139/331 (41%), Gaps = 51/331 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI-- 97
           Y P++++    ++C  PLC S        CKS  D C Y  +Y+ +  SS G LV+D   
Sbjct: 98  YKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVP 152

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGL 156
           L L + S  AP+      +  GCG     S  D + P  GV+GLG G+VS  S L+  G+
Sbjct: 153 LKLTNGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGV 206

Query: 157 IQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           ++N    C  + + G +FFGD+       T  S S   IG  Y +   G      G    
Sbjct: 207 VRNVVGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSS---GPAEVYFGGKAT 262

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS---- 266
                  + DSG+S+T+  ++ Y  ++      +  K +  + +  S   C+  +     
Sbjct: 263 GIKDLTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKS 322

Query: 267 ----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD----GDY 314
               ++   +  +R   +KN    +       PEN      +   C  +++      GD 
Sbjct: 323 LRDVKKYFNLLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEVGLGDL 376

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            IIG   +    +++D E  ++ W  + C +
Sbjct: 377 NIIGDISLKDKMVIYDNERRRIGWFPTNCNK 407


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 130/329 (39%), Gaps = 26/329 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  +SCS   C      S S C    + C Y   Y  + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L        +   +  + V+ GC   QTG  +    A DG+ G G   +SV S LA  G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243

Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           +    FS C   EN  G +     G   + +  F P+      Y V + S  +    L  
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
                 T +G   ++D+G +  +L    Y   V      VS   + +  +GN    CY  
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNF 321
           ++      P + L F+   S  +    +   +N   G  V+C+           I+G   
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
           +     V+D    ++ W++  C   ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSTSVNVS 447


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 82/332 (24%), Positives = 147/332 (44%), Gaps = 34/332 (10%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSC--KSLKDPCPYIADYSTEDTS-SSGYL 93
           D++L   DP++SS+   + C    C++   +SC  ++L +    I  Y   D S + G +
Sbjct: 120 DQDLPVLDPAASSTYAALPCGAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEI 179

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             D       S  + +S     +  GCG    G +       G+ G G G  S+PS L  
Sbjct: 180 ATDRFTFGD-SGGSGESLHTRRLTFGCGHLNKGVFQSNET--GIAGFGRGRWSLPSQLNV 236

Query: 154 AGLIQNSFSICFD---ENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGV 202
                 SFS CF    E+ S  V  G    A          ++T  L    +   YF+ +
Sbjct: 237 -----TSFSYCFTSMFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSL 291

Query: 203 ESYCIGNSCLT--QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
           +   +G + L   ++ F++ ++DSGAS T LP E+Y  V  +F   V      ++G++  
Sbjct: 292 KGISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALD 351

Query: 260 YCYNASSEEMLK---VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYG 315
            C+      + +   VP + L        + R N++F   E+ G  V C+ + +  G+  
Sbjct: 352 LCFALPVTALWRRPAVPSLTLHLEGADWELPRSNYVF---EDLGARVMCIVLDAAPGEQT 408

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           +IG        +V+D EN +L+++ ++C+ ++
Sbjct: 409 VIGNFQQQNTHVVYDLENDRLSFAPARCDRLV 440


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/322 (26%), Positives = 143/322 (44%), Gaps = 36/322 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++PS SSS KN+ C   LC S R +  S ++ C Y   Y  + + S G L  D L L S 
Sbjct: 129 FNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYG-DSSHSQGDLSVDTLSLEST 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S  +P S  ++  +IGCG    G++  G A  G++GLG G VS+ + L  +  I   FS 
Sbjct: 188 SG-SPVSFPKT--VIGCGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSY 240

Query: 164 CF------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG 216
           C       + N S  + FGD    +       P+ +K    YF+ ++++ +GN  +   G
Sbjct: 241 CLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGG 300

Query: 217 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                      ++DSG + T +P+++Y  +      LV   R+      +  CY+  S E
Sbjct: 301 SSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFS--FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMG 324
                D  +I +  +   +  H  S   P  +G   F        G  +G +  QN ++G
Sbjct: 361 Y----DFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG 416

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
               +D +   +++  + C +V
Sbjct: 417 ----YDLQQKTVSFKPTDCTKV 434


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 134/320 (41%), Gaps = 36/320 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVD 95
           YDPS S + K +SC+   C SR    +L DP        C Y A Y   DTS S GYL  
Sbjct: 29  YDPSVSKTYKKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQ 85

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KA 154
           D+L L S S+  PQ         GCG+   G +   A   G++GL    +S+ + L+ K 
Sbjct: 86  DLLTLTS-SQTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKY 135

Query: 155 GLIQNSFSICFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G   ++FS C    +SGS   G        P + + T  L   +    YF+ + +  +  
Sbjct: 136 G---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSG 192

Query: 210 SCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNA 264
             L  +        L+DSG   T LP  +YA +   F K++S+K       S    C+  
Sbjct: 193 RPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKG 252

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           S + +  VP++++IF       +R        ++G T       S      IIG      
Sbjct: 253 SLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQT 312

Query: 325 HRIVFDRENLKLAWSHSKCE 344
           + I +D    ++ ++   C 
Sbjct: 313 YNIAYDVSTSRIGFAPGSCH 332


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 130/329 (39%), Gaps = 26/329 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  +SCS   C      S S C    + C Y   Y  + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L        +   +  + V+ GC   QTG  +    A DG+ G G   +SV S LA  G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243

Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           +    FS C   EN  G +     G   + +  F P+      Y V + S  +    L  
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
                 T +G   ++D+G +  +L    Y   V      VS   + +  +GN    CY  
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNF 321
           ++      P + L F+   S  +    +   +N   G  V+C+           I+G   
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
           +     V+D    ++ W++  C   ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSTSVNVS 447


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 144/318 (45%), Gaps = 41/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++P  S+S  +V C+   C +       ++  C Y   Y  + T S G L  + + + S 
Sbjct: 134 FNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS- 191

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 SSV+S  +IGCG   +G +       GV+GLG G +S+ S +++   I   FS 
Sbjct: 192 ------SSVKS--VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 164 CFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQS 215
           C      + +G + FG+    +       P+  K     Y++ +E+  IGN       + 
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 300

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKVP 273
           G   ++DSG + T LP E+Y  VV    K+V +KR+     S   C++   ++   L +P
Sbjct: 301 G-NVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIP 359

Query: 274 DMRLIFS--KNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQ----NFMMGH 325
            +   FS   N + +  N      +N    V CLT+   S   ++GIIG     NF++G 
Sbjct: 360 VITAHFSGGANVNLLPINTFRKVADN----VNCLTLKAASPTTEFGIIGNLAQANFLIG- 414

Query: 326 RIVFDRENLKLAWSHSKC 343
              +D E  +L++  + C
Sbjct: 415 ---YDLEAKRLSFKPTVC 429


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 78/333 (23%), Positives = 138/333 (41%), Gaps = 51/333 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           YDP +S + + + C+ P C+       C +    C Y+  Y  + ++SSG L  D L L 
Sbjct: 134 YDPRNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDTLVL- 191

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 P  +   +V +GCG    G     A   G++G G G +S P+ LA A    + F
Sbjct: 192 ------PDDTRVHNVTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVF 240

Query: 162 SICFDE------NDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCL 212
           S C  +      N S  + FG        ST+F P+     +   Y+V +  + +G   +
Sbjct: 241 SYCLGDRMSRARNSSSYLVFGRT--PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV 298

Query: 213 TQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQG 255
             +GF                +VDSG + +    + YA V   F    ++   +R+  + 
Sbjct: 299 --AGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKF 356

Query: 256 NSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMST 310
           + +  CY+         ++VP + L F+      +   N++      +  T FCL + + 
Sbjct: 357 SVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAA 416

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           D    ++G     G  +VFD E  ++ ++ + C
Sbjct: 417 DDGLNVLGNVQQQGFGVVFDVERGRIGFTPNGC 449


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 134/320 (41%), Gaps = 36/320 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVD 95
           YDPS S + K +SC+   C SR    +L DP        C Y A Y   DTS S GYL  
Sbjct: 168 YDPSVSKTYKKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQ 224

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KA 154
           D+L L S S+  PQ         GCG+   G +   A   G++GL    +S+ + L+ K 
Sbjct: 225 DLLTLTS-SQTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKY 274

Query: 155 GLIQNSFSICFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G   ++FS C    +SGS   G        P + + T  L   +    YF+ + +  +  
Sbjct: 275 G---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSG 331

Query: 210 SCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNA 264
             L  +        L+DSG   T LP  +YA +   F K++S+K       S    C+  
Sbjct: 332 RPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKG 391

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           S + +  VP++++IF       +R        ++G T       S      IIG      
Sbjct: 392 SLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQT 451

Query: 325 HRIVFDRENLKLAWSHSKCE 344
           + I +D    ++ ++   C 
Sbjct: 452 YNIAYDVSTSRIGFAPGSCH 471


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 135/324 (41%), Gaps = 31/324 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YD   S + K VSC    C +      S C +    C Y   Y+ + +SS GY V D
Sbjct: 142 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRD 199

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           I+     S     +S   SVI GC   Q+G      A DG++G G  + S+ S LA +G 
Sbjct: 200 IVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGK 259

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
           ++  F+ C D  + G +F    G   Q   +  P+      Y V +++  +G   L    
Sbjct: 260 VRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPT 317

Query: 214 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                  + G   ++DSG +  +LP  +Y +++ K     S  ++    + +  C+  S 
Sbjct: 318 DVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSE 374

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
                 P +   F  +    V  H + F  +    ++C+      + S D  +  ++G  
Sbjct: 375 SLDDGFPAVTFHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDL 431

Query: 321 FMMGHRIVFDRENLKLAWSHSKCE 344
            +    +++D EN  + W+   C+
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCK 455


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 140/341 (41%), Gaps = 46/341 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYL 93
           D+    +DPS SS+ + V+C  P+C+     S S+C      C Y+  Y  + + ++GY+
Sbjct: 124 DQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYG-DKSITAGYI 182

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             D     S +         S +  GCG   TG +    +  G+ G G G +S+PS L +
Sbjct: 183 FKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNES--GIAGFGRGPLSLPSQL-R 239

Query: 154 AGLIQNSFSICF------DENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAY 198
            G     FS C       + N + +VF G           GP   +ST  +        Y
Sbjct: 240 VG----RFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPF--RSTPIIHSPSFPTFY 293

Query: 199 FVGVESYCIGNSCL-TQSGFQAL---------VDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           ++ +E   +G + L   S   AL         +DSG   T  P  ++ ++  +F   +  
Sbjct: 294 YLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPL 353

Query: 249 KRISLQGNSWKYCYNASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCL 305
            R                +   +VP  +LIF   S +      N+I   PE+    V CL
Sbjct: 354 PRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHLASADMDLPRENYI---PEDTDSGVMCL 410

Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            +   + D  +IG        IV+D EN KL ++ ++C+++
Sbjct: 411 MINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDKM 451


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/269 (26%), Positives = 112/269 (41%), Gaps = 22/269 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YD   S++ K VSC    C        S C +    CPY+  Y  + +S++GY V D
Sbjct: 131 LTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKD 188

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKA 154
            +     S     ++   S+  GCG +Q+G        A DG++G G  + S+ S LA  
Sbjct: 189 YVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
             ++  F+ C D  + G +F    G   Q   +  P+      Y V +    +G+  L  
Sbjct: 249 RKVKKMFAHCLDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNI 306

Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
           S   F+A      ++DSG +  +LP  IY  +V K   L     + +Q    +Y C+  S
Sbjct: 307 SADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYS 364

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
                  P +   F  +    V  H + F
Sbjct: 365 ERVDDGFPPVIFHFENSLLLKVYPHEYLF 393


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 145/339 (42%), Gaps = 40/339 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   YDP  SSS KN++C  P C+  SS      CK     CPY   Y     ++  +
Sbjct: 231 EQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDF 290

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
            ++      +  +  P+  +  +V+ GCG    G +   A    ++GLG G +S  + L 
Sbjct: 291 ALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL- 346

Query: 153 KAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFLP-IGEKYDA----YFVG 201
              L  +SFS C  + +S S      +F  D+   +  + +F   +G K +     Y+V 
Sbjct: 347 -QSLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVL 405

Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
           ++S  +G   L           Q G   ++DSG + T+     Y  +   F + +    +
Sbjct: 406 IKSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL 465

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSF-PENEGFTVFCLTVM 308
                  K CYN S  E +++P+  ++F+      F V N+     PE+    V CL ++
Sbjct: 466 VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPED----VVCLAIL 521

Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            T      IIG        I++D +  +L ++  KC +V
Sbjct: 522 GTPRSALSIIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 128/321 (39%), Gaps = 50/321 (15%)

Query: 56  VSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 107
           V CS P+C +           C     PC Y  +Y+ ++  S+G L  D +H+ S     
Sbjct: 116 VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYA-DNAESTGALARDYMHIGS----- 169

Query: 108 PQSSVQSSVIIGCGRKQT-GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
           P  S    V+ GCG +Q         +  GV+GLG G +S+ S L   G I N    C  
Sbjct: 170 PSGSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLS 229

Query: 167 ENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
               G +F GD+          P  Q S       EK+  Y  G              G 
Sbjct: 230 AEGGGYLFLGDKFIPSSGIFWTPIIQSSL------EKH--YSTGPVDLFFNGKPTPAKGL 281

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYC--YNASSEEM 269
           Q + DSG+S+T+    +Y  V    +  +  K +  +         WK    + + +E  
Sbjct: 282 QIIFDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVN 341

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGH 325
                + L F+K+     +N  F  P  + F   CL +++ +    G+  ++G   +   
Sbjct: 342 NYFKPLTLSFTKS-----KNLQFQLPPVK-FGNVCLGILNGNEAGLGNRNVVGDISLQDK 395

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            +V+D E  ++ W+ + C+++
Sbjct: 396 VVVYDNEKQQIGWASANCKQI 416


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/325 (22%), Positives = 134/325 (41%), Gaps = 33/325 (10%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
            L+ +D + SSS++ + C+ P+C + S+    C +  D C Y   Y  + + +SG+ V D
Sbjct: 127 ELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTD 185

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
            +H       +  ++  ++++ GC   Q G       A DG+ G G G+ SV S L+  G
Sbjct: 186 SMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRG 245

Query: 156 LIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC-- 211
           +    FS C    EN  G +  G+     + S  + P+      Y + ++S  +      
Sbjct: 246 ITPKVFSHCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP 302

Query: 212 ------LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 ++ +G + ++DSG +  +L  E+Y  +V      VS           + C+  S
Sbjct: 303 NPTMFPISNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVS 360

Query: 266 SEEMLKVPDMRLIFSKNQSFVVR-------NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
                  P +R  F    S VV        + I   P      ++C+     +    I+G
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREP-----ALWCIGFQKAEDGLNILG 415

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
              +    IV+D    ++ W++  C
Sbjct: 416 DLVLKDKIIVYDLARQRIGWANYDC 440


>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 395

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 138/331 (41%), Gaps = 39/331 (11%)

Query: 51  SSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
           +  K V    PLC+    +++ C++ K  C Y   Y+ + +SS G L  D + L +    
Sbjct: 62  TEGKIVHPRDPLCEELQGNQNYCETCKQ-CDYEITYA-DRSSSKGVLARDNMQLTT---- 115

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
           A         + GC   Q G  LD   + DG++GL  G +S+ + LA +G+I N F  C 
Sbjct: 116 ADGEMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCM 175

Query: 166 --DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG-----F 217
             D +  G +F GD     +   +++PI     + Y   V     G   L   G      
Sbjct: 176 ATDPSSGGYMFLGDDY-VPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLT 234

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
           Q + DSG+S+T+ P EIY  ++   +             +  +C   +   +  V D+  
Sbjct: 235 QVIFDSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMKPNV-PVRSVGDVEQ 293

Query: 278 IFS------KNQSFVVRNHIFSFPENEGFTV----FCLTVMSTDG-DYG-----IIGQNF 321
           +F+      + + FV+       PEN          CL V+  DG + G     IIG   
Sbjct: 294 LFNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGVL--DGTEIGHSSTIIIGDAS 351

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
           + G  +V+D +  ++ W  S C     +S V
Sbjct: 352 LRGKFVVYDNDENRIGWVQSDCTRPQKQSRV 382


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 34/294 (11%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y  +Y+ + +SS G L  D LHL      A  SS       GC   Q G  L+     
Sbjct: 284 CDYEIEYA-DHSSSMGVLARDELHLT----MANGSSTNLKFNFGCAYDQQGLLLNTLVKT 338

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
           DG++GL    VS+PS LA  G+I N    C   D    G +F GD     +   S++P+ 
Sbjct: 339 DGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDF-VPRWGMSWVPML 397

Query: 193 E--KYDAYFVGVESYCIGNSCLTQSGFQALV-----DSGASFTFLPTEIYAEVVVKFDKL 245
           +    D+Y   +     G+  L+  G +  V     DSG+S+T+   E Y+E+V    ++
Sbjct: 398 DSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQV 457

Query: 246 VSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSK-----NQSFVVRNHIFSFPENEG 299
                I    + +  +C+ A    +  V D++  F          + + +  F  P  EG
Sbjct: 458 SGEALIQDTSDPTLPFCWRAKF-PIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPP-EG 515

Query: 300 FTVF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           + +       CL ++      DG   I+G   + G  I++D  N K+ W+ S C
Sbjct: 516 YLIISNKGNVCLGILDGSDVHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDC 569


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/316 (22%), Positives = 131/316 (41%), Gaps = 32/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DPS SS+  +++CS   C+      + +C S K  CPY   Y+ +D+ + G L  D L 
Sbjct: 176 FDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKK-CPYEITYA-DDSYTVGNLARDTLT 233

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           L      +P  +V    + GCG    GS+      DG++GLG G  S+ S +  A     
Sbjct: 234 L------SPTDAVP-GFVFGCGHNNAGSF---GEIDGLLGLGRGKASLSSQV--AARYGA 281

Query: 160 SFSICFDENDSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
            FS C   + S + +    G     P   Q T  +  G+    Y++ +    +    +  
Sbjct: 282 GFSYCLPSSPSATGYLSFSGAAAAAPTNAQFTEMV-AGQHPSFYYLNLTGITVAGRAIKV 340

Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 +    ++DSG +F+ LP   YA +       +   + +     +  CY+ +  E
Sbjct: 341 PPSVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHE 400

Query: 269 MLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
            +++P + L+F+   +  +  + +     N   T         D   G++G        +
Sbjct: 401 TVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAV 460

Query: 328 VFDRENLKLAWSHSKC 343
           ++D +N K+ +  + C
Sbjct: 461 IYDVDNQKVGFGANGC 476


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/328 (22%), Positives = 136/328 (41%), Gaps = 24/328 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +D   SS++  + CS P+C SR     + C    + C Y   Y  + + +SGY V D
Sbjct: 122 LNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSD 180

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
            ++ +      P  +  ++++ GC   Q+G       A DG+ G G G +SV S L+  G
Sbjct: 181 AMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRG 240

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C  + D         G   + S  + P+      Y + ++S  +    L   
Sbjct: 241 ITPKVFSHCL-KGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIN 299

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNA 264
                 + +    +VD G +  +L  E Y  +V   +  V  S+++ + +GN    CY  
Sbjct: 300 PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLV 356

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQNFM 322
           S+      P + L F    S V++   +       +G  ++C+          I+G   +
Sbjct: 357 STSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVL 416

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
               +V+D    ++ W++  C   ++ S
Sbjct: 417 KDKIVVYDIAQQRIGWANYDCSLSVNVS 444


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 140/319 (43%), Gaps = 36/319 (11%)

Query: 53  SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
            K V C+ PLC        +   C+   D C Y  +Y+ + T+S G L+ D   L + S 
Sbjct: 89  KKLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYA-DGTTSLGVLLLDKFSLPTGS- 146

Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI-QNS 160
                    ++  GCG  Q       A      DG++GLG G V + S L  +G + +N 
Sbjct: 147 -------ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNV 199

Query: 161 FSICFDENDSGSVFFGDQG-PATQQSTSFLP-IGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
              C      G +F G++  P++     ++  I  + + Y  G  +  +G + +    F+
Sbjct: 200 IGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFK 259

Query: 219 ALVDSGASFTFLPTEIYAEVV--VKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDM 275
           A+ DSG+++T+LP  ++A++V  +K   + SS K +S        C+    +    V D+
Sbjct: 260 AIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKG-PKPFKTVHDL 318

Query: 276 RLIFSKNQSFVVRNHIFSF---PEN----EGFTVFCLTVMSTDG-DYGIIGQNFMMGHRI 327
              F K+   +  +H  +    PEN     G    C  ++   G D  +IG   M    +
Sbjct: 319 PKEF-KSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLFVIGGISMQEQLV 377

Query: 328 VFDRENLKLAWSHSKCEEV 346
           + D E  +LAW  S C+++
Sbjct: 378 IHDNEKGRLAWMPSPCDKM 396


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 137/329 (41%), Gaps = 25/329 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
            L+ +D SSSS++  V CS P+C S      + C    + C Y   Y  + + +SGY V 
Sbjct: 109 QLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYE-DGSGTSGYYVS 167

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D L+  +    +   +  + ++ GC   Q+G   +   A DG+ G G G++SV S L+  
Sbjct: 168 DTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTH 227

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           G+    FS C  + +         G   +    + P+      Y + ++S  +    L  
Sbjct: 228 GITPRVFSHCL-KGEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPI 286

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
                 T +    +VDSG +  +L  E Y   V   + +VS     +  +GN    CY  
Sbjct: 287 DPSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ---CYLV 343

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNF 321
           S+      P     F+   S V++  +++  F P   G  ++C+      G   I+G   
Sbjct: 344 STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQG-VTILGDLV 402

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
           +     V+D    ++ W++  C   ++ S
Sbjct: 403 LKDKIFVYDLVRQRIGWANYDCSLSVNVS 431


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 127/291 (43%), Gaps = 41/291 (14%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P  SSS   V C+        +C S K  C Y   Y+ E +SSSG L +DI+     
Sbjct: 130 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 183

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+   Q +V      GC   +TG      A DG+MGLG G +S+   L + G+I +SFS+
Sbjct: 184 SELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSL 237

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    V  G   P+    +   P+   Y  Y + ++   +    L        
Sbjct: 238 CYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELKEIHVAGKALRVDSRIFD 295

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEE 268
           S    ++DSG ++ +LP + +    + F   V+SK  SL   +G    Y   C+  +   
Sbjct: 296 SKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRN 351

Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
           + K+    PD+ ++F   Q  S    N++F   + +G   +CL V     D
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKD 400


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 134/331 (40%), Gaps = 44/331 (13%)

Query: 40  RNLSE-YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           R LS  YDP  SS+     CS P C++  +C      C Y   Y  + +S+SG L  D L
Sbjct: 135 RQLSPLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYG-DASSTSGNLATDRL 193

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
             ++       +SV  +V +GCG    G +   A   G++G+  G+ S  + +A +    
Sbjct: 194 VFSN------DTSV-GNVTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YG 241

Query: 159 NSFSICF-DENDSGS----VFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNS 210
             F+ C  D   SGS    + FG   P    S  F P+     +   Y+V +  + +G  
Sbjct: 242 RYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGE 300

Query: 211 CLTQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKL---VSSKRISL 253
            +T  GF                +VDSG S T    + Y  +   FD     V  +++  
Sbjct: 301 PVT--GFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGR 358

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDG 312
             + +  CY+     +   P + L F+      +    +  PE  G +  F L     DG
Sbjct: 359 GISVFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDG 418

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              +IG       R+VFD EN ++ +  + C
Sbjct: 419 -LSVIGNVLQQRFRVVFDVENERVGFEPNGC 448


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 80/322 (24%), Positives = 132/322 (40%), Gaps = 35/322 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PSSSSS K + CS  LC +      L + C Y ADY     +    + D+++   +F 
Sbjct: 58  FNPSSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAF- 116

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
              P   V +++ +GCG    G++   A   G++GLG G +S P+ L  +   +N FS C
Sbjct: 117 --GPGQVVLTNIPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNLDAS--TRNIFSYC 169

Query: 165 F-----DENDSGSVFFGDQG-PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCLTQ 214
                 D N   ++ FGD   P T   S  F+P          Y+V +    +G + LT 
Sbjct: 170 LPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTN 229

Query: 215 ---SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
              S FQ         + DSG + T L    Y  V   F         +     +  CY+
Sbjct: 230 IPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYD 289

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
            +    + VP +   F  +    +   N+I     N    +FC    ++ G   +IG   
Sbjct: 290 FTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNN---IFCFAFAASMGP-SVIGNVQ 345

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R+++D  + ++     +C
Sbjct: 346 QQSFRVIYDNVHKQIGLLPDQC 367


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 73/323 (22%), Positives = 135/323 (41%), Gaps = 26/323 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
            L+ +D + SSS++ + C+ P+C + S+    C +  D C Y   Y  + + +SG+ V D
Sbjct: 127 ELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTD 185

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
            +H       +  ++  ++++ GC   Q G       A DG+ G G G+ SV S L+  G
Sbjct: 186 SMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRG 245

Query: 156 LIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC-- 211
           +    FS C    EN  G +  G+     + S  + P+      Y + ++S  +      
Sbjct: 246 ITPKVFSHCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP 302

Query: 212 ------LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                 ++ +G + ++DSG +  +L  E+Y  +V      VS           + C+  S
Sbjct: 303 NPTMFPISNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVS 360

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIF----SFPENEGF-TVFCLTVMSTDGDYGIIGQN 320
                  P +R  F    S VV    +    S      F +++C+     +    I+G  
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDL 420

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
            +    IV+D    ++ W++  C
Sbjct: 421 VLKDKIIVYDLAQQRIGWANYDC 443


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/318 (25%), Positives = 142/318 (44%), Gaps = 55/318 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           + PS SS+ K + C+ P+CK+                      +   YL  D L L S +
Sbjct: 132 FHPSKSSTYKTIPCTSPICKN----------------------ADGHYLGVDTLTLNS-N 168

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
              P S    +++IGCG +  G  L+G    G +GL  G +S  S L  +  I   FS C
Sbjct: 169 NGTPISF--KNIVIGCGHRNQGP-LEGYV-SGNIGLARGPLSFISQLNSS--IGGKFSYC 222

Query: 165 F-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----TQS 215
                  EN S  + FGD+   +   T   PI E+ + YFV +E++ +G+  +    + +
Sbjct: 223 LVPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEE-NGYFVSLEAFSVGDHIIKLENSDN 281

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML-KVPD 274
              +++DSG + T LP ++Y+ +      +V  KR+      +  CY  +S  +L KV  
Sbjct: 282 RGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLI 341

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-------GIIGQNFMMGHRI 327
           +   FS ++  +   + F +P  +   V C   +S  G++        ++ QNF++G   
Sbjct: 342 ITAHFSGSEVHLNALNTF-YPITD--EVICFAFVS-GGNFSSLAIFGNVVQQNFLVG--- 394

Query: 328 VFDRENLKLAWSHSKCEE 345
            FD     +++  + C +
Sbjct: 395 -FDLNKKTISFKPTDCTK 411


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/325 (25%), Positives = 138/325 (42%), Gaps = 40/325 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S S   V C+ PLC+   S  C   +  C Y   Y  + + ++G    + L  A 
Sbjct: 182 FDPRRSRSYNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYG-DGSVTAGDFATETLTFAG 240

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++ A        V +GCG    G ++  A    ++GLG G +S P+ +++      SFS
Sbjct: 241 GARVA-------RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGRSFS 288

Query: 163 ICFDEN--------DSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFV---------GVE 203
            C  +          S +V FG     +  ++SF P+    + + ++          G  
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348

Query: 204 SYCIGNSCLT---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-W 258
              + NS L     SG    +VDSG S T L    Y+ +   F    +  R+S  G S +
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLF 408

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
             CY+ S  +++KVP + + F+      +    +  P +   T FC     TDG   IIG
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIG 467

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                G R+VFD +  ++A++   C
Sbjct: 468 NIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 134/319 (42%), Gaps = 34/319 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS S S + + C+   C+S          C S    C Y+ +Y  + + + G L  + 
Sbjct: 107 FNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYG-DGSYTRGDLGMEQ 165

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L+L +   H       S+ I GCGR   G +       G+MGLG  D+S+ S    + + 
Sbjct: 166 LNLGT--THV------SNFIFGCGRNNKGLF---GGASGLMGLGKSDLSLVS--QTSAIF 212

Query: 158 QNSFSICFDE---NDSGSVFFGDQGPATQQST-----SFLPIGEKYDAYFVGVESYCIGN 209
           +  FS C      + SGS+  G      + +T       +   +    YF+ +    IG 
Sbjct: 213 EGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG 272

Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
             L    ++    L+DSG   T LP  +Y ++  +F K  S    +   +    C+N + 
Sbjct: 273 VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNG 332

Query: 267 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMG 324
            + + +P +R+ F  N    V    IF F + +   V   L  +S D +  IIG      
Sbjct: 333 YDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRN 392

Query: 325 HRIVFDRENLKLAWSHSKC 343
            R++++ +  KL ++   C
Sbjct: 393 QRVIYNTKESKLGFAAEAC 411


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
           L  ++P SSS++  ++CS   C +          +  S   PC Y   Y  + + +SGY 
Sbjct: 135 LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 193

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
           V D +   +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L 
Sbjct: 194 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 253

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
             G+    FS C   +D+G       G   +    + P+      Y + +ES        
Sbjct: 254 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 312

Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
            I +S  T S  Q  +VDSG +  +L    Y   V      VS    SL     + C+  
Sbjct: 313 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 371

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
           SS      P + L F    +  V+  N++      +   ++C+      G +  I+G   
Sbjct: 372 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 431

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
           +     V+D  N+++ W+   C   ++
Sbjct: 432 LKDKIFVYDLANMRMGWADYDCSMSVN 458


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 127/315 (40%), Gaps = 36/315 (11%)

Query: 56  VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           V C  PLCK+  S     C    + C Y  +Y+ +  SS G L+ D + L    K    S
Sbjct: 114 VKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNIPL----KFTNGS 168

Query: 111 SVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
             +  +  GCG  Q    +   A+  GV+GLG G  S+ S L   GLI+N    C  E  
Sbjct: 169 LARPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERG 228

Query: 170 SGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDSGASF 227
            G +FFGDQ    Q    + P+ +      Y  G           +  G Q + DSG+S+
Sbjct: 229 GGFLFFGDQ-LVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSY 287

Query: 228 TFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS------EEMLKVPDMRLIF 279
           T+  ++ +  +V      +  K +S   + +S   C+          +       + L F
Sbjct: 288 TYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSF 347

Query: 280 SKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFD 330
           +K+     +N +   P      V      CL ++       G+  IIG   +    +++D
Sbjct: 348 TKS-----KNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYD 402

Query: 331 RENLKLAWSHSKCEE 345
            E  ++ W+ + C+ 
Sbjct: 403 NEKQQIGWASANCDR 417


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 145/328 (44%), Gaps = 24/328 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  +D   S ++ +V+CS P+C S      + C S  + C Y   Y  + + +SGY + 
Sbjct: 143 DLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           D  +  +    +  ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260

Query: 155 GLIQNSFSICFDENDSGSVFF--GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNS 210
           G+    FS C   + SG   F  G+        +  LP    Y+     +GV    +   
Sbjct: 261 GITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILP-- 318

Query: 211 CLTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
            +  + F+A      +VD+G + T+L  E Y   +      V S+ ++L  ++ + CY  
Sbjct: 319 -IDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLV 376

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           S+      P + L F+   S ++R  +++F +   +G +++C+       +  I+G   +
Sbjct: 377 STSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVL 436

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
                V+D    ++ W++  C   ++ S
Sbjct: 437 KDKVFVYDLARQRIGWANYDCSMSVNVS 464


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 140/332 (42%), Gaps = 43/332 (12%)

Query: 34  ASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS-GY 92
           +SI+    ++ YDP  S ++   +CS PLC    SC+   + C Y  D S EDTSSS G 
Sbjct: 132 SSIIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCAY--DISYEDTSSSTGI 189

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
              D++HL        ++S+ +++ +GC    +G +      DG+MG G   VSVP+ LA
Sbjct: 190 YFRDVVHLGH------KASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLA 239

Query: 153 KAGLIQNSFSICFD-ENDSGSVFF---GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
                 N F  C   E + G +      D+ P       + P+      Y V + S  + 
Sbjct: 240 AQAGSYNIFYHCLSGEKEGGGILVLGKNDEFP----EMVYTPMLANDIVYNVKLVSLSVN 295

Query: 209 NSCL--TQSGFQ---------ALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQ 254
           +  L    S F+          ++DSG S    P++   ++ + V KF   + +  +   
Sbjct: 296 SKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESS 355

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF-------SFPENEGFTVFCLTV 307
           G+      +  +   +  P++ L F    +  +  H +          E+  F    L  
Sbjct: 356 GSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVC 415

Query: 308 MS-TDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           +S + G+  I+G   +    +V+D E  ++ W
Sbjct: 416 ISWSVGNSTILGDAILKDKVVVYDMEKSRIGW 447


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
           L  ++P SSS++  ++CS   C +          +  S   PC Y   Y  + + +SGY 
Sbjct: 133 LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 191

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
           V D +   +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L 
Sbjct: 192 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 251

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
             G+    FS C   +D+G       G   +    + P+      Y + +ES        
Sbjct: 252 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 310

Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
            I +S  T S  Q  +VDSG +  +L    Y   V      VS    SL     + C+  
Sbjct: 311 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 369

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
           SS      P + L F    +  V+  N++      +   ++C+      G +  I+G   
Sbjct: 370 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 429

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
           +     V+D  N+++ W+   C   ++
Sbjct: 430 LKDKIFVYDLANMRMGWADYDCSMSVN 456


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 90/322 (27%), Positives = 140/322 (43%), Gaps = 37/322 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS SS+ K + CS P CK+   + C S  K  C Y   Y  E   S G L  D L L 
Sbjct: 131 FDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLN 189

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           S +   P S    +++IGCG +  G  L+G    G +GLG G +S  S L  +  I   F
Sbjct: 190 S-NNDTPISF--KNIVIGCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKF 242

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
           S C      +E  SG + FGD+   +   T   PI      Y   + +  +G+  +    
Sbjct: 243 SYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFEN 302

Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 +    ++DSG + T LP  +Y+ +      +V  +R       +K CY A+ + 
Sbjct: 303 STSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN 362

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG-IIG----QNFMM 323
            L VP +   F+     +   + F   ++E   V C   +S     G IIG    QNF++
Sbjct: 363 -LDVPIITAHFNGADVHLNSLNTFYPIDHE---VVCFAFVSVGNFPGTIIGNIAQQNFLV 418

Query: 324 GHRIVFDRENLKLAWSHSKCEE 345
           G    FD +   +++  + C +
Sbjct: 419 G----FDLQKNIISFKPTDCTK 436


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 137/328 (41%), Gaps = 29/328 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  SS++  VSCS PLC      + + C    + C YI  Y  + ++S GY V D
Sbjct: 46  LTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYG-DGSTSEGYYVRD 104

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            +     S +   ++  S V+ GC  +QTG       A DG++G G  ++SVP+ LA   
Sbjct: 105 AMQYNVISSNG-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQ 163

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            I   FS C  E +         G   +   ++ P+      Y V +    + ++ L   
Sbjct: 164 NIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPID 222

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                + +    ++DSG +  + P+  Y   V    +  S+  + +QG   + C+  S  
Sbjct: 223 AEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGR 281

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFT-VFCLTVMSTDGDYG--------I 316
                P++ L F      +  ++   +      G T V+C+   S+    G        I
Sbjct: 282 LSDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTI 341

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           +G   +    +V+D +N ++ W    C+
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNCK 369


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
           L  ++P SSS++  ++CS   C +          +  S   PC Y   Y  + + +SGY 
Sbjct: 49  LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 107

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
           V D +   +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L 
Sbjct: 108 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 167

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
             G+    FS C   +D+G       G   +    + P+      Y + +ES        
Sbjct: 168 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 226

Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
            I +S  T S  Q  +VDSG +  +L    Y   V      VS    SL     + C+  
Sbjct: 227 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 285

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
           SS      P + L F    +  V+  N++      +   ++C+      G +  I+G   
Sbjct: 286 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 345

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
           +     V+D  N+++ W+   C   ++
Sbjct: 346 LKDKIFVYDLANMRMGWADYDCSMSVN 372


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 86/336 (25%), Positives = 143/336 (42%), Gaps = 61/336 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--I 97
           Y P++++    ++C  PLC S        CKS  D C Y  +Y+ +  SS G LV+D   
Sbjct: 98  YKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVP 152

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGL 156
           L L + S  AP+      +  GCG     S  D + P  GV+GLG G+VS  S L+  G+
Sbjct: 153 LKLTNGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGV 206

Query: 157 IQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKY-----DAYFVGVESYCI 207
           ++N    C  + + G +FFGD+       T  S S   IG  Y     + YF G  +   
Sbjct: 207 VRNVVGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKAT--- 262

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS 265
           G   LT      + DSG+S+T+  ++ Y  ++      +  K +  + +  S   C+  +
Sbjct: 263 GIKDLT-----LVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGT 317

Query: 266 S--------EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD-- 311
                    ++      +R   +KN    +       PEN      +   C  +++    
Sbjct: 318 RPFKSLRDVKKYFNPLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEV 371

Query: 312 --GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
             GD  IIG   +    +++D E  ++ W  + C +
Sbjct: 372 GLGDLNIIGDISLKDKMVIYDNERRRIGWFPTNCNK 407


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 129/324 (39%), Gaps = 47/324 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
           +DP++SSS   VSC   +C++ S             DYS    + + + G L  + L L 
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                   ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQL--GGAAGGVF 278

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY----------DAYFVGVESYCIGNSC 211
           S C     +G       G      T  +P+G  +            Y+VG+    +G   
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGER 333

Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
                    LT+ G   +V D+G + T LP E YA +   FD  + +   S   +    C
Sbjct: 334 LPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTC 393

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           Y+ S    ++VP +   F +     +  RN +       G  VFCL    +     I+G 
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGN 449

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
               G +I  D  N  + +  + C
Sbjct: 450 IQQEGIQITVDSANGYVGFGPNTC 473


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 129/324 (39%), Gaps = 47/324 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
           +DP++SSS   VSC   +C++ S             DYS    + + + G L  + L L 
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                   ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY----------DAYFVGVESYCIGNSC 211
           S C     +G       G      T  +P+G  +            Y+VG+    +G   
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGER 333

Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
                    LT+ G   +V D+G + T LP E YA +   FD  + +   S   +    C
Sbjct: 334 LPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTC 393

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           Y+ S    ++VP +   F +     +  RN +       G  VFCL    +     I+G 
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGN 449

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
               G +I  D  N  + +  + C
Sbjct: 450 IQQEGIQITVDSANGYVGFGPNTC 473


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 136/331 (41%), Gaps = 40/331 (12%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           D++   +DP  S S   V CS PLC+   S  C   +  C Y   Y  + + ++G    +
Sbjct: 178 DQSGQVFDPRRSRSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 236

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L  A  ++ A        + +GCG    G ++  A    ++GLG G +S P+ +++   
Sbjct: 237 TLTFAGGARVA-------RIALGCGHDNEGLFVAAAG---LLGLGRGSLSFPAQISR--R 284

Query: 157 IQNSFSICFDENDSG--------SVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV 202
              SFS C  +  S         +V FG     +  + SF P+ +       Y    VG+
Sbjct: 285 YGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGI 344

Query: 203 ESYCIGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
                  S +  S  +          +VDSG S T L    Y+ +   F    +  R+S 
Sbjct: 345 SVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSP 404

Query: 254 QGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
            G S +  CY+ S  +++KVP + + F+      +    +  P +   T FC     TDG
Sbjct: 405 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDG 463

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              IIG     G R+VFD +  ++ +    C
Sbjct: 464 GVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 73/330 (22%), Positives = 136/330 (41%), Gaps = 28/330 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +D   SS++  + CS  +C S      + C    + C Y   Y  + + +SGY V D
Sbjct: 112 LNFFDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSD 170

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
            ++        P  +  ++++ GC   Q+G       A DG+ G G G +SV S L+  G
Sbjct: 171 AMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQG 230

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           +    FS C   D N  G +  G+     + S  + P+      Y + ++S  +    L 
Sbjct: 231 ITPKVFSHCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLP 287

Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCY 262
                   + +    +VD G +  +L  E Y  +V   +  V  S+++ + +GN    CY
Sbjct: 288 INPAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CY 344

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQN 320
             S+      P + L F    S V++   +       +G  ++C+          I+G  
Sbjct: 345 LVSTSIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDL 404

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
            +    +V+D    ++ W++  C   ++ S
Sbjct: 405 VLKDKIVVYDIAQQRIGWANYDCSLSVNVS 434


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/309 (23%), Positives = 126/309 (40%), Gaps = 24/309 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+ P C    +       C Y   Y  + + S G+   D L L+S+ 
Sbjct: 222 FDPARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 279

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 280 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 327

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ- 218
           C     +G+ +  FG   PA + +T+ + +      Y+VG+    +G   L   QS F  
Sbjct: 328 CLPARSTGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT 387

Query: 219 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPD 274
              +VDSG   T LP   Y+ +   F   +S++  + +   +    CY+ +    + +P 
Sbjct: 388 AGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPT 447

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           + L+F       V      +  +              GD GI+G   +    + +D    
Sbjct: 448 VSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 507

Query: 335 KLAWSHSKC 343
            +++S   C
Sbjct: 508 VVSFSPGAC 516


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 78/327 (23%), Positives = 136/327 (41%), Gaps = 29/327 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  SS++  VSCS PLC      + + C    + C YI  Y  + ++S GY V D
Sbjct: 73  LTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYG-DGSTSEGYYVRD 131

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            +     S +   ++  S V+ GC  +QTG       A DG++G G  ++SVP+ LA   
Sbjct: 132 AMQYNVISSNG-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQ 190

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
            I   FS C  E +         G   +   ++ P+      Y V +    + ++ L   
Sbjct: 191 NIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPID 249

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                + +    ++DSG +  + P+  Y   V    +  S+  + +QG   + C+  S  
Sbjct: 250 AEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGR 308

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFT-VFCLTVMSTDGDYG--------I 316
                P++ L F      +  ++   +      G T V+C+   S+    G        I
Sbjct: 309 LSDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTI 368

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +G   +    +V+D +N ++ W    C
Sbjct: 369 LGDIVLKDKLVVYDLDNSRIGWMSYNC 395


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 141/320 (44%), Gaps = 34/320 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCK--SLKDPCPYIADYSTEDTSSSGYLVD 95
           +N  ++DP+ S+S KN+SCS   CKS  + S +  S  + C Y   Y T  T   G+L  
Sbjct: 170 QNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTGYT--VGFLAT 227

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           + L +         S V  + +IGCG +  G +   A   G++GLG   V++PS  +   
Sbjct: 228 ETLTIT-------PSDVFENFVIGCGERNGGRFSGTA---GLLGLGRSPVALPSQTSST- 276

Query: 156 LIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKY-DAYFVGVESYCIGNSCL 212
             +N FS C   + S  G + FG       Q+  F PI  K  + Y + V    +G   L
Sbjct: 277 -YKNLFSYCLPASSSSTGHLSFGG---GVSQAAKFTPITSKIPELYGLDVSGISVGGRKL 332

Query: 213 --TQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-- 265
               S F+    ++DSG + T+LP+  ++ +   F +++++  ++   +  + CY+ S  
Sbjct: 333 PIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKH 392

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMM 323
           + + + +P + + F       + +       N G    CL       D D  I G     
Sbjct: 393 ANDNITIPQISIFFEGGVEVDIDDSGIFIAAN-GLEEVCLAFKDNGNDTDVAIFGNVQQK 451

Query: 324 GHRIVFDRENLKLAWSHSKC 343
            + +V+D     + ++   C
Sbjct: 452 TYEVVYDVAKGMVGFAPGGC 471


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 129/315 (40%), Gaps = 29/315 (9%)

Query: 52  SSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 109
            +  V C+  LC++   ++C    + C Y  +Y+ +  SS G L+ D   L    +    
Sbjct: 114 KNNRVPCASSLCQAIQNNNCDIPTEQCDYEVEYA-DLGSSLGVLLSDYFPL----RLNNG 168

Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
           S +Q  +  GCG  Q   YL   +P    G++GLG G  S+ S L   G+ QN    CF 
Sbjct: 169 SLLQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFS 226

Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
               G +FFGD   P +  + + +        Y  G      G       G Q + DSG+
Sbjct: 227 RVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGS 286

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS--------EEMLKVPDM 275
           S+T+   ++Y  ++    K +S   +  + +  +   C+  +         +   K   +
Sbjct: 287 SYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTI 346

Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
             I +KN    +    +     +G    CL +++      G+  +IG  FM    +V+D 
Sbjct: 347 NFIKAKNVQLQLAPEDYLIITKDGNV--CLGILNGGEQGLGNLNVIGDIFMQDRVVVYDN 404

Query: 332 ENLKLAWSHSKCEEV 346
           E  ++ W  + C  +
Sbjct: 405 ERQQIGWFPTNCNRL 419


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 84/335 (25%), Positives = 145/335 (43%), Gaps = 61/335 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           Y P+ S++  NVSC  P+C++     S C      C Y   Y  + TS+ G L  +   L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S        +    V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +  
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR-- 238

Query: 161 FSIC---FDENDSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNS 210
           FS C   F+   +  +F G      +  ++T F+P        +   Y++ +E   +G++
Sbjct: 239 FSYCFTPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298

Query: 211 C---------LTQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--- 257
                     LT  G    ++DSG +FT L    +   V     L S  R+ L   +   
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAF---VALARALASRVRLPLASGAHLG 355

Query: 258 WKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
              C+ A+S E ++VP + L F       + +S+VV        E+    V CL ++S  
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSAR 407

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G   ++G        I++D E   L++  +KC E+
Sbjct: 408 G-MSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 441


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 134/314 (42%), Gaps = 37/314 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP+ S+S K + CS  LC+S R  C S K  C Y+  Y  +++SS+G L  + +  +  
Sbjct: 173 FDPTKSASFKGLPCSSKLCQSIRQGCSSPK--CTYLTAY-VDNSSSTGTLATETISFSHL 229

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                      +++IGC  + +G  L      G+MGL    +S+ S    A +    FS 
Sbjct: 230 KYDF------KNILIGCSDQVSGESL---GESGIMGLNRSPISLAS--QTANIYDKLFSY 278

Query: 164 CFDEN--DSGSVFFGDQGPATQQSTSFLPIGEK-----YDAYFVGVESYCIGNSCL--TQ 214
           C       +G + FG + P       F P+ +      YD    G+    +G   L    
Sbjct: 279 CIPSTPGSTGHLTFGGKVP---NDVRFSPVSKTAPSSDYDIKMTGIS---VGGRKLLIDA 332

Query: 215 SGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           S F+  + +DSGA  T LP + Y+ +   F +++    +  Q +    CY+ S+   + +
Sbjct: 333 SAFKIASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAI 392

Query: 273 PDMRLIFSK--NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           P + + F         V   ++  P   G  V+CL     D +  I G      + +VFD
Sbjct: 393 PSISVFFEGGVEMDIDVSGIMWQVP---GSKVYCLAFAELDDEVSIFGNFQQKTYTVVFD 449

Query: 331 RENLKLAWSHSKCE 344
               ++ ++   C+
Sbjct: 450 GAKERIGFAPGGCD 463


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 134/330 (40%), Gaps = 32/330 (9%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R L    P    SS  + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV
Sbjct: 94  RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 151

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            D+  +     +     +   + +GCG  Q          DGV+GLG G VS+ S L   
Sbjct: 152 RDVFSM----NYTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQ 207

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSC 211
           G ++N    C      G +FFGD    + +  S+ P+  +Y  ++   +G E    G   
Sbjct: 208 GYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRT 265

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS---- 265
                   + DSG+S+T+  ++ Y  V     + +S K +  +   ++   C+       
Sbjct: 266 TGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFM 325

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGI 316
           S E +K     L  S    +  +  +F  P      +      CL +++       +  +
Sbjct: 326 SIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNL 384

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           IG   M    I++D E   + W  + C+E+
Sbjct: 385 IGDISMQDQMIIYDNEKQSIGWMPADCDEL 414


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 110/236 (46%), Gaps = 41/236 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SSS +N+ C    C S   +SC    D   Y++  +    S++GY V       S
Sbjct: 130 FDPSLSSSYQNIPCLSDTCHSMRTTSC----DVRGYLSVETLTLDSTTGYSV-------S 178

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F K           +IGCG + TG++   ++  G++GLG G +S+PS L  +  I   FS
Sbjct: 179 FPK----------TMIGCGYRNTGTFHGPSS--GIVGLGSGPMSLPSQLGTS--IGGKFS 224

Query: 163 ICFDE---NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG 216
            C      N +  + FGD            PI +K DA   Y++ +E++ +GN  +   G
Sbjct: 225 YCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKLIEFGG 283

Query: 217 -------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                     L+DSG +FTFLP ++Y        + ++ + +     ++K CYN +
Sbjct: 284 PTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVA 339


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/335 (25%), Positives = 145/335 (43%), Gaps = 61/335 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           Y P+ S++  NVSC  P+C++     S C      C Y   Y  + TS+ G L  +   L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S        +    V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +  
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR-- 238

Query: 161 FSIC---FDENDSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNS 210
           FS C   F+   +  +F G      +  ++T F+P        +   Y++ +E   +G++
Sbjct: 239 FSYCFTPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298

Query: 211 C---------LTQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--- 257
                     LT  G    ++DSG +FT L    +   V     L S  R+ L   +   
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAF---VALARALASRVRLPLASGAHLG 355

Query: 258 WKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
              C+ A+S E ++VP + L F       + +S+VV        E+    V CL ++S  
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSAR 407

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G   ++G        I++D E   L++  +KC E+
Sbjct: 408 G-MSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 441


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/329 (22%), Positives = 135/329 (41%), Gaps = 25/329 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +D   SS++  V CS P+C S      + C    + C Y   Y  + + +SG  V D
Sbjct: 128 LNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYE-DGSGTSGVYVSD 186

Query: 97  ILHLASFSKHAPQSSVQSS--VIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAK 153
            ++       +  ++V SS  ++ GC   Q+G       A DG++G G G++SV S L+ 
Sbjct: 187 AMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSS 246

Query: 154 AGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
            G+    FS C   D N  G +  G+     + S  + P+      Y + ++S  +    
Sbjct: 247 RGITPKVFSHCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQV 303

Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
           L        T      ++DSG + ++L  E Y  +V   D  VS    S      + CY 
Sbjct: 304 LSINPAVFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-CYL 362

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
             +      P +   F    S  ++   ++ +    +G  ++C+          I+G   
Sbjct: 363 VLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLV 422

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
           +    +V+D    ++ W++  C   ++ S
Sbjct: 423 LKDKIVVYDLARQQIGWTNYDCSMSVNVS 451


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 135/331 (40%), Gaps = 34/331 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R L    P    SS  + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV
Sbjct: 82  RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 139

Query: 95  DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            D+     FS +  Q   +   + +GCG  Q          DGV+GLG G VS+ S L  
Sbjct: 140 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 194

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
            G ++N    C      G +FFGD    + +  S+ P+  +Y  ++   +G E    G  
Sbjct: 195 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 252

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS--- 265
                    + DSG+S+T+  ++ Y  V     + +S K +  +   ++   C+      
Sbjct: 253 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 312

Query: 266 -SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYG 315
            S E +K     L  S    +  +  +F  P      +      CL +++       +  
Sbjct: 313 MSIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLN 371

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +IG   M    I++D E   + W    C+E+
Sbjct: 372 LIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 402


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 139/327 (42%), Gaps = 23/327 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
            LS +DP  SSS+  VSCS   C S    +S   P   C Y   Y  + + +SG+ + D 
Sbjct: 127 QLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGFYISDF 185

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGL 156
           +   +        +  +  + GC   QTG       A DG+ GLG G +SV S LA  GL
Sbjct: 186 MSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGL 245

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
               FS C   + SG       G   +  T + P+      Y V ++S  +    L    
Sbjct: 246 APRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDP 304

Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 +G   ++D+G +  +LP E Y+  +      VS     +   S++ C+  ++ +
Sbjct: 305 SVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFEITAGD 363

Query: 269 MLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMM 323
           +   P++ L F+   S V+R H    IFS   + G +++C+     +     I+G   + 
Sbjct: 364 VDVFPEVSLSFAGGASMVLRPHAYLQIFS---SSGSSIWCIGFQRMSHRRITILGDLVLK 420

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKS 350
              +V+D    ++ W+   C   ++ S
Sbjct: 421 DKVVVYDLVRQRIGWAEYDCSLEVNVS 447


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/330 (25%), Positives = 144/330 (43%), Gaps = 26/330 (7%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
            LS +DPSSSS++  VSCSHP+C S      + C    + C Y   Y  + + ++GY V 
Sbjct: 129 ELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYG-DGSGTTGYYVS 187

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D+L+  +    +  ++  +S++ GC   Q+G       A DG+ G G  D+SV S L+  
Sbjct: 188 DMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSL 247

Query: 155 GLIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           G+    FS C   E D G       G   + +  + P+      Y + ++S  +    L 
Sbjct: 248 GITPKVFSHCLKGEGDGGGKLV--LGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLP 305

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYN 263
                  T +    +VDSG + T+L    Y   V      VSS    +  +GN    CY 
Sbjct: 306 IDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ---CYL 362

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
            S+      P + L F+   S V++   ++     ++G  ++C+      +    I+G  
Sbjct: 363 VSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDL 422

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
            +     V+D  + ++ W++  C   ++ S
Sbjct: 423 VLKDKIFVYDLAHQRIGWANYDCSLSVNVS 452


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 135/331 (40%), Gaps = 34/331 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R L    P    SS  + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV
Sbjct: 94  RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 151

Query: 95  DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            D+     FS +  Q   +   + +GCG  Q          DGV+GLG G VS+ S L  
Sbjct: 152 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 206

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
            G ++N    C      G +FFGD    + +  S+ P+  +Y  ++   +G E    G  
Sbjct: 207 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 264

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS--- 265
                    + DSG+S+T+  ++ Y  V     + +S K +  +   ++   C+      
Sbjct: 265 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 324

Query: 266 -SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYG 315
            S E +K     L  S    +  +  +F  P      +      CL +++       +  
Sbjct: 325 MSIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLN 383

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +IG   M    I++D E   + W    C+E+
Sbjct: 384 LIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 126/269 (46%), Gaps = 33/269 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +++P  SS+ + VSC+        +C + +  C Y   Y+ E +SSSG L +DI+   + 
Sbjct: 131 KFEPELSSTYQPVSCN-----IDCTCDNERKQCVYERQYA-EMSSSSGVLGEDIISFGNQ 184

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +     I GC  ++TG      A DG+MGLG GD+S+   L + G+I +SFS+
Sbjct: 185 SELVPQRA-----IFGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSL 238

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   D G    +  G   P+        P+  +Y  Y + +++  +    L        
Sbjct: 239 CYGGMDIGGGAMILGGISPPSGMVFAESDPVRSQY--YNIDLKAIHVAGKQLHLDPSIFD 296

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
                ++DSG ++ +LP   +        K ++S +  + G    Y   C++ +  ++ +
Sbjct: 297 GKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLK-QIHGPDPNYNDICFSGAESDVSQ 355

Query: 272 V----PDMRLIFSKNQSFVV--RNHIFSF 294
           +    P + ++FS  Q   +   N++F +
Sbjct: 356 LSNTFPAVEMVFSNGQKLSLSPENYLFQY 384


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
           YDPS SSS K V C+   C+   +  S           +K PC Y+       Y+  D +
Sbjct: 175 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 234

Query: 89  SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
           S   L+ D   L +F             + GCGR   G +   +   G+       VS+ 
Sbjct: 235 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 277

Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
           S   K       FS C    ++  SGS+ FG+       ST  S+ P+ +       Y +
Sbjct: 278 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 335

Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
            +    IG   L  S F    L+DSG   T LP  IY  V ++F K  S    +   +  
Sbjct: 336 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 395

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
             C+N +S E + +P +++IF  N    V    +F F + +   V   L  +S + + GI
Sbjct: 396 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 455

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           IG       R+++D    +L      C 
Sbjct: 456 IGNYQQKNQRVIYDTTQERLGIVGENCR 483


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 140/333 (42%), Gaps = 43/333 (12%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           D++   +DP +S S   V C+ PLC+   S  C   +  C Y   Y  + + ++G    +
Sbjct: 183 DQSGQMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 241

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L  AS ++  P+      V +GCG    G ++  A    ++GLG G +S PS +++   
Sbjct: 242 TLTFASGAR-VPR------VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--R 289

Query: 157 IQNSFSICFDENDSGS---------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVES 204
              SFS C  +  S S         V FG        + SF P+ +       Y+V +  
Sbjct: 290 FGRSFSYCLVDRTSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMG 349

Query: 205 YCIGNSCL-------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
             +G + +             T  G   +VDSG S T L    YA +   F    +  R+
Sbjct: 350 ISVGGARVPGVAVSDLRLDPSTGRG-GVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRL 408

Query: 252 SLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
           S  G S +  CY+ S  +++KVP + + F+      +    +  P +   T FC     T
Sbjct: 409 SPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT 467

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           DG   IIG     G R+VFD +  +L +    C
Sbjct: 468 DGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 139/317 (43%), Gaps = 33/317 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
           + P+ SS+   +SC    C+  S++SC +  + C Y   YS  D S + G L  +     
Sbjct: 149 FQPTRSSTYSQLSCQSNACQALSQASCDADSE-CQY--QYSYGDGSRTIGVLSTETF--- 202

Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           SF     +  V+   V  GC     G++      DG++GLG G  S+ S L     I   
Sbjct: 203 SFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLGAGAFSLVSQLGATTHIDRK 258

Query: 161 FSIC----FDENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCIGNSCLTQ 214
            S C    +D N S ++ FG +   ++   +  P +    D+Y+ V +ES  +G   +  
Sbjct: 259 LSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVAT 318

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SSEEMLK 271
              + +VDSG + TFL   +   +V + ++ +  +R+       + CY+    S  +   
Sbjct: 319 HDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFG 378

Query: 272 VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHR 326
           +PD+ L F    +  +R  + FS  + EG     L  +S      I+G    QNF +G  
Sbjct: 379 IPDVTLRFGGGAAVTLRPENTFSLLQ-EGTLCLVLVPVSESQPVSILGNIAQQNFHVG-- 435

Query: 327 IVFDRENLKLAWSHSKC 343
             +D +   + ++ + C
Sbjct: 436 --YDLDARTVTFAAADC 450


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 140/330 (42%), Gaps = 50/330 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP++S++   VSC   +C+    S+C   +   C Y   Y+ + + + G L  + L L 
Sbjct: 213 FDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYA-DGSYTKGALALETLTLG 271

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                    +    V+IGCG +  G ++  A   G+MGLG G +S+   L   G +  +F
Sbjct: 272 --------GTAVEGVVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQL--GGEVGGAF 318

Query: 162 SICF----------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIG 208
           S C            ++D+G +  G +  A  +   ++P+     A   Y+VG+    +G
Sbjct: 319 SYCLASRGGYGSGAADDDAGWLVLG-RSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVG 377

Query: 209 NSCL-TQSG-FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
           +  L  Q+G FQ         ++D+G + T LP E YA +   F   ++      QG S 
Sbjct: 378 DERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSS 437

Query: 258 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD 313
                CY+ S    ++VP +   F  +   ++  RN +          ++CL    +   
Sbjct: 438 SVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVD----MGIYCLAFAPSSSG 493

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             I+G     G +I  D  N  + +  + C
Sbjct: 494 LSIMGNTQQAGIQITVDSANGYIGFGPANC 523


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 135/319 (42%), Gaps = 32/319 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSSS  N++C    C    S  C + +  C Y   Y+ +++ + G L  + L L S
Sbjct: 102 FDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYA-DNSITQGVLAQETLTLTS 160

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQNSF 161
            +    +      +I GCG   +G + D     G++GLG G +S+ S +  + G   N F
Sbjct: 161 TTG---EPVAFQGIIFGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMF 214

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVE------SYC 206
           S C      D + +  + FG         T   P+    G  Y A  +G+        + 
Sbjct: 215 SQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFS 274

Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
            G+S  T +    L+DSG + T+LP E Y  ++ +    V+ +   + G  ++ CY   +
Sbjct: 275 NGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG--YELCYQTPT 332

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
              L  P + + F      +    +F   +++    FC  V  T+ +Y   G      + 
Sbjct: 333 N--LNGPTLTIHFEGGDVLLTPAQMFIPVQDDN---FCFAVFDTNEEYVTYGNYAQSNYL 387

Query: 327 IVFDRENLKLAWSHSKCEE 345
           I FD E   +++  + C +
Sbjct: 388 IGFDLERQVVSFKATDCTK 406


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 133/319 (41%), Gaps = 33/319 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           ++DPS S S +  +C+  LC   +    +C +  + C Y   Y  +  ++     + I  
Sbjct: 80  KFDPSKSRSFRKAACTDNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETI-- 135

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
             S +  A   SV  +   GCG +  G++   A   G++GLG G +S+ S L+      N
Sbjct: 136 --SLNNGAGTQSV-PNFAFGCGTQNLGTF---AGAAGLVGLGQGPLSLNSQLSHT--FAN 187

Query: 160 SFSICFDENDSGS---VFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
            FS C    +S S   + FG    A   Q TS +        Y+V + S  +G   L   
Sbjct: 188 KFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA 247

Query: 214 -------QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                  QS  +   ++DSG + T L    Y+ V+  ++  V+  R+         C+N 
Sbjct: 248 PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNI 307

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +      VPDM   F +   F +R        +   T  CL +  + G + IIG      
Sbjct: 308 AGVSNPSVPDMVFKF-QGADFQMRGENLFVLVDTSATTLCLAMGGSQG-FSIIGNIQQQN 365

Query: 325 HRIVFDRENLKLAWSHSKC 343
           H +V+D E  K+ ++ + C
Sbjct: 366 HLVVYDLEAKKIGFATADC 384


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
           YDPS SSS K V C+   C+   +  S           +K PC Y+       Y+  D +
Sbjct: 175 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 234

Query: 89  SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
           S   L+ D   L +F             + GCGR   G +   +   G+       VS+ 
Sbjct: 235 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 277

Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
           S   K       FS C    ++  SGS+ FG+       ST  S+ P+ +       Y +
Sbjct: 278 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 335

Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
            +    IG   L  S F    L+DSG   T LP  IY  V ++F K  S    +   +  
Sbjct: 336 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 395

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
             C+N +S E + +P +++IF  N    V    +F F + +   V   L  +S + + GI
Sbjct: 396 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 455

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           IG       R+++D    +L      C 
Sbjct: 456 IGNYQQKNQRVIYDSTQERLGIVGENCR 483


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
           YDPS SSS K V C+   C+   +  S           +K PC Y+       Y+  D +
Sbjct: 127 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 186

Query: 89  SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
           S   L+ D   L +F             + GCGR   G +   +   G+       VS+ 
Sbjct: 187 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 229

Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
           S   K       FS C    ++  SGS+ FG+       ST  S+ P+ +       Y +
Sbjct: 230 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 287

Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
            +    IG   L  S F    L+DSG   T LP  IY  V ++F K  S    +   +  
Sbjct: 288 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 347

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
             C+N +S E + +P +++IF  N    V    +F F + +   V   L  +S + + GI
Sbjct: 348 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 407

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           IG       R+++D    +L      C 
Sbjct: 408 IGNYQQKNQRVIYDTTQERLGIVGENCR 435


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/326 (22%), Positives = 137/326 (42%), Gaps = 20/326 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  +D   S ++ +V+CS P+C S      + C S  + C Y   Y  + + +SGY + 
Sbjct: 143 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           D  +  +    +  ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           G+    FS C   + SG   F   G        + P+      Y + + S  +    L  
Sbjct: 261 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 319

Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
               F+A      +VD+G + T+L  E Y   +      VS     +  N  + CY  S+
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 378

Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
                 P + L F+   S ++R  +++F +   +G +++C+       +  I+G   +  
Sbjct: 379 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 438

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
              V+D    ++ W+   C   ++ S
Sbjct: 439 KVFVYDLARQRIGWASYDCSMSVNVS 464


>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 654

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 144/322 (44%), Gaps = 37/322 (11%)

Query: 45  YDPSSSSSSKNVSCS----HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +   +SS+  +V+CS    H  CK    C    D C     Y  E +S    +V+D+++L
Sbjct: 107 FQADNSSTLIHVTCSQQQSHFQCKE---CTEKSDTCAISQSY-MEGSSWKASVVEDVVYL 162

Query: 101 ---ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
              +SF   A +    +    GC   +TG ++   A DG+MGL   D  + + L +   I
Sbjct: 163 GGESSFHDEAMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKI 221

Query: 158 -QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
             N FS+CF EN  G++  G+    A +   S+  + +   A   Y V ++   IG   +
Sbjct: 222 PSNLFSLCFTEN-GGTMSVGEPNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSI 280

Query: 213 TQ-----SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +    +VDSG + ++LP  +  E +  F ++    R    G S   C+  ++E
Sbjct: 281 NAKEEAYTRGHYIVDSGTTDSYLPRAMKNEFLQVFKEVAG--RDYQVGTS---CHGYTNE 335

Query: 268 EMLKVPDMRLIFSKNQSFVVRNH--IFSFPENEGF----TVFCLTVMSTDGDYGIIGQNF 321
           ++  +P ++L+    +++   N   I   P  +        +C ++  ++   G+IG N 
Sbjct: 336 DLASLPKIQLVM---EAYGDENGEVIIDIPPEQYLLHNDNSYCGSIYLSENAGGVIGANL 392

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
           MM   ++FD  N ++ +  + C
Sbjct: 393 MMNRDVIFDNGNQRVGFVDADC 414


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 132/335 (39%), Gaps = 50/335 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           YDP  +   + V C  P C       + +C      C Y  DY  + +S+ G LV+D + 
Sbjct: 74  YDPKRA---RVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDY-VDGSSTMGILVEDTIT 129

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           L   +     +  Q+  +IGCG  Q G+     A  DGV+GL    +S+PS LA  G+  
Sbjct: 130 LVLTNG----TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIAN 185

Query: 159 NSFSICF--DENDSGSVFFGDQ-GPATQQSTSFL---PIGEKYDAYFVGVESYCIGNSCL 212
           N    C     N  G +FFGD   PA   + + +   P+ E Y A    ++    G   L
Sbjct: 186 NVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIK---YGGEVL 242

Query: 213 TQSGFQ-----ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNA 264
              G       A+ DSG SFT+L    Y  V   VV+  +    +RI     +  +C+  
Sbjct: 243 ELEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD-TTLPFCWRG 301

Query: 265 SSEEMLKVPDMRLIFSK------NQSFVVRNHIFSFPENEGFTV------FCLTVMSTDG 312
            S     V D+   F          ++     +      EG+ +       CL V+    
Sbjct: 302 PS-PFESVADVSAYFKTVTLDFGGSTWWSSGKLLEL-SPEGYLIVSTQGNVCLGVLDASV 359

Query: 313 D----YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                  I+G   M G+ +V+D    ++ W    C
Sbjct: 360 ASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +        +   + GC 
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GREKLDFVFGCA 301

Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
             Q G  L   A  DG++GL    +S PS LA  G+I N F  C   ++   G +F GD 
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDD 361

Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVDSGASFTFLPT 232
               +   ++  I    D  Y         G+  L +     S  Q + DSG+S+T+LP 
Sbjct: 362 Y-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPN 420

Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
           EIY  +V         F +  S + + L    WK  +     E +K     + L F K  
Sbjct: 421 EIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFEPLNLHFGKKW 477

Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
            F+ +    S PE+          CL +++ T+ ++G   I+G   + G  +V+D +  +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQ 536

Query: 336 LAWSHSKC 343
           + W+ S C
Sbjct: 537 IGWADSDC 544


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 130/322 (40%), Gaps = 45/322 (13%)

Query: 55  NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
            V C  PLC + R         S  DP  C Y   Y T    S G L  DI+ +    K 
Sbjct: 93  KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
                    +  GCG KQ        +P +G++GLG+G     + L    +I +N    C
Sbjct: 150 -------KRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHC 202

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
                 G ++ GD  P T+   ++ P+ E    Y  G+    I    +     F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTR-GVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
           G+++T +P +IY E+V K     S   +  ++G +   C+            +   K   
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321

Query: 275 MRLIFSK---NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMG 324
           +++  ++   N     +N++F   + E     CL ++    D       + +IG   M  
Sbjct: 322 LKITHARGTNNLDIPPQNYLFVKEDGET----CLAILDASLDPVLKELNFILIGAVTMQD 377

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
             +++D E  +L W  ++C+ V
Sbjct: 378 LFVIYDNEKKQLGWVRAQCDRV 399


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           L  +++ C++ K  C Y  +Y+ + +SS G L  D +H+ + +        +   + GC 
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GREKLDFVFGCA 301

Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
             Q G  L   A  DG++GL    +S PS LA  G+I N F  C   ++   G +F GD 
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDD 361

Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVDSGASFTFLPT 232
               +   ++  I    D  Y         G+  L +     S  Q + DSG+S+T+LP 
Sbjct: 362 Y-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPN 420

Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
           EIY  +V         F +  S + + L    WK  +     E +K     + L F K  
Sbjct: 421 EIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFEPLNLHFGKKW 477

Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
            F+ +    S PE+          CL +++ T+ ++G   I+G   + G  +V+D +  +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQ 536

Query: 336 LAWSHSKC 343
           + W+ S C
Sbjct: 537 IGWADSDC 544


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 127/314 (40%), Gaps = 49/314 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
           +DP++SSS   VSC   +C++ S             DYS    + + + G L  + L L 
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                   ++VQ  V IGCG + +G ++  A   G++GLG G +S+   L   G     F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---------L 212
           S C     +G         A   ++SF         Y+VG+    +G            L
Sbjct: 279 SYCLASRGAGG--------AGSLASSF---------YYVGLTGIGVGGERLPLQDSLFQL 321

Query: 213 TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
           T+ G   +V D+G + T LP E YA +   FD  + +   S   +    CY+ S    ++
Sbjct: 322 TEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVR 381

Query: 272 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           VP +   F +     +  RN +       G  VFCL    +     I+G     G +I  
Sbjct: 382 VPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQITV 437

Query: 330 DRENLKLAWSHSKC 343
           D  N  + +  + C
Sbjct: 438 DSANGYVGFGPNTC 451


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 139/325 (42%), Gaps = 33/325 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +N   +DP  SS+ K V C    C     S+ +C      C Y   Y          LV 
Sbjct: 129 QNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHT------LVS 182

Query: 96  DILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            IL   S +  +  ++++   +  GC      +  +     G++GLG+G +S+ S L   
Sbjct: 183 GILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQ 242

Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESY 205
             I   FS CF     N +  + FG+     Q     ST  +   IG  Y  Y++ +E  
Sbjct: 243 --IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSY--YYLNLEGV 298

Query: 206 CIGNSCLTQSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
            IGN  +  S  Q     L+DSG SFT L    Y + V    ++   + + +    + +C
Sbjct: 299 SIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFC 358

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
           +    +   + PD+  +F+  +  V  +++F   E E   + C+  + T D D  I G +
Sbjct: 359 FENKGKRK-RFPDVVFLFTGAKVRVDASNLF---EAEDNNLLCMVALPTSDEDDSIFGNH 414

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
             +G+++ +D +   ++++ + C +
Sbjct: 415 AQIGYQVEYDLQGGMVSFAPADCAK 439


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/326 (22%), Positives = 137/326 (42%), Gaps = 20/326 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  +D   S ++ +V+CS P+C S      + C S  + C Y   Y  + + +SGY + 
Sbjct: 148 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 205

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           D  +  +    +  ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 206 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 265

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           G+    FS C   + SG   F   G        + P+      Y + + S  +    L  
Sbjct: 266 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 324

Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
               F+A      +VD+G + T+L  E Y   +      VS     +  N  + CY  S+
Sbjct: 325 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 383

Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
                 P + L F+   S ++R  +++F +   +G +++C+       +  I+G   +  
Sbjct: 384 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 443

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
              V+D    ++ W+   C   ++ S
Sbjct: 444 KVFVYDLARQRIGWASYDCSMSVNVS 469


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/320 (22%), Positives = 135/320 (42%), Gaps = 20/320 (6%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L  +D   S ++ +V+CS P+C S      + C S  + C Y   Y  + + +SGY + 
Sbjct: 143 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           D  +  +    +  ++  + ++ GC   Q+G       A DG+ G G G +SV S L+  
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           G+    FS C   + SG   F   G        + P+      Y + + S  +    L  
Sbjct: 261 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 319

Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
               F+A      +VD+G + T+L  E Y   +      VS     +  N  + CY  S+
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 378

Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
                 P + L F+   S ++R  +++F +   +G +++C+       +  I+G   +  
Sbjct: 379 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 438

Query: 325 HRIVFDRENLKLAWSHSKCE 344
              V+D    ++ W+   C+
Sbjct: 439 KVFVYDLARQRIGWASYDCK 458


>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
          Length = 947

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 136/321 (42%), Gaps = 27/321 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +D S S+SS  V+C    C     C+  K  C +   YS E +S   Y V+D+L +   +
Sbjct: 168 WDQSKSTSSHIVTCED--CHGSFRCQKDKR-CGFSQRYS-EGSSWRAYQVEDVLWVGELT 223

Query: 105 KHAPQ------SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI- 157
               +      S+     + GC   QTG +    A DG+MG+     ++   LAKAG I 
Sbjct: 224 LQQSEKINHDESAYSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIK 282

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIG------N 209
           + +FS+CF +N    V  G      +      + P  +    + V V    +       +
Sbjct: 283 ERTFSLCFGKNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQD 342

Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
             + Q G   +VDSG + T+LP  +       +++   S   + + N   +C   +S E+
Sbjct: 343 PAIFQRGKGIIVDSGTTDTYLPRSVAKGFSAAWERATGSPYANCKDN--HFCMILTSAEL 400

Query: 270 LKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             +P + +         VR   ++ +  ++     +   +  T+   G++G N M+ H +
Sbjct: 401 EALPTVTIHMDGGLEVNVRPSGYMDALGKD---NAYAPRIYLTESMGGVLGANVMLDHNV 457

Query: 328 VFDRENLKLAWSHSKCEEVID 348
           VFD EN  + ++   C+   D
Sbjct: 458 VFDYENHLVGFAEGVCDYRAD 478


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/295 (25%), Positives = 122/295 (41%), Gaps = 28/295 (9%)

Query: 44  EYDPSSSSSSKNVSC--SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +Y P++S + ++  C  SHP      +   L   C Y   Y  ++T+  G L  +++   
Sbjct: 100 KYRPAASITYRDAMCEDSHPKSNPHFAFDPLTRICTYQQHY-LDETNIKGTLAQEMI--- 155

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +   H         V  GC     GSY  G    G++GLG+G  S+       G   + F
Sbjct: 156 TVDTHDGGFKRVHGVYFGCNTLSDGSYFTGT---GILGLGVGKYSI------IGEFGSKF 206

Query: 162 SICFDE----NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
           S C  E      S ++  GD G   Q   + + I E +  +   +ES  +G         
Sbjct: 207 SFCLGEISEPKASHNLILGD-GANVQGHPTVINITEGHTIF--QLESIIVGEEITLDDPV 263

Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
           Q  VD+G++ + L T +Y + V  FD L+ S+ +S +      CY A + E L+  D+  
Sbjct: 264 QVFVDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEPT---LCYKADTIERLEKMDVGF 320

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFD 330
            F       V  H   F +     + CL + +    +   IIG   M G+ + +D
Sbjct: 321 KFDVGAELSVNIHNI-FIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYD 374


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 28/280 (10%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ YD  +S++S  V C    C         CK  L+  C Y   Y  + +S++GY V 
Sbjct: 121 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 177

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
           D +     S +   +    +V+ GCG KQ+G     + A DG++G G  + S+ S LA +
Sbjct: 178 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYC 206
           G ++  FS C D  D G +F    G   +    FL +              Y V ++   
Sbjct: 238 GKVKKVFSHCLDNVDGGGIFA--IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIE 295

Query: 207 IGNSCLT------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           +G   L       +SG +   ++DSG +  + P E+Y  ++ K        R+     ++
Sbjct: 296 VGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF 355

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 298
             C++ +       P + L F K+ S  V  H + F   E
Sbjct: 356 T-CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE 394


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 137/323 (42%), Gaps = 35/323 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P +SSS + VSC+ P C ++  C +    C Y   Y+ E +SS G L  D+L   + 
Sbjct: 143 RFKPDNSSSYQTVSCNSPDCITKM-CDARVHQCKYERVYA-EMSSSKGVLGKDLLGFGNG 200

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           S+  P       ++ GC   +TG  YL  A  DG+MGLG G +S+   L   G +++SFS
Sbjct: 201 SRLQPHP-----LLFGCETAETGDLYLQHA--DGIMGLGRGPLSIVDQLVGTGAMEDSFS 253

Query: 163 ICFDENDSG--SVFFGDQGPATQQSTSFLPIGEKYDAYF------VGVESYCIGNSCLTQ 214
           +C+   D G  S+  G   P    +  F         Y+      + V+   +       
Sbjct: 254 LCYGGMDEGGGSMVLGAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF 311

Query: 215 SG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSE 267
           +G    ++DSG ++ +LP + +      F   ++ +  SLQ   G    Y   C+  +  
Sbjct: 312 NGRLGTVLDSGTTYAYLPDKAFD----AFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGS 367

Query: 268 EMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
           +   +    P +  +FS NQ   +    + F   +    +CL          ++G   + 
Sbjct: 368 DSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVR 427

Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
              + +DR N ++ +  + C  +
Sbjct: 428 NTLVTYDRANHQIGFFKTNCTNL 450


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 139/310 (44%), Gaps = 37/310 (11%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           L  +++ C++ K  C Y  +Y+ + +SS G L  D +HL + +        +   + GC 
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHLIATNG----GREKLDFVFGCA 301

Query: 122 RKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
             Q G  L   A  DG++GL    +S+PS LA  G+I N F  C   ++   G +F GD 
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFLGDD 361

Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPT 232
               +   ++  I    D  Y         G+  L    Q+G   Q + DSG+S+T+LP 
Sbjct: 362 Y-VPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPD 420

Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
           EIY  +V         F +  S + + L    WK  +     E +K     + L F K  
Sbjct: 421 EIYENLVAAIKYASPGFVQDSSDRTLPL---CWKADFPVRYLEDVKQFFKPLNLHFGKKW 477

Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
            F+ +    S PE+          CL +++ T+ ++G   I+G   + G  +V+D +  +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRRQ 536

Query: 336 LAWSHSKCEE 345
           + W++S C +
Sbjct: 537 IGWTNSDCTK 546


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/338 (24%), Positives = 141/338 (41%), Gaps = 47/338 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
           D+ L  +DPS+SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+
Sbjct: 71  DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 129

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L  D           P       V  GCG    G +       G+ G G G +S+PS L 
Sbjct: 130 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 180

Query: 153 KAGLIQNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAY 198
           K G    +FS CF             +    +F   QG    Q+T  +   +       Y
Sbjct: 181 KVG----NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLY 234

Query: 199 FVGVESYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
           ++ ++   +G++          LT      ++DSG S T LP ++Y  V  +F   +   
Sbjct: 235 YLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 294

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM 308
            +         C++A S+    VP + L F      + R N++F  P++ G ++ CL + 
Sbjct: 295 VVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 354

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             D +  IIG        +++D +N  L++  ++C+++
Sbjct: 355 KGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 391


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 140/324 (43%), Gaps = 37/324 (11%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P +SSS + + C    C +   C S    C Y   Y+ E ++S G L  D+L     
Sbjct: 92  RFKPENSSSYQKIGCRSSDCIT-GLCDSNSHQCKYERMYA-EMSTSKGVLGKDLLDFG-- 147

Query: 104 SKHAPQSSVQSSVI-IGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
               P S +QS ++  GC   ++G  YL  A  DG+MGLG G +S+   L   G I++SF
Sbjct: 148 ----PASRLQSQLLSFGCETAESGDLYLQVA--DGIMGLGRGPLSIVDQLVGNGAIEDSF 201

Query: 162 SICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQ 214
           S+C+   DE     V      P+        P    Y   +   + V+   +  +S +  
Sbjct: 202 SLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFN 261

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNAS--- 265
             F  ++DSG ++ +LP   +      F   V ++  SLQ       N    CY  +   
Sbjct: 262 GKFGTILDSGTTYAYLPDRAFE----AFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTD 317

Query: 266 SEEMLK-VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           ++E+ K  P +  +F++NQ  S    N++F   +  G   +CL          ++G   +
Sbjct: 318 TKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPG--AYCLGFFKNQDATTLLGGIIV 375

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
               + +DR N ++ +  + C E+
Sbjct: 376 RNMLVTYDRYNHQIGFLKTNCTEL 399


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 127/325 (39%), Gaps = 33/325 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +Y P+ ++    + CSH LC          C   +D C Y   YS +  SS G LV D  
Sbjct: 110 QYKPNHNT----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGYS-DHASSIGALVTDEF 164

Query: 99  HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L    K A  S +   +  GCG  +Q           G++GLG G V + + L   G+ 
Sbjct: 165 PL----KLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGIT 220

Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           +N    C      G +  GD+  P++  + + L        Y  G       +      G
Sbjct: 221 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKG 280

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 266
              + DSG+S+T+   E Y  ++    K ++ K +  +    S   C+            
Sbjct: 281 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 340

Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNF 321
           ++  K   +R  + KN Q F V    +     +G    CL +++        Y I+G   
Sbjct: 341 KKYFKTITLRFGYQKNGQLFQVPPESYLIITEKGNV--CLGILNGTEVGLDSYNIVGDIS 398

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
             G  +++D E  ++ W  S C+++
Sbjct: 399 FQGIMVIYDNEKQRIGWISSDCDKI 423


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 137/326 (42%), Gaps = 46/326 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP +SS+ ++ SC    C    K RS  K  K  C +   Y+ + + + G L  + L +
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTV 190

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S    A +         GCG    G +    +  G++GLG G++S+ S L     I   
Sbjct: 191 DS---TAGKPVSFPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQLKST--INGL 243

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLT 213
           FS C      D + S  + FG  G  +   T   P+ +K     Y++ +E   +G   L 
Sbjct: 244 FSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLP 303

Query: 214 QSGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
             G+           +VDSG ++TFLP E Y+++       +  KR+      +  CYN 
Sbjct: 304 YKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNT 363

Query: 265 SSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ---- 319
           ++E  +  P +   F   N      N      E+    + C TV  T  D G++G     
Sbjct: 364 TAE--INAPIITAHFKDANVELQPLNTFMRMQED----LVCFTVAPTS-DIGVLGNLAQV 416

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEE 345
           NF++G    FD    ++++  + C +
Sbjct: 417 NFLVG----FDLRKKRVSFKAADCTQ 438


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 77/304 (25%), Positives = 132/304 (43%), Gaps = 52/304 (17%)

Query: 75  PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
           PC Y  DY+ + +S++G+L  D    A+ S      +    V  GCG +  G    G   
Sbjct: 140 PCGYAYDYA-DGSSTTGFLARDT---ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG- 194

Query: 135 DGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDSGS-------VFFGDQGPATQQST 186
            GV+GLG G +S P   A++G L   +FS C  + + G        +F G   P  + + 
Sbjct: 195 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGR--PERRAAF 248

Query: 187 SFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTE 233
           ++ P+     A   Y+VGV +  +GN  L   G +           ++DSG++ T+L   
Sbjct: 249 AYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLG 308

Query: 234 IYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYNASSEEMLK-----VPDMRLIFSKNQ 283
            Y  +V  F   V   RI       QG   + CYN SS           P + + F++  
Sbjct: 309 AYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSSAPANGGFPRLTIDFAQGL 366

Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWS 339
           S  +   N++    ++    V CL +  T   +   ++G     G+ + FDR + ++ ++
Sbjct: 367 SLELPTGNYLVDVADD----VKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFA 422

Query: 340 HSKC 343
            ++C
Sbjct: 423 RTEC 426


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 140/338 (41%), Gaps = 48/338 (14%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDP---CPYIADYSTEDTSSSGY 92
           D+ L  +D S SS++  + C    CK   + + C  L      C Y   Y  +++ + G 
Sbjct: 71  DQPLPYFDTSRSSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYG-DNSVTIGL 129

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L  D     +        +    V  GCG   TG +   +   G+ G G G +S+PS L 
Sbjct: 130 LAADKFTFVA-------GTSLPGVTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQL- 179

Query: 153 KAGLIQNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAY 198
           K G    +FS CF             +    +F   QG    Q+T  +   +       Y
Sbjct: 180 KVG----NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLY 233

Query: 199 FVGVESYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
           ++ ++   +G++          LT      ++DSG S T LP ++Y  V  +F   +   
Sbjct: 234 YLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 293

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM 308
            +         C++A S+    VP + L F      + R N++F  P++ G ++ CL + 
Sbjct: 294 VVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 353

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             D +  IIG        +++D +N  L++  ++C+++
Sbjct: 354 KGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 390


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 142/337 (42%), Gaps = 29/337 (8%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYS 83
           L W   V   +  + RN   +DP  S++ +N+SC   LC K  +   S +  C Y   Y+
Sbjct: 95  LTWTSCVPCNNCYKQRN-PMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYA 153

Query: 84  TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
           +   +  G L  + + L+S      +S     ++ GCG   TG + D     G++GLG G
Sbjct: 154 SAAITR-GVLAQETITLSSTKG---KSVPLKGIVFGCGHNNTGGFNDHEM--GIIGLGGG 207

Query: 144 DVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA- 197
            VS+ S +  +      FS C      D + S  + FG     + +     P+  K D  
Sbjct: 208 PVSLISQMGSS-FGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKT 266

Query: 198 -YFVGVESYCIGNSCLTQSG-------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
            YFV +    + N+ L  +G           +DSG   T LPT++Y +VV +    V+ K
Sbjct: 267 PYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMK 326

Query: 250 RISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
            ++   +   + CY   ++  L+ P +   F      +     F  P++    VFCL   
Sbjct: 327 PVTDDPDLGPQLCYR--TKNNLRGPVLTAHFEGADVKLSPTQTFISPKDG---VFCLGFT 381

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           +T  D G+ G      + I FD +   +++    C +
Sbjct: 382 NTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDCTK 418


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 138/352 (39%), Gaps = 63/352 (17%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADY 82
           ++   +  + P  SSSSK + C +P C        +S+   C S    C     PY+  Y
Sbjct: 125 IKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY 184

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
            +   S++G L+ + L         P        ++GC      S      P+G+ G G 
Sbjct: 185 GSG--STAGLLLSETLDF-------PNKKTIPDFLVGC------SIFSIKQPEGIAGFGR 229

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG-------PATQQSTSFL--PIGE 193
              S+PS L          S  FD+  + S    D G        A    T FL  P   
Sbjct: 230 SPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTA 289

Query: 194 KYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
             D Y+V + +  IG++ +          T      +VDSG +FTF+   +Y  V  +F+
Sbjct: 290 FRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFE 349

Query: 244 KLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEG 299
           K ++   ++ +  +    + CYN S E+ L VPD+   F       +  ++ FS  ++  
Sbjct: 350 KQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSG- 408

Query: 300 FTVFCLTVMSTDGDYG--------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             V CLT++S +            I+G        + FD EN K  +    C
Sbjct: 409 --VICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 136/334 (40%), Gaps = 47/334 (14%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L  +D S+S +   V C+ P+C++          C Y  +Y  +++ + G L  D     
Sbjct: 132 LPRFDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYG-DNSVTIGQLAKDSF--- 187

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +F            ++ GCG+  TG++       G+ G G G +S+P  L  +     SF
Sbjct: 188 TFDGKGGGKVTVPDLVFGCGQYNTGNFHSNET--GIAGFGRGPLSLPRQLGVS-----SF 240

Query: 162 SICFD---ENDSGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
           S CF    E+ S  VF G            GP    ST FLP   +Y  Y++ ++   +G
Sbjct: 241 SYCFTTIFESKSTPVFLGGAPADGLRAHATGPIL--STPFLPNHPEY--YYLSLKGITVG 296

Query: 209 NSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GN 256
            + L   +S F          ++DSG + T  P  ++  +   F   V     S    G 
Sbjct: 297 KTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGE 356

Query: 257 SWKYCYNASS---EEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
               C++  S      + VP M L +   +      N++  +P+++     C+ V++ D 
Sbjct: 357 PTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSD---QLCVVVLAGDD 413

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           D  +IG        IV D    KL    ++C+++
Sbjct: 414 DRTMIGNFQQQNMHIVHDLAGNKLVIEPAQCDKM 447


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 136/319 (42%), Gaps = 31/319 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP  SS+  N+SC  PLC K  +   S +  C Y   Y  +++ + G L  D    A+F
Sbjct: 110 FDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATF 165

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSF 161
           + +  +    S  + GCG   TG + D     G++GLG G     SL+++ G +     F
Sbjct: 166 TSNTGKPVSLSRFLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKF 220

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGV------ESYCIG 208
           S C      D   S  + FG             P+   EK  +YFV +      ++Y   
Sbjct: 221 SQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPM 280

Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSE 267
           NS + ++    LVDSG     LP ++Y +V  +    V+ K I+   +   + CY   + 
Sbjct: 281 NSTIGKA--NMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHR 326
             LK P +   F      +     F  P  +   +FCL + + T+ D G+ G      + 
Sbjct: 339 --LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYL 396

Query: 327 IVFDRENLKLAWSHSKCEE 345
           I FD +   +++  + C +
Sbjct: 397 IGFDLDRQVVSFKPTDCTK 415


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 140/323 (43%), Gaps = 42/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP +SS+ ++ SC    C +  + +S ++   C ++  Y+ + + + G L  + L +AS
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYA-DGSFTGGNLAVETLTVAS 192

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
               A +         GC  +  G + + ++  G++GLG+ ++S+ S L     I   FS
Sbjct: 193 ---TAGKPVSFPGFAFGCVHRSGGIFDEHSS--GIVGLGVAELSMISQLKST--INGRFS 245

Query: 163 ICF-----DENDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQ 214
            C      D + S  + FG  G  +   T   P+   G     Y + +E + +G   L+ 
Sbjct: 246 YCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSY 305

Query: 215 SGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
            GF           +VDSG ++T+LP E Y ++       +  KR+         CYN +
Sbjct: 306 KGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTT 365

Query: 266 SEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----N 320
            ++ +  P +   F   N      N      E+    + C TV+ T  D GI+G     N
Sbjct: 366 VDQ-IDAPIITAHFKDANVELQPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVN 419

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
           F++G    FD    ++++  + C
Sbjct: 420 FLVG----FDLRKKRVSFKAADC 438


>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
          Length = 642

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 86/356 (24%), Positives = 146/356 (41%), Gaps = 40/356 (11%)

Query: 53  SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ--- 109
           SK+ +  +  C    SC+S +    YI+    E +     +VD+++ +  FS  A +   
Sbjct: 140 SKSTTAKYLACHDFDSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWVGGFSSPADEMEG 199

Query: 110 --SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFD 166
              +      +GC  K+TG ++     +G+MGLG    +V S +  AG + QN F++CF 
Sbjct: 200 VLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFTLCF- 257

Query: 167 ENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCL------TQSGFQ 218
             D G + FG    +   S   + P+     AY+ V V+   +    L        SG  
Sbjct: 258 AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTGTINSGRG 317

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKYCYNASSEEMLKVPD 274
            +VDSG + TF   +     +  F K      S  R+ L           +SEE+  +P 
Sbjct: 318 VIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKL-----------TSEELAALPV 366

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           + +I S  +     +     P ++  T       +      ++   G++G + M+G  ++
Sbjct: 367 ISIILSGMKGDGTDDVQLDVPASQYLTPADDGKSYYGNFHFSERSGGVLGASAMVGFDVI 426

Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAAAPP 383
           FD EN ++ ++ S C      S+     P A  S N P P T     SN      P
Sbjct: 427 FDVENKRVGFAESDCGR--SYSNATTAAPIASDSTNQPAPATPVSVDSNATEQPAP 480


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 40/331 (12%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDI 97
           +N   +DPS S++ KNV+CS P+C       S  D   C Y   Y  +D+ S G L  D 
Sbjct: 120 QNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYG-DDSHSQGNLAVDT 178

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           + + S S    +       +IGCG    G++   A   G++GLG G  S+ + L  A   
Sbjct: 179 VTMQSTSG---RPVAFPRTVIGCGHDNAGTF--NANVSGIVGLGRGPASLVTQLGPA--T 231

Query: 158 QNSFSICF------DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF-VGVESYCIG 208
              FS C         NDS  + FG     +   T   PI    +Y  ++ + +E+  +G
Sbjct: 232 GGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVG 291

Query: 209 NSCL------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
           ++        ++ G ++  ++DSG + T+LP+ +         + +S            Y
Sbjct: 292 DTKFNFPEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDY 351

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGI 316
           C+ A++ +  ++P + + F      + R ++F    ++     CL   S   D    YG 
Sbjct: 352 CF-ATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDD---TICLAFGSFPDDNIFIYGN 407

Query: 317 IGQ-NFMMGHRIVFDRENLKLAWSHSKCEEV 346
           I Q NF++G    +D +NL +++  + C  V
Sbjct: 408 IAQSNFLVG----YDIKNLAVSFQPAHCGAV 434


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 81/319 (25%), Positives = 138/319 (43%), Gaps = 47/319 (14%)

Query: 56  VSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
           V C  P+C S       C    D C Y  +Y+ +  SS G LV+D+  ++L S  +  P+
Sbjct: 117 VVCKDPICASLHPDNYRCDD-PDQCDYEVEYA-DGGSSIGVLVNDLFPVNLTSGMRARPR 174

Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
                 + IGCG  Q    L G A    DGV+GLG G  S+ + L+  GL++N    CF 
Sbjct: 175 ------LTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFS 224

Query: 167 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DS 223
               G +FFGD          + P+   Y  ++    +  I N     SG + L+   DS
Sbjct: 225 RRGGGYLFFGDD-IYDSSKVIWTPMSRDYLKHYTPGFAELILNG--RSSGLKNLLVVFDS 281

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFS- 280
           G+S+T+  T+ Y  ++    K +  K +  +++ ++   C+    +    + D +  F  
Sbjct: 282 GSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRG-KKPFKSIRDAKKYFKP 340

Query: 281 ---------KNQS-FVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
                    K +S F ++   +    ++G    CL +++       +Y IIG   M    
Sbjct: 341 LALSFGSGWKTKSQFEIQQESYLIISSKGSV--CLGILNGTEVGLQNYNIIGDISMQEKL 398

Query: 327 IVFDRENLKLAWSHSKCEE 345
           +++D E   + W  S C+ 
Sbjct: 399 VIYDNEKQVIGWQPSNCDR 417


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 95/340 (27%), Positives = 141/340 (41%), Gaps = 59/340 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           YDPS+SS+    SCS   C+S   S C S    C Y   Y  + +S+ G    + L L S
Sbjct: 46  YDPSASSTFAKTSCSTSSCQSLPASGCSSSAKTCIYGYQYG-DSSSTQGDFALETLTLRS 104

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S    +   GCGR  +GS+  GAA  G++GLG G +S+ + L  A  I N FS
Sbjct: 105 ---SGGSSKAFPNFQFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFS 156

Query: 163 IC---FDENDSGS--VFFGDQGPATQQ--STSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
            C   FD++ S +  + FG          ST  +P   +   YFVG+E   +G   L+  
Sbjct: 157 YCLVDFDDDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLA 216

Query: 214 ----------------------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
                                  SG   + DSG + T L   +Y++V   F   VS   +
Sbjct: 217 TRAIDFLSVRSKKKLRVRALEVNSG-GTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTV 275

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF-------TVFC 304
               + +  CY+ S  +  K P + L F        +   FS P+   F       TV C
Sbjct: 276 DASSSGFDLCYDVSKSKNFKFPALTLAF--------KGTKFSPPQKNYFVIVDTAETVAC 327

Query: 305 LTVMSTDGDYGIIGQNFM-MGHRIVFDRENLKLAWSHSKC 343
           L +  +      I  N M   + +V+DR    ++ S ++C
Sbjct: 328 LAMGGSGSLGLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 143/301 (47%), Gaps = 31/301 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS S++ K +  S   C+S   +SC S  +  C Y   Y  + + S G L  + L L 
Sbjct: 128 FDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYG-DGSYSQGDLSVETLTLG 186

Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQN 159
           S       SSV+    +IGCGR  T S+ +G +  G++GLG G VS +  L  ++  I  
Sbjct: 187 S----TNGSSVKFRRTVIGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGR 240

Query: 160 SFSICFDE--NDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCL--T 213
            FS C     N S  + FGD    +   T   PI   +    Y++ +E++ +GN+ +  T
Sbjct: 241 KFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFT 300

Query: 214 QSGFQ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
            S F+       ++DSG + T LP +IY+++      LV   R+         CY ++ +
Sbjct: 301 SSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFD 360

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIG-QNFMMG 324
           E L  P +   FS     V  N + +F E E   V CL  +S+     +G +  QNF++G
Sbjct: 361 E-LNAPVIMAHFSGAD--VKLNAVNTFIEVEQ-GVTCLAFISSKIGPIFGNMAQQNFLVG 416

Query: 325 H 325
           +
Sbjct: 417 Y 417


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 141/337 (41%), Gaps = 50/337 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           YD  +S+    V CS   C            C Y   Y  E + S GYLV D++ L    
Sbjct: 77  YDYDASADFSRVECS--ACAGIGGKCGTSGVCRYDVHY-LEGSGSEGYLVRDVVSLGG-- 131

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                S   ++V+ GC  ++ GS +   + DG+ G G    ++ + LA A +I + FS+C
Sbjct: 132 -----SVGNATVVFGCEERELGS-IKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMC 185

Query: 165 FDENDS------------GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            +  +             G+  FG   PA      + P+      Y V   S+ +GNS +
Sbjct: 186 VEGYEKLSGEHVGGLLTLGNFDFGADAPAL----VYTPMVSSAMYYQVTTTSWTLGNSVV 241

Query: 213 TQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQ-------------GN 256
             S G   ++DSG S+T++P  ++A    +F +L   +++   L+             GN
Sbjct: 242 EGSRGVLTIIDSGTSYTYVPGNMHA----RFLQLAEDAARESGLEKVAPPEDYPDLCFGN 297

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
           S    ++  SE     P +++ +  +    +    + +   +  + FC+ ++  D +  +
Sbjct: 298 SGGLGWSTVSEYF---PALKIEYHGSARLTLSPETYLYWHQKNASAFCVGILEHDDNRIL 354

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
           +GQ  M      FD    ++  + + CE + +K   H
Sbjct: 355 LGQITMRNTFTEFDVARSQVGMASANCEMLREKYVEH 391


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 79/302 (26%), Positives = 130/302 (43%), Gaps = 27/302 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
           + PS S++   +SC    C+  S++SC +  + C Y   Y+  D S + G L  +    A
Sbjct: 144 FHPSRSTTYSLLSCQSAACQALSQASCDADSE-CQY--QYAYGDGSRTIGVLSTETFSFA 200

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +             V  GC     GS+      DG++GLG G +S+ S L  A  I   F
Sbjct: 201 AAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAARIARRF 256

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCI-GNSCLT 213
           S C        N S ++ FG +   +    +  P +  + D+Y+ V +ES  + G    +
Sbjct: 257 SYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVAS 316

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SSEEML 270
            +  + +VDSG + TFL   +   +V + ++ +   R        + CY+    S  E  
Sbjct: 317 ANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDF 376

Query: 271 KVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGH 325
            +PD+ L F    S  +R  + FS  E EG     L  +S      I+G    QNF +G+
Sbjct: 377 GIPDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGY 435

Query: 326 RI 327
            +
Sbjct: 436 DL 437


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/361 (23%), Positives = 144/361 (39%), Gaps = 53/361 (14%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSS 68
           N L C P      CL  F      D+    +DP +S+S +NV+C    C       +  +
Sbjct: 174 NWLQCAP------CLDCF------DQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRT 221

Query: 69  CKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
           C+S + DPCPY   Y  +  ++     D  L   + +  A  S     V++GCG +  G 
Sbjct: 222 CRSSRSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTASSSRRVDGVVLGCGHRNRGL 277

Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---SVFFGDQGPATQQ 184
           +   A   G+    L   S   L A  G   ++FS C  ++ S     + FGD       
Sbjct: 278 FHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHGSAVGSKIVFGDDNVLLSH 332

Query: 185 S----TSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTF 229
                T+F P   +   Y+V ++   +G   L                  ++DSG + ++
Sbjct: 333 PQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSY 392

Query: 230 LPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFV 286
            P   Y  +   F D++  +  +         CYN S  E ++VP+  L+F+      F 
Sbjct: 393 FPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFP 452

Query: 287 VRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
             N+     + EG  + CL V+ T      IIG        +++D  + +L ++  +C E
Sbjct: 453 AENYFIRL-DTEG--IMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRCAE 509

Query: 346 V 346
           V
Sbjct: 510 V 510


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 83/355 (23%), Positives = 144/355 (40%), Gaps = 32/355 (9%)

Query: 5   ICFGSHANAYNALLCLPVTTLLW--CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPL 62
           + FG+ A  Y  L+    + + W  CL   G    Q   +  +DP+ S++   V C HP 
Sbjct: 124 VGFGTPAQTYT-LMFDTGSDVSWIQCLPCSGHCYKQHDPI--FDPTKSATYSAVPCGHPQ 180

Query: 63  CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
           C +     S    C Y   Y  + +S++G L  + L L S       +        GCG 
Sbjct: 181 CAAAGGKCSSNGTCLYKVQYG-DGSSTAGVLSHETLSLTS-------ARALPGFAFGCGE 232

Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT 182
              G + D    DG++GLG G +S+ S  A +     S+ +       G +  G   PA+
Sbjct: 233 TNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPAS 289

Query: 183 -QQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLP 231
                 +  + +K D    YFV + S  +G   L       T+ G   L+DSG   T+LP
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG--TLLDSGTVLTYLP 347

Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH- 290
            E Y  +  +F   ++  + +   + +  CY+ + +  + +P +   FS   SF +    
Sbjct: 348 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFG 407

Query: 291 IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +  FP++      CL  +       + I+G        +++D    K+ +    C
Sbjct: 408 VLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 73/320 (22%), Positives = 131/320 (40%), Gaps = 27/320 (8%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCK--SLKDP--CPYIADYSTEDTSSSGYLVDDILHLAS 102
           P    S+  + C  PLC S       + +DP  C Y   Y+ +  S+ G L++D+ +L +
Sbjct: 115 PLYKPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYA-DQYSTLGVLLNDV-YLLN 172

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F+       ++  + +GCG  Q  S       DG++GLG G  S+ S L   GL++N   
Sbjct: 173 FTNGV---QLKVRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMG 229

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
            C      G +FFG+   +++ S + +   +    Y  G      G           + D
Sbjct: 230 HCLSSRGGGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFD 289

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------W--KYCYNASSEEMLKVPD 274
           +G+S+T+  ++ Y  ++   +K +  K I    +       W  K  + + +E       
Sbjct: 290 TGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKP 349

Query: 275 MRLIFSK----NQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
           + L F+        F +    +    N G    CL +++      G+  +IG   M+   
Sbjct: 350 LTLSFTNGGRVKPQFEIPPEAYLIISNMGNV--CLGILNGPEVGLGELNLIGDISMLDKV 407

Query: 327 IVFDRENLKLAWSHSKCEEV 346
           +VFD E   + W  + C  V
Sbjct: 408 MVFDNEKQLIGWGPADCNSV 427


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 140/338 (41%), Gaps = 30/338 (8%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYS 83
           L W   V      + RN   +DP  S+S +N+SC   LC K  +   S +  C Y   Y+
Sbjct: 48  LTWTSCVPCNKCYKQRN-PIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYA 106

Query: 84  TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
           +    + G L  + + L+S      +S     ++ GCG   TG + D     G++GLG G
Sbjct: 107 SAAI-TQGVLAQETITLSSTKG---ESVPLKGIVFGCGHNNTGGFNDREM--GIIGLGGG 160

Query: 144 DVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA- 197
            VS  S +  +      FS C      D + S  +  G     + +     P+  K D  
Sbjct: 161 PVSFISQIGSS-FGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKT 219

Query: 198 -YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
            YFV +    +GN+ L  +G            +DSG   T LPT++Y  +V +    V+ 
Sbjct: 220 PYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAM 279

Query: 249 KRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
           K ++   +   + CY   ++  L+ P +   F      ++    F  P++    VFCL  
Sbjct: 280 KPVTNDLDLGPQLCYR--TKNNLRGPVLTAHFEGGDVKLLPTQTFVSPKDG---VFCLGF 334

Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            +T  D G+ G      + I FD +   +++    C +
Sbjct: 335 TNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCTK 372


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 63/223 (28%), Positives = 98/223 (43%), Gaps = 14/223 (6%)

Query: 51  SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + +K V C   +C +       R  C S K  C Y   Y+ +  SS G LV D   L   
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159

Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + A  S V+  +  GCG  +Q GS  + +A DGV+GLG G VS+ S L + G+ +N   
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218

Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
            C      G +FFGD   P ++ + + +      + Y  G  +   G   L     + + 
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           DSG+SFT+   + Y  +V      +S     +  +S   C+  
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG 321


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 140/331 (42%), Gaps = 43/331 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
           YDP  SSS +N++C  P CK  SS      CK     CPY   Y     ++  + ++   
Sbjct: 234 YDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFT 293

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           ++L + +  + Q  V++ V+ GCG    G +   A    ++GLG G +S  S L    + 
Sbjct: 294 VNLTTPNGKSEQKHVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQLQ--SIY 347

Query: 158 QNSFSICF-DENDSGSV----FFGDQGPATQQS----TSFLPIGEKYDA---YFVGVESY 205
            +SFS C  D N   SV     FG+            TSF+  GE+      Y+VG++S 
Sbjct: 348 GHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVG-GEENSVDTFYYVGIKSI 406

Query: 206 CIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
            +    L           + G   ++DSG + T+     Y  +   F K +    +    
Sbjct: 407 MVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGF 466

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DG 312
              K CYN S  E +++PD  ++FS      F V N+      +    + CL ++ T   
Sbjct: 467 PPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPD----LVCLAILGTPKS 522

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              IIG        I++D +  +L ++  KC
Sbjct: 523 ALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 31/329 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +Y P+ ++    + CSH LC          C   +D C Y   YS +  SS G LV D +
Sbjct: 109 QYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDEV 163

Query: 99  HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L    K A  S +   +  GCG  +Q           G++GLG G V + + L   G+ 
Sbjct: 164 PL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGIT 219

Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           +N    C      G +  GD+  P++  + + L        Y  G       +      G
Sbjct: 220 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKG 279

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS-------- 266
              + DSG+S+T+   E Y  ++    K ++ K ++      S   C+            
Sbjct: 280 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 339

Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNFM 322
           ++  K   +R    KN Q F V    +     +G     +   T +  +G Y IIG    
Sbjct: 340 KKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDISF 398

Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
            G  +++D E  ++ W  S C+++ + +H
Sbjct: 399 QGIMVIYDNEKQRIGWISSDCDKLPNVNH 427


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 127/314 (40%), Gaps = 32/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  N+SC+ P C    +       C Y   Y  + + S G+   D L L+S+ 
Sbjct: 229 FDPARSSTDANISCAAPACSDLYTKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 286

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 287 ------DAIKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAH 334

Query: 164 CFDENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN-------SC 211
           CF    SG+ +  D GP +  +     T+ + +      Y+VG+    +G        S 
Sbjct: 335 CFPARSSGTGYL-DFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSV 393

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEM 269
            T +G   +VDSG   T LP   Y+ +   F   ++++  + +   +    CY+ +    
Sbjct: 394 FTTAG--TIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQ 451

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           + +P + L+F    S  V      +  +             D D GI+G   +    +V+
Sbjct: 452 VAIPTVSLLFQGGASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVY 511

Query: 330 DRENLKLAWSHSKC 343
           D     + +S   C
Sbjct: 512 DIGKKVVGFSPGAC 525


>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 452

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 128/318 (40%), Gaps = 40/318 (12%)

Query: 56  VSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           V C   LC         +C S  D C Y  +Y+ +  SS G LV D +      +    S
Sbjct: 114 VQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYA-DHGSSLGVLVRDYIPF----QFTNGS 168

Query: 111 SVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
            V+  V  GCG  Q  S  +   A  GV+GLG G  S+ S L   GLI N    C     
Sbjct: 169 VVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARG 228

Query: 170 SGSVFFGDQGPATQQ--STSFLP-IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGAS 226
            G +FFGD    +     TS LP   EK+  Y  G              G + + DSG+S
Sbjct: 229 GGFLFFGDDFIPSSGIVWTSMLPSSSEKH--YSSGPAELVFNGKATVVKGLELIFDSGSS 286

Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEMLKVPDMRLIFS 280
           +T+  ++ Y  VV    + +  K++    +       WK   +  S   +K     L  S
Sbjct: 287 YTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKKYFKPLALS 346

Query: 281 KNQSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGIIGQNFMMGHRIV 328
             ++ +++ H+      E + +       CL ++  DG      +  IIG   +    ++
Sbjct: 347 FTKTKILQMHL----PPEAYLIITKHGNVCLGIL--DGTEVGLENLNIIGDISLQDKMVI 400

Query: 329 FDRENLKLAWSHSKCEEV 346
           +D E  ++ W  S C+ +
Sbjct: 401 YDNEKQQIGWVSSNCDRL 418


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 128/295 (43%), Gaps = 37/295 (12%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y  +Y+ + +SS G L  D LHL      A  S  +  ++ GC   Q G  L+  A  
Sbjct: 390 CDYEIEYA-DHSSSMGVLASDDLHLML----ANGSLTKLGIMFGCAYDQQGLLLNSLAKT 444

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI- 191
           DG++GL    VS+PS LA   +I N    C   D    G +F GD         +++P+ 
Sbjct: 445 DGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPML 503

Query: 192 ---GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLV 246
                 Y +  + +       S   Q G   + + D+G+S+T+ P E Y  +V    K V
Sbjct: 504 NSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDV 562

Query: 247 SSKRISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENE 298
           S + +   G+  +   C+ A    +  V D++  F       +++ ++V    F  P  E
Sbjct: 563 SDEGLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-E 619

Query: 299 GFTVF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           G+ +       CL ++      DG   I+G   + G  +V+D  N K+ W+ S C
Sbjct: 620 GYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 674


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 75/324 (23%), Positives = 133/324 (41%), Gaps = 45/324 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS S S + V C+   C+S          C S    C Y+ +Y  + + +SG +  + 
Sbjct: 106 FNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYG-DGSYTSGEVGMEH 164

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L+L +        +  ++ I GCGRK  G +       G++GLG  D+S+ S ++   + 
Sbjct: 165 LNLGN--------TTVNNFIFGCGRKNQGLF---GGASGLVGLGRTDLSLISQISP--MF 211

Query: 158 QNSFSICFDEND---SGSVFFGDQGPATQQSTS----------FLPIGEKYDAYFVGVES 204
              FS C    +   SGS+  G      + +T            LP       YF+ +  
Sbjct: 212 GGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPF------YFLNLTG 265

Query: 205 YCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
             +G   +    F   + ++DSG   + LP  IY  +  +F K  S    +        C
Sbjct: 266 ITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSC 325

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQ 319
           +N S  + +K+PD+++ F  +    V      +      +  CL + S   + + GIIG 
Sbjct: 326 FNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGN 385

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
                 RI++D +   L ++   C
Sbjct: 386 YQQKNQRIIYDTKGSMLGFAEEAC 409


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 133/326 (40%), Gaps = 21/326 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS++  +SCS   C      S + C S  + C Y   Y  + + +SGY V D
Sbjct: 127 LNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSD 185

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L+  +    +  +S  +S++ GC   QTG       A DG+ G G  D+SV S ++  G
Sbjct: 186 LLNFDAIVGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 244

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C   +          G   ++   + P+      Y + ++S  +    L   
Sbjct: 245 ITPKVFSHCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAID 303

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T +    +VDSG +  +L  E Y   V    + VS     L     + CY  +S 
Sbjct: 304 PEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSS 362

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMG 324
                P + L F+   S  ++   +   +N      V+C+      G    I+G   +  
Sbjct: 363 VKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 422

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
              V+D    ++ W++  C   ++ S
Sbjct: 423 KIFVYDLAGQRIGWANYDCSMSVNVS 448


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 137/326 (42%), Gaps = 41/326 (12%)

Query: 34  ASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----PCPYIADYSTEDTSS 89
           AS   ++ L  +DPS+SSS  ++ CS P C++   C    D    PC Y   Y  + + S
Sbjct: 121 ASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSISYG-DGSVS 179

Query: 90  SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
            G +  ++   AS +     ++V   ++ GCG    G +       G+ G G G +S+PS
Sbjct: 180 RGEIGREVFTFASGTGEGSSAAV-PGLVFGCGHANRGVFTSNET--GIAGFGRGSLSLPS 236

Query: 150 LLAKAGLIQNSFSICFDE---NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 206
            L K G    +FS CF     + + +V  G  G A   ++   P+G +  +Y        
Sbjct: 237 QL-KVG----NFSHCFTTITGSKTSAVLLGLPGVAPPSAS---PLGRRRGSY-------- 280

Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS- 265
               C +        +SG S T LP   Y  V  +F   V    +         C++A  
Sbjct: 281 ---RCRSTPRSS---NSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPL 334

Query: 266 SEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGF----TVFCLTVMSTDGDYGIIGQN 320
                 VP M L F          N++F   +++       + CL V+  +G   I+G  
Sbjct: 335 RGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVI--EGGEIILGNI 392

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
                 +++D +N KL++  ++C+++
Sbjct: 393 QQQNMHVLYDLQNSKLSFVPAQCDQL 418


>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Glycine max]
          Length = 454

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 126/316 (39%), Gaps = 36/316 (11%)

Query: 56  VSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           V C   LC         +C S  DPC Y  +Y+ +  SS G LV D +      +    S
Sbjct: 114 VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYA-DHGSSLGVLVRDYIPF----QFTNGS 168

Query: 111 SVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
            V+  V  GCG  Q  S  +   A  GV+GLG G  S+ S L   GLI+N    C     
Sbjct: 169 VVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQG 228

Query: 170 SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 228
            G +FFGD   P++    + +        Y  G              G + + DSG+S+T
Sbjct: 229 GGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYT 288

Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEMLKVPDMRLIFSKN 282
           +  ++ Y  VV    K +  K++    +       WK   +  S   +K     L  S  
Sbjct: 289 YFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLALSFK 348

Query: 283 QSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGIIGQNFMMGHRIVFD 330
           +S  ++ H+      E + +       CL ++  DG      +  IIG   +    +++D
Sbjct: 349 KSXNLQMHL----PPESYLIITKHGNVCLGIL--DGTEVGLENLNIIGDITLQDKMVIYD 402

Query: 331 RENLKLAWSHSKCEEV 346
            E  ++ W  S C+ +
Sbjct: 403 NEKQQIGWVSSNCDRL 418


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 81/325 (24%), Positives = 139/325 (42%), Gaps = 17/325 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
            L+ +D SSSSSS  VSCS P+C S      + C +  + C Y   Y  + + +SGY V 
Sbjct: 122 QLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYG-DGSGTSGYYVS 180

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           + ++       +  ++  +SV+ GC   Q+G       A DG+ G G GD+SV S L+  
Sbjct: 181 ESMYFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSAR 240

Query: 155 GLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIG 208
           G+    FS C   + N  G +  G+        +  +P    Y+ Y     V  ++  I 
Sbjct: 241 GITPKVFSHCLKGEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPID 300

Query: 209 NSCLTQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
            S    S     ++DSG +  +L  E Y   V      V S+ ++   +    CY  S+ 
Sbjct: 301 PSVFATSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTS 359

Query: 268 EMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
                P + L F+ + S V++   ++      +G  ++C+          I+G   M   
Sbjct: 360 VGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDK 419

Query: 326 RIVFDRENLKLAWSHSKCEEVIDKS 350
             V+D    ++ W+   C + ++ S
Sbjct: 420 IFVYDLARQRIGWASYDCSQAVNVS 444


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 134/311 (43%), Gaps = 31/311 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +++PSSSS+ +NVSCS P+C+   SC +    C Y   Y  + + + G+L  +   L   
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIGYG-DKSFTQGFLAKEKFTLT-- 228

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FS 162
                 S V   V  GCG    G +      DGV GL        SL A+     N+ FS
Sbjct: 229 -----NSDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277

Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQS 215
            C   F  N +G + FG  G    +S  F PI     A+  G++     +G+    +T +
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
            F    A++DSG  FT LPT++YAE+   F + +SS + +     +  CY+ +  + +  
Sbjct: 336 SFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTY 395

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
           P +   F+      +     S P     +  CL     D    I G        +V+D  
Sbjct: 396 PTIAFSFAGGTVVELDGSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVA 453

Query: 333 NLKLAWSHSKC 343
             ++ ++ + C
Sbjct: 454 GGRVGFAPNGC 464


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 136/316 (43%), Gaps = 34/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDI 97
           ++PS+S++ + + CS   C S     +L DP       C Y A Y  + + S GYL  D+
Sbjct: 163 FEPSASNTYRPLYCSSSEC-SLLKAATLNDPLCTASGVCVYTASYG-DASYSMGYLSRDL 220

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
           L L         S    S   GCG+   G +   A   G++GL    +S+ + L+ K G 
Sbjct: 221 LTLTP-------SQTLPSFTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPKYGY 270

Query: 157 IQNSFSICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--C 211
              +FS C   + S   G +  G   P++ + T  +   +    YF+ + +  +      
Sbjct: 271 ---AFSYCLPTSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVG 327

Query: 212 LTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEE 268
           +  +G+Q   ++DSG   T LP  IYA +   F K++S +       S    C+  S + 
Sbjct: 328 VAAAGYQVPTIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKS 387

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           M   P++R+IF       +R        ++G  + CL   S++    IIG +    + I 
Sbjct: 388 MSGAPEIRMIFQGGADLSLRAPNILIEADKG--IACLAFASSN-QIAIIGNHQQQTYNIA 444

Query: 329 FDRENLKLAWSHSKCE 344
           +D    K+ ++   C 
Sbjct: 445 YDVSASKIGFAPGGCR 460


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/379 (21%), Positives = 145/379 (38%), Gaps = 76/379 (20%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +L+ +D +SSS++  VSCS P+C      + S C S  + C Y   Y  + + +SGY V 
Sbjct: 114 DLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYG-DGSGTSGYYVY 172

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
           D ++       +  S+  S+V+ GC   Q+G       A DG+ G G G +SV S ++  
Sbjct: 173 DAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQ 232

Query: 155 GLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           G+    FS C     SG   +  G+        T  +P+   Y+   + ++S  +    L
Sbjct: 233 GMAPKVFSHCLKGQGSGGGILVLGEILEPNIVYTPLVPLQPHYN---LNLQSIAVNGQIL 289

Query: 213 --------TQSGFQALVDSGASFTFLPTEIY----------------------------- 235
                   T +    +VDSG +  +L  E Y                             
Sbjct: 290 PIDQDVFATGNNRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGN 349

Query: 236 ----------------AEVVVKFDKLVS------SKRISLQGNSWKYCYNASSEEMLKVP 273
                             +V+K   +++      SK I  +GN    CY   +      P
Sbjct: 350 NNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQ---CYLVPTSLGDIFP 406

Query: 274 DMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
            + L F    S V++   ++  +   +G  ++C+        Y I+G   +     V+D 
Sbjct: 407 LVSLNFMGGASMVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGYTILGDLVLKDKIFVYDL 466

Query: 332 ENLKLAWSHSKCEEVIDKS 350
            N ++ W+   C   ++ S
Sbjct: 467 ANQRIGWTDYDCSLAVNVS 485


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 135/323 (41%), Gaps = 39/323 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS+S + K V CS   C        +  +C    + C Y A Y  + + S GYL  D+
Sbjct: 146 FNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDV 204

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L         S   SS + GCG+   G +      DG++GL   ++S+ S L  +G  
Sbjct: 205 LTLT-------PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKY 252

Query: 158 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCI 207
            N+FS C        +    G +  G        S  F P+ +  +    YF+ +ES  +
Sbjct: 253 GNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITV 312

Query: 208 GNSCL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY 262
               L    S ++   ++DSG   T LPT +Y  +   +  ++S K     G S    C+
Sbjct: 313 AGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372

Query: 263 NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
             S   + +V PD+R+IF       ++ H        G T  CL  M+      IIG   
Sbjct: 373 KGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQ 429

Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
               ++ +D  N ++ ++   C+
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGCQ 452


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/349 (24%), Positives = 140/349 (40%), Gaps = 53/349 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R L    P    SS  + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV
Sbjct: 72  RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 129

Query: 95  DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            D+     FS +  Q   +   + +GCG  Q          DGV+GLG G VS+ S L  
Sbjct: 130 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 184

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
            G ++N    C      G +FFGD    + +  S+ P+  +Y  ++   +G E    G  
Sbjct: 185 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 242

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNA---- 264
                    + DSG+S+T+  ++ Y  V     + +S K +  +   ++   C+      
Sbjct: 243 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 302

Query: 265 -SSEEMLKV---------------------PDMRLIFSKNQSF-VVRNHIFSFPENEGFT 301
            S EE+ K                      P+  LI S   S  +++       + +G  
Sbjct: 303 MSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNV 362

Query: 302 VFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             CL +++       +  +IG   M    I++D E   + W    C+E+
Sbjct: 363 --CLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 409


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 133/326 (40%), Gaps = 21/326 (6%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS++  +SCS   C      S + C S  + C Y   Y  + + +SGY V D
Sbjct: 112 LNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSD 170

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L+  +    +  +S  +S++ GC   QTG       A DG+ G G  D+SV S ++  G
Sbjct: 171 LLNFDAIVGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 229

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +    FS C   +          G   ++   + P+      Y + ++S  +    L   
Sbjct: 230 ITPKVFSHCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAID 288

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T +    +VDSG +  +L  E Y   V    + VS     L     + CY  +S 
Sbjct: 289 PEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSS 347

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMG 324
                P + L F+   S  ++   +   +N      V+C+      G    I+G   +  
Sbjct: 348 VKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 407

Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
              V+D    ++ W++  C   ++ S
Sbjct: 408 KIFVYDLAGQRIGWANYDCSMSVNVS 433


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 146/349 (41%), Gaps = 55/349 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSS 90
           R+   +DP++S S + V C   LC          S   C +    C Y   Y  +  +S+
Sbjct: 30  RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSYG-DSRNST 88

Query: 91  GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
           G    D++ L S +  + Q+     V  GC     G  +D  +  G++G   G++S+PS 
Sbjct: 89  GDFSQDVIFLNS-TNSSSQAVQFRDVAFGCAHSPQGFLVDLGSL-GIVGFNRGNLSLPSQ 146

Query: 151 LAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYF 199
           L K  L  + FS CF         +G +F GD G  ++   S+ P+ +      +   Y+
Sbjct: 147 L-KDRLGGSKFSYCFPSQPWQPRATGVIFLGDSG-LSKSKVSYTPLLDNPVTPARSQLYY 204

Query: 200 VGVESYCIGNSCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           VG+ S  +    L   +S F+          ++DSG +FT +  + Y      F    +S
Sbjct: 205 VGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AAS 261

Query: 249 KRISLQGN-----SWKYCYNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENE 298
            R  L+        +  CYN S+   L  VP++RL    N    +R  H+F   S   NE
Sbjct: 262 NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE 321

Query: 299 GFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                CL ++S+     G   ++G      + + +D E  ++ +  + C
Sbjct: 322 --VTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 129/321 (40%), Gaps = 23/321 (7%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
           L  ++P SSS+S  + CS   C +          S  S   PC Y   Y  + + +SG+ 
Sbjct: 133 LEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYG-DGSGTSGFY 191

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLA 152
           V D ++  +   +   ++  +SV+ GC   Q+G  +    A DG+ G G   +SV S L 
Sbjct: 192 VSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLY 251

Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
             G+   +FS C   +D+G       G   +    F P+      Y + +ES        
Sbjct: 252 SLGVSPKTFSHCLKGSDNGGGIL-VLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKL 310

Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
            I +S    S  Q  +VDSG +  +L    Y   +      VS    S+     + C+  
Sbjct: 311 PIDSSLFATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVT 369

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           +S      P   L F    S  V+  N++      +   ++C+    + G   I+G   +
Sbjct: 370 TSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQG-ITILGDLVL 428

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
                V+D  N+++ W+   C
Sbjct: 429 KDKIFVYDLANMRMGWADYDC 449


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 141/338 (41%), Gaps = 40/338 (11%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYL 93
           +N   YDP  SSS KN+ C  P C   SS      CK+    CPY   Y     ++  + 
Sbjct: 229 QNGPYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFA 288

Query: 94  VDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           ++   ++L S +  +    V+ +V+ GCG    G +   A    ++GLG G +S  S L 
Sbjct: 289 LETFTVNLTSPAGKSEFKRVE-NVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL- 343

Query: 153 KAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVG 201
              L  +SFS C      D N S  + FG D+        +F  L  G++      Y+V 
Sbjct: 344 -QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQ 402

Query: 202 VESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
           ++S  +G   L           +     +VDSG + ++     Y  +   F K V    +
Sbjct: 403 IKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV 462

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
                    CYN S  E +++P+ R++F      +F V N+       E   + CL ++ 
Sbjct: 463 IKDFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEE---IVCLAILG 519

Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           T      IIG        I++D +  +L ++  KC +V
Sbjct: 520 TPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 135/323 (41%), Gaps = 39/323 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS+S + K V CS   C        +  +C    + C Y A Y  + + S GYL  D+
Sbjct: 146 FNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDV 204

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L         S   SS + GCG+   G +      DG++GL   ++S+ S L  +G  
Sbjct: 205 LTLT-------PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKY 252

Query: 158 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCI 207
            N+FS C        +    G +  G        S  F P+ +  +    YF+ +ES  +
Sbjct: 253 GNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITV 312

Query: 208 GNSCL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY 262
               L    S ++   ++DSG   T LPT +Y  +   +  ++S K     G S    C+
Sbjct: 313 AGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372

Query: 263 NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
             S   + +V PD+R+IF       ++ H        G T  CL  M+      IIG   
Sbjct: 373 KGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQ 429

Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
               ++ +D  N ++ ++   C+
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGCQ 452


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 124/312 (39%), Gaps = 30/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+SSSS   + C  P C++        D C Y   Y         Y V D    A+ +
Sbjct: 202 FDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYG-----DGSYTVGD---FATET 253

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                S     V IGCG    G ++  A   G+ G  L      SL ++  +  +SFS C
Sbjct: 254 VSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPL------SLTSQ--IKASSFSYC 305

Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
               D  DS ++ F    P+   +       +    Y+VG+    +G   L    S F+ 
Sbjct: 306 LVNRDSVDSSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEV 365

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   +VD G + T L T+ Y  +   F KL      +     +  CYN SS   ++
Sbjct: 366 DGSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVR 425

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           VP +  +F   +S  +    +  P +   T FCL    T     IIG     G R+ +D 
Sbjct: 426 VPTVAFLFDGGKSLPLPPSNYLIPVDSAGT-FCLAFAPTTASLSIIGNVQQQGTRVTYDL 484

Query: 332 ENLKLAWSHSKC 343
            N ++++S  KC
Sbjct: 485 ANSQVSFSSRKC 496


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/311 (26%), Positives = 135/311 (43%), Gaps = 31/311 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +++PSSSS+ +NVSCS P+C+   SC +    C Y   Y  + + + G+L  +   L   
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIVYG-DKSFTQGFLAKEKFTLT-- 228

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FS 162
                 S V   V  GCG    G +      DGV GL        SL A+     N+ FS
Sbjct: 229 -----NSDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277

Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQS 215
            C   F  N +G + FG  G    +S  F PI     A+  G++     +G+    +T +
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
            F    A++DSG  FT LPT++YAE+   F + +SS + +     +  CY+ +  + +  
Sbjct: 336 SFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTY 395

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
           P +   F+ +    +     S P     +  CL     D    I G        +V+D  
Sbjct: 396 PTIAFSFAGSTVVELDGSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVA 453

Query: 333 NLKLAWSHSKC 343
             ++ ++ + C
Sbjct: 454 GGRVGFAPNGC 464


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/250 (23%), Positives = 101/250 (40%), Gaps = 30/250 (12%)

Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----- 170
           V  GCG +  GS++      GV+GLG G +S  S    A   +N F+ C     S     
Sbjct: 149 VAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFAYCLTSYLSPTSVF 203

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNSCL--TQSGFQ------- 218
            S+ FGD   +T     F P+         Y+V +   C G   L    S ++       
Sbjct: 204 SSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNG 263

Query: 219 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
             + DSG + T+   + YA ++  F+K V   R          C N S  +    P   +
Sbjct: 264 GTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIYPSFTI 323

Query: 278 IFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENL 334
            F +  ++     N+      N    + CL ++ +  D + +IG      + + +DRE  
Sbjct: 324 EFDQGATYRPNQGNYFIEVSPN----IDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEH 379

Query: 335 KLAWSHSKCE 344
           ++ ++H+ C+
Sbjct: 380 RIGFAHANCD 389


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 128/295 (43%), Gaps = 37/295 (12%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
           C Y  +Y+ + +SS G L  D LHL      A  S  +  ++ GC   Q G  L+  A  
Sbjct: 177 CDYEIEYA-DHSSSMGVLASDDLHLML----ANGSLTKLGIMFGCAYDQQGLLLNSLAKT 231

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI- 191
           DG++GL    VS+PS LA   +I N    C   D    G +F GD         +++P+ 
Sbjct: 232 DGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPML 290

Query: 192 ---GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLV 246
                 Y +  + +       S   Q G   + + D+G+S+T+ P E Y  +V    K V
Sbjct: 291 NSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDV 349

Query: 247 SSKRISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENE 298
           S + +   G+  +   C+ A    +  V D++  F       +++ ++V    F  P  E
Sbjct: 350 SDEGLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-E 406

Query: 299 GFTVF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           G+ +       CL ++      DG   I+G   + G  +V+D  N K+ W+ S C
Sbjct: 407 GYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 461


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 141/346 (40%), Gaps = 61/346 (17%)

Query: 46  DPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           DP++SS+   V C  P+C++       R      +  C Y+  Y  + + + G L  D  
Sbjct: 138 DPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYG-DKSITVGKLASDRF 196

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                         +  +  GCG    G +   A   G+ G G G  S+PS L       
Sbjct: 197 TFGPGDNADGGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV----- 249

Query: 159 NSFSICFD---ENDSGSVFFGDQGPAT------QQSTSFLPIGEKYDAYFVGVESYCIGN 209
            SFS CF    E+ S  V  G   PA        QST  L    +   YF+ +++  +G 
Sbjct: 250 TSFSYCFTSMFESTSSLVTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGA 308

Query: 210 SCLTQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
           + +     +       A++DSGAS T LP ++Y  V  +F   V     +++G++   C+
Sbjct: 309 TRIPIPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCF 368

Query: 263 NASSEEM-----------------LKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFT 301
              S                    ++VP  RL+F      +      N++F   E+ G  
Sbjct: 369 ALPSAAAPKSAFGWRWRGRGRAMPVRVP--RLVFHLGGGADWELPRENYVF---EDYGAR 423

Query: 302 VFCLTV--MSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCE 344
           V CL +   +  GD  ++  N+   +  +V+D EN  L+++ ++CE
Sbjct: 424 VMCLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/325 (23%), Positives = 128/325 (39%), Gaps = 31/325 (9%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++Y P+ ++    + CSH LC          C   +D C Y   YS +  SS G LV D 
Sbjct: 103 TKYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDE 157

Query: 98  LHLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           + L    K A  S +   +  GCG  +Q           G++GLG G V + + L   G+
Sbjct: 158 VPL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGI 213

Query: 157 IQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
            +N    C      G +  GD+  P++  + + L        Y  G       +      
Sbjct: 214 TKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVK 273

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS------- 266
           G   + DSG+S+T+   E Y  ++    K ++ K +  +    S   C+           
Sbjct: 274 GINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDE 333

Query: 267 -EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNF 321
            ++  K   +R    KN Q F V    +     +G     +   T +  +G Y IIG   
Sbjct: 334 VKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDIS 392

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
             G  +++D E  ++ W  S C+++
Sbjct: 393 FQGIMVIYDNEKQRIGWISSDCDKL 417


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 99/206 (48%), Gaps = 23/206 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDIL 98
           Y P+++S    V C++ LC +  S     + CP      Y   Y T+  SS G L++D  
Sbjct: 97  YRPTANSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-- 150

Query: 99  HLASFSKHAPQSSVQSSVIIGCGR-KQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKA 154
              +FS     S+++  +  GCG  +Q G   +GA   A DG++GLG G VS+ S L + 
Sbjct: 151 ---NFSLPMRSSNIRPGLTFGCGYDQQVGK--NGAVQAATDGMLGLGRGSVSLVSQLKQQ 205

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLT 213
           G+ +N    C   N  G +FFGD    T + T ++P+ +    Y+  G  +       L 
Sbjct: 206 GITKNVLGHCLSTNGGGFLFFGDDIVPTSRVT-WVPMAKISGNYYSPGSGTLYFDRRSLG 264

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVV 239
               + + DSG+++T+   + Y  VV
Sbjct: 265 VKPMEVVFDSGSTYTYFTAQPYQAVV 290


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 126/313 (40%), Gaps = 30/313 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+ P C    +       C Y   Y  + + S G+   D L L+S+ 
Sbjct: 225 FDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSVQYG-DGSYSIGFFAMDTLTLSSY- 282

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 283 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 330

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
           C     SG+ +  FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS
Sbjct: 331 CLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 388

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
            F     +VDSG   T LP   Y+ +   F   ++++  + +   +    CY+ +    +
Sbjct: 389 VFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEV 448

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            +P + L+F       V      +  +             D D GI+G   +    +V+D
Sbjct: 449 AIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYD 508

Query: 331 RENLKLAWSHSKC 343
                + +S   C
Sbjct: 509 IGKKTVGFSPGAC 521


>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
 gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
          Length = 603

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 134/329 (40%), Gaps = 64/329 (19%)

Query: 74  DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-A 132
           D C Y  +Y+ + +SS G L  D L L      A  S  + + I GC   Q G  L    
Sbjct: 264 DQCDYEIEYA-DHSSSMGVLATDKLLLMV----ANGSLTKLNFIFGCAYDQQGLLLKTLV 318

Query: 133 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLP 190
             DG++GL    VS+PS LA  G+I N    C   D    G +F GD     +   +++P
Sbjct: 319 KTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGDDF-VPRWGMAWVP 377

Query: 191 IGE--KYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFD 243
           + +    + Y   V     G+S L+  G ++     L DSG+S+T+ P E Y+E+V   +
Sbjct: 378 MLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASLN 437

Query: 244 KLVSSKRI-SLQGNSWKYCYNAS-------SEEMLKVP---------------------- 273
           ++  +  + S    +   C+ A+           L  P                      
Sbjct: 438 EVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHI 497

Query: 274 --DMR-----LIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMS----TDGDYGI 316
             D++     L F     ++V +  F  P  EG+ +       CL ++      DG   I
Sbjct: 498 KGDVKKFFKTLTFQFGTKWLVISTKFRIPP-EGYLMMSDKGNVCLGILEGSKVHDGSTII 556

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           +G   + G  +V+D  N K+ W+ S C +
Sbjct: 557 LGDISLRGQLVVYDNVNKKIGWTPSDCAK 585


>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
          Length = 424

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 128/312 (41%), Gaps = 33/312 (10%)

Query: 56  VSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
           V C  P+C         C+   + C Y  +Y+ +  SS G LV D+  L+  +  + AP+
Sbjct: 117 VICKDPMCAXLHPPGYKCEH-PEQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR 174

Query: 110 SSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
                 + +GCG  Q    SY      DGV+GLG G  S+ S L   G+I+N    C   
Sbjct: 175 ------LALGCGYDQIPGXSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSS 225

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
           +  G +FFGD    + +      + +++  Y  G     +G             DSG+S+
Sbjct: 226 HGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSY 285

Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQS 284
           T+L +  Y  +V    K +S K  R +L   +   C+         V D+R  F     S
Sbjct: 286 TYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVRKFFKPLALS 344

Query: 285 FVVRNHI---FSFPENEGFTV---FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENL 334
           F         +  P      +    CL +++       D+ +IG   M    +V+D E  
Sbjct: 345 FAGGGRTKTQYDIPLESYLIISGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKN 404

Query: 335 KLAWSHSKCEEV 346
           ++ W+ + C+ +
Sbjct: 405 QIGWAPTNCDRL 416


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 78/342 (22%), Positives = 136/342 (39%), Gaps = 46/342 (13%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSS 89
           + L   DP++SS+   + C  P C++          RSS  +    C YI  Y  + + +
Sbjct: 129 QGLPLLDPAASSTYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYG-DKSVT 187

Query: 90  SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
            G +  D       +           +  GCG    G +       G+ G G G  S+PS
Sbjct: 188 VGEIATDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNET--GIAGFGRGRWSLPS 245

Query: 150 LLAKAGLIQNSFSICFD---ENDSGSVFFGDQGPATQ-------------QSTSFLPIGE 193
            L        +FS CF    E+ S  V  G   PA               ++T  L    
Sbjct: 246 QLNV-----TTFSYCFTSMFESKSSLVTLGG-APAAALLYSHAAHISGEVRTTPLLKNPS 299

Query: 194 KYDAYFVGVESYCIGNSCLTQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
           +   YF+ ++   +G + L     +    ++DSGAS T LP  +Y  V  +F   V    
Sbjct: 300 QPSLYFLSLKGISVGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPP 359

Query: 251 ISL-QGNSWKYCYNASSEEMLK---VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCL 305
             + +G++   C+      + +   VP + L        + R N++F   E+    V C+
Sbjct: 360 TGVVEGSALDLCFALPVTALWRRPPVPSLTLHLDGADWELPRGNYVF---EDLAARVMCV 416

Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
            + +  GD  +IG        +V+D EN  L+++ ++C+ ++
Sbjct: 417 VLDAAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARCDSLV 458


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 110/243 (45%), Gaps = 43/243 (17%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           ++   ++ PS SS+ KN+ CS  LCKS                         G L  D L
Sbjct: 123 NQTTPKFKPSKSSTYKNIPCSSDLCKS----------------------GQQGNLSVDTL 160

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L S + H P S  ++  +IGCG   T S+ +GA+  G++GLG G  S+ + L  +  I 
Sbjct: 161 TLESSTGH-PISFPKT--VIGCGTDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--ID 213

Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSC 211
             FS C      + N +  + FGD    +       PI +K     Y++ +E++ +GN  
Sbjct: 214 AKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKR 273

Query: 212 LTQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           +   G          ++DSG + T +PT++Y  +     +LV  KR++     +  CY+ 
Sbjct: 274 IEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSV 333

Query: 265 SSE 267
           +S+
Sbjct: 334 TSD 336


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/324 (23%), Positives = 127/324 (39%), Gaps = 31/324 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +Y P+ ++    + CSH LC          C   +D C Y   YS +  SS G LV D +
Sbjct: 109 QYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDEV 163

Query: 99  HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
            L    K A  S +   +  GCG  +Q           G++GLG G V + + L   G+ 
Sbjct: 164 PL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGIT 219

Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
           +N    C      G +  GD+  P++  + + L        Y  G       +      G
Sbjct: 220 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKG 279

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 266
              + DSG+S+T+   E Y  ++    K ++ K +  +    S   C+            
Sbjct: 280 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 339

Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNFM 322
           ++  K   +R    KN Q F V    +     +G     +   T +  +G Y IIG    
Sbjct: 340 KKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDISF 398

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
            G  +++D E  ++ W  S C+++
Sbjct: 399 QGIMVIYDNEKQRIGWISSDCDKL 422


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 140/330 (42%), Gaps = 40/330 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DPS SS+  +V CS P C      ++ C +    C Y   Y  E + + G L ++   L
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATS--CEYSVKYGDE-SETHGSLAEETFTL 222

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +  S  AP +   + V+ GC  +    + D G    G++GLG GD S+   L++     N
Sbjct: 223 SPPSPLAPAA---TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSI---LSQTRRSIN 276

Query: 160 S----FSICFDENDS--GSVFFGDQGPATQQ---STSFLP----IGEKYDAYFVGVESYC 206
           S    FS C     S  G +  G    A QQ   + SF P    I +   AY V +    
Sbjct: 277 SGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVS 336

Query: 207 IGNSC--LTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
           +  +   +  S F   A++DSG   T +P   Y  +  +F   + S ++  +G+      
Sbjct: 337 VNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT 396

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEG----FTVFCLTVMSTD-GD 313
           CY+ + ++++  P + L F       V     +   P  +G     T+ CL  + T+   
Sbjct: 397 CYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAG 456

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             I+G      + +VFD +  ++ +  + C
Sbjct: 457 LVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)

Query: 23  TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
           TT + C     A   Q   L  +DP+SSS+  NVSC+ P C            C Y   Y
Sbjct: 206 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 263

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + + S G+   D L L+S+               GCG +  G + + A   G++GLG 
Sbjct: 264 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 312

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
           G  S+P  +   G     F+ C     +G+ +  FG   P    +T  L  G     Y+V
Sbjct: 313 GKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 369

Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
           G+    +G   L    S F A   +VDSG   T LP   Y+ +   F   ++++  R + 
Sbjct: 370 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 429

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
             +    CY+ +    + +P + L+F    +  V      +  +              GD
Sbjct: 430 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 489

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            GI+G   +    + +D     + +S   C
Sbjct: 490 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
          Length = 394

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 158/365 (43%), Gaps = 58/365 (15%)

Query: 1   MLGAICFGSHANAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSC-- 58
           ++G   F    +  ++L+ +P+     C          DR    YDP+ S  SK VSC  
Sbjct: 46  IVGNHTFTVQVDTGSSLMAIPMVNCNTC---------HDR--PSYDPTHSQYSKVVSCFS 94

Query: 59  --------SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
                   + P CK+R+     +D C ++  Y  + +  SG +  D+++L+  S  A   
Sbjct: 95  EHCLGSGSAPPQCKNRA-----EDDCDFVILYG-DGSRVSGKIYQDVVNLSGLSGIAN-- 146

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG-DVSVPSL---LAKAGLIQNSFSICFD 166
                   G  R +TG + +    DG++G G      VP++   L +A  ++N F++  D
Sbjct: 147 -------FGANRIETGDF-EYPRADGIVGFGRSCKTCVPTVFESLVQAHGLKNIFAMSMD 198

Query: 167 ENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS--GFQALVDS 223
               G++  G+  P+       + P+ E    Y +   ++ + ++ +     G Q +VDS
Sbjct: 199 YEGRGTLSLGELNPSNHIGEIQYTPLFEDGPFYNIKPTNFKVDDTVILPRLLGRQVIVDS 258

Query: 224 GASFTFLPTEIYAEVVVKFDK-------LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
           G+S   L +  Y  +V  F K       +  S  I L G+    CYN++S   L +P + 
Sbjct: 259 GSSALSLASGAYDALVHHFRKNYCHVAGICDSPSI-LDGS---ICYNSASSLDL-LPTIY 313

Query: 277 LIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           L F       V  +N++   P   G + +C  +   D    I+G  FM G+  VFD E  
Sbjct: 314 LTFEGGVKVAVPPKNYLTKAPLTNGASGYCWMIDRADPSTTILGDVFMRGYYTVFDNEEK 373

Query: 335 KLAWS 339
           ++ ++
Sbjct: 374 RIGFA 378


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)

Query: 23  TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
           TT + C     A   Q   L  +DP+SSS+  NVSC+ P C            C Y   Y
Sbjct: 203 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 260

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + + S G+   D L L+S+               GCG +  G + + A   G++GLG 
Sbjct: 261 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 309

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
           G  S+P  +   G     F+ C     +G+ +  FG   P    +T  L  G     Y+V
Sbjct: 310 GKTSLP--VQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 366

Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
           G+    +G   L    S F A   +VDSG   T LP   Y+ +   F   ++++  R + 
Sbjct: 367 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 426

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
             +    CY+ +    + +P + L+F    +  V      +  +              GD
Sbjct: 427 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 486

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            GI+G   +    + +D     + +S   C
Sbjct: 487 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)

Query: 23  TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
           TT + C     A   Q   L  +DP+SSS+  NVSC+ P C            C Y   Y
Sbjct: 202 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 259

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + + S G+   D L L+S+               GCG +  G + + A   G++GLG 
Sbjct: 260 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 308

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
           G  S+P  +   G     F+ C     +G+ +  FG   P    +T  L  G     Y+V
Sbjct: 309 GKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 365

Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
           G+    +G   L    S F A   +VDSG   T LP   Y+ +   F   ++++  R + 
Sbjct: 366 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 425

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
             +    CY+ +    + +P + L+F    +  V      +  +              GD
Sbjct: 426 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 485

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            GI+G   +    + +D     + +S   C
Sbjct: 486 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 135/329 (41%), Gaps = 51/329 (15%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
           P    S+  V C HPLC S     + +    +  DY  E     SS G LV+D+ ++ +F
Sbjct: 126 PLYRPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDV-YVLNF 184

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +       ++  + +GCG  Q          DG++GLG G  S+ S L   GL++N    
Sbjct: 185 TNGV---QLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGH 241

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQALVD 222
           C      G +FFGD   +++   ++ P+  + Y  Y  G     +G          A+ D
Sbjct: 242 CLSAQGGGYIFFGDVYDSSR--LAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFD 299

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
           +G+S+T+  +  Y     +  K ++ K I  + +  +   C+        K P  R ++ 
Sbjct: 300 AGSSYTYFNSNAY-----QLTKELAGKPIKEAPEDQTLPLCWYG------KRP-FRSVYE 347

Query: 281 KNQSFVVRNHIFSFPEN-----------EGFTVF------CLTVMSTDG------DYGII 317
             + F  +    SFP +           E + +       CL ++  DG      D  +I
Sbjct: 348 VKKYF--KPIALSFPGSRRSKAQFEIPPEAYLIISNMGNVCLGIL--DGSEVGVEDLNLI 403

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G   M+   +VFD E   + W+ + C  V
Sbjct: 404 GDISMLDKVMVFDNEKQLIGWTAADCNRV 432


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 81/334 (24%), Positives = 140/334 (41%), Gaps = 43/334 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
           YDP  SSS +N+SC  P C+  S+      CK+    CPY   Y     ++  + ++   
Sbjct: 239 YDPKDSSSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFT 298

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           ++L + +  +    V+ +V+ GCG    G +   A   G+    L   S         L 
Sbjct: 299 VNLTTPNGTSELKHVE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLY 352

Query: 158 QNSFSICF-DENDSGSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYC 206
             SFS C  D N + SV     FG D+   +  + +F   G   D      Y+V ++S  
Sbjct: 353 GQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVM 412

Query: 207 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
           + +  L          ++     ++DSG + T+     Y  +   F + +   ++     
Sbjct: 413 VDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLP 472

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNH-IFSFPENEGFTVFCLTVMST-DG 312
             K CYN S  E +++PD  ++F+     +F V N+ I+  PE     V CL ++     
Sbjct: 473 PLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPE-----VVCLAILGNPRS 527

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              IIG        I++D +  +L ++  KC +V
Sbjct: 528 ALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 561


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 127/313 (40%), Gaps = 31/313 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP+ SS+   V C+ P C+   SRS  +  K  C Y   Y  + + + G L  D L L 
Sbjct: 188 FDPARSSTYSAVPCASPECQGLDSRSCSRDKK--CRYEVVYG-DQSQTDGALARDTLTLT 244

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                  QS V    + GCG + TG  L G A DG++GLG   VS+ S  A        F
Sbjct: 245 -------QSDVLPGFVFGCGEQDTG--LFGRA-DGLVGLGREKVSLSSQAASK--YGAGF 292

Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YF-----VGVESYCIGNSCLT 213
           S C   + S + +    GPA   +  F  +  ++D+   Y+     V V    +  S + 
Sbjct: 293 SYCLPSSPSAAGYLSLGGPAPANA-RFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIV 351

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSWKYCYNASSEEML 270
            S    ++DSG   T LP  +YA +   F + +     KR     +    CY+ +    +
Sbjct: 352 FSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPAL-SILDTCYDFTGHTTV 410

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           ++P + L+F+   +  +      +                  D GIIG        +V+D
Sbjct: 411 RIPSVALVFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYD 470

Query: 331 RENLKLAWSHSKC 343
               K+ +  + C
Sbjct: 471 VARQKIGFGANGC 483


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 96/219 (43%), Gaps = 18/219 (8%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R L    P    SS  + C+ PLCK     S   C++  + C Y  +Y+ +  SS G LV
Sbjct: 91  RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 148

Query: 95  DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
            D+     FS +  Q   +   + +GCG  Q          DGV+GLG G VS+ S L  
Sbjct: 149 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 203

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
            G ++N    C      G +FFGD    + +  S+ P+  +Y  ++   +G E    G  
Sbjct: 204 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 261

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
                    + DSG+S+T+  ++ Y  V     + +S K
Sbjct: 262 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGK 300


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 38/338 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   YDP  SSS +N+ C    C   SS      CK+    CPY   Y     ++  +
Sbjct: 217 EQNGPHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDF 276

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
            ++      + S   P+     +V+ GCG    G +   A    ++GLG G +S  S L 
Sbjct: 277 ALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQ 333

Query: 153 KAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVG 201
              L  +SFS C      D N S  + FG D+   +    +F  L  G++      Y+V 
Sbjct: 334 S--LYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQ 391

Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
           ++S  +G   +          T      ++DSG + ++     Y  +   F   V    +
Sbjct: 392 IKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPV 451

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
                  + CYN +  E   +PD  ++FS     +F V N+   F E E   V CL ++ 
Sbjct: 452 VKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENY---FIEIEPREVVCLAILG 508

Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           T      IIG        I++D +  +L ++ +KC +V
Sbjct: 509 TPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 546


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 140/335 (41%), Gaps = 47/335 (14%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           D++   +DP  SSS   V C+ PLC+   S  C   +  C Y   Y  + + ++G    +
Sbjct: 176 DQSGPVFDPRRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYG-DGSVTAGDFATE 234

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L  A  ++ A        V +GCG    G ++  A    ++GLG G +S P+ +++   
Sbjct: 235 TLTFAGGARVA-------RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--R 282

Query: 157 IQNSFSICFDENDSG------------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVG 201
              SFS C  +  S             +V FG   P +  + SF P+         Y+V 
Sbjct: 283 YGKSFSYCLVDRTSSSSSGAASRSRSSTVTFG---PPSASAASFTPMVRNPRMETFYYVQ 339

Query: 202 VESYCIGNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
           +    +G +    + +S  +          +VDSG S T L    Y+ +   F    +  
Sbjct: 340 LVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGL 399

Query: 250 RISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
           R+S  G S +  CY+    +++KVP + + F+      +    +  P +   T FC    
Sbjct: 400 RLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFA 458

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            TDG   IIG     G R+VFD +  ++ ++   C
Sbjct: 459 GTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 133/315 (42%), Gaps = 35/315 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP++S++   V C   +C++ R+S       C Y   Y  + + + G L  + L L   
Sbjct: 169 FDPATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYG-DGSYTKGALALETLTLG-- 225

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                  +    V IGCG +  G ++  A   G++GLG G +S+   L  A     +FS 
Sbjct: 226 ------GTAVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSY 274

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC--------- 211
           C     +GS+  G +  A  +   ++P+     A   Y+VG+    +G+           
Sbjct: 275 CLASRGAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQ 333

Query: 212 LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
           LT+ G   +V D+G + T LP E YA +   F   V +   +   +    CY+ S    +
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSV 393

Query: 271 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           +VP +   F    +  +  RN +    E +G  ++CL    +     I+G     G +I 
Sbjct: 394 RVPTVSFYFDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGPSILGNIQQEGIQIT 449

Query: 329 FDRENLKLAWSHSKC 343
            D  N  + +  + C
Sbjct: 450 VDSANGYIGFGPTTC 464


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 44/320 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SSSS+N+ C  P CK     +C + K  C +   Y      +S  L  D L LA 
Sbjct: 131 FDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKS-CGFNMTYGGSTIEAS--LTQDTLTLA- 186

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  + V  S   GC  K TG+ L      G+MGLG G +S+ S      L  ++FS
Sbjct: 187 -------NDVIKSYTFGCISKATGTSLPA---QGLMGLGRGPLSLIS--QTQNLYMSTFS 234

Query: 163 ICF----DENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
            C       N SGS+  G +  P   ++T  L    +   Y+V +    +GN  +     
Sbjct: 235 YCLPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 294

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +G   + DSG  FT L    Y  V  +F + + +   +  G  +  CY+ S  
Sbjct: 295 ALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG-GFDTCYSGS-- 351

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMM 323
             +  P +  +F+     +  +++     +   +  CL + +   +      +I      
Sbjct: 352 --VVYPSVTFMFAGMNVTLPPDNLLI--HSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQ 407

Query: 324 GHRIVFDRENLKLAWSHSKC 343
            HR++ D  N +L  S   C
Sbjct: 408 NHRVLIDLPNSRLGISRETC 427


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 33/322 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +++PS SSS KN+SCS  LC+S   +SC   K+ C Y  +Y  + + S G L  + L L 
Sbjct: 128 KFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKN-CEYSINYGNQ-SHSQGDLSLETLTLE 185

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV---PSLLAKAGLIQ 158
           S +   P S  ++  +IGCG    GS+   ++    +G G   +     PS+  K     
Sbjct: 186 S-TTGRPVSFPKT--VIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCL 242

Query: 159 NSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQ 214
              SI       GS  + FGD    +  +    PI +K  +  Y++ +E++ +G+  +  
Sbjct: 243 VRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEF 302

Query: 215 SGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
           +G          ++DS    TF+P+++Y ++      LV+ +R+      +  CYN SS+
Sbjct: 303 AGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSD 362

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMM 323
           E    P M   F      +   + F     +   V C     ++G   I G    Q+FM+
Sbjct: 363 EEYDFPYMTAHFKGADILLYATNTFVEVARD---VLCFAFAPSNGG-AIFGSFSQQDFMV 418

Query: 324 GHRIVFDRENLKLAWSHSKCEE 345
           G    +D +   +++    C E
Sbjct: 419 G----YDLQQKTVSFKSVDCTE 436


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 79/333 (23%), Positives = 135/333 (40%), Gaps = 41/333 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
           YDP  SSS +N+SC  P C+  SS      CK+    CPY   Y     ++  + ++   
Sbjct: 237 YDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFT 296

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           ++L + +  +    V+ +V+ GCG    G +   A   G+    L   S         L 
Sbjct: 297 VNLTTPNGKSELKHVE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLY 350

Query: 158 QNSFSICF-DENDSGSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYC 206
             SFS C  D N + SV     FG D+   +  + +F   G   D      Y+V + S  
Sbjct: 351 GQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVM 410

Query: 207 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
           + +  L          ++     ++DSG + T+     Y  +   F + +    +     
Sbjct: 411 VDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLP 470

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGD 313
             K CYN S  E +++PD  ++F+     +F V N+      +    V CL ++      
Sbjct: 471 PLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPD----VVCLAILGNPRSA 526

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             IIG        I++D +  +L ++  KC +V
Sbjct: 527 LSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 147/359 (40%), Gaps = 55/359 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSS 90
           R+   +DP++S S + V C   LC          S   C +    C Y   Y  +  +S+
Sbjct: 131 RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYG-DSRNST 189

Query: 91  GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
           G    D++ L S +  + Q+     V  GC     G  +D  +  G++G   G++S+PS 
Sbjct: 190 GDFSQDVIFLNS-TNSSGQAVQFRDVAFGCAHSPQGFLVDLGSL-GIVGFNRGNLSLPSQ 247

Query: 151 LAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYF 199
           L K  L  + FS CF         +G +F GD G  ++    + P+ +      +   Y+
Sbjct: 248 L-KDRLGGSKFSYCFPSQPWQPRATGVIFLGDSG-LSKSKVGYTPLLDNPVTPARSQLYY 305

Query: 200 VGVESYCIGNSCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           VG+ S  +    L   +S F+          ++DSG +FT +  + Y      F    +S
Sbjct: 306 VGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AAS 362

Query: 249 KRISLQGN-----SWKYCYNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENE 298
            R  L+        +  CYN S+   L  VP++RL    N    +R  H+F   S   NE
Sbjct: 363 NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE 422

Query: 299 GFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
                CL ++S+     G   ++G      + + +D E  ++ +  + C        VH
Sbjct: 423 --VTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADCSGAAGSFLVH 479


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 128/318 (40%), Gaps = 40/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SSSS+ + C  P CK     SC ++   C +   Y    ++   YL  D L LA 
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA- 183

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S V  +   GC  K +G+ L      G+MGLG G +S+ S      L Q++FS
Sbjct: 184 -------SDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231

Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
            C       N SGS+  G +  P   ++T  L    +   Y+V +    +GN  +     
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +G   + DSG  +T L    Y  V  +F + V +   +  G  +  CY+ S  
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG-GFDTCYSGS-- 348

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGH 325
             +  P +  +F+     +  +++         +   +    ++ +    +I       H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNH 406

Query: 326 RIVFDRENLKLAWSHSKC 343
           R++ D  N +L  S   C
Sbjct: 407 RVLIDVPNSRLGISRETC 424


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 69/281 (24%), Positives = 111/281 (39%), Gaps = 23/281 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SS ++  +SCS   C      S S C    + C Y   Y  + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
           +L        +   +  + V+ GC   QTG  +    A DG+ G G   +SV S LA  G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243

Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           +    FS C   EN  G +     G   + +  F P+      Y V + S  +    L  
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
                 T +G   ++D+G +  +L    Y   V      VS   + +  +GN    CY  
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 305
           ++      P + L F+   S  +    +   +N   +  C 
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVASALCF 399


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 130/296 (43%), Gaps = 36/296 (12%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA-P 134
           C Y   Y+ + +SS G L  D + L +    A         + GCG  Q G+ L   A  
Sbjct: 233 CDYEITYA-DRSSSMGILARDNMQLIT----ADGERENLDFVFGCGYDQQGNLLSSPANT 287

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
           DG++GL    +S+P+ LA  G+I N F  C   D ++ G +F GD     +   +++PI 
Sbjct: 288 DGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLGDDY-VPRWGMTWMPIR 346

Query: 193 E-KYDAYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
               + Y   V+    G+  L    ++G   Q + DSG+S+T+LP + Y  ++     L 
Sbjct: 347 NGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDSGSSYTYLPHDDYTNLIASLKSLS 406

Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFT 301
            S        +  +C   +   +  + D++ +F K  S V +  +F  P       E + 
Sbjct: 407 PSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KPLSLVFKKRLFILPRTFVIPPEDYL 464

Query: 302 V------FCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           +       CL V+  DG + G     +IG   + G  +V++ +  ++ W  S C +
Sbjct: 465 IISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGKLVVYNNDEKQIGWVQSDCAK 518


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 130/296 (43%), Gaps = 36/296 (12%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA-P 134
           C Y   Y+ + +SS G L  D + L +    A         + GCG  Q G+ L   A  
Sbjct: 233 CDYEITYA-DRSSSMGILARDNMQLIT----ADGERENLDFVFGCGYDQQGNLLSSPANT 287

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
           DG++GL    +S+P+ LA  G+I N F  C   D ++ G +F GD     +   +++PI 
Sbjct: 288 DGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLGDDY-VPRWGMTWMPIR 346

Query: 193 E-KYDAYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
               + Y   V+    G+  L    ++G   Q + DSG+S+T+LP + Y  ++     L 
Sbjct: 347 NGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDSGSSYTYLPHDDYTNLIASLKSLS 406

Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFT 301
            S        +  +C   +   +  + D++ +F K  S V +  +F  P       E + 
Sbjct: 407 PSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KPLSLVFKKRLFILPRTFVIPPEDYL 464

Query: 302 V------FCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           +       CL V+  DG + G     +IG   + G  +V++ +  ++ W  S C +
Sbjct: 465 IISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGKLVVYNNDEKQIGWVQSDCAK 518


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 128/318 (40%), Gaps = 40/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SSSS+ + C  P CK     SC ++   C +   Y    ++   YL  D L LA 
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA- 183

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S V  +   GC  K +G+ L      G+MGLG G +S+ S      L Q++FS
Sbjct: 184 -------SDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231

Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
            C       N SGS+  G +  P   ++T  L    +   Y+V +    +GN  +     
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +G   + DSG  +T L    Y  V  +F + V +   +  G  +  CY+ S  
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG-GFDTCYSGS-- 348

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGH 325
             +  P +  +F+     +  +++         +   +    ++ +    +I       H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNH 406

Query: 326 RIVFDRENLKLAWSHSKC 343
           R++ D  N +L  S   C
Sbjct: 407 RVLIDVPNSRLGISRETC 424


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 76/292 (26%), Positives = 118/292 (40%), Gaps = 31/292 (10%)

Query: 69  CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
           C +    C Y A Y   DTS S GYL  D+L L       P ++  S  + GCG+   G 
Sbjct: 181 CSNATGACVYKASYG--DTSFSIGYLSQDVLTLT------PSAAPSSGFVYGCGQDNQGL 232

Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFG-DQGP 180
           +   A   G++GL    +S+   L+      N+FS C         N S S F       
Sbjct: 233 FGRSA---GIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASS 287

Query: 181 ATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTE 233
            +     F P+ +       YF+G+ +  +    L  S        ++DSG   T LP  
Sbjct: 288 LSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVA 347

Query: 234 IYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 292
           IY  +   F  ++S K     G S    C+  S +EM  VP++R+IF       ++ H  
Sbjct: 348 IYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNS 407

Query: 293 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
                +G T  CL + ++     IIG        + +D  N K+ ++   C+
Sbjct: 408 LVEIEKGTT--CLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGCQ 457


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 125/317 (39%), Gaps = 38/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  N+SC+ P C    +       C Y   Y  + + S G+   D L L+S+ 
Sbjct: 223 FDPARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
           C     SG+ +  FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS
Sbjct: 329 CLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 386

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASS 266
            F     +VDSG   T LP   Y+ +   F   ++++       +SL       CY+ + 
Sbjct: 387 VFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSL----LDTCYDFTG 442

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
              + +P + L+F       V      +  +              GD GI+G   +    
Sbjct: 443 MSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFG 502

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D     + +S   C
Sbjct: 503 VAYDIGKKVVGFSPGAC 519


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 134/323 (41%), Gaps = 44/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS SS+   ++CS   C       +C +  + C Y   Y  + + + GY         
Sbjct: 67  FDPSKSSTYNKIACSSSACADLLGTQTCSAAAN-CIYAYGYG-DGSVTRGY--------- 115

Query: 102 SFSKHA--PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            FSK       +    V  G     TG++ D    +G++GLG G VS+PS L    ++ N
Sbjct: 116 -FSKETITATDTAGEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGN 171

Query: 160 SFSICFDE-----NDSGSVFFGDQG-PATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            FS C  +     +++ +++FGD   P+ + Q T  +P  +    Y++ V+   +G S L
Sbjct: 172 KFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLL 231

Query: 213 --TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
              QS ++         ++DSG + T+L  E++  +V  +   V     +        C+
Sbjct: 232 DIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTT-SATGLDLCF 290

Query: 263 NASSEEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
           N         P M + +   +      N   S   N    + CL   S  D    I G  
Sbjct: 291 NTRGTGSPVFPAMTIHLDGVHLELPTANTFISLETN----IICLAFASALDFPIAIFGNI 346

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
                 IV+D +N+++ ++ + C
Sbjct: 347 QQQNFDIVYDLDNMRIGFAPADC 369


>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 440

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 129/328 (39%), Gaps = 43/328 (13%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
           P    S+  V C H LC S     +     P+  DY  +     SS G L+ D+  L +F
Sbjct: 120 PLYRPSNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NF 178

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +       ++  + +GCG  Q          DG++GLG G  S+ S L   GL++N    
Sbjct: 179 TNGV---QLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGH 235

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALV 221
           C      G +FFGD   + +   ++ P+  + Y  Y V G      G          A+ 
Sbjct: 236 CLSAQGGGYIFFGDVYDSFR--LTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVF 293

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           D+G+S+T+  +  Y  ++    K    K +  +    +   C+             R I+
Sbjct: 294 DTGSSYTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIY 346

Query: 280 SKNQSFVVRNHIFSFPEN-----------EGFTV------FCLTVMSTD----GDYGIIG 318
              + F  +  + SF  N           E + +       CL +++      GD  +IG
Sbjct: 347 EVRKYF--KPIVLSFTSNGRSKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDLNLIG 404

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              M+   +VFD +   + W+ + C++V
Sbjct: 405 DISMLNKVMVFDNDKQLIGWAPADCDQV 432


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 130/322 (40%), Gaps = 51/322 (15%)

Query: 56  VSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
           V C  P+C S       C+   + C Y  +Y+ +  SS G LV D+  L+  +  + AP+
Sbjct: 117 VICKDPMCASLHPPGYKCEH-PEQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR 174

Query: 110 SSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
                 + +GCG  Q    SY      DGV+GLG G  S+ S L   G+I+N    C   
Sbjct: 175 ------LALGCGYDQIPGQSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSS 225

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
              G +FFGD    + +      + +++  Y  G     +G             DSG+S+
Sbjct: 226 RGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSY 285

Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 285
           T+L +  Y  +V    K +S K  R +L   +   C+         V D++  F      
Sbjct: 286 TYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVKKFF------ 338

Query: 286 VVRNHIFSFPEN-----------EGFTVF------CLTVMSTD----GDYGIIGQNFMMG 324
             +    SFP             E + +       CL +++       D+ +IG   M  
Sbjct: 339 --KPLALSFPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQD 396

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
             +V+D E  ++ W+ + C+ +
Sbjct: 397 KMVVYDNEKNQIGWAPTNCDRL 418


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 142/314 (45%), Gaps = 58/314 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--SSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS S + KN+ CS   CKS   +SC S  +  C +  +Y  + + S G L+ + + L 
Sbjct: 130 FDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYK-DGSHSQGDLIVETVTLG 188

Query: 102 SFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           S++    H P++      +IGC R    S+       G++GLG G VS+   L+ +  I 
Sbjct: 189 SYNDPFVHFPRT------VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLSSS--IS 236

Query: 159 NSFSICFD--ENDSGSVFFGD----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
             FS C     + S  + FGD     G  T  +       +K+  Y++ +E++ +GN+ +
Sbjct: 237 KKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKF--YYLTLEAFSVGNNRI 294

Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                   +      ++DSG +FT LP ++Y+++      +V  +R       +  CY  
Sbjct: 295 EFRSSSSRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK- 353

Query: 265 SSEEMLKVPDMRLIFSKN-------QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           S+ + + VP +   FS          +F+V +H           V CL  +S+     I 
Sbjct: 354 STYDKVDVPVITAHFSGADVKLNALNTFIVASH----------RVVCLAFLSSQSG-AIF 402

Query: 318 G----QNFMMGHRI 327
           G    QNF++G+ +
Sbjct: 403 GNLAQQNFLVGYDL 416


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 129/315 (40%), Gaps = 36/315 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           YDP+ SS+   + C  P CK   S     C    D C YI +Y  +  +++G  V D L 
Sbjct: 200 YDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYG-DGKATTGTYVTDTLT 258

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           ++        + V      GC     GS+ +  A  G++ LG G  S+  L   A    N
Sbjct: 259 MSP-------TIVVKDFRFGCSHAVRGSFSNQNA--GILALGGGRGSL--LEQTADAYGN 307

Query: 160 SFSICFDENDSGSVFFGDQGPATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCL--- 212
           +FS C  +  S   F    GP       S+ P+ +   A   Y V +E+  +    L   
Sbjct: 308 AFSYCIPKPSSAG-FLSLGGPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVP 366

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 270
            T     A++DSGA  T LP ++YA +   F   + +   ++    +   CY+ +    +
Sbjct: 367 PTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDV 426

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIV 328
           KVP + L+F+   +  +          +G    CL   +T G+   G IG      + ++
Sbjct: 427 KVPKVSLVFAGGATLDLEPASIIL---DG----CLAFAATPGEESVGFIGNVQQQTYEVL 479

Query: 329 FDRENLKLAWSHSKC 343
           +D    K+ +    C
Sbjct: 480 YDVGGGKVGFRRGAC 494


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 137/318 (43%), Gaps = 32/318 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 97
           +DPS SSS  N++C+  LC        +S C S  D  C Y A Y  ++++S G+L  + 
Sbjct: 89  FDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYG-DNSTSVGFLSQER 147

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L + +       + +    + GCG+   G + +G+A  G+MGLG   +S+  +   +   
Sbjct: 148 LTITA-------TDIVDDFLFGCGQDNEGLF-NGSA--GLMGLGRHPISI--VQQTSSNY 195

Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL 212
              FS C     S  G + FG    AT  S  + P+     D  F G++  S  +G + L
Sbjct: 196 NKIFSYCLPATSSSLGHLTFG-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKL 254

Query: 213 ---TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
              + S F A   ++DSG   T L   +YA +   F + +    ++ +      CY+ S 
Sbjct: 255 PAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSG 314

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            + + VP +   FS   +  + +      E+E           +D D  + G        
Sbjct: 315 YKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLE 374

Query: 327 IVFDRENLKLAWSHSKCE 344
           +V+D +  ++ +  + C+
Sbjct: 375 VVYDVKGGRIGFGAAGCK 392


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 130/306 (42%), Gaps = 33/306 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS S + K + CS  +C+S    +SC S  D C Y   Y  +++ S G L  + L L 
Sbjct: 136 FDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYG-DNSHSQGDLSVETLTLG 194

Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           S       SSVQ    +IGCG    G++      +G   +GLG   V  +   +  I   
Sbjct: 195 S----TDGSSVQFPKTVIGCGHNNKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGK 246

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLT 213
           FS C        N S  + FGD+   + + T   PI  K     YF+ +E++ +G++ + 
Sbjct: 247 FSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIE 306

Query: 214 QSGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                          ++DSG + T LP + Y  +       +  +R+       + CY  
Sbjct: 307 FGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRT 366

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFM 322
           +S + L VP +   F      V  N I +F E +EG   F          +G +  QN +
Sbjct: 367 TSSDELNVPVITAHFKGAD--VELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLL 424

Query: 323 MGHRIV 328
           +G+ +V
Sbjct: 425 VGYDLV 430


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 130/317 (41%), Gaps = 33/317 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ S++   V C HP C +     S    C Y   Y  + +S++G L  + L L+S +
Sbjct: 204 FDPTKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYG-DGSSTAGVLSHETLSLSS-T 261

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           +  P          GCG+   G +        ++GLG G +S+PS    A     +FS C
Sbjct: 262 RDLP------GFAFGCGQTNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYC 310

Query: 165 FDENDS--GSVFFGDQGPATQ------QSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
               D+  G +  G   PA        Q T+ +   +    YFV V S  IG   L    
Sbjct: 311 LPSYDTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPP 370

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T+ G   L DSG   T+LP E YA +  +F   ++  + +   + +  CY+ +    
Sbjct: 371 TVFTRDG--TLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNA 428

Query: 270 LKVPDMRLIFSKNQSFVVRN-HIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHR 326
           + +P +   FS    F +    I  +P++      CL  +       + IIG     G  
Sbjct: 429 IFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTE 488

Query: 327 IVFDRENLKLAWSHSKC 343
           +++D    K+ +    C
Sbjct: 489 VIYDVAAEKIGFGQFTC 505


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 143/339 (42%), Gaps = 40/339 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   YDP  SSS +N+ C  P C   SS      CK+    CPY   Y     ++  +
Sbjct: 126 EQNGPYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDF 185

Query: 93  LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
             +   ++L S +  +    V+ +V+ GCG    G +  GA+  G++GLG G +S  S L
Sbjct: 186 ATETFTVNLTSPTGKSEFKRVE-NVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQL 241

Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSFLP-IGEKYDA----YFV 200
               L  +SFS C      D N S  + FG D+        +F   +G K +     Y+V
Sbjct: 242 QS--LYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYV 299

Query: 201 GVESYCIGNSCL---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
            ++S  +G   L         T  G    +VDSG + ++     Y  +   F K V    
Sbjct: 300 QIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYP 359

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVM 308
           I         CYN S  E + +PD  ++F+     +F V N+       E   V CL ++
Sbjct: 360 IVQDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEE---VVCLAIL 416

Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            T      IIG        +++D +  +L ++   C +V
Sbjct: 417 GTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 455


>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
 gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
           nagariensis]
          Length = 475

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 62/262 (23%), Positives = 104/262 (39%), Gaps = 27/262 (10%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
            + C Y   Y+ E +SS G++V+D           P       ++ GC   +TG      
Sbjct: 4   NEKCYYSRTYA-ERSSSEGWMVEDAFGF-------PDDQPPVRMVFGCENGETGEIYRQL 55

Query: 133 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 192
           A DG+MG+G    +  S L   G+I++ FS+CF     G +  GD       +T + P+ 
Sbjct: 56  A-DGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLL 114

Query: 193 EKYDAYFVGVESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
                ++  V    I         N+ +   G+  ++DSG +FT+LPTE +  +      
Sbjct: 115 NNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGS 174

Query: 245 LVSSKRI-SLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPE 296
              S  + S  G   +Y   C+  + +    +    P    +F  N    +    + F  
Sbjct: 175 YALSHGLQSTPGADPQYNDICWKGAPDNFQGLENHFPSAEFVFGDNARLSLPPLRYLFVS 234

Query: 297 NEGFTVFCLTVMSTDGDYGIIG 318
             G   +CL V    G   +IG
Sbjct: 235 RPG--EYCLGVFDNGGSGTLIG 254


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
           +++L  YD S SS+    SC    CK   S + C       C Y   YS  D S++   +
Sbjct: 71  NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFL 128

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D  +   SF   A   SV   V+ GCG   TG +       G+ G G G +S+PS L K 
Sbjct: 129 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 179

Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G   + F+       S  +F         G  T Q+T  +        Y++ ++   +G+
Sbjct: 180 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 239

Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
           + L   +S F         ++DSG +FT LP  +Y  V  +F   V    +         
Sbjct: 240 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 299

Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
           C++A    +   VP + L F      + R N++F   ++ G    CL ++  +G+  IIG
Sbjct: 300 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 356

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                   +++D +N KL++  +KC+++
Sbjct: 357 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 384


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
           +++L  YD S SS+    SC    CK   S + C       C Y   YS  D S++   +
Sbjct: 127 NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFL 184

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D  +   SF   A   SV   V+ GCG   TG +       G+ G G G +S+PS L K 
Sbjct: 185 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 235

Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G   + F+       S  +F         G  T Q+T  +        Y++ ++   +G+
Sbjct: 236 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 295

Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
           + L   +S F         ++DSG +FT LP  +Y  V  +F   V    +         
Sbjct: 296 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 355

Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
           C++A    +   VP + L F      + R N++F   ++ G    CL ++  +G+  IIG
Sbjct: 356 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 412

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                   +++D +N KL++  +KC+++
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 440


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 74/328 (22%), Positives = 128/328 (39%), Gaps = 43/328 (13%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
           P    S+  V C H LC S     +     P+  DY  +     SS G L+ D+  L +F
Sbjct: 118 PLYRPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NF 176

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +       ++  + +GCG  Q          DG++GLG G  S+ S L   GL++N    
Sbjct: 177 TNGV---QLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGH 233

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAY-FVGVESYCIGNSCLTQSGFQALV 221
           C      G +FFGD   +++   ++ P+  + Y  Y   G      G          A+ 
Sbjct: 234 CLSAQGGGYIFFGDVYDSSR--LTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVF 291

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           D+G+S+T+     Y  ++    K    K  + +    +   C+             R I+
Sbjct: 292 DTGSSYTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIY 344

Query: 280 SKNQSFVVRNHIFSFPEN-----------EGFTVF------CLTVMSTD----GDYGIIG 318
              + F  +  + SF  N           E + +       CL +++      GD  +IG
Sbjct: 345 EVRKYF--KPIVLSFTSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLNLIG 402

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              M+   +VFD +   + W+ + C++V
Sbjct: 403 DISMLNKVMVFDNDKQLIGWTPADCDQV 430


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 118/293 (40%), Gaps = 33/293 (11%)

Query: 69  CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
           C +    C Y A Y   DTS S GYL  D+L L       P  +  S  + GCG+   G 
Sbjct: 187 CSNATGACVYKASYG--DTSFSIGYLSQDVLTLT------PSEAPSSGFVYGCGQDNQGL 238

Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFGDQG 179
           +       G++GL    +S+   L+K     N+FS C   +         SG +  G   
Sbjct: 239 F---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSSFSAPNSSSLSGFLSIGASS 293

Query: 180 PATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPT 232
             T     F P+ +       YF+ + +  +    L  S        ++DSG   T LP 
Sbjct: 294 -LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPV 352

Query: 233 EIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 291
            +Y  +   F  ++S K     G S    C+  S +EM  VP++++IF       ++ H 
Sbjct: 353 AVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHN 412

Query: 292 FSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
                 +G T  CL + ++     IIG       ++ +D  N K+ ++   C+
Sbjct: 413 SLVEIEKGTT--CLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGCQ 463


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 33/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S+S  +V+C +P C     ++C++    C Y   Y  + + + G    + L L  
Sbjct: 205 FDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG- 262

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+  SSV IGCG    G ++  A    + G  L   S PS ++       +FS
Sbjct: 263 ------DSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 308

Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGF 217
            C  + DS S   + FGD   A + +   +        Y+VG+    +G   L+   S F
Sbjct: 309 YCLVDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAF 367

Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                     +VDSG + T L +  YA +   F +   S   +   + +  CY+ S    
Sbjct: 368 AMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTS 427

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           ++VP + L F+      +    +  P + G   +CL    T+    IIG     G R+ F
Sbjct: 428 VEVPAVSLRFAGGGELRLPAKNYLIPVD-GAGTYCLAFAPTNAAVSIIGNVQQQGTRVSF 486

Query: 330 DRENLKLAWSHSKC 343
           D     + ++ +KC
Sbjct: 487 DTAKSTVGFTSNKC 500


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 73/321 (22%), Positives = 127/321 (39%), Gaps = 28/321 (8%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
           P    S+  + C+ PLCK+     + +   P   DY  E     SS G LV D+  L   
Sbjct: 98  PLYQPSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSL--- 154

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
             +     +   + +GCG  Q          DGV+GLG G VS+ S L   G ++N    
Sbjct: 155 -NYTKGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGH 213

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQAL 220
           C      G +FFG+    + +  S+ P+  +   ++   +G E    G           +
Sbjct: 214 CLSSLGGGILFFGNDLYDSSR-VSWTPMARENSKHYSPAMGGE-LLFGGRTTGLKNLLTV 271

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPD 274
            DSG+S+T+  ++ Y  V     + +S K +  +   ++   C+       S E +K   
Sbjct: 272 FDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYF 331

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGH 325
             L  S    +  +  +F  P      +      CL +++       +  +IG   M   
Sbjct: 332 KPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 390

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            I++D E   + W  + C+E+
Sbjct: 391 MIIYDNEKQSIGWIPADCDEI 411


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 73/299 (24%), Positives = 122/299 (40%), Gaps = 28/299 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  N+SC+ P C            C Y   Y  + + S G+   D L L+S+ 
Sbjct: 204 FDPARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 261

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G Y + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 262 ------DAIKGFRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAH 309

Query: 164 CFDENDSGSVFFGDQGPA-----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
           CF    SG+ +  D GP      + + T+ + +      Y+VG+    +G   L+  QS 
Sbjct: 310 CFPARSSGTGYL-DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSV 368

Query: 217 FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLK 271
           F     +VDSG   T LP   Y+ +   F   ++ +  + +   +    CY+ +    + 
Sbjct: 369 FTTSGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVA 428

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           +P + L+F    S  V      +  +             D D GI+G   +    +V+D
Sbjct: 429 IPTVSLLFQGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYD 487


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 139/333 (41%), Gaps = 41/333 (12%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
           D+ L  +DPS+SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+
Sbjct: 118 DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 176

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L  D           P       V  GCG    G +       G+ G G G +S+PS L 
Sbjct: 177 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 227

Query: 153 KAGLIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
           K G    +FS CF   +     +V           G    QST  +        Y++ ++
Sbjct: 228 KVG----NFSHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLK 283

Query: 204 SYCIGNSCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
              +G++ L   +S F         ++DSG + T LPT +Y  V   F   V    +S  
Sbjct: 284 GITVGSTRLPVPESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGN 343

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
                +C +A       VP + L F      + R N++F   E+ G ++ CL ++   G+
Sbjct: 344 TTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGE 401

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              IG        +++D +N KL++  ++C+++
Sbjct: 402 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDKL 434


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 33/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S+S  +V+C +P C     ++C++    C Y   Y  + + + G    + L L  
Sbjct: 209 FDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG- 266

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+  SSV IGCG    G ++  A    + G  L   S PS ++       +FS
Sbjct: 267 ------DSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 312

Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGF 217
            C  + DS S   + FGD   A + +   +        Y+VG+    +G   L+   S F
Sbjct: 313 YCLVDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAF 371

Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                     +VDSG + T L +  YA +   F +   S   +   + +  CY+ S    
Sbjct: 372 AMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTS 431

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           ++VP + L F+      +    +  P  +G   +CL    T+    IIG     G R+ F
Sbjct: 432 VEVPAVSLRFAGGGELRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSF 490

Query: 330 DRENLKLAWSHSKC 343
           D     + ++ +KC
Sbjct: 491 DTAKSTVGFTTNKC 504


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS SS+  ++S   P+C +    K +  + C Y A Y+   TSS     +DI+    F
Sbjct: 101 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 156

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 +   SSV+ GCG    G + DG    G++GL  GD S+ S L       + FS 
Sbjct: 157 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 208

Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
           C    FD + + +      G   + S++  P       Y+V +E   +G + L       
Sbjct: 209 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 266

Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
             T+SG   +V DSG + TFL  + +  +  +  +LV    +++  +      CY     
Sbjct: 267 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 326

Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
           E L+  P++   F++    V+  +     +N+   VFCL V+ ++        GI+ Q  
Sbjct: 327 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 384

Query: 322 ------MMGHRIVFDRENLKL 336
                 ++G R+ F R + +L
Sbjct: 385 YNVAYDLIGKRVYFQRTDCEL 405


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 108/242 (44%), Gaps = 19/242 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+ +N+SC+   C   SS       C Y   Y  + +S+ G+L  +   LA+  
Sbjct: 59  FDPTLSSTYRNISCTSAACTGLSSRGCSGSTCVYGVTYG-DGSSTVGFLATETFTLAA-- 115

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                 +V ++ I GCG+   G +  GAA  G++GLG    S+ S LA +  + N FS C
Sbjct: 116 -----GNVFNNFIFGCGQNNQGLF-TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYC 165

Query: 165 FDENDSGSVFFGDQGP-ATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA-- 219
                S + +     P  T   T+ L        YF+ +    +G +   L+ + FQ+  
Sbjct: 166 LPSTSSATGYLNIGNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVG 225

Query: 220 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
            ++DSG   T LP   Y  +   F   ++    +   +    CY+ S    +  P ++L 
Sbjct: 226 TIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLH 285

Query: 279 FS 280
           ++
Sbjct: 286 YT 287


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS SS+  ++S   P+C +    K +  + C Y A Y+   TSS     +DI+    F
Sbjct: 101 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 156

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 +   SSV+ GCG    G + DG    G++GL  GD S+ S L       + FS 
Sbjct: 157 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 208

Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
           C    FD + + +      G   + S++  P       Y+V +E   +G + L       
Sbjct: 209 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 266

Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
             T+SG   +V DSG + TFL  + +  +  +  +LV    +++  +      CY     
Sbjct: 267 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 326

Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
           E L+  P++   F++    V+  +     +N+   VFCL V+ ++        GI+ Q  
Sbjct: 327 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 384

Query: 322 ------MMGHRIVFDRENLKL 336
                 ++G R+ F R + +L
Sbjct: 385 YNVAYDLIGKRVYFQRTDCEL 405


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS SS+  ++S   P+C +    K +  + C Y A Y+   TSS     +DI+    F
Sbjct: 133 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 188

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 +   SSV+ GCG    G + DG    G++GL  GD S+ S L       + FS 
Sbjct: 189 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 240

Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
           C    FD + + +      G   + S++  P       Y+V +E   +G + L       
Sbjct: 241 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 298

Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
             T+SG   +V DSG + TFL  + +  +  +  +LV    +++  +      CY     
Sbjct: 299 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 358

Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
           E L+  P++   F++    V+  +     +N+   VFCL V+ ++        GI+ Q  
Sbjct: 359 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 416

Query: 322 ------MMGHRIVFDRENLKL 336
                 ++G R+ F R + +L
Sbjct: 417 YNVAYDLIGKRVYFQRTDCEL 437


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 79/312 (25%), Positives = 132/312 (42%), Gaps = 42/312 (13%)

Query: 56  VSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
           V C   LC+  S  SC +  D C Y+  Y  + +S+SG L D+   ++S S         
Sbjct: 93  VLCQSSLCQPPSIFSCNNDGD-CEYVYPYG-DRSSTSGILSDETFSISSQSL-------- 142

Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 169
            ++  GCG    G   D     G++G G G +S+ S L  +  + N FS C     D + 
Sbjct: 143 PNITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSK 196

Query: 170 SGSVFFGDQG--PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--------QSGFQA 219
           +  +F G+     AT   ++ L      + Y++ +E   +G   L         QS    
Sbjct: 197 TSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSG 256

Query: 220 --LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
             ++DSG + TFL    Y  V    + +VSS  +         C+N         P M  
Sbjct: 257 GLIIDSGTTLTFLQQTAYDAVK---EAMVSSINLPQADGQLDLCFNQQGSSNPGFPSMTF 313

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENL 334
            F K   + V    + FP++    + CL +M T+   G+  I G      ++I++D EN 
Sbjct: 314 HF-KGADYDVPKENYLFPDSTS-DIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENN 371

Query: 335 KLAWSHSKCEEV 346
            L+++ + C+ +
Sbjct: 372 VLSFAPTACDTL 383


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 139/333 (41%), Gaps = 41/333 (12%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
           D+ L  +DPS+SS+    SC   LC+    +SC S K      C Y   Y  + + ++G+
Sbjct: 118 DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 176

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L  D           P       V  GCG    G +       G+ G G G +S+PS L 
Sbjct: 177 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 227

Query: 153 KAGLIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
           K G    +FS CF   +     +V           G    QST  +        Y++ ++
Sbjct: 228 KVG----NFSHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLK 283

Query: 204 SYCIGNSCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
              +G++ L   +S F         ++DSG + T LPT +Y  V   F   V    +S  
Sbjct: 284 GITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGN 343

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
                +C +A       VP + L F      + R N++F   E+ G ++ CL ++   G+
Sbjct: 344 TTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGE 401

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              IG        +++D +N KL++  ++C+++
Sbjct: 402 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDKL 434


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 138/320 (43%), Gaps = 36/320 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLA 101
           +DP SS + ++ SC    C    +S+C    + C Y   YS  D S + G +  D + L 
Sbjct: 137 FDPKSSKTYRDFSCDARQCSLLDQSTCSG--NICQY--QYSYGDRSYTMGNVASDTITLD 192

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           S +  +P S  ++  +IGCG +  G++ D  +  G++GLG G +S+ S +  +  +   F
Sbjct: 193 S-TTGSPVSFPKT--VIGCGHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKF 245

Query: 162 SICF-----DENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN--- 209
           S C         +S  + FG      GP  Q ST  L        YF+ +E+  +GN   
Sbjct: 246 SYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQ-STPLLSSETMSSFYFLTLEAMSVGNERI 304

Query: 210 ----SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
               S L       ++DSG + T +P + ++ +       V  +R          CY+A+
Sbjct: 305 KFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSAT 364

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           S+  LKVP +   F+     +   + F    ++   V CL   ST     I G    M  
Sbjct: 365 SD--LKVPAITAHFTGADVKLKPINTFVQVSDD---VVCLAFASTTSGISIYGNVAQMNF 419

Query: 326 RIVFDRENLKLAWSHSKCEE 345
            + ++ +   L++  + C +
Sbjct: 420 LVEYNIQGKSLSFKPTDCTK 439


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 129/314 (41%), Gaps = 34/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DP+ S+S  NVSCS PLC S  S      +     C Y   Y  + + S G+L  + L 
Sbjct: 168 FDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYG-DGSYSIGFLGKERLT 226

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           + S       + + ++   GCG+   G +   A   G++GLG   +SV S  A       
Sbjct: 227 IGS-------TDIFNNFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQ 274

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSC 211
            FS C     S S  F   G +  +S  F P+     +++        VG +   I  S 
Sbjct: 275 LFSYCLPS--SSSTGFLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSV 332

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            + +G   ++DSG   T LP   Y+ +   F K ++S  +    +    CY+ S  + +K
Sbjct: 333 FSTAG--TIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIK 390

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVF 329
           VP + + FS      V +    F  N G    CL      G  D  I G        +V+
Sbjct: 391 VPKIVISFSGGVDVDV-DQAGIFVAN-GLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVY 448

Query: 330 DRENLKLAWSHSKC 343
           D    K+ ++ + C
Sbjct: 449 DVSGGKVGFAPASC 462


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 41/328 (12%)

Query: 45  YDPSSSSSSKNVSCSH----PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDPS+SS+   V CS     P+ +SR +C +    C Y   YS +   S+G L  + L L
Sbjct: 119 YDPSASSTFSPVPCSSATCLPVLRSR-NCSTPSSLCRYGYSYS-DGAYSAGILGTETLTL 176

Query: 101 ASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            S     P  +V  S V  GCG    G  L+     G +GLG G +   SLLA+ G+   
Sbjct: 177 GS---SVPGQAVSVSDVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--G 225

Query: 160 SFSIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
            FS C    F+         G       GP   QST  L        Y V ++   +G+ 
Sbjct: 226 KFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDV 285

Query: 211 CL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWK 259
            L            S    +VDSG +F+ LP   +  VV    +++    ++    +S  
Sbjct: 286 RLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPC 345

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
           +   A   ++  +PD+ L F+      + R++  S+  N+  + FCL ++ T   + ++G
Sbjct: 346 FPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSY--NQEDSSFCLNIVGTTSTWSMLG 403

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                  +++FD    +L++  + C ++
Sbjct: 404 NFQQQNIQMLFDMTVGQLSFLPTDCSKL 431


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 128/318 (40%), Gaps = 40/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SSSS+ + C  P CK     SC ++   C +   Y    ++   YL  D L LA 
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSAIEAYLTQDTLTLA- 183

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  + V  +   GC  K +G+ L      G+MGLG G +S+ S      L Q++FS
Sbjct: 184 -------TDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231

Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
            C       N SGS+  G +  P   ++T  L    +   Y+V +    +GN  +     
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +G   + DSG  +T L    Y  +  +F + V +   +  G  +  CY+ S  
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLG-GFDTCYSGS-- 348

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGH 325
             +  P +  +F+     +  +++         +   +    T+ +    +I       H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNH 406

Query: 326 RIVFDRENLKLAWSHSKC 343
           R++ D  N +L  S   C
Sbjct: 407 RVLIDVPNSRLGISRETC 424


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 134/343 (39%), Gaps = 47/343 (13%)

Query: 40  RNLSEYDPSSS---SSSKNVSCSH---------PLCKS-RSSCKSLKDPCPYIADYSTED 86
           RN + + P S+     S   S +H         PL K  R +   L  PC Y  +YS  D
Sbjct: 121 RNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRY--EYSYGD 178

Query: 87  TS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGL 142
            S +SG+   +   L + S    + +    +  GC  + +G  + GA+     GVMGLG 
Sbjct: 179 GSKTSGFFSKETTTLNTSSG---REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGR 235

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDEND-----SGSVFFG----DQGPATQQSTSFLPIGE 193
           G +S+ S L       N FS C  ++D     +  +  G    D  P  ++   F P+  
Sbjct: 236 GPISLSSQLGHR--FGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPG-KRRMRFTPLHI 292

Query: 194 KYDA---YFVGVESYCI-GNSCLTQSGFQAL---------VDSGASFTFLPTEIYAEVVV 240
              +   Y++G+ES  + G          AL         VDSG + TFLP   Y +++ 
Sbjct: 293 NPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILT 352

Query: 241 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 300
              + V     +     +  C N S  E  ++P +      +  F      +    +E  
Sbjct: 353 VIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDV 412

Query: 301 TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               L  + T   + +IG     G  + FD++  +L +S   C
Sbjct: 413 KCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 137/325 (42%), Gaps = 19/325 (5%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
            LS +DP  SSS+  VSCS   C S    +S   P   C Y   Y  + + +SGY + D 
Sbjct: 127 QLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGYYISDF 185

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGL 156
           +   +        +  +  + GC   Q+G       A DG+ GLG G +SV S LA  GL
Sbjct: 186 MSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGL 245

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
               FS C   + SG       G   +  T + P+      Y V ++S  +    L    
Sbjct: 246 APRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDP 304

Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 +G   ++D+G +  +LP E Y+  +      VS     +   S++ C+  ++ +
Sbjct: 305 SVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFEITAGD 363

Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGH 325
           +   P + L F+   S V+  R ++  F  + G +++C+     +     I+G   +   
Sbjct: 364 VDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWCIGFQRMSHRRITILGDLVLKDK 422

Query: 326 RIVFDRENLKLAWSHSKCEEVIDKS 350
            +V+D    ++ W+   C   ++ S
Sbjct: 423 VVVYDLVRQRIGWAEYDCSLEVNVS 447


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 82/329 (24%), Positives = 131/329 (39%), Gaps = 40/329 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSS---GYLVDDIL 98
           +DP  S+S   ++   P C++  RS     K   C Y   Y     S+S   G LV++ L
Sbjct: 176 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETL 235

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
             A   +       Q+ + IGCG    G  L GA   G++GLG G +S+P  +A  G   
Sbjct: 236 TFAGGVR-------QAYLSIGCGHDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-N 285

Query: 159 NSFSICFDENDSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYC 206
            SFS C  +  SG      ++ FG     T    SF P      +   Y    +GV    
Sbjct: 286 ASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGG 345

Query: 207 IGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGN 256
           +    +T+   Q          ++DSG + T L    Y      F    +S  ++S  G 
Sbjct: 346 VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGP 405

Query: 257 S--WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
           S  +  CY       +KVP + + F+      ++   +  P +   TV      + D   
Sbjct: 406 SGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSV 465

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            +IG     G R+V+D    ++ ++ + C
Sbjct: 466 SVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
 gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
          Length = 490

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/391 (23%), Positives = 158/391 (40%), Gaps = 65/391 (16%)

Query: 26  LWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 85
           ++C   F    +QD   S   P+ SSS K + C +  C +     S K    Y   Y+ E
Sbjct: 63  MFCSFFF----LQDPRFS---PALSSSYKPLECGNE-CSTGFCDGSRK----YQRQYA-E 109

Query: 86  DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
            ++SSG L  D++  ++ S    Q      ++ GC   +TG   D  A DG++GLG G +
Sbjct: 110 KSTSSGVLGKDVISFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPL 163

Query: 146 SVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 202
           S+   L +   +++ FS+C+   DE     +  G Q P     TS  P    Y  Y + +
Sbjct: 164 SIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY--YNLML 221

Query: 203 ESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
           +   +G S L          +  ++DSG ++ + P   +        + V S +  + G 
Sbjct: 222 KGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK-EVPGP 280

Query: 257 SWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTV 307
             K+   CY  +   +  +    P +  +F   QS  +   N++F   +  G   +CL V
Sbjct: 281 DEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISG--AYCLGV 338

Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLP 367
                   ++G   +    + ++R    + +  +KC ++  +                LP
Sbjct: 339 FENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDLWSR----------------LP 382

Query: 368 TTEQ--QSTSNGQAAAPPSTAKTAPSKSIAA 396
            T +   ST   Q   PP     APS S+ A
Sbjct: 383 ETNEPGHSTQPAQFLLPP-----APSPSVGA 408


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 78/332 (23%), Positives = 134/332 (40%), Gaps = 48/332 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DP  SS+ + V CS P C++       S  +    C Y+  Y  + +SS+G L  D L 
Sbjct: 128 FDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGDLATDKLA 186

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            A+        +  ++V +GCGR   G + D AA  G++G+G G +S+ + +A A    +
Sbjct: 187 FAN-------DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGS 234

Query: 160 SFSICFDENDSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
            F  C  +  S S      VF     P +   T+ L    +   Y+V +  + +G   +T
Sbjct: 235 VFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT 294

Query: 214 QSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGN 256
             GF                +VDSG + +    + YA +   FD    +  +     + +
Sbjct: 295 --GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHS 352

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD 311
            +  CY+         P + L F+      +    +  P + G         CL   + D
Sbjct: 353 VFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD 412

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               +IG     G R+VFD E  ++ ++   C
Sbjct: 413 DGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/338 (22%), Positives = 139/338 (41%), Gaps = 30/338 (8%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADY 82
           L+W   +   S  + +N   +DPS S+S K VSC    C+     SC   +  C +   Y
Sbjct: 114 LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY 172

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + + + G +  + L L S   ++ Q     +++ GCG   +G++ +     G+ G G 
Sbjct: 173 G-DGSLAQGVIATETLTLNS---NSGQPXSIXNIVFGCGHNNSGTFNENEM--GLFGTGG 226

Query: 143 GDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
             +S+ S +         FS C      D + +  + FG +   +       P+  K D 
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDP 286

Query: 198 --YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
             YFV ++   +G+        S +   G    +D+G   T LP + Y  +V    + + 
Sbjct: 287 TYYFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIP 345

Query: 248 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
            + +       + CY +++  ++  P +   F      +   + F  P+ EG  V+C  +
Sbjct: 346 MEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAM 400

Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
              DGD GI G    M   I FD +  K+++    C +
Sbjct: 401 QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 138/349 (39%), Gaps = 65/349 (18%)

Query: 47  PSSSSSSKNVSCSHPLCKS-------------RSSCKSLKDPCP-YIADYSTEDTSSSGY 92
           P  SSS   V+C+   CK+               S K+  + CP Y   Y     S++G 
Sbjct: 33  PRMSSSLHLVTCADSNCKTLYGNNTELLCQSCAGSLKNCSETCPPYGIQYGR--GSTAGL 90

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L+ + L+L    ++   +   +   +GC      S +    P G+ G G G +S+PS L 
Sbjct: 91  LLTETLNLPL--ENGEGARAITHFAVGC------SIVSSQQPSGIAGFGRGALSMPSQLG 142

Query: 153 KAGLIQNSFSIC-----FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA--------- 197
           +  + ++ F+ C     FDE +  S+   GD+        ++ P      A         
Sbjct: 143 EH-IGKDRFAYCLQSHRFDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVY 201

Query: 198 YFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
           Y++G+    IG   L           T+     ++DSG +FT    EI+  +   F   +
Sbjct: 202 YYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQI 261

Query: 247 SSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTV 302
             +R            CY+ +  E + +P+    F      V  V N+   F     F  
Sbjct: 262 GYRRAGEVEDKTGMGLCYDVTGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSS---FDS 318

Query: 303 FCLTVMSTDG----DYG---IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            CLT++S+ G    D G   I+G +      +++DRE  +L ++   C+
Sbjct: 319 ICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYDREKNRLGFTQQTCK 367


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/326 (26%), Positives = 140/326 (42%), Gaps = 42/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++PS SSS KN+ C   LC+S   +SC   K+ C Y + Y  +++ S G L  D L L S
Sbjct: 129 FNPSKSSSYKNIPCPSKLCQSMEDTSCND-KNYCEY-STYYGDNSHSGGDLSVDTLTLES 186

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            +          +++IGCG     SY +GA+  G++G G G  S  + L  +      FS
Sbjct: 187 TNGLTVSF---PNIVIGCGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFS 239

Query: 163 ICF---------DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSC 211
            C            N +  + FGD    +       PI +K     Y++ +E++ +GN  
Sbjct: 240 YCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRR 299

Query: 212 LTQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           +   G          ++DSG + T L  + Y+ +      LV  +R+     +   CY+ 
Sbjct: 300 VEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSV 359

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QN 320
            +E      D  +I    +   V  H  S   +    VFCL   S+  D+ I G    QN
Sbjct: 360 KAEGY----DFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQ-DHAIFGNLAQQN 414

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
            M+G    +D +   +++  S C +V
Sbjct: 415 LMVG----YDLQQKIVSFKPSDCTKV 436


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 30/338 (8%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADY 82
           L+W   +   S  + +N   +DPS S+S K VSC    C+     SC   +  C +   Y
Sbjct: 114 LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY 172

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + + + G +  + L L S   ++ Q +   +++ GCG   +G++ +     G+ G G 
Sbjct: 173 G-DGSLAQGVIATETLTLNS---NSGQPTSILNIVFGCGHNNSGTFNENEM--GLFGTGG 226

Query: 143 GDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
             +S+ S +         FS C      D + +  + FG +   +       P+  K D 
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDP 286

Query: 198 --YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
             YFV ++   +G+        S +   G    +D+G   T LP + Y  +V    + + 
Sbjct: 287 TYYFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIP 345

Query: 248 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
            + +       + CY +++  ++  P +   F      +   + F  P+ EG  V+C  +
Sbjct: 346 MEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAM 400

Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
              DGD GI G    M   I FD +  K+++    C +
Sbjct: 401 QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 129/314 (41%), Gaps = 35/314 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP++SSS   ++C    C+    S+C++ K  C Y   Y     +   Y+ + +    S
Sbjct: 199 FDPTASSSYNPLTCDAQQCQDLEMSACRNGK--CLYQVSYGDGSFTVGEYVTETV----S 252

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F   +      + V IGCG    G +        V   GL  +    L   + +   SFS
Sbjct: 253 FGAGS-----VNRVAIGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQIKATSFS 299

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
            C  + DSG   ++ F    P        L   +    Y+V +    +G   +T      
Sbjct: 300 YCLVDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETF 359

Query: 214 ---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              QSG    +VDSG + T L T+ Y  V   F +  S+ R +     +  CY+ SS + 
Sbjct: 360 AVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQS 419

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           ++VP +   FS ++++ +    +  P  +G   +C     T     IIG     G R+ F
Sbjct: 420 VRVPTVSFHFSGDRAWALPAKNYLIPV-DGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSF 478

Query: 330 DRENLKLAWSHSKC 343
           D  N  + +S +KC
Sbjct: 479 DLANSLVGFSPNKC 492


>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 873

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/307 (21%), Positives = 140/307 (45%), Gaps = 23/307 (7%)

Query: 52  SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
           ++K+ S +   CK    C + +D    I    +E +     ++ D++ + +      +  
Sbjct: 90  ATKSTSINFVQCKYEEGCDTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMI 149

Query: 112 VQSSVI---IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
           ++   I    GC  ++TG ++     +G+MGLG+G  ++ + + KA  ++ + F++CF +
Sbjct: 150 MRRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQ 208

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT------QSGFQAL 220
                V  G          ++ P+ +   + Y + V+   IG   L       +SG  A+
Sbjct: 209 KGGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAI 268

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNSWKYCYNASSEEMLKVPDMRLIF 279
           VDSG + T+ P+         F      KRI+ ++ N  K   N + E +  +P++ LI 
Sbjct: 269 VDSGTTDTYFPSAAATPFQEAF------KRITGVEYNENKM--NLTPEMVETLPNVSLII 320

Query: 280 S--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           +    + F +  +   +  N+    F  T+  ++    ++G + MMG+ ++FD E  ++ 
Sbjct: 321 AGEDGEDFEISLNASDYILNDSNHHFFGTLHFSERRGAVLGASIMMGYDVIFDLEKKRVG 380

Query: 338 WSHSKCE 344
           ++ + C+
Sbjct: 381 FAEATCD 387


>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
          Length = 547

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/316 (24%), Positives = 134/316 (42%), Gaps = 24/316 (7%)

Query: 45  YDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-- 101
           +DPS SS++  V+C     C     C+S K  C  + ++ TE +S     VDD+L +   
Sbjct: 150 WDPSQSSTAHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWVGER 207

Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-Q 158
             S S+    S+       GC    TG +    A DG+MGL     ++ + LA AG I +
Sbjct: 208 TLSDSQKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKISE 266

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLT-- 213
             FS+CF E   G++  G   P   +  S   + P   +  A  V V    +    +T  
Sbjct: 267 RKFSLCFSET-GGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITTD 325

Query: 214 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
               Q G    + SG + T+LP  +       ++    S   + + N  ++C   ++ E+
Sbjct: 326 ASVFQKGTGIKIVSGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMN--EFCMTRTTVEL 383

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIV 328
             +P + +         VR   +    ++   V+  L    + G  G++G N +  H +V
Sbjct: 384 EALPVLMIHMDGGVEVNVRPEAYMDASSDEENVYPSLPPPCSMG--GVLGANLLRDHNVV 441

Query: 329 FDRENLKLAWSHSKCE 344
           FD +N  + ++   C+
Sbjct: 442 FDYDNHVVGFADGACD 457


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/355 (23%), Positives = 125/355 (35%), Gaps = 69/355 (19%)

Query: 40  RNLSEYDPSS------SSSSKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTED 86
           RN S + PSS      SSS     C  P C+         C    L  PC ++  Y+ + 
Sbjct: 120 RNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA-DG 178

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGA---APDGVMGLGL 142
           + SSG+   +   L S S     S +    +  GCG + +G  + GA      GVMGLG 
Sbjct: 179 SLSSGFFSKETTTLKSLSG----SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGR 234

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----- 197
           G +S  S L +     N FS C  +              +   TSFL IG    +     
Sbjct: 235 GSISFSSQLGRR--FGNKFSYCLMDYT-----------LSPPPTSFLMIGGGLHSLPLTN 281

Query: 198 ------------------YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 229
                             Y++ + S  I    L           Q     +VDSG + T+
Sbjct: 282 ATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTY 341

Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVR 288
           L    Y EV+    + V     +     +  C NAS E     +P +R        F   
Sbjct: 342 LTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPP 401

Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              +     EG     +  + +   + +IG     G  + FD+E  +L ++   C
Sbjct: 402 PRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/334 (23%), Positives = 129/334 (38%), Gaps = 40/334 (11%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTSS 89
            D+    +DPSSS S   V C+   C         S  +C      C Y   Y  + + S
Sbjct: 146 HDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYR-DGSYS 204

Query: 90  SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
            G L  D L LA          +Q   + GCG    G +       G+MGLG   +S+ S
Sbjct: 205 RGVLAHDRLSLAG-------EDIQG-FVFGCGTSNQGPF---GGTSGLMGLGRSQLSLIS 253

Query: 150 -LLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFV 200
             + + G +   FS C    +   SGS+  GD     + ST  +      D      Y  
Sbjct: 254 QTMDQFGGV---FSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLA 310

Query: 201 GVESYCIGNSCLTQSGF------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
            +    +G   +   GF      +A+VDSG   T L   +YA V  +F   ++    +  
Sbjct: 311 NLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAP 370

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
            +    C++ +    ++VP ++L+F       V +    +      +  CL + S   +Y
Sbjct: 371 FSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEY 430

Query: 315 G--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              IIG       R++FD    ++ ++   C+ +
Sbjct: 431 DTPIIGNYQQKNLRVIFDTVGSQIGFAQETCDYI 464


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 136/329 (41%), Gaps = 47/329 (14%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +N + +DP +SSS + +SCS P CK     +C S  + C Y   Y  + + + G L  D 
Sbjct: 51  QNDAVFDPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASDS 109

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
             L S  + +P       V+ GCG    G ++  A   G+    L   S PS L+     
Sbjct: 110 F-LVSRGRTSP-------VVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLS----- 153

Query: 158 QNSFSICFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN 209
              FS C    D+G     ++ FGD    T  S ++  +    K D  Y+ G+    IG 
Sbjct: 154 SRKFSYCLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGG 213

Query: 210 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQ 254
           + L+   + F+          ++DSG S T LPT  Y  +   F     KL  +   SL 
Sbjct: 214 TLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL- 272

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
              +  CY+ S+   + +P +   F    S  +    +  P +   T FC     T  D 
Sbjct: 273 ---FDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDL 328

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            IIG       R+  D ++ ++ ++  +C
Sbjct: 329 SIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 135/324 (41%), Gaps = 44/324 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP+SS++   VSC   +C++ R+S       C Y   Y  + + + G L  + L L   
Sbjct: 167 FDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYG-DGSYTKGTLALETLTLG-- 223

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                  +    V IGCG +  G ++  A   G++GLG G +S+   L  A     +FS 
Sbjct: 224 ------GTAVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSY 272

Query: 164 CFDE---------NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC 211
           C            + +GS+  G +  A  +   ++P+     A   Y+VGV    +G+  
Sbjct: 273 CLASRGGSGSGAADAAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDER 331

Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
                    LT+ G   +V D+G + T LP E YA +   F   V +   +   +    C
Sbjct: 332 LPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTC 391

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           Y+ S    ++VP +   F    +  +  RN +    E +G  ++CL    +     I+G 
Sbjct: 392 YDLSGYTSVRVPTVSFYFDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGLSILGN 447

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
               G +I  D  N  + +  + C
Sbjct: 448 IQQEGIQITVDSANGYIGFGPATC 471


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 134/327 (40%), Gaps = 51/327 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS---CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           Y+   SSS+ +V C  P C++  S   C    + C Y  +Y    +S+  + V+ +    
Sbjct: 172 YNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF-- 229

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 P       V IGCG    G +   AA  G++GLG G +S PS +  AG    SF
Sbjct: 230 ------PPGVRVPGVAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQI--AGRYGRSF 279

Query: 162 SICFDENDSG----SVFFGDQGPA------TQQSTSFLPIGEKYDAYFVGVESYCIGN-- 209
           S C     +G    ++ FG    A          T  L     Y  Y+VG+    +G   
Sbjct: 280 SYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339

Query: 210 -SCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNS 257
              +T+S  +          +VDSG + T L    YA     F ++ + K +     G  
Sbjct: 340 VRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-RVAAVKELGWPSPGGP 398

Query: 258 WKY---CYNA-SSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTD 311
           + +   CY++     M KVP + + F+      +  +N++     N+G   F     +  
Sbjct: 399 FAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAF---AGS 455

Query: 312 GDYG--IIGQNFMMGHRIVFDRENLKL 336
           GD G  IIG   + G R+V+D +  ++
Sbjct: 456 GDRGVSIIGNIQLQGFRVVYDVDGQRV 482


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/360 (22%), Positives = 147/360 (40%), Gaps = 40/360 (11%)

Query: 12  NAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSC 69
           +A N+++CL +       ++          L  +D S+SS+    SC   LC+    +SC
Sbjct: 144 DAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTSSTLLLTSCDSTLCQGLLVASC 203

Query: 70  KSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 125
            + K      C Y   Y+ +  ++       +L +  F+  A  S     V  GCG    
Sbjct: 204 GNTKFWPNQTCVYTYYYNDKSVTTG------LLEVDKFTFGAGASV--PGVAFGCGLFNN 255

Query: 126 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS---GSVFFG------ 176
           G +       G+ G G G +S+PS L K G    +FS CF   +     +V         
Sbjct: 256 GVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGLKQSTVLLDLLADLY 308

Query: 177 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---------CLTQSGFQALVDSGASF 227
             G    QST  +        Y++ ++   +G++          LT      ++DSG S 
Sbjct: 309 KNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSI 368

Query: 228 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 287
           T LP ++Y  V  +F   +    +         C++A S+    VP + L F      + 
Sbjct: 369 TSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLP 428

Query: 288 R-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           R N++F  P++ G ++ CL +     +   IG        +++D +N  L++  ++C+++
Sbjct: 429 RENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 488



 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 6/124 (4%)

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
           LT      ++DSG S T LP ++Y  V  +F   +    +         C++A S+    
Sbjct: 58  LTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPD 117

Query: 272 VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM---MGHRI 327
           VP + L F      + R N++F  P++ G ++ CL +    GD   I  NF    M    
Sbjct: 118 VPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI--NKGDETTIIGNFQQQNMHALP 175

Query: 328 VFDR 331
            FDR
Sbjct: 176 YFDR 179


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
           +++L  YD S SS+    SC    CK   S + C       C +   YS  D S++   +
Sbjct: 127 NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAF--SYSYGDKSATIGFL 184

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D  +   SF   A   SV   V+ GCG   TG +       G+ G G G +S+PS L K 
Sbjct: 185 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 235

Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G   + F+       S  +F         G  T Q+T  +        Y++ ++   +G+
Sbjct: 236 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 295

Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
           + L   +S F         ++DSG +FT LP  +Y  V  +F   V    +         
Sbjct: 296 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 355

Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
           C++A    +   VP + L F      + R N++F   ++ G    CL ++  +G+  IIG
Sbjct: 356 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 412

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                   +++D +N KL++  +KC+++
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 440


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/317 (23%), Positives = 125/317 (39%), Gaps = 38/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+ P C    +       C Y   Y  + + S G+   D L L+S+ 
Sbjct: 222 FDPARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 279

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 280 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 327

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
           C     SG+ +  FG   PA   +    P+    G  +  Y+VG+    +G   L+  QS
Sbjct: 328 CLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 385

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASS 266
            F     +VDSG   T LP   Y+ +   F   ++++       +SL       CY+ + 
Sbjct: 386 VFATAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSL----LDTCYDFTG 441

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
              + +P + L+F       V      +  +              GD GI+G   +    
Sbjct: 442 MSQVAIPTVSLLFQGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFG 501

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D     + +S   C
Sbjct: 502 VAYDIGKKVVGFSPGAC 518


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 86/344 (25%), Positives = 144/344 (41%), Gaps = 68/344 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DPS SSS  ++ CSHPLCK R       +SC S +  C Y   Y+ + T + G LV 
Sbjct: 122 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 179

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +    ++     P       +I+GC ++ T          G++G+ LG +   S +++A 
Sbjct: 180 EKFTFSNSQTTPP-------LILGCAKESTDE-------KGILGMNLGRL---SFISQAK 222

Query: 156 LIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
           + + S+ I    N      +GS + GD               P +Q+  +  P+     A
Sbjct: 223 ISKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPL-----A 277

Query: 198 YFVGVESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
           Y V ++   IG   L   G           Q +VDSG+ FT L    Y +V  +  +LV 
Sbjct: 278 YTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVG 337

Query: 248 S--KRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 303
           S  K+  + G++   C+  N S E    + D+   F +    +V     S   N G  + 
Sbjct: 338 SRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQ--SLLVNVGGGIH 395

Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           C+ +  +        IIG        + FD  N ++ +S ++C 
Sbjct: 396 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECR 439


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/316 (23%), Positives = 125/316 (39%), Gaps = 37/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++   S++ K V C  P CK   + K     C +   Y +   +++  L  D++ LA+ S
Sbjct: 135 FNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDS 192

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     S   GC  + TGS +    P G++GLG G +S+  L     L Q++FS C
Sbjct: 193 I--------PSYTFGCLTEATGSSIP---PQGLLGLGRGPMSL--LSQTQNLYQSTFSYC 239

Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
                  N SGS+  G  G P   ++T  L    +   Y+V + +  +G   +       
Sbjct: 240 LPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSAL 299

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   + DSG  FT L    Y  V   F K V +  ++  G  +  CY +     
Sbjct: 300 AFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG-FDTCYTSP---- 354

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRI 327
           +  P +  +FS     +  +++         T   +     + +    +I       HRI
Sbjct: 355 IVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 414

Query: 328 VFDRENLKLAWSHSKC 343
           +FD  N +L  +   C
Sbjct: 415 LFDVPNSRLGVAREPC 430


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 134/316 (42%), Gaps = 36/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP+SSS+ K+++CS P C S   S+C+S K  C Y   Y     +   Y  D +     
Sbjct: 206 FDPTSSSTFKSLTCSDPKCASLDVSACRSNK--CLYQVSYGDGSFTVGNYATDTVTF--- 260

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 +S   + V +GCG    G +   A   G+ G  L   +   + AK      SFS
Sbjct: 261 -----GESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTN--QIKAK------SFS 307

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
            C  + DS    S+ F         +T+ L    K D  Y+VG+  + +G     +  S 
Sbjct: 308 YCLVDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSL 367

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSE 267
           F+         ++D G + T L T+ Y  +   F KL +  K+ +   + +  CY+ SS 
Sbjct: 368 FEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSL 427

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             +KVP +   F+  +S  +    +  P ++  T FC     T     IIG     G RI
Sbjct: 428 STVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGT-FCFAFAPTSSSLSIIGNVQQQGTRI 486

Query: 328 VFDRENLKLAWSHSKC 343
            +D  N  +  S +KC
Sbjct: 487 TYDLANNLIGLSANKC 502


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 134/322 (41%), Gaps = 32/322 (9%)

Query: 47  PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LH 99
           P    +   ++C+ P+C      S+  CK+  + C Y   Y+ +  SS G LV DI  L 
Sbjct: 76  PPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYA-DHGSSLGVLVHDIFSLQ 134

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGL 156
           L + +  AP+      +  GCG  Q  SY    AP   DGV+GLG G  S+ + L   GL
Sbjct: 135 LTNGTLAAPR------LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 186

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQ 214
           I++    C      G +F GD   +T     + P+  K    AY +G             
Sbjct: 187 IRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGV 245

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKV 272
            G + + DSG+S+T+   + Y   +    K ++ K       S   C+  +   + + +V
Sbjct: 246 KGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEV 305

Query: 273 PD----MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMG 324
            +      L F+K +S  ++    S+         CL +++      GD  +IG      
Sbjct: 306 KNYFKPFALSFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQD 365

Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
             +++D E  ++ W    C ++
Sbjct: 366 KMVIYDNERQQIGWVPKDCNKL 387


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 54/371 (14%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
            + P+ SSS K + C    C +     S K    Y   Y+ E ++SSG L  D++  ++ 
Sbjct: 76  RFSPALSSSYKPLECGSE-CSTGFCDGSRK----YQRQYA-EKSTSSGVLGKDVIGFSNS 129

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S    Q      ++ GC   +TG   D  A DG++GLG G +S+   L +   +++ FS+
Sbjct: 130 SDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSL 183

Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
           C+   DE     +  G Q P     T+  P    Y  Y + ++   +G S L        
Sbjct: 184 CYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY--YNLMLKGIRVGGSPLRLKPEVFD 241

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
             +  ++DSG ++ + P   +        + V S +  + G   K+   CY  +   +  
Sbjct: 242 GKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK-EVPGPDEKFKDICYAGAGTNVSN 300

Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
           +    P +  +F   QS  +    + F   +    +CL V        ++G   +    +
Sbjct: 301 LSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLV 360

Query: 328 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQ--QSTSNGQAAAPPST 385
            ++R    + +  +KC ++  +                LP T +   ST   Q   PP  
Sbjct: 361 TYNRGKASIGFLKTKCNDLWSR----------------LPETNEPGHSTQPAQFLLPP-- 402

Query: 386 AKTAPSKSIAA 396
              APS S+ A
Sbjct: 403 ---APSPSVGA 410


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 137/329 (41%), Gaps = 47/329 (14%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +N + +DP +SSS + +SCS P CK     +C S  + C Y   Y  + + + G L  D 
Sbjct: 51  QNDAVFDPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASD- 108

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
               SFS    ++   S V+ GCG    G ++  A   G+    L   S PS L+     
Sbjct: 109 ----SFSVSRGRT---SPVVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS---- 154

Query: 158 QNSFSICFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN 209
              FS C    D+G     ++ FGD    T  S ++  +    K D  Y+ G+    IG 
Sbjct: 155 -RKFSYCLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGG 213

Query: 210 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQ 254
           + L+   + F+          ++DSG S T LPT  Y  +   F     KL  +   SL 
Sbjct: 214 TLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL- 272

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
              +  CY+ S+   + +P +   F    S  +    +  P +   T FC     T  D 
Sbjct: 273 ---FDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDL 328

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            IIG       R+  D ++ ++ ++  +C
Sbjct: 329 SIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 86/338 (25%), Positives = 139/338 (41%), Gaps = 54/338 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP++SSS +NV+C    C         R+  +  +D CPY   Y  +  ++        
Sbjct: 191 FDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGD------ 244

Query: 98  LHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           L L SF+ +  AP +S +   V+ GCG +  G +   A   G+    L   S   L A  
Sbjct: 245 LALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVY 302

Query: 155 GLIQNSFSICFDEN--DSGS-VFFGDQ----GPATQQSTSFLPIGEKYDA-YFVGVESYC 206
           G   ++FS C  E+  D+GS V FG+          + T+F P     D  Y+V ++   
Sbjct: 303 G---HTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVL 359

Query: 207 IGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG- 255
           +G   L  S              ++DSG + ++     Y  +   F  L+S     +   
Sbjct: 360 VGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDF 419

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMS 309
                CYN S  E  +VP++ L+F+          ++ FP    F       + CL V  
Sbjct: 420 PVLNPCYNVSGVERPEVPELSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVRG 472

Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           T      IIG        +V+D +N +L ++  +C EV
Sbjct: 473 TPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 510


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 136/320 (42%), Gaps = 34/320 (10%)

Query: 56  VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAP 108
           ++C+ P+C      S+  CK+  + C Y   Y+ +  SS G LV DI  L L + +  AP
Sbjct: 118 ITCNDPMCSALHWPSKPPCKASHEQCDYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAP 176

Query: 109 QSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
           +      +  GCG  Q  SY    AP   DGV+GLG G  S+ + L   GLI++    C 
Sbjct: 177 R------LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCL 228

Query: 166 DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQALVDS 223
                G +F GD   +T     + P+  K    AY +G              G + + DS
Sbjct: 229 SGRGGGFLFLGDGL-STTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDS 287

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD----MRL 277
           G+S+T+   + Y   +    K ++ K       S   C+  +   + + +V +      L
Sbjct: 288 GSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFAL 347

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
            F+K +S  ++    S+         CL +++      GD  +IG        +++D E 
Sbjct: 348 SFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNER 407

Query: 334 LKLAWSHSKCEEV--IDKSH 351
            ++ W    C ++  +D+ +
Sbjct: 408 QQIGWVPKDCNKLPKVDRDY 427


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 97/398 (24%), Positives = 155/398 (38%), Gaps = 68/398 (17%)

Query: 7   FGSHANAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR 66
            GSH +    L     + L+W        I+ +   +   P + + S  VSC  P C + 
Sbjct: 25  LGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQSPACSTA 84

Query: 67  SSCKSLKDPCPY----IADYSTEDTSSSG-----YLVDDILHLASFSKHAPQSSVQSSVI 117
            S  S  D C      + +  T D SS+      Y   D     SF  H  + ++  S +
Sbjct: 85  HSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD----GSFIAHLHRDTLSMSQL 140

Query: 118 ------IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-----F 165
                  GC           A P GV G G G +S+P+ LA  +  + N FS C     F
Sbjct: 141 FLKNFTFGCAHTAL------AEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHSF 194

Query: 166 DE---NDSGSVFFGDQGPATQQSTSFL---PIGEKYDAYF--VGVESYCIGNSCL----- 212
           D+        +  G     + +   F+    +     +YF  VG+    +G   +     
Sbjct: 195 DKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTILAPEM 254

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWKYCYN 263
                 +     +VDSG +FT LP  +Y  VV +FD+ V    KR S   +      CY 
Sbjct: 255 LRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGPCYF 314

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENEG---FTVFCLTVMS-------T 310
              E +++VP +   F  N S V+    N+ + F + E      V CL +M+       +
Sbjct: 315 L--EGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDDTELS 372

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
            G   I+G     G  +V+D EN ++ ++  +C  + D
Sbjct: 373 GGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASLWD 410


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 77/307 (25%), Positives = 122/307 (39%), Gaps = 23/307 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SS+ +NVSC+ P C   S+       C Y   Y  + +S+ G+L  D   L    
Sbjct: 59  FDPSLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYG-DGSSTIGFLAMDTFMLTPAQ 117

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAK-AGLIQNSFS 162
           K         + I GCG+  TG +       G  GL GLG  S  SL ++ A  + N FS
Sbjct: 118 KF-------KNFIFGCGQNNTGLF------QGTAGLVGLGRSSTYSLNSQVAPSLGNVFS 164

Query: 163 ICFDENDSGSVFFGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 219
            C     S + +     P  T   T+ L        YF+ +    +G + L+ S   FQ+
Sbjct: 165 YCLPSTSSATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQS 224

Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
              ++DSG   T LP   Y+ +       ++   ++        CY+ S    +  P + 
Sbjct: 225 VGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIV 284

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
           L F+     +    +F F  N           +     GIIG    +   + +D E  ++
Sbjct: 285 LHFAGLDVRIPATGVF-FVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRI 343

Query: 337 AWSHSKC 343
            +S   C
Sbjct: 344 GFSAGAC 350


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 76/330 (23%), Positives = 135/330 (40%), Gaps = 42/330 (12%)

Query: 46  DPSSSSSSKNVSCSHPLCKSR--SSC--KSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
           DP++SS+   + C  PLC++   +SC  +S  D  C Y+  Y  + + + G L  D    
Sbjct: 134 DPAASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYG-DRSLTVGQLATDSFTF 192

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                    ++ +  V  GCG    G +   A   G+ G G G  S+PS L        S
Sbjct: 193 GGDDNAGGLAARR--VTFGCGHINKGIFQ--ANETGIAGFGRGRWSLPSQLNV-----TS 243

Query: 161 FSICF----DENDSGSVFFGDQGP-----------ATQQSTSFLPIGEKYDAYFVGVESY 205
           FS CF    D   S  V  G                  ++T  +    +   YFV +   
Sbjct: 244 FSYCFTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGI 303

Query: 206 CIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
            +G +   + +S  ++  ++DSGAS T LP ++Y  V  +F   V     +    +   C
Sbjct: 304 SVGGARVAVPESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLC 363

Query: 262 YNASSEEMLKVP-----DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
           +      + + P      + L    +      N++F   E+    V C+ + +  G+  +
Sbjct: 364 FALPVAALWRRPAVPALTLHLDGGADWELPRGNYVF---EDYAARVLCVVLDAAAGEQVV 420

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           IG        +V+D EN  L+++ ++C+++
Sbjct: 421 IGNYQQQNTHVVYDLENDVLSFAPARCDKL 450


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 99/209 (47%), Gaps = 21/209 (10%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +S +DP  S++  ++SC+   C     +  C   +  CPY   Y  + +S++GY ++D+ 
Sbjct: 85  MSTFDPRKSTTKISISCTDAECGVLNKKLQCSPERLSCPYSLLYG-DGSSTAGYYLNDVF 143

Query: 99  HLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
                 S ++   S  + ++ GCG  QTGS+    + DG++G G   VS+P+ LA+  + 
Sbjct: 144 TFNQVPSDNSTAKSGTARLVFGCGGTQTGSW----SVDGLLGFGPTTVSLPNQLAQQNIS 199

Query: 158 QNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQ 214
            N F+ C   D +  GS+  G      +    + P+    D Y V + +  I G +  T 
Sbjct: 200 VNIFAHCLQGDVSGRGSLVIGT---IREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTP 256

Query: 215 SGFQ------ALVDSGASFTFLPTEIYAE 237
           + F        ++DSG + T+L    Y E
Sbjct: 257 ASFDLEYTGGVIIDSGTTLTYLVQPAYDE 285


>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
           partial [Brachypodium distachyon]
          Length = 354

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 69/300 (23%), Positives = 126/300 (42%), Gaps = 34/300 (11%)

Query: 64  KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 123
           + +  CK   + C Y   Y+  + SS G L+ D   L       P    + ++  GCG  
Sbjct: 66  RFKHDCKENPNQCDYDVRYAGGE-SSLGVLIADKFSL-------PGRDARPTLTFGCGYD 117

Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPAT 182
           Q G   +    DGV+G+G G   + S L + G I +N    C      G +FFG +    
Sbjct: 118 QEGGKAEMPV-DGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHE-KVP 175

Query: 183 QQSTSFLPIGEKYDAYFVGVESY----CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 238
               +++P+      Y  G+ +      +GN  ++ +  + ++DSG+++T++PTE Y  +
Sbjct: 176 SSVVTWVPMVPNNHYYSPGLAALHFNGNLGNP-ISVAPMEVVIDSGSTYTYMPTETYRRL 234

Query: 239 V-VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF--- 294
           V V    L  S    ++  +   C+ A  E    + D++  F   +   ++    +    
Sbjct: 235 VFVVIASLSKSSLTLVRDPALPVCW-AGKEPFKXIGDVKDKFKPLELAFIQGTSQAIMEI 293

Query: 295 -PEN----EGFTVFCLTVMSTDG------DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            PEN     G    C+ ++  DG         +IG   M    +++D E  ++ W  + C
Sbjct: 294 PPENYLIISGEGNVCMGIL--DGTQAGLRKLNVIGDISMQNQLVIYDNERARIGWVRAPC 351


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 81/315 (25%), Positives = 128/315 (40%), Gaps = 45/315 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP +SS+ ++ SC    C    K RS  K  K  C +   Y+ + + + G L  + L +
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTV 190

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S    A +         GCG    G +    +  G++GLG G++S+ S L     I   
Sbjct: 191 DS---TAGKPVSFPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQLKST--INGL 243

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
           FS C      D + S  + FG  G  +   T   P+   Y  Y    E          + 
Sbjct: 244 FSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGYSKKTE---------VEE 294

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
           G   +VDSG ++TFLP E Y+++       +  KR+      +  CYN ++E  +  P +
Sbjct: 295 G-NIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE--INAPII 351

Query: 276 RLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFD 330
              F   N      N      E+    + C TV  T  D G++G     NF++G    FD
Sbjct: 352 TAHFKDANVELQPLNTFMRMQED----LVCFTVAPTS-DIGVLGNLAQVNFLVG----FD 402

Query: 331 RENLKLAWSHSKCEE 345
               +     ++ EE
Sbjct: 403 LRKKRGFSKKAEVEE 417



 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 15/129 (11%)

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           +VDSG ++T+LP E Y ++       +  KR+         CYN + ++ +  P +   F
Sbjct: 421 IVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAPIITAHF 479

Query: 280 SK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENL 334
              N      N      E+    + C TV+ T  D GI+G     NF++G    FD    
Sbjct: 480 KDANVELQPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVNFLVG----FDLRKK 530

Query: 335 KLAWSHSKC 343
           ++++  + C
Sbjct: 531 RVSFKAADC 539


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 138/317 (43%), Gaps = 41/317 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+SS+S   +SC+   C+S    +   D C Y   Y  + + + G  V + + L S  
Sbjct: 191 FEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYG-DGSYTVGDFVTETITLGS-- 247

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
             AP  +V     IGCG    G ++  A    ++GLG G +S PS +        SFS C
Sbjct: 248 --APVDNVA----IGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQINAT-----SFSYC 293

Query: 165 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ- 218
             + DS S   + F    P    S   L        Y+VG+    +G   ++  +S FQ 
Sbjct: 294 LVDRDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQI 353

Query: 219 -------ALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
                   +VDSG + T L T++Y  +   F K    L S+  I+L    +  CY+ SS+
Sbjct: 354 DESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL----FDTCYDLSSK 409

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
             ++VP +   F   +   +    +  P ++EG   FC     T     IIG     G R
Sbjct: 410 GNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEG--TFCFAFAPTASSLSIIGNVQQQGTR 467

Query: 327 IVFDRENLKLAWSHSKC 343
           +V+D  N  + +  +KC
Sbjct: 468 VVYDLVNHLVGFVPNKC 484


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 134/320 (41%), Gaps = 32/320 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP  SS+  N+SC  PLC K      S +  C Y   Y+ + + + G L  + + L S 
Sbjct: 106 FDPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYA-DSSLTKGVLAQETVTLTSN 164

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSF 161
           +      S+Q  ++ GCG   TG++ D     G++GLG G     SL+++ G +     F
Sbjct: 165 TGKP--ISLQ-GILFGCGHNNTGNFNDHEM--GLIGLGGGPT---SLVSQIGPLFGGKKF 216

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV---ESYCI 207
           S C      D   S  + FG       +     P+ ++      Y    +G+   ++Y  
Sbjct: 217 SQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLP 276

Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
            NS + +     LVDSG     LP ++Y  V V+    V  + I+   +   + CY   +
Sbjct: 277 MNSTIEKGNM--LVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQT 334

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGH 325
              LK P +   F      +     F  P  E   VFCL + +  + D GI G      +
Sbjct: 335 N--LKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNY 392

Query: 326 RIVFDRENLKLAWSHSKCEE 345
            I FD +   +++  + C +
Sbjct: 393 LIGFDLDRQIVSFKPTDCTK 412


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 129/325 (39%), Gaps = 36/325 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPC----PYIADYS---TEDTSSSGY 92
           +DPSSS S   V C  P C +      +   +   PC    P    Y+    + + S G 
Sbjct: 183 FDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGV 242

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           L  D L LA          V    + GCG    G    G +  G+MGLG   +S+ S   
Sbjct: 243 LAHDRLSLA--------GEVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTV 292

Query: 153 K--AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--------YFVGV 202
               G+      +  + + SGS+  GD   A + ST  +      ++        Y V +
Sbjct: 293 DQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNL 352

Query: 203 ESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
               +G   +  +GF  +A+VDSG   T L   +Y  V  +F   ++    +   +    
Sbjct: 353 TGITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDT 412

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIG 318
           C+N +  + ++VP + L+F       V +    +  +   +  CL V S  ++ +  IIG
Sbjct: 413 CFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIG 472

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                  R+VFD    ++ ++   C
Sbjct: 473 NYQQKNLRVVFDTSASQVGFAQETC 497


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 143/363 (39%), Gaps = 49/363 (13%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           + ++P  S++  +V C+   C+  +  +C +    C Y   Y     +++G L  +    
Sbjct: 132 APFNPVRSTTVADVPCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTF 191

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                     +    V+ GCG K  G +   +   GV+GLG G++S+ S L       + 
Sbjct: 192 GD--------TRIDGVVFGCGLKNVGDF---SGVSGVIGLGRGNLSLVSQLQV-----DR 235

Query: 161 FSICFDENDS----GSVFFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-GNSCL 212
           FS  F  +DS      + FGD   P T    ST  L        Y+V +    + G    
Sbjct: 236 FSYHFAPDDSVDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLA 295

Query: 213 TQSG-FQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW--KYCY 262
             SG F      G+   FL       +  E   K  +   + +I L    G++     CY
Sbjct: 296 IPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCY 355

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGIIGQN 320
              S    KVP M L+F+      +   + F      G    CLT++ S+ GD  ++G  
Sbjct: 356 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSVLGSL 413

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAA 380
             +G  +++D    KL +     E +   +     PPP+G S      T QQ+     A+
Sbjct: 414 IQVGTHMMYDINGSKLVF-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGRRSAS 464

Query: 381 APP 383
           APP
Sbjct: 465 APP 467


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 87/388 (22%), Positives = 152/388 (39%), Gaps = 71/388 (18%)

Query: 19  CLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 78
           C P T    C+L  G       N S       + S+ + C+ P C +  S     D C  
Sbjct: 113 CAPFT----CMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAA 168

Query: 79  ----IADYSTEDTSSSG------YLVDDILHLASFSKHAPQSSVQSSVII-----GCGRK 123
               + D  T   ++S       Y   D   +A   +   +  + +SV +      C   
Sbjct: 169 ARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRG--RVGIAASVAVENFTFACAHT 226

Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFF 175
             G       P GV G G G +S+P+ LA A L    FS C   +            +  
Sbjct: 227 ALGE------PVGVAGFGRGPLSLPAQLAPAAL-SGRFSYCLVAHSFRADRPIRPSPLIL 279

Query: 176 GD---QGPATQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCLT---------QSGFQAL 220
           G    + PA++    + P+    K+  ++ V +E+  +G + +          ++G   +
Sbjct: 280 GRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGRAGDGGM 339

Query: 221 V-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---------WKYCYNASSEE-- 268
           V DSG +FT LP E YA V  +F + +++ R      +         + Y ++AS+ E  
Sbjct: 340 VVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEG 399

Query: 269 -MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQN 320
               VP + + F    + V+  RN+   F   E   V CL +M+       G  G +G  
Sbjct: 400 SARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNF 459

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID 348
              G  +V+D +  ++ ++  +C ++ D
Sbjct: 460 QQQGFEVVYDVDAGRVGFARRRCTDLWD 487


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 77/332 (23%), Positives = 133/332 (40%), Gaps = 48/332 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DP  SS+ + V CS P C++       S  +    C Y+  Y  + +SS+G L  D L 
Sbjct: 128 FDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGELATDKLA 186

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            A+        +  ++V +GCGR   G + D AA  G++G+  G +S+ + +A A    +
Sbjct: 187 FAN-------DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGS 234

Query: 160 SFSICFDENDSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
            F  C  +  S S      VF     P +   T+ L    +   Y+V +  + +G   +T
Sbjct: 235 VFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT 294

Query: 214 QSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGN 256
             GF                +VDSG + +    + YA +   FD    +  +     + +
Sbjct: 295 --GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHS 352

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD 311
            +  CY+         P + L F+      +    +  P + G         CL   + D
Sbjct: 353 VFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD 412

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               +IG     G R+VFD E  ++ ++   C
Sbjct: 413 DGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
          Length = 245

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 96/237 (40%), Gaps = 21/237 (8%)

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
           DG++GLG G  S+ S L   GL++N    C      G +FFGD   +++   ++ P+  +
Sbjct: 13  DGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGDVYDSSR--LTWTPMSSR 70

Query: 195 -YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
               Y  G      G       G   + D+G+S+T+  +  Y  V+    K ++ K +  
Sbjct: 71  DLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKE 130

Query: 254 QGNS------W--KYCYNASSEEMLKVPDMRLIFS----KNQSFVVRNHIFSFPENEGFT 301
             +       W  K  + +  E       M L F+     N  F +    +    N G  
Sbjct: 131 APDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVSNMGNV 190

Query: 302 VFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 354
             CL ++       GD  +IG   M+   +VFD E   + W+ + C  V +  HV +
Sbjct: 191 --CLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADCNRVPNSRHVSI 245


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 81/338 (23%), Positives = 131/338 (38%), Gaps = 60/338 (17%)

Query: 58  CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 102
           C  PLC    S  +  DPC  +A  S                  T  +G +V   L   +
Sbjct: 92  CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 150

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + H    S    V   C      +Y +   P G+ G G G +S+PS L   G +Q  FS
Sbjct: 151 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 204

Query: 163 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            CF       + N S  +  GD   ++    Q TS L      + Y++G+E+  +GN+  
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264

Query: 213 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 259
            Q                ++DSG ++T LP   Y +++     +++  R   Q     + 
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 324

Query: 260 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF--PENEGFTVFCLTVMS 309
            CY      N  ++    +P +   FS N S V+   NH ++   P N    V CL + +
Sbjct: 325 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST-VVKCLLLQN 383

Query: 310 TD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            D    G  G+ G       ++V+D E  ++ +    C
Sbjct: 384 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 139/326 (42%), Gaps = 42/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           YDPS+SS+   + CS   C    SR+   S    C Y   Y  +   S+G L  + L L 
Sbjct: 113 YDPSASSTFSPLPCSSATCLPIWSRNCTPS--SLCRYRYAYG-DGAYSAGILGTETLTLG 169

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
             S  AP S     V  GCG    G  L+     G +GLG G +S   LLA+ G+    F
Sbjct: 170 PSS--APVSV--GGVAFGCGTDNGGDSLNST---GTVGLGRGTLS---LLAQLGV--GKF 217

Query: 162 SIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           S C    F+         G       GP+T QST  L   +    YFV ++   +G+  L
Sbjct: 218 SYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRL 277

Query: 213 T----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
                            +VDSG +FT L    + EVV +  +++    ++        C+
Sbjct: 278 PIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CF 336

Query: 263 NASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
            A + E   +PD+ L F+      + R++  S+  NE  + FCL +  T  +   +  NF
Sbjct: 337 PAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSY--NEEDSSFCLNIAGTTPESTSVLGNF 394

Query: 322 MMGH-RIVFDRENLKLAWSHSKCEEV 346
              + +++FD    +L++  + C ++
Sbjct: 395 QQQNIQMLFDTTVGQLSFLPTDCSKL 420


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 142/316 (44%), Gaps = 36/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P+SSS+ K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
             K        + V +GCG    G +       G++GLG G +S+ + +        SFS
Sbjct: 261 SGKI-------NDVALGCGHDNEGLF---TGAAGLLGLGGGALSITNQMKAT-----SFS 305

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQ---- 214
            C  + DSG   S+ F      +  +T+ L   +K D  Y+VG+  + +G   +      
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAI 365

Query: 215 -----SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSE 267
                SG   ++ D G + T L T+ Y  +   F KL ++ K+ +   + +  CY+ SS 
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSL 425

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             +KVP +   F+  +S  +    +  P ++  T FC     T     IIG     G RI
Sbjct: 426 SSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484

Query: 328 VFDRENLKLAWSHSKC 343
            +D  N  +  S +KC
Sbjct: 485 TYDLANKIIGLSGNKC 500


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 136/313 (43%), Gaps = 33/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+SS+S  ++SC    CKS    +     C Y   Y  + + + G  V + + L S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTS 251

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     ++ IGCG    G ++  A    ++GLG G +S PS L  +     SFS C
Sbjct: 252 L--------GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYC 295

Query: 165 FDENDSGSVFFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA 219
             + DS S    D   P T  + T+ L      D +F +G+    +G + L   ++ FQ 
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   +VDSG + T L T +Y  +   F K     + +     +  CY+ SS+  ++
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVE 415

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           VP +   F+      +    +  P ++EG   FC     TD    I+G     G R+ FD
Sbjct: 416 VPTVSFHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFD 473

Query: 331 RENLKLAWSHSKC 343
             N  + +S +KC
Sbjct: 474 LANSLVGFSPNKC 486


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 136/313 (43%), Gaps = 33/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+SS+S  ++SC    CKS    +     C Y   Y  + + + G  V + + L S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTS 251

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     ++ IGCG    G ++  A    ++GLG G +S PS L  +     SFS C
Sbjct: 252 L--------GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYC 295

Query: 165 FDENDSGSVFFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA 219
             + DS S    D   P T  + T+ L      D +F +G+    +G + L   ++ FQ 
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   +VDSG + T L T +Y  +   F K     + +     +  CY+ SS+  ++
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVE 415

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           VP +   F+      +    +  P ++EG   FC     TD    I+G     G R+ FD
Sbjct: 416 VPTVSFHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFD 473

Query: 331 RENLKLAWSHSKC 343
             N  + +S +KC
Sbjct: 474 LANSLVGFSPNKC 486


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 81/338 (23%), Positives = 131/338 (38%), Gaps = 60/338 (17%)

Query: 58  CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 102
           C  PLC    S  +  DPC  +A  S                  T  +G +V   L   +
Sbjct: 75  CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 133

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            + H    S    V   C      +Y +   P G+ G G G +S+PS L   G +Q  FS
Sbjct: 134 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 187

Query: 163 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            CF       + N S  +  GD   ++    Q TS L      + Y++G+E+  +GN+  
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247

Query: 213 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 259
            Q                ++DSG ++T LP   Y +++     +++  R   Q     + 
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 307

Query: 260 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF--PENEGFTVFCLTVMS 309
            CY      N  ++    +P +   FS N S V+   NH ++   P N    V CL + +
Sbjct: 308 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST-VVKCLLLQN 366

Query: 310 TD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            D    G  G+ G       ++V+D E  ++ +    C
Sbjct: 367 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 141/311 (45%), Gaps = 48/311 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SS+ + VSCS   C++   +SC + ++ C Y   Y  +++ + G +  D + + S
Sbjct: 128 FDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYG-DNSYTKGDVAVDTVTMGS 186

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S   P S    ++IIGCG + TG++    A  G++GLG G  S+ S L K+  I   FS
Sbjct: 187 -SGRRPVS--LRNMIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFS 239

Query: 163 ICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL--T 213
            C      +   +  + FG  G  +        + +K  A  YF+ +E+  +G+  +  T
Sbjct: 240 YCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFT 299

Query: 214 QSGF-----QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
            + F       ++DSG + T LP+  Y E+       + ++R+         CY  SS  
Sbjct: 300 STIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSS- 358

Query: 269 MLKVPDMRLIFSKN-------QSFVVRNH---IFSFPENEGFTVFCLTVMSTDGDYGIIG 318
             KVPD+ + F           +FV  +     F+F  NE  T+F           G + 
Sbjct: 359 -FKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIF-----------GNLA 406

Query: 319 Q-NFMMGHRIV 328
           Q NF++G+  V
Sbjct: 407 QMNFLVGYDTV 417


>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 407

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 134/343 (39%), Gaps = 51/343 (14%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDT 87
           G ++ +DR   +Y P  +     V C  PLC +  S     C +  + C Y  +Y+ +  
Sbjct: 82  GCTLPRDR---QYKPHGNL----VKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYA-DQG 133

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVS 146
           SS G LV DI+ L    K    +   S +  GCG  QT   +    +  GV+GLG G  S
Sbjct: 134 SSLGVLVRDIIPL----KLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRAS 189

Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDA 197
           + S L   GLI+N    C      G +FFGDQ          P  Q S+S L        
Sbjct: 190 ILSQLNSKGLIRNVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLL------KH 243

Query: 198 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 257
           Y  G           +  G +   DSG+S+T+  +  +  +V      +  K +S     
Sbjct: 244 YKTGPADMFFNGKATSVKGLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATED 303

Query: 258 ------WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLT 306
                 WK      S   +      L+ S  +S   +N +F  P      V      CL 
Sbjct: 304 PSLPICWKGPKPFKSLHDVTSNFKPLVLSFTKS---KNSLFQVPPEAYLIVTKHGNVCLG 360

Query: 307 VMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
           ++       G+  IIG   +    +++D E  ++ W+ + C+ 
Sbjct: 361 ILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQRIGWASANCDR 403


>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 242

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 111/248 (44%), Gaps = 35/248 (14%)

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
           +SSSG L +DI+     S+   Q +V      GC   +TG      A DG+MGLG G +S
Sbjct: 2   SSSSGVLGEDIVSFGRESELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLS 55

Query: 147 VPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
           +   L + G+I +SFS+C+   D G    V  G   P+    +   P+   Y  Y + ++
Sbjct: 56  IMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELK 113

Query: 204 SYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--- 254
              +    L        S    ++DSG ++ +LP + +    + F   V+SK  SL+   
Sbjct: 114 EIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIR 169

Query: 255 --GNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCL 305
               S+K  C+  +   + K+    PD+ ++F   Q  S    N++F   + +G   +CL
Sbjct: 170 GPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCL 227

Query: 306 TVMSTDGD 313
            V     D
Sbjct: 228 GVFQNGKD 235


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 124/315 (39%), Gaps = 34/315 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +DP++SS++  V C  P C+S        S +S    C Y+ +YS +D +++G  + D L
Sbjct: 179 FDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYS-DDRATAGTYMTDTL 237

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            ++        ++   +   GC     G + D  A  G M LG G  S+ +  A++  + 
Sbjct: 238 TISG-------TTAVRNFRFGCSHAVRGRFSDLTA--GTMSLGGGAQSLLAQTARS--LG 286

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------YFVGVESYCIGNSCL 212
           N+FS C  +  S S F    GPAT  ST+         +      Y V ++   +    L
Sbjct: 287 NAFSYCVPQA-SASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRL 345

Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                     A++DS A  T LP   Y  +   F   + +   S    +   CY+     
Sbjct: 346 GIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLT 405

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            ++VP + L+F      V+       P          T  S+D   G IG      H ++
Sbjct: 406 NVRVPAVSLVFGGGAVVVLDP-----PAVMIGGCLAFTATSSDLALGFIGNVQQQTHEVL 460

Query: 329 FDRENLKLAWSHSKC 343
           +D     + +    C
Sbjct: 461 YDVAAGGVGFRRGAC 475


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 126/323 (39%), Gaps = 33/323 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           + PSSSS+   V C  P C ++R SC S    D CPY   Y  + + + G+L +D L L 
Sbjct: 129 FAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEVVYG-DKSRTVGHLGNDTLTLG 187

Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
              S +     S+     + GCG   TG  L G A DG+ GLG G VS+ S    AG   
Sbjct: 188 TTPSTNASENNSNKLPGFVFGCGENNTG--LFGKA-DGLFGLGRGKVSLSS--QAAGKYG 242

Query: 159 NSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
             FS C      N  G +  G   PA   +  F P+  + +    Y+V +    +    +
Sbjct: 243 EGFSYCLPSSSSNAHGYLSLGTPAPAPAHA-RFTPMLNRSNTPSFYYVKLVGIRVAGRAI 301

Query: 213 TQSGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVS------SKRISLQGNSWKY 260
             S   AL      VDSG   T L    Y+ +   F   +       + R+S+      Y
Sbjct: 302 KVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTC--Y 359

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
            + A +   + +P + L+F+   +  V      +                    GI+G  
Sbjct: 360 DFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNT 419

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
                 +V+D    K+ ++   C
Sbjct: 420 QQRTVAVVYDVGRQKIGFAAKGC 442


>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
 gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 131/325 (40%), Gaps = 38/325 (11%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
           P    S+  V+C  P+C+S  +    +   P   DY  E     SS G LV D  +L +F
Sbjct: 61  PYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNL-NF 119

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +    QS + +  + G  +   G+Y      DGV+GLG G  S+ S L+  GL++N    
Sbjct: 120 TSEKRQSPLLALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGH 176

Query: 164 CFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
           C     SG              +S   + P+      Y  G             +GF+ L
Sbjct: 177 CL----SGRGGGFLFFGDDLYDSSRVAWTPMSPNAKHYSPGFAELTFDGKT---TGFKNL 229

Query: 221 V---DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDM 275
           +   DSGAS+T+L +++Y  ++    + +S+K  R +L   +   C+    +    V D+
Sbjct: 230 IVAFDSGASYTYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKG-RKPFKSVRDV 288

Query: 276 RLIFSKNQSFVVRNH-----IFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNF 321
           +  F K  +    N         FP      V      CL V++       D  +IG   
Sbjct: 289 KKYF-KTFALSFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDIS 347

Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
           M    +++D E   + W+   C+ +
Sbjct: 348 MQDRVVIYDNEKQLIGWAPRNCDRI 372


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 119/309 (38%), Gaps = 25/309 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SS+   V+C  P C+    S C S    C Y   Y  + + + G LV D L L++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA 248

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S      + GCG +  G +      DG+ GLG   VS+PS  A +      F+
Sbjct: 249 -------SDTLPGFVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFT 296

Query: 163 ICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQ 214
            C   + SG  +   G   PA  Q T+ L  G     Y++ +    +G   +        
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFA 355

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           +    ++DSG   T LP   YA +   F + ++  + +   +    CY+ +     ++P 
Sbjct: 356 AAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPT 415

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           + L F+   +  +      +              + D    I+G        + +D  N 
Sbjct: 416 VELAFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQ 475

Query: 335 KLAWSHSKC 343
           ++ +    C
Sbjct: 476 RIGFGAKGC 484


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 137/316 (43%), Gaps = 36/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P+SSS+ K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
             K        ++V +GCG    G +   A   G+         V S+  +  +   SFS
Sbjct: 261 SGKI-------NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFS 305

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
            C  + DSG   S+ F         +T+ L   +K D  Y+VG+  + +G     L  + 
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAI 365

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSE 267
           F          ++D G + T L T+ Y  +   F KL V+ K+ S   + +  CY+ SS 
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSL 425

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             +KVP +   F+  +S  +    +  P ++  T FC     T     IIG     G RI
Sbjct: 426 STVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484

Query: 328 VFDRENLKLAWSHSKC 343
            +D     +  S +KC
Sbjct: 485 TYDLSKNVIGLSGNKC 500


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 89/349 (25%), Positives = 137/349 (39%), Gaps = 60/349 (17%)

Query: 40  RNLSEYDPSS------SSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTE 85
           RN S + P++      SS+     C  P+C+         R +   +   CPY  +Y   
Sbjct: 115 RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPY--EYGYA 172

Query: 86  DTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLG 141
           D S +SG    +   L + S    + +   SV  GCG + +G  + G +    +GVMGLG
Sbjct: 173 DGSLTSGLFARETTSLKTSSG---KEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLG 229

Query: 142 LGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVFFGDQGPATQQ--STSFL--PIG 192
            G +S  S L +     N FS C  +       +  +  GD G A  +   T  L  P+ 
Sbjct: 230 RGPISFASQLGRR--FGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLS 287

Query: 193 EKYDAYFVGVESYCIGNSCLT---------QSGFQALV-DSGASFTFLPTEIYAEVVVKF 242
             +  Y+V ++S  +  + L           SG    V DSG +  FL    Y  V+   
Sbjct: 288 PTF--YYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAV 345

Query: 243 DKLVSSKRISLQGNSWKYCYNASS----EEMLKVPDMRLIFSKNQSFVV--RNHIFSFPE 296
            + +           +  C N S     E++L  P ++  FS    FV   RN+     E
Sbjct: 346 KQRIKLPNADELTPGFDLCVNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNYFIETEE 403

Query: 297 NEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                + CL + S D   G  +IG     G    FDR+  +L +S   C
Sbjct: 404 Q----IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448


>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 681

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 70/317 (22%), Positives = 138/317 (43%), Gaps = 28/317 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---A 101
           +  ++SS+  +++C+         C    D C     Y  E +S    +V+DI++L   +
Sbjct: 109 FQAANSSTLVHITCAQKSLFQCKECHVQSDTCGISQSY-MEGSSWKASVVEDIVYLGGES 167

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNS 160
           SF     ++   +    GC   + G ++   A DG+MGL   +  + + L +   I  N 
Sbjct: 168 SFDDKEMRNRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNL 226

Query: 161 FSICFDENDSGSVFFGDQGPATQQ-STSFLPI------GEKYDAYF----VGVESYCIGN 209
           FS+CF EN  G++  G    A  +   S++ +      G  Y+ +     +G +S     
Sbjct: 227 FSLCFTEN-GGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKE 285

Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T+  +  +VDSG + ++LP  +  E +  F ++    R    GNS   C   +++++
Sbjct: 286 EAYTRGHY--IVDSGTTDSYLPRALKTEFLQMFKEIAG--RDYQVGNS---CKGFTNKDL 338

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
             +P ++L+            +   PE    E    +C  +  ++   G+IG N MM   
Sbjct: 339 ASLPTIQLVMEAYGDENAEVILDVPPEQYLLESNGAYCGGIYLSENSGGVIGANLMMNRD 398

Query: 327 IVFDRENLKLAWSHSKC 343
           ++FD  + ++ +  + C
Sbjct: 399 VIFDLGDQRVGFVDADC 415


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 137/316 (43%), Gaps = 36/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P+SSS+ K+++CS P C     S+C+S K  C Y   Y  + + + G L  D +   +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
             K        ++V +GCG    G +   A   G+         V S+  +  +   SFS
Sbjct: 261 SGKI-------NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFS 305

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
            C  + DSG   S+ F         +T+ L   +K D  Y+VG+  + +G     L  + 
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAI 365

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSE 267
           F          ++D G + T L T+ Y  +   F KL V+ K+ S   + +  CY+ SS 
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSL 425

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             +KVP +   F+  +S  +    +  P ++  T FC     T     IIG     G RI
Sbjct: 426 STVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484

Query: 328 VFDRENLKLAWSHSKC 343
            +D     +  S +KC
Sbjct: 485 TYDLSKNVIGLSGNKC 500


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 145/318 (45%), Gaps = 34/318 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP +SS+ K+VSCS   C   ++++SC +    C Y+  Y+ + + + G    D L L 
Sbjct: 136 FDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYA-DGSYTMGKFAVDTLTLG 194

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNS 160
           S      Q     ++IIGCG+    ++ + ++    +G         SL+ + G  I   
Sbjct: 195 STDNRPVQ---LKNIIIGCGQNNAVTFRNKSSGVVGLG-----GGAVSLIKQLGDSIDGK 246

Query: 161 FSICF-DENDSGS-VFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           FS C   END  S + FG      GP T  +   L +  +   Y++ ++S  +G+  +  
Sbjct: 247 FSYCLVPENDQTSKINFGTNAVVSGPGTVSTP--LVVKSRDTFYYLTLKSISVGSKNMQT 304

Query: 213 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
             +      ++DSG + T LP + Y E+      L+++ +   +      CYNA+++  L
Sbjct: 305 PDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATAD--L 362

Query: 271 KVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 328
            +P + + F   +      N  F   E+     F ++    +G YG + Q NF++G    
Sbjct: 363 NIPVITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYR-NGIYGNVAQKNFLVG---- 417

Query: 329 FDRENLKLAWSHSKCEEV 346
           +D  +  +++  + C ++
Sbjct: 418 YDTASKTMSFKPTDCAKM 435


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 79/303 (26%), Positives = 129/303 (42%), Gaps = 39/303 (12%)

Query: 57  SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 114
           +C  P+C+   S  C   ++ C Y   Y  + + ++G    + L  A  ++      VQ 
Sbjct: 177 NCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------VQR 229

Query: 115 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSV 173
            V IGCG    G ++   A  G++GLG G +S PS +A++     SFS C  D   S   
Sbjct: 230 -VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSRRA 283

Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---CLTQSGFQ---------ALV 221
               +   T +  +F         Y+V +  + +G +    ++QS  +          ++
Sbjct: 284 RPSRRWGGTPRMATF---------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVIL 334

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFS 280
           DSG S T L   +Y  V   F       R+S  G S +  CYN S   ++KVP + +  +
Sbjct: 335 DSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLA 394

Query: 281 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
              S  +    +  P +   T FC  +  TDG   IIG     G R+VFD +  ++ +  
Sbjct: 395 GGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 453

Query: 341 SKC 343
             C
Sbjct: 454 KSC 456


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 119/309 (38%), Gaps = 25/309 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS SS+   V+C  P C+    S C S    C Y   Y  + + + G LV D L L++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA 248

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S      + GCG +  G +      DG+ GLG   VS+PS  A +      F+
Sbjct: 249 -------SDTLPGFVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFT 296

Query: 163 ICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQ 214
            C   + SG  +   G   PA  Q T+ L  G     Y++ +    +G   +        
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFA 355

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           +    ++DSG   T LP   YA +   F + ++  + +   +    CY+ +     ++P 
Sbjct: 356 AAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPT 415

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
           + L F+   +  +      +              + D    I+G        + +D  N 
Sbjct: 416 VELAFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQ 475

Query: 335 KLAWSHSKC 343
           ++ +    C
Sbjct: 476 RIGFGAKGC 484


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 137/340 (40%), Gaps = 33/340 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSS+  N++     C     +SC   ++ C Y   Y  +D+ + G L  + L L S
Sbjct: 101 FDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYE-DDSITEGVLAQETLTLTS 159

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            +    +      VI GCG    G + D     G++GLG G +S+ S +  +      FS
Sbjct: 160 TTG---KPVALKGVIFGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGSS-FGGKMFS 213

Query: 163 IC---FDENDS--GSVFFGDQGPATQQSTSFLPIGEK--YDAYF------VGVESYCI-- 207
            C   F  N S    + FG             P+  K  + A++      + VE   +  
Sbjct: 214 QCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPF 273

Query: 208 --GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNA 264
             G+S    +    ++DSG   T LP + Y  +V +    V+   I +     ++ CY  
Sbjct: 274 NDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRT 333

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMM 323
            +   LK   +   F      +    IF  P  +G  +FC    ST   +YGI G +   
Sbjct: 334 PTN--LKGTTLTAHFEGADVLLTPTQIF-IPVQDG--IFCFAFTSTFSNEYGIYGNHAQS 388

Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
            + I FD E   +++  + C  + D   ++ V P    +P
Sbjct: 389 NYLIGFDLEKQLVSFKATDCTNLQDAPSINGVLPNVLSAP 428


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 140/332 (42%), Gaps = 37/332 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ ++ +  +  S PLC+   +     + C Y   Y+   +S   Y+ D +  +    
Sbjct: 204 YRPARTADA--LPASDPLCEG--AQHENPNQCDYEISYADGSSSMGVYVRDSMQFVGEDG 259

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +        + ++ GCG  Q G  L+     DGV+GL    +S+P+ LA  G+I N+F  
Sbjct: 260 ERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 216
           C   + SG+   +F GD     +   +++PI  G   D     V+    G+  L   G  
Sbjct: 315 CMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKL 373

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDM 275
            Q + D+G+++T+ P E    ++    +  S + +    + +  +C   S   +  V D+
Sbjct: 374 TQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFCMK-SDFPVRSVEDV 432

Query: 276 RLIFSK-----------NQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQN 320
           +  F             +++F +R   +    ++G    CL V++ T   Y    I+G  
Sbjct: 433 KHFFKPLSLQFEKRFFFSRTFNIRPEHYLVISDKGNV--CLGVLNGTTIGYDSVVIVGDV 490

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
            + G  + +D +  ++ W    C     +S +
Sbjct: 491 SLRGKLVAYDNDKNEVGWVDFDCTNPRKRSRI 522


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 78/353 (22%), Positives = 136/353 (38%), Gaps = 65/353 (18%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPC---------PYIAD 81
           ++   +  + P  SSSS  + C +  C      K +S C+   DP          PY+  
Sbjct: 134 IEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQEC-DPTTQNCTQSCPPYVIQ 192

Query: 82  YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 141
           Y     S++G L+ + L         P        ++GC      S      P+G+ G G
Sbjct: 193 YGLG--STAGLLLSETLDF-------PHKKTIPGFLVGC------SLFSIRQPEGIAGFG 237

Query: 142 LGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST----SFLPIGEK--- 194
               S+PS L          S  FD+  + S    D G  +  +     S+ P  +    
Sbjct: 238 RSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTA 297

Query: 195 --YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
              D Y+V + +  IG++ +          +      +VDSG +FTF+   +Y  V  +F
Sbjct: 298 AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEF 357

Query: 243 DKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
           +K V+   ++ +       + C+N S E+ + VP+    F       +   + FSF ++ 
Sbjct: 358 EKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDS- 416

Query: 299 GFTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              V CLT++S +        G   I+G        + FD +N +  +    C
Sbjct: 417 --GVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 144/345 (41%), Gaps = 55/345 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLC------------KSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
           +DP++SSS +NV+C    C              R+  +  +DPCPY   Y  +  ++   
Sbjct: 193 FDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGD- 251

Query: 93  LVDDILHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
                L L SF+ +  AP +S +   V+ GCG +  G +   A   G+    L   S   
Sbjct: 252 -----LALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--Q 304

Query: 150 LLAKAGLIQNSFSICFDEN--DSGS-VFFGDQGPATQ-------QSTSFLPIGEKYDA-- 197
           L A  G   ++FS C  ++  D GS V FG+   A         + T+F P         
Sbjct: 305 LRAVYG---HTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPAD 361

Query: 198 --YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKF-DK 244
             Y+V ++   +G   L  S              ++DSG + ++     Y  +   F D+
Sbjct: 362 TFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDR 421

Query: 245 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTV 302
           +  S  +  +      CYN S  E  +VP++ L+F+      F   N+     + +G ++
Sbjct: 422 MSRSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRL-DPDGGSI 480

Query: 303 FCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            CL V+ T      IIG        +V+D +N +L ++  +C EV
Sbjct: 481 MCLAVLGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 65/139 (46%), Gaps = 8/139 (5%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YDP  S S + V+C    C +       SC S   PC Y   Y  + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTD 191

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
            L     S     +   +SV  GCG K  G       A DG++G G  + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251

Query: 156 LIQNSFSICFDENDSGSVF 174
            ++  F+ C D  + G +F
Sbjct: 252 KVRKMFAHCLDTVNGGGIF 270


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 77/316 (24%), Positives = 130/316 (41%), Gaps = 34/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S+S   VSC  P C+    ++C++    C Y   Y  + + + G    + L L  
Sbjct: 211 FDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 268

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+  ++V IGCG    G ++  A    + G  L   S PS ++      ++FS
Sbjct: 269 ------DSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 314

Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
            C  + DS    ++ FG  G      T+ L    +    Y+V +    +G   L+   S 
Sbjct: 315 YCLVDRDSPAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSA 374

Query: 217 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
           F           +VDSG + T L +  YA +   F +   S   +   + +  CY+ S  
Sbjct: 375 FAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDR 434

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             ++VP + L F    +  +    +  P  +G   +CL    T+    IIG     G R+
Sbjct: 435 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 493

Query: 328 VFDRENLKLAWSHSKC 343
            FD     + ++ +KC
Sbjct: 494 SFDTAKGVVGFTPNKC 509


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 53/339 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           ++L  +DPS S + +N SC     S P  +  +  +S    C Y   Y  + T S G L 
Sbjct: 122 QSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS----CEYSMRY-MDGTGSKGILA 176

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            ++L   +    +  +++   V+ GCG    G  L G    G++GLG G+    SL+ + 
Sbjct: 177 KEMLMFNTIYDESSSAALH-DVVFGCGHDNYGEPLVGT---GILGLGYGEF---SLVHRF 229

Query: 155 GLIQNSFSICFDENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
           G     FS CF   D  S     +  GD G      T+ L I   +  Y+V +E+  +  
Sbjct: 230 G---TKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDG 284

Query: 210 SCL----------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QG 255
             L           Q+G    ++D+G S T L  E Y  +  K +     +  +    Q 
Sbjct: 285 IILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQD 344

Query: 256 NSWKY-CYNASSEEML---KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
           + +K  CYN + E  L     P +   FS     S  V++       N    VFCL V  
Sbjct: 345 DMFKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPN----VFCLAV-- 398

Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
           T G+   IG      + I +D E  K+++    C  + D
Sbjct: 399 TPGNMNSIGATAQQSYNIGYDLEAKKISFERIDCGVLFD 437


>gi|306015415|gb|ADM76761.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015421|gb|ADM76764.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015423|gb|ADM76765.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015427|gb|ADM76767.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015429|gb|ADM76768.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015445|gb|ADM76776.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015449|gb|ADM76778.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015455|gb|ADM76781.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015457|gb|ADM76782.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015469|gb|ADM76788.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015475|gb|ADM76791.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015479|gb|ADM76793.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015485|gb|ADM76796.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015487|gb|ADM76797.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015489|gb|ADM76798.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015491|gb|ADM76799.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015505|gb|ADM76806.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56

Query: 372 QSTSNGQAAAPPSTAKT 388
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|306015413|gb|ADM76760.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015419|gb|ADM76763.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015425|gb|ADM76766.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015431|gb|ADM76769.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015433|gb|ADM76770.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015435|gb|ADM76771.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015437|gb|ADM76772.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015439|gb|ADM76773.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015441|gb|ADM76774.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015443|gb|ADM76775.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015447|gb|ADM76777.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015451|gb|ADM76779.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015453|gb|ADM76780.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015459|gb|ADM76783.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015461|gb|ADM76784.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015463|gb|ADM76785.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015465|gb|ADM76786.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015467|gb|ADM76787.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015471|gb|ADM76789.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015473|gb|ADM76790.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015477|gb|ADM76792.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015481|gb|ADM76794.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015483|gb|ADM76795.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015493|gb|ADM76800.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015495|gb|ADM76801.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015497|gb|ADM76802.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015499|gb|ADM76803.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015501|gb|ADM76804.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015503|gb|ADM76805.1| aspartyl protease-like protein, partial [Picea sitchensis]
 gi|306015507|gb|ADM76807.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWKTRTPL---QQ 56

Query: 372 QSTSNGQAAAPPSTAKT 388
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/349 (23%), Positives = 138/349 (39%), Gaps = 60/349 (17%)

Query: 57  SCSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLA 101
           SC+ P C    S  +  DPC  +A  S                  T  +G +V   L   
Sbjct: 74  SCASPYCTDIHSSDNSFDPCT-VAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTRD 132

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +   H   + V   +   C      +Y +   P G+ G   G +S PS L   GL++  F
Sbjct: 133 TLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL---GLLKKGF 186

Query: 162 SICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSC 211
           S CF       + N S  +  GD   +++ +  F P+ +     + Y++G+E+  +GN  
Sbjct: 187 SHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVS 246

Query: 212 LT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQGNS 257
            T           Q     L+DSG ++T LP   Y++++  F  +++  R   + ++   
Sbjct: 247 ATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEMRA-G 305

Query: 258 WKYCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVF-CLTVM 308
           +  CY      N  +++    P +   F  N SFV+   NH ++       TV  CL   
Sbjct: 306 FDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQ 365

Query: 309 S-TDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
           S  D DY   G+ G       +IV+D E  ++ +    C        +H
Sbjct: 366 SMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCASAAVSQGLH 414


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 90/326 (27%), Positives = 129/326 (39%), Gaps = 39/326 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DP +SSS + + C  PLCK     S S  +     C Y   Y  + + S G    D+  
Sbjct: 171 FDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYG-DGSFSVGDFSSDLFT 229

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           L + SK         SV  GCG    G +   A   G+    L   S     +      N
Sbjct: 230 LGTGSKAM-------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTAN 282

Query: 160 SFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL 212
           SFS C  +        S S+ FG     +  + S L    K D  Y+  +    +G + L
Sbjct: 283 SFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQL 342

Query: 213 ---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFD----KLVSSKRISLQGNSW 258
                    +QSG    ++DSG S T  PT +YA +   F      L S+ R SL    +
Sbjct: 343 PISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSL----F 398

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
             CYN S +  + VP + L F       +    +  P N   + FCL    T  + GIIG
Sbjct: 399 DTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGS-FCLAFAPTSMELGIIG 457

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCE 344
                  RI FD +   LA++  +C+
Sbjct: 458 NIQQQSFRIGFDLQKSHLAFAPQQCK 483


>gi|306015417|gb|ADM76762.1| aspartyl protease-like protein, partial [Picea sitchensis]
          Length = 114

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
           IIGQNFM  +R+VFDRENLKL WS S C + +D++   + P P+ Q    +  PL   +Q
Sbjct: 1   IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56

Query: 372 QSTSNGQAAAPPSTAKT 388
           Q TS G+A AP    +T
Sbjct: 57  QQTSPGRAVAPAIAGRT 73


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/333 (23%), Positives = 133/333 (39%), Gaps = 49/333 (14%)

Query: 46  DPSSSSSSKNVSCSHPLCKSR----SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDI 97
           +PS     +      PL  SR    +SC S K      C Y   Y  + + ++G+L  D 
Sbjct: 24  NPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDK 82

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
                     P       V  GCG    G +       G+ G G G +S+PS L K G  
Sbjct: 83  FTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG-- 131

Query: 158 QNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVE 203
             +FS CF             +    +F   QG    Q+T  +   +       Y++ ++
Sbjct: 132 --NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLK 187

Query: 204 SYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
              +G++          LT      ++DSG S T LP ++Y  V  +F   +    +   
Sbjct: 188 GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 247

Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
                 C++A S+    VP + L F      + R N++F  P++ G ++ CL +   D +
Sbjct: 248 ATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-E 306

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             IIG        +++D +N  L++  ++C+++
Sbjct: 307 TTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 339


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 139/333 (41%), Gaps = 56/333 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+ S+S  ++ CS  +C +  S    ++ C Y A Y  +  SS+G L ++     +F 
Sbjct: 130 FEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFG 185

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            ++ + +V   V  GCG    G+  +G+   G++G G G +S+ S L         FS C
Sbjct: 186 TNSTRVAVP-RVSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYC 236

Query: 165 ---FDENDSGSVFFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
              F    +  ++FG             GP   QST F+        YF+ +    +   
Sbjct: 237 LTSFMSPATSRLYFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGD 294

Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSW 258
            L           T      ++DSG + TFL    YA V   F   V   R  +   +++
Sbjct: 295 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTF 354

Query: 259 KYCYN--ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
             C+        M+ +P+M L F   +    + N++     + G    CL ++ +D D  
Sbjct: 355 DTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGS 410

Query: 316 IIG----QNFMMGHRIVFDRENLKLAWSHSKCE 344
           IIG    QNF M    ++D EN  L++  + C 
Sbjct: 411 IIGSFQHQNFHM----LYDLENSLLSFVPAPCN 439


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 133/337 (39%), Gaps = 45/337 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           ++P  SSS  +  C+  +C  RS      +C      C +   Y  + + + G +  +I 
Sbjct: 41  FNPGLSSSFISEPCTSSVCLGRSKLGFQSACNRSTGSCSFQVAY-LDGSEAYGVIAREIF 99

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL---AKAG 155
            L S+   A   S    VI GC  K     +D ++  G +GL  G  S P+ +   +K+G
Sbjct: 100 SLQSWDGAA---STLGDVIFGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSG 154

Query: 156 LIQNSFSICFDE-----NDSGSVFFGDQG-PATQQSTSFL----PIGEKYDAYFVGVESY 205
           L  + FS CF       N SG + FGD G PA       L    PI    D Y+VG++  
Sbjct: 155 L-SDRFSYCFPNRAEHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGI 213

Query: 206 CIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLV-SSKRISLQ 254
            +G   L   +S F+           DSG + +FL    +  +V  F + V    R S  
Sbjct: 214 SVGGELLHIPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGS 273

Query: 255 GNSWKYCYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMS- 309
             + + CY+  A    +   P + L F  N    +R      P          CL  ++ 
Sbjct: 274 DFTKELCYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNA 333

Query: 310 ---TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                G   +IG      + I  D E  ++ ++ + C
Sbjct: 334 GAVAQGGVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 141/335 (42%), Gaps = 45/335 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDIL 98
           YD ++S+S   V C+   C      SR+   +   PC Y   Y+ +D + S+G L  + L
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRY--RYAYDDGAYSAGVLGTETL 194

Query: 99  HLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
             A  S  AP   V    V  GCG    G   +     G +GLG G +S   L+A+ G+ 
Sbjct: 195 TFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNST---GTVGLGRGSLS---LVAQLGV- 247

Query: 158 QNSFSIC----FDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVES 204
              FS C    F+ +    V FG           G A  QST  +        Y+V +E 
Sbjct: 248 -GKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEG 306

Query: 205 YCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
             +G++ L                 +VDSG  FT L    +  VV     +++   ++  
Sbjct: 307 ISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNAS 366

Query: 255 G-NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG 312
             +S  +   A  +++  +PDM L F+      + R++  SF  N+  + FCL +     
Sbjct: 367 SLDSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSF--NQESSSFCLNIAGAPS 424

Query: 313 DYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
            YG I  NF   + +++FD    +L++  + C ++
Sbjct: 425 AYGSILGNFQQQNIQMLFDITVGQLSFVPTDCSKL 459


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 41/329 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCP----------YIADYSTEDTSSSGYL 93
           +DPSSS S   V C+   C + R +  +   PC           Y   Y  + + S G L
Sbjct: 160 FDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR-DGSYSRGVL 218

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLA 152
             D L LA               + GCG    G+   G +  G+MGLG   VS V   + 
Sbjct: 219 ARDKLRLAGQDIEG--------FVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMD 268

Query: 153 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------YFVGV 202
           + G +   FS C    +   SGS+  GD   A + ST  +      D+       YF+ +
Sbjct: 269 QFGGV---FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNL 325

Query: 203 ESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
               +G   +    F A   ++DSG   T L   +Y  V  +F   ++    +   +   
Sbjct: 326 TGITVGGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILD 385

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTV-FCLTVMSTDGDYGII 317
            C+N +  + ++VP ++ +F  +    V +  +  F  ++   V   L  + ++ D  II
Sbjct: 386 TCFNLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSII 445

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       R++FD    ++ ++   C+ +
Sbjct: 446 GNYQQKNLRVIFDTLGSQIGFAQETCDYI 474


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
           ++L  ++PS S +   + C   +C+  + SSC         C Y   Y+ + + ++G+L 
Sbjct: 148 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 206

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            D    AS + HA   +    +  GCG    G ++      G+ G   G +S+P     A
Sbjct: 207 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 258

Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
            L  ++FS CF     ++   VF G            G    QST+ +     +  AY++
Sbjct: 259 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 318

Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
            ++   +G + L   +S F          +VDSG   T LP  +Y  V    D  V+  +
Sbjct: 319 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 375

Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
           +++  ++    + C++        VP + L F      + R N++F   E  G  + CL 
Sbjct: 376 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 435

Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           + + + D  +IG        +++D  N  L++  ++C ++
Sbjct: 436 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474


>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
 gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
          Length = 408

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 135/328 (41%), Gaps = 50/328 (15%)

Query: 51  SSSKNVSCSHPLCK-------SRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +  K V C+ PLC        +   C  + K+ C Y   Y    +S    L+D       
Sbjct: 88  TRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLD------- 140

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI- 157
             K +  +    ++  GCG  Q       A      DG++GLG G V + S L  +G + 
Sbjct: 141 --KFSLPTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVS 198

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI-----GEKYDAYFVGVESYCIGNSCL 212
           +N    C      G +F G++   +   T ++P+     GE  + Y  G  +  + ++ +
Sbjct: 199 KNVIGHCLSSKGGGYLFIGEENVPSSHVT-WVPMAPTTPGEP-NHYSPGQATLHLDSNPI 256

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
                +A+ DSG+++T+LP  ++A+       LVS+ + SL  +S K   + +     K 
Sbjct: 257 GTKPLKAIFDSGSTYTYLPENLHAQ-------LVSALKASLSKSSLKQVSDPALPLCWKG 309

Query: 273 PD-MRLIFSKNQSFV--------VRNHIFSFPEN----EGFTVFCLTVMSTDG-DYGIIG 318
           P   + +    + F         +   +   PEN     G    C  ++   G D  IIG
Sbjct: 310 PKPFKTVHDTPKEFKSLVTLKFDLGVTMIIPPENYLIITGHGNACFGILDMPGLDQYIIG 369

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              M    +++D E  +LAW  S C+++
Sbjct: 370 DITMQEQLVIYDNEKGRLAWMPSPCDKI 397


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 85/182 (46%), Gaps = 23/182 (12%)

Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
            GCGR   G +  GA  DG++GLG G +S  S  A     +  FS C  E DS GS+ FG
Sbjct: 171 FGCGRNNEGDFGSGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFG 226

Query: 177 DQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
           ++   +Q S  F  +         E+   YFV +    +GN  L    S F +   ++DS
Sbjct: 227 EKA-TSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNVPSSVFASPGTIIDS 285

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
           G   T LP   Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L F
Sbjct: 286 GTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 345

Query: 280 SK 281
            +
Sbjct: 346 GE 347


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 142/342 (41%), Gaps = 57/342 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLC--------KSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVD 95
           +DP++SSS +N++C  P C         +  +C+   +DPCPY   Y  +  S+      
Sbjct: 188 FDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGD---- 243

Query: 96  DILHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
             L L SF+ +  AP +S +   V+ GCG +  G +   A    ++GLG G +S  S L 
Sbjct: 244 --LALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL- 297

Query: 153 KAGLIQNSFSICFDENDS---GSVFFGDQ------GPATQQSTSFLPIGEKYDA-YFVGV 202
           +A    ++FS C  ++ S     V FG+            + T+F P     D  Y+V +
Sbjct: 298 RAVYGGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRL 357

Query: 203 ESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRI 251
               +G   L  S              ++DSG + ++     Y  +   F D++  S   
Sbjct: 358 TGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPP 417

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCL 305
                    CYN S  E  +VP++ L+F+          ++ FP    F       + CL
Sbjct: 418 VPDFPVLSPCYNVSGVERPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMCL 470

Query: 306 TVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            V+ T      IIG        + +D  N +L ++  +C EV
Sbjct: 471 AVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAEV 512


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/316 (24%), Positives = 129/316 (40%), Gaps = 35/316 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP++SS+   V+C    C S   SSC+S +  C Y  +Y         Y   D    A+
Sbjct: 203 FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNYG-----DGSYTFGD---FAT 252

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S     S    +V +GCG    G +        V   GL  +    L     L   SFS
Sbjct: 253 ESVSFGNSGSVKNVALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTNQLKATSFS 304

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
            C    DS    ++ F          T+ L    K D  Y+VG+    +G   ++  +S 
Sbjct: 305 YCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 364

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
           F+         +VD G + T L T+ Y  +   F ++  + +++     +  CY+ S + 
Sbjct: 365 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQA 424

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            ++VP +   F+  +S+ +    +  P +   T +C     T     IIG     G R+ 
Sbjct: 425 SVRVPTVSFHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVT 483

Query: 329 FDRENLKLAWSHSKCE 344
           FD  N ++ +S +KC+
Sbjct: 484 FDLANNRMGFSPNKCQ 499


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 126/315 (40%), Gaps = 35/315 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           YDPS S+S   V C  P C+    ++C++    C Y   Y  + + + G    + L L  
Sbjct: 205 YDPSVSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYG-DGSYTVGDFATETLTLG- 262

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+  S+V IGCG    G ++  A    + G  L   S PS ++       +FS
Sbjct: 263 ------DSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 308

Query: 163 ICFDENDSGS---VFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
            C  + DS S   + FGD + PA        P    +  Y+V +    +G   L+   S 
Sbjct: 309 YCLVDRDSPSSSTLQFGDSEQPAVTAPLIRSPRTNTF--YYVALSGISVGGEALSIPSSA 366

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
           F          +VDSG + T L +  Y  +   F +   S   +   + +  CY+ +   
Sbjct: 367 FAMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRS 426

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            ++VP + L F       +    +  P +   T +CL    T G   IIG     G R+ 
Sbjct: 427 SVQVPAVALWFEGGGELKLPAKNYLIPVDAAGT-YCLAFAGTSGPVSIIGNVQQQGVRVS 485

Query: 329 FDRENLKLAWSHSKC 343
           FD     + ++  KC
Sbjct: 486 FDTAKNTVGFTADKC 500


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/302 (26%), Positives = 136/302 (45%), Gaps = 29/302 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP +SS+ K+VSCS   C   ++++SC +  + C Y   Y  + + + G +  D L L 
Sbjct: 136 FDPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYG-DRSYTKGNIAVDTLTLG 194

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           S      Q     ++IIGCG    G++       G++GLG G VS+ + L  +  I   F
Sbjct: 195 STDTRPVQ---LKNIIIGCGHNNAGTF--NKKGSGIVGLGGGAVSLITQLGDS--IDGKF 247

Query: 162 SICF----DENDSGS-VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQ 214
           S C      END  S + FG     +       P+  K     Y++ ++S  +G+  +  
Sbjct: 248 SYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQY 307

Query: 215 SGFQA-------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
            G  +       ++DSG + T LPTE Y+E+       + +++          CY+A+ +
Sbjct: 308 PGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGD 367

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHR 326
             LKVP + + F      +  ++ F    +E    F      +   YG + Q NF++G+ 
Sbjct: 368 --LKVPAITMHFDGADVNLKPSNCF-VQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYD 424

Query: 327 IV 328
            V
Sbjct: 425 TV 426


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
           ++L  ++PS S +   + C   +C+  + SSC         C Y   Y+ + + ++G+L 
Sbjct: 148 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 206

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            D    AS + HA   +    +  GCG    G ++      G+ G   G +S+P     A
Sbjct: 207 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 258

Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
            L  ++FS CF     ++   VF G            G    QST+ +     +  AY++
Sbjct: 259 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 318

Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
            ++   +G + L   +S F          +VDSG   T LP  +Y  V    D  V+  +
Sbjct: 319 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 375

Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
           +++  ++    + C++        VP + L F      + R N++F   E  G  + CL 
Sbjct: 376 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 435

Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           + + + D  +IG        +++D  N  L++  ++C ++
Sbjct: 436 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
           ++L  ++PS S +   + C   +C+  + SSC         C Y   Y+ + + ++G+L 
Sbjct: 122 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 180

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            D    AS + HA   +    +  GCG    G ++      G+ G   G +S+P     A
Sbjct: 181 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 232

Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
            L  ++FS CF     ++   VF G            G    QST+ +     +  AY++
Sbjct: 233 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 292

Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
            ++   +G + L   +S F          +VDSG   T LP  +Y  V    D  V+  +
Sbjct: 293 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 349

Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
           +++  ++    + C++        VP + L F      + R N++F   E  G  + CL 
Sbjct: 350 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 409

Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           + + + D  +IG        +++D  N  L++  ++C ++
Sbjct: 410 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 448


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 135/324 (41%), Gaps = 37/324 (11%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           LS Y+ S+SS+S   SCS PLC       SRS   S    C Y++ Y  +  S   Y+ D
Sbjct: 127 LSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNS---ACAYVSSYQDKSASVGAYVRD 183

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+ ++     H   ++  S +  GC    TGS+      DG+MG GL   +VP+ +A   
Sbjct: 184 DMHYVL----HGGNATT-SRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQR 234

Query: 156 LIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            +   FS C   E   G +    + P T +   F P+      Y V + S  + +  L  
Sbjct: 235 NMSRVFSHCLGGEKHGGGILEFGEAPNTTEMV-FTPLLNVTTHYNVDLLSISVNSKVLPI 293

Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYC 261
                     + +    ++DSG +F  L T+    +  +   L ++K    L+G    Y 
Sbjct: 294 DPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECFYL 353

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
            +  + E    P++ L FS   +  ++  N++      +    +C    S DG   I G+
Sbjct: 354 KSGLTMET-SFPNVTLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADG-LTIFGE 411

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
             +    + +D EN ++ W    C
Sbjct: 412 IVLKDKLVFYDVENRRIGWKGQNC 435


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 139/333 (41%), Gaps = 56/333 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+ S+S  ++ CS  +C +  S    ++ C Y A Y  +  SS+G L ++     +F 
Sbjct: 127 FEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFG 182

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
            ++ + +V   V  GCG    G+  +G+   G++G G G +S+ S L         FS C
Sbjct: 183 TNSTRVAVP-RVSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYC 233

Query: 165 ---FDENDSGSVFFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
              F    +  ++FG             GP   QST F+        YF+ +    +   
Sbjct: 234 LTSFMSPATSRLYFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGD 291

Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSW 258
            L           T      ++DSG + TFL    YA V   F   V   R  +   +++
Sbjct: 292 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTF 351

Query: 259 KYCYN--ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
             C+        M+ +P+M L F   +    + N++     + G    CL ++ +D D  
Sbjct: 352 DTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGS 407

Query: 316 IIG----QNFMMGHRIVFDRENLKLAWSHSKCE 344
           IIG    QNF M    ++D EN  L++  + C 
Sbjct: 408 IIGSFQHQNFHM----LYDLENSLLSFVPAPCN 436


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/312 (23%), Positives = 133/312 (42%), Gaps = 21/312 (6%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ SS+  +V C    C    +++  C S K  C Y+  Y T D+ + G L  D +  
Sbjct: 130 FDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQ-CIYLHQYGT-DSFTIGRLGYDTISF 187

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +S       ++   SV  GC      ++      +G +GLG G +S+ S L     I + 
Sbjct: 188 SSTGMGQGGATFPKSVF-GCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHK 244

Query: 161 FSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIG-NSCLT-Q 214
           FS C   F    +G + FG   P  +  ++   I   Y +Y+V  +E   +G    LT Q
Sbjct: 245 FSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQ 304

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
            G   ++DS    T L   IY + +    + ++ +        ++YC    +   L  P+
Sbjct: 305 IGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTN--LNFPE 362

Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
               F+     +   ++F   +N    + C+TV+ + G   I G    +  ++ +D    
Sbjct: 363 FVFHFTGADVVLGPKNMFIALDNN---LVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEK 418

Query: 335 KLAWSHSKCEEV 346
           K++++ + C  +
Sbjct: 419 KVSFAPTNCSTI 430


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)

Query: 46  DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
           +PS+S+S KN+SCS  LCK  +S K     C      Y   Y  + + S G+   + L L
Sbjct: 175 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 233

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +S       S+V  + + GCG++  G +   A   G+    L   ++PS  AK    +  
Sbjct: 234 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 281

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
           FS C   + S   +    G    +S  F P+   +D+   Y + +    +G   L+  +S
Sbjct: 282 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDES 340

Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
            F A  ++DSG   T L    Y+E+   F  L++    +   + +  CY+ S  + +++P
Sbjct: 341 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 400

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
            + + F       +      +P N G    CL       D D  I G      +++V+D 
Sbjct: 401 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 459

Query: 332 ENLKLAWSHSKC 343
              ++ ++   C
Sbjct: 460 AKGRVGFAPGGC 471


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 140/333 (42%), Gaps = 53/333 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P +SSS + + C+  LC      SC+   D C Y   Y  + T++ G    +    +S
Sbjct: 146 FSPGASSSYEPMRCAGELCNDILHHSCQR-PDTCTYRYSYG-DGTTTRGVYATERFTFSS 203

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S     + + + +  GCG    GS  +G+   G++G G   +S+ S LA        FS
Sbjct: 204 SSSGGETTKLSAPLGFGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAI-----RRFS 255

Query: 163 ICFDENDSG---SVFFG-------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            C     SG   ++ FG       D   AT Q+T  L   +    Y+V      +G   L
Sbjct: 256 YCLTPYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRL 315

Query: 213 TQ--SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK--- 259
               S F         A+VDSG + T  P  + AEVV  F    S  R+    N      
Sbjct: 316 RIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFR---SQLRLPFAANGSSGPD 372

Query: 260 --YCYNASSEEMLK---VPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
              C+ A++  + +   VP  R++F     +     RN++    +++     CL +++  
Sbjct: 373 DGVCFAAAASRVPRPAVVP--RMVFHLQGADLDLPRRNYVL---DDQRKGNLCL-LLADS 426

Query: 312 GDYGIIGQNFM-MGHRIVFDRENLKLAWSHSKC 343
           GD G    NF+    R+++D E   L+++ ++C
Sbjct: 427 GDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 144/354 (40%), Gaps = 79/354 (22%)

Query: 45  YDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCP--------YIADYSTEDTSSS 90
           + P SSSSSK + C +P C      K +S C+  +   P        Y+  Y +  T   
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG-- 196

Query: 91  GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
           G ++ + L L    K  P      + I+GC      S L  + P G+ G G G  S+PS 
Sbjct: 197 GIMLSETLDLPG--KGVP------NFIVGC------SVLSTSQPAGISGFGRGPPSLPSQ 242

Query: 151 LAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQST--SFLPIGEKYDA----- 197
           L   GL    FS C       D  +S S+    +  + +++   S+ P  +         
Sbjct: 243 L---GL--KKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHA 297

Query: 198 ----YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
               Y++G+    +G   +                 ++DSG +FT++  EI+  V  +F+
Sbjct: 298 FSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFE 357

Query: 244 KLVSSKRIS-LQG-NSWKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEG 299
           K V SKR + ++G    + C+N S       P++ L F         + N++       G
Sbjct: 358 KQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFL---GG 414

Query: 300 FTVFCLTVMSTDGDYG--------IIGQNFMMGHRIV-FDRENLKLAWSHSKCE 344
             V CLT++ TDG  G        II  NF   +  V +D  N +L +    C+
Sbjct: 415 DDVVCLTIV-TDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 467


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 138/354 (38%), Gaps = 61/354 (17%)

Query: 25  LLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY- 78
           LLW     CL  +     QD  L  Y PS+SS+   V C  P C    + +    PC + 
Sbjct: 88  LLWVQCAPCLQCY----AQDTPL--YAPSNSSTFNPVPCLSPECLLIPATEGF--PCDFH 139

Query: 79  -----IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 133
                  +Y   DTS S  +        ++            V  GCGR   GS+   AA
Sbjct: 140 YPGACAYEYRYADTSLSKGVF-------AYESATVDDVRIDKVAFGCGRDNQGSF---AA 189

Query: 134 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSV----FFGDQGPATQQSTSF 188
             GV+GLG G +S  S +  A    N F+ C  +  D  SV     FGD+  +T     F
Sbjct: 190 AGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTSVSSWLIFGDELISTIHDLQF 247

Query: 189 LPI---GEKYDAYFVGVESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIY 235
            PI         Y+V +E   +G   L  + S +         ++ DSG + T+     Y
Sbjct: 248 TPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAY 307

Query: 236 AEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIF 292
             ++  FDK V   R  S+QG     C + +  +    P   ++      F  +  N+  
Sbjct: 308 RNILAAFDKNVRYPRAASVQG--LDLCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFV 365

Query: 293 SFPENEGFTVFCLTVM---STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               N    V CL +    S+ G +  IG        + +DRE  ++ ++ +KC
Sbjct: 366 DVAPN----VQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKC 415


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)

Query: 46  DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
           +PS+S+S KN+SCS  LCK  +S K     C      Y   Y  + + S G+   + L L
Sbjct: 163 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 221

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +S       S+V  + + GCG++  G +   A   G+    L   ++PS  AK    +  
Sbjct: 222 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 269

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
           FS C   + S   +    G    +S  F P+   +D+   Y + +    +G   L+  +S
Sbjct: 270 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDES 328

Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
            F A  ++DSG   T L    Y+E+   F  L++    +   + +  CY+ S  + +++P
Sbjct: 329 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 388

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
            + + F       +      +P N G    CL       D D  I G      +++V+D 
Sbjct: 389 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 447

Query: 332 ENLKLAWSHSKC 343
              ++ ++   C
Sbjct: 448 AKGRVGFAPGGC 459


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 133/327 (40%), Gaps = 45/327 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDD 96
           ++PS+SSS  ++ C+ P C        S   C +     C Y  DY  + + S G L   
Sbjct: 106 FNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--- 161

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
                 F K     +   + I GCGR   G +       G+MGL   ++S+ S    + L
Sbjct: 162 -----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSL 211

Query: 157 IQNSFSICFDEN---DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESY 205
             + FS C        SGS+  G    +  ++ S  PI         +  + YF+ +   
Sbjct: 212 FGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGI 269

Query: 206 CIGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
            IG   L      +  G  +L+DSG   T L   IY     +F+K  S  R +   +   
Sbjct: 270 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN 329

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGI 316
            C+N +  E + +P ++ IF  N   +V    +F F +++   + CL   S   +    I
Sbjct: 330 TCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMI 388

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IG       R++++ +  K+ ++   C
Sbjct: 389 IGNYQQKNQRVIYNSKESKVGFAGEPC 415


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 124/312 (39%), Gaps = 31/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PSSSSS + +SC  P C +    +     C Y   Y  + + + G    + L + S  
Sbjct: 190 FEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS-- 246

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                 ++  +V +GCG    G +        V   GL  +    L   + L   SFS C
Sbjct: 247 ------TLVQNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYC 292

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA 219
             + DS S    D G +        P+   +     Y++G+    +G   L   QS F+ 
Sbjct: 293 LVDRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 352

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   ++DSG + T L TEIY  +   F K       +     +  CYN S++  ++
Sbjct: 353 DESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVE 412

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           VP +   F   +   +    +  P +   T FCL    T     IIG     G R+ FD 
Sbjct: 413 VPTVAFHFPGGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDL 471

Query: 332 ENLKLAWSHSKC 343
            N  + +S +KC
Sbjct: 472 ANSLIGFSSNKC 483


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 128/335 (38%), Gaps = 55/335 (16%)

Query: 58  CSHPLCKSRSSCKSLKDPCPYIA-DYST-------------EDTSSSGYLVDDILHLASF 103
           C+ P C    S  +  DPC       ST               T  +G +V   L   + 
Sbjct: 143 CTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTRDTL 202

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
             H     V   +   C      SY +   P G+ G G G +S+PS L   G ++  FS 
Sbjct: 203 RVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQL---GFLRKGFSH 256

Query: 164 CF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSCLT 213
           CF       + N S  +  GD    ++    F P+ +     + Y+VG+E+  +GN   T
Sbjct: 257 CFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSAT 316

Query: 214 Q-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKY 260
           +                LVDSG ++T LP   Y++V+     +++  R +       +  
Sbjct: 317 EVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRTGFDL 376

Query: 261 CYNA--SSEEMLK---VPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVF-CLTVMST-D 311
           CY     +  +L    +P +   F  N S V+    H ++       TV  CL   S  D
Sbjct: 377 CYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQSMDD 436

Query: 312 GDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           GDY   G++G        +V+D E  ++ +    C
Sbjct: 437 GDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 90/377 (23%), Positives = 148/377 (39%), Gaps = 71/377 (18%)

Query: 17  LLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC----KSRSSCKS- 71
           ++  P T+   C     +S      +  + P  SSSSK + C +P C     S  +C   
Sbjct: 90  IVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCSWIHHSNINCDQD 149

Query: 72  ------LKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
                 L   CP Y+  Y +  T   G  + + LHL S SK         + ++GC    
Sbjct: 150 CSIKSCLNQTCPPYMIFYGSGTTG--GVALSETLHLHSLSK--------PNFLVGC---- 195

Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPA 181
             S      P G+ G G G  S+PS L          S  FD++   S   V   +Q  +
Sbjct: 196 --SVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKKSSSLVLDMEQLDS 253

Query: 182 TQQSTS--FLPI--GEKYDA-------YFVGVESYCIGNSCLT----------QSGFQAL 220
            +++ +  + P     K D        Y++G+    +G   +                 +
Sbjct: 254 DKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSPGEDGNGGVI 313

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRL 277
           +DSG +FTF+  E +  +  +F + +   R   +       + C+N S  + +  P++RL
Sbjct: 314 IDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAKTVSFPELRL 373

Query: 278 IFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHR 326
            F    + +  V N+ F+F   E   V CLTV+ TDG  G         I+G   M    
Sbjct: 374 YFKGGADVALPVENY-FAFVGGE---VACLTVV-TDGVAGPERVGGPGMILGNFQMQNFY 428

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D  N +L +   KC
Sbjct: 429 VEYDLRNERLGFKQEKC 445


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 133/327 (40%), Gaps = 45/327 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDD 96
           ++PS+SSS  ++ C+ P C        S   C +     C Y  DY  + + S G L   
Sbjct: 185 FNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--- 240

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
                 F K     +   + I GCGR   G +       G+MGL   ++S+ S    + L
Sbjct: 241 -----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSL 290

Query: 157 IQNSFSICFDEN---DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESY 205
             + FS C        SGS+  G    +  ++ S  PI         +  + YF+ +   
Sbjct: 291 FGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGI 348

Query: 206 CIGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
            IG   L      +  G  +L+DSG   T L   IY     +F+K  S  R +   +   
Sbjct: 349 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN 408

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGI 316
            C+N +  E + +P ++ IF  N   +V    +F F +++   + CL   S   +    I
Sbjct: 409 TCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMI 467

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IG       R++++ +  K+ ++   C
Sbjct: 468 IGNYQQKNQRVIYNSKESKVGFAGEPC 494


>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
          Length = 393

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 135/318 (42%), Gaps = 43/318 (13%)

Query: 56  VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHL--ASFSKHAPQS 110
           V C  P+C+S  S    +   P   DY  E     SS G LV D  +L   S  +H+P  
Sbjct: 84  VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPL- 142

Query: 111 SVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 168
                + +GCG  Q   GS+      DGV+GLG G  S+ S L+  GL++N    C   +
Sbjct: 143 -----LALGCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGH 194

Query: 169 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGA 225
             G +FFGD    + +  ++ P+      Y  G+            +GF+ L+   DSGA
Sbjct: 195 GGGFLFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGA 250

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 283
           S+T+L ++ Y  ++    K +S K  R +L   +   C+    +    + D++  F K  
Sbjct: 251 SYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTF 308

Query: 284 SFVVRNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIV 328
           +    N         FP  E + +       CL +++       D  +IG   M    ++
Sbjct: 309 ALSFTNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVI 367

Query: 329 FDRENLKLAWSHSKCEEV 346
           +D E  ++ W+   C  +
Sbjct: 368 YDNEKERIGWAPGNCNRL 385


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)

Query: 46  DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
           +PS+S+S KN+SCS  LCK  +S K     C      Y   Y  + + S G+   + L L
Sbjct: 115 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 173

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +S       S+V  + + GCG++  G +   A   G+    L   ++PS  AK    +  
Sbjct: 174 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 221

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
           FS C   + S   +    G    +S  F P+   +D+   Y + +    +G   L+  +S
Sbjct: 222 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDES 280

Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
            F A  ++DSG   T L    Y+E+   F  L++    +   + +  CY+ S  + +++P
Sbjct: 281 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 340

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
            + + F       +      +P N G    CL       D D  I G      +++V+D 
Sbjct: 341 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 399

Query: 332 ENLKLAWSHSKC 343
              ++ ++   C
Sbjct: 400 AKGRVGFAPGGC 411


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 130/318 (40%), Gaps = 35/318 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++PS S+S  N+SCS P C        +  SC +    C Y   Y  + + S G+   D 
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYG-DQSYSVGFFAQDK 237

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
           L L S       + V ++ + GCG+   G ++  A   G++GLG   +S+ S  A K G 
Sbjct: 238 LALTS-------TDVFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGK 287

Query: 157 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSC 211
           +   FS C     S  G + FG  G  T ++  F P     +    YF+ + +  +G   
Sbjct: 288 L---FSYCLPSTSSSTGYLTFGSGG-GTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRK 343

Query: 212 LTQSG-----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
           L+ S         ++DSG   + LP   Y+++   F + +S    +   +    CY+ S 
Sbjct: 344 LSTSASVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQ 403

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            + + VP + L FS      +      +  N           S   D  I+G        
Sbjct: 404 YDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFD 463

Query: 327 IVFDRENLKLAWSHSKCE 344
           +V+D    ++ ++   CE
Sbjct: 464 VVYDVAGGRIGFAPGGCE 481


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/341 (24%), Positives = 140/341 (41%), Gaps = 52/341 (15%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
           R L   DPS+SS+   + CS P+C   + SSC         C Y+  Y      + G + 
Sbjct: 452 RALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY------ADGSIT 505

Query: 95  DDILHLASFSKHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
              L   +F+  A   + Q++V     GCG    G +       G+ G G G +S+PS L
Sbjct: 506 TGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNET--GIAGFGRGALSLPSQL 563

Query: 152 AKAGLIQNSFSICFDE---NDSGSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGV 202
                  ++FS CF     ++  SV  G             QST  +       AY++ +
Sbjct: 564 KV-----DNFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSL 618

Query: 203 ESYCIGNS---------CLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
           +   +G++          L Q G    ++DSG   T LP + Y  V    D   +  R+ 
Sbjct: 619 KGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLV---HDAFTAQVRLP 675

Query: 253 LQGNS----WKYCYNASSEEMLK--VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCL 305
           +   +     + C++ S     K  VP + L F      + R N++F F E+ G +V CL
Sbjct: 676 VDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEF-EDAGGSVTCL 734

Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            + + D D  IIG        +++D     L++  ++C  +
Sbjct: 735 AINAGD-DLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNRL 774


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 133/320 (41%), Gaps = 39/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLA 101
           ++P  SSS + VSC+   C+S  S  C      C Y   YS  D S + G L  D + + 
Sbjct: 132 FNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSY--GYSYGDRSFTYGDLASDQITIG 189

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           SF    P++      +IGCG +  G++  G     +   G     V  +   AG ++  F
Sbjct: 190 SFK--LPKT------VIGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAG-VKPRF 239

Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN----- 209
           S C      + N +G++ FG +   + +     P+  +     YF+ +E+  +G      
Sbjct: 240 SYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKA 299

Query: 210 ----SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
               S +T  G   ++DSG + T LP  +Y  V     +++ +KR+       + CY+A 
Sbjct: 300 ANGISAMTNHG-NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAG 358

Query: 266 SEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
             + L +P +   F+   +   +  N      +N    V CLT  +      I G    +
Sbjct: 359 QVDDLNIPIITAHFAGGADVKLLPVNTFAPVADN----VTCLT-FAPATQVAIFGNLAQI 413

Query: 324 GHRIVFDRENLKLAWSHSKC 343
              + +D  N +L++    C
Sbjct: 414 NFEVGYDLGNKRLSFEPKLC 433


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 134/327 (40%), Gaps = 40/327 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSS-GYLVD 95
           + P++S S   + CS   CKS         S+  +   PC Y  DY  +D SS+ G +  
Sbjct: 157 FRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGY--DYRYKDKSSARGVVGT 214

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D   +A     + + +    V++GC     G     +  DGV+ LG  ++S  S    A 
Sbjct: 215 DAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASR--AAA 270

Query: 156 LIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGN 209
                FS C        N +  + FG  G A   S + L +  +   ++ V V++  +  
Sbjct: 271 RFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAG 330

Query: 210 SCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKY 260
             L         +    A++DSG S T L T  Y  VV    K L    R+++  + ++Y
Sbjct: 331 KALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM--DPFEY 388

Query: 261 CYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GI 316
           CYN  ++     VP + + F+   S  +R    S+  +    V C+ +   +G +    +
Sbjct: 389 CYNWTATRRPPAVPRLEVRFAG--SARLRPPTKSYVIDAAPGVKCIGLQ--EGVWPGVSV 444

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IG      H   FD  N  L +  S+C
Sbjct: 445 IGNILQQEHLWEFDLANRWLRFQESRC 471


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 124/312 (39%), Gaps = 31/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PSSSSS + +SC  P C +    +     C Y   Y  + + + G    + L + S  
Sbjct: 193 FEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS-- 249

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                 ++  +V +GCG    G +        V   GL  +    L   + L   SFS C
Sbjct: 250 ------TLVQNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYC 295

Query: 165 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
             + DS S   V FG   P        L   +    Y++G+    +G   L   QS F+ 
Sbjct: 296 LVDRDSDSASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 355

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   ++DSG + T L T IY  +   F K  S    +     +  CYN S++  ++
Sbjct: 356 DESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE 415

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           VP +   F   +   +    +  P +   T FCL    T     IIG     G R+ FD 
Sbjct: 416 VPTVAFHFPGGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDL 474

Query: 332 ENLKLAWSHSKC 343
            N  + +S +KC
Sbjct: 475 ANSLIGFSSNKC 486


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 74/321 (23%), Positives = 130/321 (40%), Gaps = 29/321 (9%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           P    S+  V C  PLC S  S    +    D C Y  +Y+ +  SS G LV D+  L +
Sbjct: 98  PLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-N 155

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            +   P   ++  + +GCG  Q          DG++GLG G VS+ S L   G+++N   
Sbjct: 156 LTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVG 212

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALV 221
            CF+        F   G        + P+   Y  ++  G                  + 
Sbjct: 213 HCFNSKGG-GYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVF 271

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
           DSG+S+T+   + Y  +    ++ ++ K  R ++  ++   C+    + +  + D+R  F
Sbjct: 272 DSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSLRDVRKYF 330

Query: 280 S----KNQSFVVRNHIFSFPENEGFTVF------CLTVMS-TD---GDYGIIGQNFMMGH 325
                   S      +F  P  EG+ +       CL +++ TD    +  IIG   M   
Sbjct: 331 KPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389

Query: 326 RIVFDRENLKLAWSHSKCEEV 346
            +V++ E   + W+ + C+ V
Sbjct: 390 MVVYNNEKQAIGWATANCDRV 410


>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Brachypodium distachyon]
          Length = 436

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/299 (22%), Positives = 118/299 (39%), Gaps = 41/299 (13%)

Query: 60  HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 119
           H +C +  S     D C Y   Y+    +++GY V D +H   F  +   +S  +SVI G
Sbjct: 149 HAICHTSHSSG---DQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNESFASSSASVIFG 205

Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-NDSGSVFFGDQ 178
           C + ++G        DGV+G G    S+ S L   G + ++FS C D+ +D G V   D+
Sbjct: 206 CSKSRSGH----LQADGVIGFGKDAPSLISQLNSQG-VSHAFSRCLDDSDDGGGVLILDE 260

Query: 179 GPATQQSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQA-LVDSGASFTFL 230
               +    F  +      Y + ++S  + N       S  T S  Q   +DSG S  + 
Sbjct: 261 --VGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYF 318

Query: 231 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--R 288
           P  +Y  V+     +  S R                      P +   F    +  V   
Sbjct: 319 PDGVYDPVIRAILFIYFSTR-----------------SFSSFPTVTXYFEGGAAMKVGPE 361

Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           N++      +  +  C+    ++GDY    I+G   +     V++ + +++ W +  C+
Sbjct: 362 NYLLRRGSYDNDSYMCIAFQRSEGDYKQTTILGDLILHDKIFVYNLKKMQIGWVNYNCK 420


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 90/325 (27%), Positives = 128/325 (39%), Gaps = 39/325 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DP +SSS + + C  PLCK     S S  +     C Y   Y  + + S G    D+  
Sbjct: 96  FDPRNSSSFQRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAYG-DGSFSVGDFSSDLFT 154

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           L + SK         SV  GCG    G +   A   G+    L   S     +      N
Sbjct: 155 LGTGSKAM-------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTAN 207

Query: 160 SFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL 212
           SFS C  +        S S+ FG     +  + S L    K D  Y+  +    +G + L
Sbjct: 208 SFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQL 267

Query: 213 ---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSW 258
                    +QSG    ++DSG S T  PT +YA +   F      L S+ R SL    +
Sbjct: 268 PISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSL----F 323

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
             CYN S +  + VP + L F       +    +  P N   + FCL    T  + GIIG
Sbjct: 324 DTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGS-FCLAFAPTSMELGIIG 382

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                  RI FD +   LA++  +C
Sbjct: 383 NIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 98/211 (46%), Gaps = 26/211 (12%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSS---CKSLK----DPCPYIADYSTEDTSSSGYLVD 95
           + +DP+     K ++C    CK+      C   +    + C Y   Y+ E +  SG LV 
Sbjct: 154 TRFDPTG----KWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSRTYA-EGSGVSGDLVR 208

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD-VSVPSLLAKA 154
           D +H       AP ++    V+ GC   ++G+  D  A DG++GLG     S+P+ LA  
Sbjct: 209 DKMHFGG--DIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADT 265

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF----LPIGEKYDAYFV-GVESYCIGN 209
             +   FS+CF   + G      + PAT  +       + + E + AY+V    +  IG+
Sbjct: 266 HGLPRVFSLCFGSFEGGGALSFGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGD 325

Query: 210 SCLTQS-----GFQALVDSGASFTFLPTEIY 235
             +        G+  ++DSG +FT++PT+++
Sbjct: 326 VAVATPSDLAVGYGTVMDSGTTFTYVPTKVF 356


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/321 (24%), Positives = 134/321 (41%), Gaps = 46/321 (14%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           ++N   +DPS SS+ K   C                 CPY  DY  + T + G L  + +
Sbjct: 101 EQNAPIFDPSKSSTFKEKRCD-------------GHSCPYEVDYF-DHTYTMGTLATETI 146

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L S S    +  V    IIGCG     S+   +   G++GL  G  S+  +    G   
Sbjct: 147 TLHSTSG---EPFVMPETIIGCGHNN--SWFKPSF-SGMVGLNWGPSSL--ITQMGGEYP 198

Query: 159 NSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
              S CF    +  + FG      G     +T F+    K   Y++ +++  +GN+ +  
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTA-KPGFYYLNLDAVSVGNTRIET 257

Query: 215 SG--FQAL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
            G  F AL     +DSG + T+ P      V    + +V++ R +    +   CYN+ + 
Sbjct: 258 MGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTI 317

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQ-NFM 322
           ++  V  M   FS     V+  +      N G  VFCL ++    + +  +G   Q NF+
Sbjct: 318 DIFPVITMH--FSGGVDLVLDKYNMYMESNNG-GVFCLAIICNSPTQEAIFGNRAQNNFL 374

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
           +G    +D  +L +++S + C
Sbjct: 375 VG----YDSSSLLVSFSPTNC 391


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 132/315 (41%), Gaps = 35/315 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP++SS+   V+C    C S   SSC+S +  C Y  +Y         Y   D    A+
Sbjct: 62  FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNYG-----DGSYTFGD---FAT 111

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S     S    +V +GCG    G ++  A   G+ G  L      SL  +  L   SFS
Sbjct: 112 ESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPL------SLTNQ--LKATSFS 163

Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
            C    DS    ++ F          T+ L    K D  Y+VG+    +G   ++  +S 
Sbjct: 164 YCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 223

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
           F+         +VD G + T L T+ Y  +   F ++  + +++     +  CY+ S + 
Sbjct: 224 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQA 283

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            ++VP +   F+  +S+ +    +  P +   T +C     T     IIG     G R+ 
Sbjct: 284 SVRVPTVSFHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVT 342

Query: 329 FDRENLKLAWSHSKC 343
           FD  N ++ +S +KC
Sbjct: 343 FDLANNRMGFSPNKC 357


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/352 (21%), Positives = 142/352 (40%), Gaps = 59/352 (16%)

Query: 35  SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS---------CKSLKDPC-----PYIA 80
           S  + + +  ++P  SSSSK + C +P C + SS         C      C     PY  
Sbjct: 127 SDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSL 186

Query: 81  DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 140
            Y T   SS  +L++++        + P  ++    ++GC     G     A    + G 
Sbjct: 187 QYGT-GASSGDFLLENL--------NFPGKTIH-EFLVGCTTSAVGEVTSAA----LAGF 232

Query: 141 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD---- 196
           G    S+P  +          S  +D+  + S    D      +  S+ P  +       
Sbjct: 233 GRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPI 292

Query: 197 AYFVGVESYCIGNSCLT-QSGFQA---------LVDSGASFTFLPTEIYAEVVVKFDKLV 246
            Y++GV+   IGN  L   S + A         ++DSG ++ ++   ++ +V  +  K +
Sbjct: 293 YYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRM 352

Query: 247 SSKRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFT 301
           S  R SL+  +      CYN + ++ +K+PD+   F    + VV  +N+    PE    +
Sbjct: 353 SKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPE---IS 409

Query: 302 VFCL---------TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           + C          T+  T G   I+G +  + + + FD +N +L +    C+
Sbjct: 410 LACFPLTTDAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTCQ 461


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 89/348 (25%), Positives = 140/348 (40%), Gaps = 65/348 (18%)

Query: 38  QDRNLSEYDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYL 93
           Q R    YDP+ SSS     C   LC++ S    +C   ++ C Y  +Y +  T   G L
Sbjct: 124 QHREKPLYDPAKSSSFAAAPCDGRLCETGSFNTKNCS--RNKCIYTYNYGSATT--KGEL 179

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             +     +F +H     V  S+  GCG+  +GS L GA+  G++G+    +S+ S L  
Sbjct: 180 ASETF---TFGEH---RRVSVSLDFGCGKLTSGS-LPGAS--GILGISPDRLSLVSQLQI 230

Query: 154 AGLIQNSFSIC----FDENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYF- 199
                  FS C     D N +  +FFG           GP    S    P G  Y  Y  
Sbjct: 231 P-----RFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVP 285

Query: 200 ------------VGVESYCIGNSCLTQSGFQALVDSGASFTFLPT---EIYAEVVVKFDK 244
                       V V S+ IG      SG    VDSG +   LP+   E   E +V+  K
Sbjct: 286 LIGISVGTKRLNVPVSSFAIGRD---GSG-GTFVDSGDTTGMLPSVVMEALKEAMVEAVK 341

Query: 245 LVSSKRISLQGNSWKYCYN------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 298
           L         G  ++ C+        + E  ++VP +   F    + ++R   +    + 
Sbjct: 342 LPVVNATD-HGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSA 400

Query: 299 GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G    CL V+S+     IIG        ++FD EN + +++ ++C ++
Sbjct: 401 G--RMCL-VISSGARGAIIGNYQQQNMHVLFDVENHEFSFAPTQCNQI 445


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 131/330 (39%), Gaps = 46/330 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   V C   LC+   S  C   +  C Y   Y  + + ++G  V + L  A 
Sbjct: 28  FDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAG 86

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++ A        V +GCG    G ++  A   G+       +S P+ +++      SFS
Sbjct: 87  GARVA-------RVALGCGHDNEGLFVAAAGLLGLGRG---GLSFPTQISR--RYGRSFS 134

Query: 163 ICF-DENDSG-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCI 207
            C  D   SG           +V FG  G     S SF P+         Y+V +    +
Sbjct: 135 YCLVDRTSSGAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISV 193

Query: 208 GNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQ 254
           G +    + +S  +          +VDSG S T L    Y+ +   F    +   R+S  
Sbjct: 194 GGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPG 253

Query: 255 GNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
           G S +  CY+     ++KVP + + F+      +    +  P +   T FC     TDG 
Sbjct: 254 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGG 312

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             IIG     G R+VFD +  ++ ++   C
Sbjct: 313 VSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 131/330 (39%), Gaps = 46/330 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   V C   LC+   S  C   +  C Y   Y  + + ++G  V + L  A 
Sbjct: 171 FDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAG 229

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            ++ A        V +GCG    G ++  A   G+       +S P+ +++      SFS
Sbjct: 230 GARVA-------RVALGCGHDNEGLFVAAAGLLGLGRG---GLSFPTQISR--RYGRSFS 277

Query: 163 ICF-DENDSG-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCI 207
            C  D   SG           +V FG  G     S SF P+         Y+V +    +
Sbjct: 278 YCLVDRTSSGAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISV 336

Query: 208 GNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQ 254
           G +    + +S  +          +VDSG S T L    Y+ +   F    +   R+S  
Sbjct: 337 GGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPG 396

Query: 255 GNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
           G S +  CY+     ++KVP + + F+      +    +  P +   T FC     TDG 
Sbjct: 397 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGG 455

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             IIG     G R+VFD +  ++ ++   C
Sbjct: 456 VSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 126/323 (39%), Gaps = 42/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS+S +  N+SC+   C    S       C S    C Y   Y  + + + G+   D 
Sbjct: 197 FDPSASKTYSNISCTSTACSGLKSATGNSPGCSS--SNCVYGIQYG-DSSFTVGFFAKDT 253

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L        Q+ V    + GCG+   G +   A   G++GLG   +S+    A+    
Sbjct: 254 LTLT-------QNDVFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--F 301

Query: 158 QNSFSICF--DENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDA--YFVGVESYCIG 208
              FS C       +G + FG+        A +   +F P      A  YF+ V    +G
Sbjct: 302 GKYFSYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVG 361

Query: 209 NSCLTQSG--FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
              L+ S   FQ    ++DSG   T LP+ +Y  +   F + +S    +   +    CY+
Sbjct: 362 GKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYD 421

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQN 320
            S+   + +P +   F+ N +  +  N I       G +  CL       D   GI G  
Sbjct: 422 LSNYTSISIPKISFNFNGNANVDLEPNGILI---TNGASQVCLAFAGNGDDDTIGIFGNI 478

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
                 +V+D    +L + +  C
Sbjct: 479 QQQTLEVVYDVAGGQLGFGYKGC 501


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 46/327 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPSSSS+   + CS  LC    S K     C Y   Y  + +S+ G L  +   LA   
Sbjct: 144 FDPSSSSTYAALPCSSTLCSDLPSSKCTSAKCGYTYTYG-DSSSTQGVLAAETFTLA--- 199

Query: 105 KHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 +    V  GCG    G  +  GA   G++GLG G +   SL+++ GL  N FS 
Sbjct: 200 -----KTKLPDVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--NKFSY 246

Query: 164 C---FDENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
           C    D+     +  G     ++        Q+T  +    +   Y+V ++   +G++ +
Sbjct: 247 CLTSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHI 306

Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
           T   S F          +VDSG S T+L  + Y  +   F   +        G     C+
Sbjct: 307 TLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCF 366

Query: 263 NASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
            A +  + +V   +L+F     +      N++     + G    CLTVM + G   IIG 
Sbjct: 367 EAPASGVDQVEVPKLVFHLDGADLDLPAENYMV---LDSGSGALCLTVMGSRG-LSIIGN 422

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEV 346
                 + V+D     L+++  +C ++
Sbjct: 423 FQQQNIQFVYDVGENTLSFAPVQCAKL 449


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 86/369 (23%), Positives = 154/369 (41%), Gaps = 62/369 (16%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------ 68
           N L CLP      C   F       +N + YDP +S+S KN++C+ P C   SS      
Sbjct: 186 NWLQCLP------CYDCF------HQNEAFYDPKTSASFKNITCNDPRCSLISSPEPPVQ 233

Query: 69  CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
           CKS    CPY   Y     ++  + V+   ++L +    + +  V+ +++ GCG    G 
Sbjct: 234 CKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVE-NMMFGCGHWNRGL 292

Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPAT 182
           +   +    ++GLG G +S  S L    L  +SFS C      D N S  + FG+     
Sbjct: 293 FSGASG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL 347

Query: 183 QQS----TSFLPIGEK--YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGAS 226
             +    TSF+   E      Y++ ++S  +G   L                 ++DSG +
Sbjct: 348 NHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTT 407

Query: 227 FTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMRLIFSKNQ 283
            ++     Y  +  KF +K+  +  +         C+N S   E  + +P++ + F+   
Sbjct: 408 LSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGA 467

Query: 284 SFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLA 337
                  +++FP    F      + CL ++ T    + IIG        I++D +  +L 
Sbjct: 468 -------VWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLG 520

Query: 338 WSHSKCEEV 346
           ++ +KC ++
Sbjct: 521 FTPTKCADI 529


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 132/337 (39%), Gaps = 51/337 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP++S S +NV+C  P C         R+  +   DPCPY   Y  +  ++     D  
Sbjct: 194 FDPATSLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLA 249

Query: 98  LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           L   + +  AP +S +   V+ GCG    G +   A   G+    L   S   L A  G 
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG- 306

Query: 157 IQNSFSICFDENDS---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCI 207
             ++FS C  ++ S     + FGD       P    +          D  Y+V ++   +
Sbjct: 307 --HAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLV 364

Query: 208 GNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
           G   L  S              ++DSG + ++     Y  +   F +++  +  +     
Sbjct: 365 GGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFP 424

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST 310
               CYN S  E ++VP+  L+F+          ++ FP    F       + CL V+ T
Sbjct: 425 VLSPCYNVSGVERVEVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGT 477

Query: 311 -DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                 IIG        +++D +N +L ++  +C EV
Sbjct: 478 PRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514


>gi|301119613|ref|XP_002907534.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262106046|gb|EEY64098.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 350

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 125/278 (44%), Gaps = 43/278 (15%)

Query: 93  LVDDILHLASFSKHAPQSSVQSSVI-------IGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
           +VD+++ +  FS   P   ++  +        +GC  K+TG ++     +G+MGLG    
Sbjct: 1   MVDELVWVGGFS--TPSDEMEGILKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRS 57

Query: 146 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF---- 199
           +V S +  AG + QN F++CF   D G + FG    +   S   + P+ +   AY+    
Sbjct: 58  TVMSYMLNAGRVTQNLFTLCF-AGDGGELVFGGVDYSHHTSDVGYTPLLDDKSAYYPVHV 116

Query: 200 --VGVESYCIG-NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD----KLVSSKRIS 252
             + +    +G ++    SG   +VDSG + TF  ++     +  F     +  S KR+ 
Sbjct: 117 KDIRMNGVSLGIDAGTINSGRGVIVDSGTTDTFFDSKGSRAFMKAFQNAAGREYSEKRMD 176

Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
           L           +++E+  +P + +I S  +     +     P +   T     V S +G
Sbjct: 177 L-----------TADELAALPTISIILSGMKGDGTEDIQLDIPASSYLTP-SDKVGSYNG 224

Query: 313 DY-------GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           ++       G++G + M+G  ++FD EN ++ ++ S C
Sbjct: 225 NFHFSERSGGVLGASTMIGFDVIFDTENKRVGFAESDC 262


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 129/326 (39%), Gaps = 42/326 (12%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
           D+    ++PS S+S  NVSCS   C S SS       C      Y   Y  + + S G+L
Sbjct: 170 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 228

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             D   L S       S V   V  GCG    G +   A   G++GLG   +S PS  A 
Sbjct: 229 AKDKFTLTS-------SDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 278

Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
           A      FS C   + S  G + FG  G    +S  F PI    D          A  VG
Sbjct: 279 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 334

Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
            +   I ++  +  G  AL+DSG   T LP + YA +   F   +S    +   +    C
Sbjct: 335 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 392

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM--STDGDYGII 317
           ++ S  + + +P +   FS      +  +   ++F  ++     CL     S D +  I 
Sbjct: 393 FDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQ----VCLAFAGNSDDSNAAIF 448

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G        +V+D    ++ ++ + C
Sbjct: 449 GNVQQQTLEVVYDGAGGRVGFAPNGC 474


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 77/337 (22%), Positives = 132/337 (39%), Gaps = 51/337 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP++S S +NV+C  P C         R+  +   DPCPY   Y  +  ++     D  
Sbjct: 194 FDPAASLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLA 249

Query: 98  LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
           L   + +  AP +S +   V+ GCG    G +   A   G+    L   S   L A  G 
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG- 306

Query: 157 IQNSFSICFDENDS---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCI 207
             ++FS C  ++ S     + FGD       P    +          D  Y+V ++   +
Sbjct: 307 --HAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLV 364

Query: 208 GNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
           G   L  S              ++DSG + ++     Y  +   F +++  +  +     
Sbjct: 365 GGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFP 424

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST 310
               CYN S  E ++VP+  L+F+          ++ FP    F       + CL V+ T
Sbjct: 425 VLSPCYNVSGVERVEVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGT 477

Query: 311 -DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                 IIG        +++D +N +L ++  +C EV
Sbjct: 478 PRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 139/316 (43%), Gaps = 49/316 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SS+ K + C            +    CPY   Y  + + + G LV + + + S S
Sbjct: 107 FDPSKSSTFKEIRC-----------DTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTS 154

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
               Q  V    IIGCGR  +G +  G A  GV+GL  G  S+  +    G      S C
Sbjct: 155 G---QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYC 206

Query: 165 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 218
           F    +  + FG      G     +T F+    K   Y++ +++  +GN+ +   G  F 
Sbjct: 207 FAGKGTSKINFGANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFH 265

Query: 219 AL-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           AL     +DSG++ T+ P E Y  +V K  +++V++ R      S   CY + + ++  V
Sbjct: 266 ALKGNIVIDSGSTLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPV 321

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRI 327
             M   FS     V+  +      N G  VFCL ++ ++  +  I G     NF++G   
Sbjct: 322 ITMH--FSGGADLVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG--- 375

Query: 328 VFDRENLKLAWSHSKC 343
            +D  +L +++  + C
Sbjct: 376 -YDSSSLLVSFKPTNC 390


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 143/364 (39%), Gaps = 81/364 (22%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYS 83
           S + P  SS+S   SC+   C    S  +  D                   PCP  A   
Sbjct: 133 SVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTY 192

Query: 84  TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
            E    SG L  DIL   + ++  P+ S       GC    T +Y +   P G+ G G G
Sbjct: 193 GEGGLISGILTRDILK--ARTRDVPRFS------FGC---VTSTYRE---PIGIAGFGRG 238

Query: 144 DVSVPSLLAKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPI 191
            +S+PS L   G ++  FS CF       + N S  +  G    +       Q T  L  
Sbjct: 239 LLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 295

Query: 192 GEKYDAYFVGVESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVV 239
               ++Y++G+ES  IG +       LT   F +      LVDSG ++T LP   Y++++
Sbjct: 296 PMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLL 355

Query: 240 VKFDKLVSSKRI--SLQGNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVV 287
                 ++  R   +     +  CY       N +S E   M+  P +   F  N + ++
Sbjct: 356 TTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLL 415

Query: 288 RN----HIFSFPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWS 339
                 +  S P ++G  V CL   +  DGDY   G+ G       ++V+D E  ++ + 
Sbjct: 416 PQGNSFYAMSAP-SDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQ 474

Query: 340 HSKC 343
              C
Sbjct: 475 AMDC 478


>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
          Length = 320

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 12/142 (8%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
            L++YDP+ S ++  V C    C + S      +C S   PC +   Y  + ++++G+ V
Sbjct: 127 ELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYV 183

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLA 152
            D +     S +   ++  +S+  GCG  Q G  L  +  A DG++G G  D S+ S LA
Sbjct: 184 TDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLA 242

Query: 153 KAGLIQNSFSICFDENDSGSVF 174
            A  ++  F+ C D    G +F
Sbjct: 243 AARRVRKIFAHCLDTVRGGGIF 264


>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
          Length = 507

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 139/321 (43%), Gaps = 47/321 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSC------KSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           Y PSS+S+   V+CS   CK   S        S  + C +   Y  + +  SGY+ +D++
Sbjct: 161 YHPSSTST--KVACSSDQCKGSGSTPPSCSRTSSGESCDFQIRYG-DGSHVSGYIYEDVV 217

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VP----SLLAK 153
           +LA          +Q     G   ++TG + +    DG++G G    S VP    SL++ 
Sbjct: 218 NLAG---------LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSD 267

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF------VGVESYC 206
            GL +N F +  +    GS+  G+   +       + P+ +K   ++      + +  Y 
Sbjct: 268 LGL-KNQFGMLLNYEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKSTGIRINDYT 326

Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-----NSWKYC 261
           I  S L Q   + +VDSG++   L +  Y ++   F     +   S+QG     N ++  
Sbjct: 327 IPGSKLGQ---EVIVDSGSTALSLASGAYDQLRNYFQ----THYCSIQGVCENPNIFQGS 379

Query: 262 YNASSEEML-KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
              SS+++L K P +   F       +  +N++   P   G   +C  +   D    I+G
Sbjct: 380 ICYSSDDVLSKFPTLYFTFDGGVQVAIPPKNYLVKAPLTNGKYGYCFMIERADSTMTILG 439

Query: 319 QNFMMGHRIVFDRENLKLAWS 339
             FM G+  VFD  N ++ ++
Sbjct: 440 DVFMRGYYTVFDNVNDRVGFA 460


>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
          Length = 656

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 95/409 (23%), Positives = 172/409 (42%), Gaps = 51/409 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---A 101
           ++ + SSS + +SC+H    S + C +  +PC        E +S S  +++DI++L   A
Sbjct: 137 FNTNLSSSIQPISCNHRTYFSCAYCTNPTEPCRTY----MEGSSWSAKVMEDIVYLGDVA 192

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAKAGLIQNS 160
           S        S  +  + GC  K+TG ++   A DG+MG+   G+  V  L  +  +  N+
Sbjct: 193 SAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDIVTKLFREKKIPSNT 251

Query: 161 FSICFDENDSGSVFFGDQGPATQQ-STSFLPI----GEKYDAYF---VGVESYCIGNSCL 212
           F++CF     G    G    +      ++  I    GE Y A F   + V  + I     
Sbjct: 252 FTLCFSPR-GGYFALGAMDTSRHAGEVTYARINDAYGENYYAVFMTDIRVGGHSIDIDMK 310

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
             + ++ +VDSG + + +       ++  +  L   K   L  N    C   S  ++ ++
Sbjct: 311 ATNSYRYIVDSGTTNSIISGRAGQALMDLYRNLTHLKN-PLNDND---CILLSPSQIEQL 366

Query: 273 PDMRLIFS-----KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
           P ++ +       +    ++ +      EN   T F + V  T    G+IG + MM H +
Sbjct: 367 PTLQFVMEGVNGDRAILEILASQYLQKGENNK-TCFNILV-DTRKIGGVIGASMMMNHDV 424

Query: 328 VFDRENLKLAWSHSKCEEVID---KSHVHLVP--------PPAGQSPNP--LPTTEQQST 374
           +FDR   K+ +  + C    D    SH + +P        P + QS N       E++  
Sbjct: 425 IFDRSQNKVGFVPANCTFAGDTEPNSHKNAIPSDDANGALPVSKQSNNKSNENAEEKKGL 484

Query: 375 SNGQAAAPPSTAKTAPS-----KSIAASAQQLDS----VLRVACSLLVL 414
           SN     P     ++PS     KS     Q+++     ++++  +LLVL
Sbjct: 485 SNDTHTDPVVEPVSSPSLEGETKSANVKLQEVEKERPIIVKLVGTLLVL 533


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   YDP  S S +N++C+ P C+  SS      CK     CPY   Y     ++  +
Sbjct: 232 EQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDF 291

Query: 93  LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            ++   ++L S +    +     +V+ GCG    G +   A    ++GLG G +S  S L
Sbjct: 292 ALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL 348

Query: 152 AKAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFL--------PIGEKY-- 195
               L  +SFS C  + DS +      +F  D+   T    +F         P+   Y  
Sbjct: 349 --QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYL 406

Query: 196 --DAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
              + FVG E   I       S   A   ++DSG + ++     Y  +   F + V   +
Sbjct: 407 QIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYK 466

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVM 308
           +         CYN S  + L  P+  + F+     +F V N+   F   +   + CL ++
Sbjct: 467 LVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAML 523

Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            T      IIG        I++D +N +L ++  +C E+
Sbjct: 524 GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 139/316 (43%), Gaps = 49/316 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SS+ K + C            +    CPY   Y  + + + G LV + + + S S
Sbjct: 101 FDPSKSSTFKEIRC-----------DTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTS 148

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
               Q  V    IIGCGR  +G +  G A  GV+GL  G  S+  +    G      S C
Sbjct: 149 G---QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYC 200

Query: 165 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 218
           F    +  + FG      G     +T F+    K   Y++ +++  +GN+ +   G  F 
Sbjct: 201 FAGKGTSKINFGANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFH 259

Query: 219 AL-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           AL     +DSG++ T+ P E Y  +V K  +++V++ R      S   CY + + ++  V
Sbjct: 260 ALKGNIVIDSGSTLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPV 315

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRI 327
             M   FS     V+  +      N G  VFCL ++ ++  +  I G     NF++G   
Sbjct: 316 ITMH--FSGGADLVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG--- 369

Query: 328 VFDRENLKLAWSHSKC 343
            +D  +L +++  + C
Sbjct: 370 -YDSSSLLVSFKPTNC 384


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 132/328 (40%), Gaps = 47/328 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVD 95
           +DP+SS S   + C+   C +          +C   + P C Y   Y  + + S G L  
Sbjct: 166 FDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAH 224

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKA 154
           D L LA          V    + GCG    G +       G+MGLG   +S+ S  + + 
Sbjct: 225 DKLSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQF 273

Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVES 204
           G +   FS C    +   SGS+  GD     + ST  +       P+   +  YFV +  
Sbjct: 274 GGV---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTG 328

Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKY 260
             IG   +  S  + +VDSG   T L   +Y    AE + +F +   +   S+       
Sbjct: 329 ITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDT 384

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIG 318
           C+N +    +++P ++ +F  N    V +    +  +   +  CL + S   +Y   IIG
Sbjct: 385 CFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIG 444

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                  R++FD    ++ ++   C+ +
Sbjct: 445 NYQQKNLRVIFDTLGSQIGFAQETCDYI 472


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 12/128 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ + V C+        +C   ++ C Y  +Y+ E +SS G L +D++   + 
Sbjct: 134 KFQPEMSSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNE 187

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S+  PQ +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +
Sbjct: 188 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 241

Query: 164 CFDENDSG 171
           C+   D G
Sbjct: 242 CYGGMDVG 249


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 132/328 (40%), Gaps = 47/328 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVD 95
           +DP+SS S   + C+   C +          +C   + P C Y   Y  + + S G L  
Sbjct: 167 FDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAH 225

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKA 154
           D L LA          V    + GCG    G +       G+MGLG   +S+ S  + + 
Sbjct: 226 DKLSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQF 274

Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVES 204
           G +   FS C    +   SGS+  GD     + ST  +       P+   +  YFV +  
Sbjct: 275 GGV---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTG 329

Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKY 260
             IG   +  S  + +VDSG   T L   +Y    AE + +F +   +   S+       
Sbjct: 330 ITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDT 385

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIG 318
           C+N +    +++P ++ +F  N    V +    +  +   +  CL + S   +Y   IIG
Sbjct: 386 CFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIG 445

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                  R++FD    ++ ++   C+ +
Sbjct: 446 NYQQKNLRVIFDTLGSQIGFAQETCDYI 473


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   V CS   C     S C S    C Y   Y  + +S+ G L  +   LA 
Sbjct: 137 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 194

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                   S    V+ GCG    G      A  G++GLG G +S   L+++ GL  + FS
Sbjct: 195 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 239

Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
            C    D+ ++  +  G     ++        Q+T  +    +   Y+V +++  +G++ 
Sbjct: 240 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 299

Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
             L  S F          +VDSG S T+L  + Y  +   F   ++       G     C
Sbjct: 300 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 359

Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           + A ++ + +V   RL+F      +      N++     + G    CLTVM + G   II
Sbjct: 360 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 415

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       + V+D  +  L+++  +C ++
Sbjct: 416 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 134/325 (41%), Gaps = 43/325 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           D+  S +DPS SSS   +SC    C     SSC S    C Y   Y  + T++ G L+++
Sbjct: 223 DQPDSIFDPSQSSSYTLLSCETKHCNLLPNSSC-SDDGYCRYNITYK-DGTNTEGVLINE 280

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            +   S       S     V +GC  K  G ++     DG  GLG G +S PS +  +  
Sbjct: 281 TVSFES-------SGWVDRVSLGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINAS-- 328

Query: 157 IQNSFSICFDENDSG----SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG---- 208
              S S C  E+  G    ++ F     +       L   +  + Y+VG++   +G    
Sbjct: 329 ---SMSYCLVESKDGYSSSTLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKI 385

Query: 209 ---NSCLTQSGFQ---ALVDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSW 258
              NS  T   +     +V S +  T L  + Y  V    V K   L   K   LQ ++ 
Sbjct: 386 DVPNSTFTIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAF-LQFDT- 443

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
             CYN SS   +++P +    +  +S+++    + +  ++  T FC     + G + I+G
Sbjct: 444 --CYNLSSNNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGT-FCFAFAPSKGSFSILG 500

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                G R+ FD  N    + H+ C
Sbjct: 501 TLQQYGTRVTFDLVN-SFVYLHTLC 524


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 143/364 (39%), Gaps = 81/364 (22%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYS 83
           S + P  SSSS   SC+   C    S  +  D                   PCP  A   
Sbjct: 61  SIFSPLHSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTY 120

Query: 84  TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
            E    SG L  DIL   + ++  P+ S       GC    T +Y +   P G+ G G G
Sbjct: 121 GEGGLVSGILTRDILK--ARTRDVPRFS------FGC---VTSTYHE---PIGIAGFGRG 166

Query: 144 DVSVPSLLAKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPI 191
            +S+PS L   G ++  FS CF       + N S  +  G    +       Q T  L  
Sbjct: 167 LLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 223

Query: 192 GEKYDAYFVGVESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVV 239
               ++Y++G+ES  IG +       LT   F +      LVDSG ++T LP   Y++++
Sbjct: 224 PVYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLL 283

Query: 240 VKFDKLVSSKRI--SLQGNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVV 287
                 ++  R   +     +  CY       N +S E   M+  P +   F  N + ++
Sbjct: 284 TILQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLL 343

Query: 288 RN----HIFSFPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWS 339
                 +  S P ++G  V CL   +  DG+Y   G+ G       ++V+D E  ++ + 
Sbjct: 344 PQGNSFYAMSAP-SDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQ 402

Query: 340 HSKC 343
              C
Sbjct: 403 AMDC 406


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 127/318 (39%), Gaps = 43/318 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP SS+S   + C  P CKS    +     C Y   Y  + + + G    + + L    
Sbjct: 191 FDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--- 246

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                S+   +V IGCG    G ++  A   G+ G  L   S P     A +   SFS C
Sbjct: 247 -----SAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYC 293

Query: 165 FDENDSGSVF---FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA 219
               DS +V    F    P    +   +   E    Y++G++   +G   L   +S F+ 
Sbjct: 294 LVNRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEV 353

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
                   ++DSG + T L +E+Y  +   F K    +  +  +SL    +  CY+ SS 
Sbjct: 354 DAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSR 409

Query: 268 EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           E +++P +   F + +   +  RN++      +    FC     T     IIG     G 
Sbjct: 410 ESVEIPTVSFRFPEGRELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIIGNVQQQGT 466

Query: 326 RIVFDRENLKLAWSHSKC 343
           R+ FD  N  + +S   C
Sbjct: 467 RVGFDIANSLVGFSVDSC 484


>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
          Length = 310

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 115/248 (46%), Gaps = 19/248 (7%)

Query: 113 QSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEND 169
           ++S ++G    Q G  L   A   G++GL    +S+PS LA  G+I N F  C   + N 
Sbjct: 11  KASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGHCITRETNG 70

Query: 170 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGAS 226
            G +F GD     +   ++ PI G   + Y    +    G+  L  +G   Q +   G S
Sbjct: 71  GGYMFLGDD-YVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQEL-HAGIPVQVISRCGTS 128

Query: 227 FTFLPTEIYAEVV--VKFD--KLVSSKRISLQGNSWKYCYNASS-EEMLKVPDMRLIFSK 281
           +T+LP E+Y  ++  +K D    V     +     WK  ++  S  + L +   R  F  
Sbjct: 129 YTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFVV 188

Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLA 337
            ++F +    +    ++G    CL +++ T+ ++G   I+G   + G  +V+D E  ++ 
Sbjct: 189 PKTFTIVPDDYLIISDKGNV--CLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQIG 246

Query: 338 WSHSKCEE 345
           W++S+C +
Sbjct: 247 WANSECTK 254


>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
          Length = 140

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 66/135 (48%), Gaps = 4/135 (2%)

Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSV 173
           +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      G +
Sbjct: 1   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60

Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 232
           +FGD  P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T +P 
Sbjct: 61  YFGDFNPPSRGVT-WVPMKESXXYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119

Query: 233 EIYAEVVVKFDKLVS 247
           +IY E+V K    +S
Sbjct: 120 QIYNEIVSKVRGTLS 134


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
           ++N   YDP  S S +N++C+ P C+  SS      CK     CPY   Y     ++  +
Sbjct: 232 EQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDF 291

Query: 93  LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            ++   ++L S +    +     +V+ GCG    G +   A    ++GLG G +S  S L
Sbjct: 292 ALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL 348

Query: 152 AKAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFL--------PIGEKY-- 195
               L  +SFS C  + DS +      +F  D+   T    +F         P+   Y  
Sbjct: 349 --QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYL 406

Query: 196 --DAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
              + FVG E   I       S   A   ++DSG + ++     Y  +   F + V   +
Sbjct: 407 QIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYK 466

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVM 308
           +         CYN S  + L  P+  + F+     +F V N+   F   +   + CL ++
Sbjct: 467 LVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAML 523

Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            T      IIG        I++D +N +L ++  +C E+
Sbjct: 524 GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 129/319 (40%), Gaps = 44/319 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           YDP +SS+   V CS   C        + S+C S+++ C Y A Y  + + S GYL  D 
Sbjct: 177 YDPRASSTYATVPCSASQCDELQAATLNPSAC-SVRNVCIYQASYG-DSSFSVGYLSRDT 234

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +   S S          +   GCG+   G +   A   G++GL    +S+   LA +  +
Sbjct: 235 VSFGSGSY--------PNFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT- 213
             SFS C       S  +   GP T    S+ P+     DA  YFV +    +G S L  
Sbjct: 282 GYSFSYCLPT--PASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAV 339

Query: 214 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNAS 265
                S    ++DSG   T LPT +Y        K V++  + +Q     +    C+   
Sbjct: 340 SPAEYSSLPTIIDSGTVITRLPTAVY----TALSKAVAAAMVGVQSAPAFSILDTCFQGQ 395

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           + + L+VP + + F+   +  +         ++  T  CL    TD    IIG       
Sbjct: 396 ASQ-LRVPAVAMAFAGGATLKLATQNVLIDVDDSTT--CLAFAPTDSTT-IIGNTQQQTF 451

Query: 326 RIVFDRENLKLAWSHSKCE 344
            +V+D    ++ ++   C 
Sbjct: 452 SVVYDVAQSRIGFAAGGCS 470


>gi|452821303|gb|EME28335.1| aspartyl protease isoform 2 [Galdieria sulphuraria]
          Length = 532

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 151/356 (42%), Gaps = 70/356 (19%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL--------------KDPCPYIADY 82
           V DR    Y+ ++S++   +SC+ P C + ++C                    C +  +Y
Sbjct: 193 VPDR----YNLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEY 247

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + T+++G L  DI+ +  +S  A  +   +         +T ++L G A  GV+GL  
Sbjct: 248 G-DGTTATGALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAY 296

Query: 143 GDVS--------VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSF 188
             +S        V   L ++  + N FS+  ++ D G+   G       +GP    S + 
Sbjct: 297 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLAN 355

Query: 189 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
               + YD   V +ES  + ++ L+   F A+VD+G +       I+  +   F     +
Sbjct: 356 EQNPQFYD---VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCN 412

Query: 249 -----KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENE 298
                   S  G +W    YC N + EE+ ++PD+    +   +  +   +++F    N 
Sbjct: 413 VPGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNN 472

Query: 299 GFTV----FCLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F+     +CL +         ++DG+  I+G    + + +VFDREN ++ ++  K
Sbjct: 473 IFSAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 528


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   V CS   C     S C S    C Y   Y  + +S+ G L  +   LA 
Sbjct: 116 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 173

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                   S    V+ GCG    G      A  G++GLG G +S   L+++ GL  + FS
Sbjct: 174 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 218

Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
            C    D+ ++  +  G     ++        Q+T  +    +   Y+V +++  +G++ 
Sbjct: 219 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 278

Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
             L  S F          +VDSG S T+L  + Y  +   F   ++       G     C
Sbjct: 279 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 338

Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           + A ++ + +V   RL+F      +      N++     + G    CLTVM + G   II
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 394

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       + V+D  +  L+++  +C ++
Sbjct: 395 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   V CS   C     S C S    C Y   Y  + +S+ G L  +   LA 
Sbjct: 147 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 204

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                   S    V+ GCG    G      A  G++GLG G +S   L+++ GL  + FS
Sbjct: 205 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 249

Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
            C    D+ ++  +  G     ++        Q+T  +    +   Y+V +++  +G++ 
Sbjct: 250 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 309

Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
             L  S F          +VDSG S T+L  + Y  +   F   ++       G     C
Sbjct: 310 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 369

Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           + A ++ + +V   RL+F      +      N++     + G    CLTVM + G   II
Sbjct: 370 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 425

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       + V+D  +  L+++  +C ++
Sbjct: 426 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 65/128 (50%), Gaps = 12/128 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++ P  SS+ + V C+        +C   K+ C Y  +Y+ E +SS G L +D++   + 
Sbjct: 163 KFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNE 216

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           S   PQ +V      GC   +TG      A DG++GLG GD+S+   L   GLI NSF +
Sbjct: 217 SHLTPQRAV-----FGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGLISNSFGL 270

Query: 164 CFDENDSG 171
           C+   D G
Sbjct: 271 CYGGLDVG 278


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   V CS   C     S C S    C Y   Y  + +S+ G L  +   LA 
Sbjct: 209 FDPSSSSTYATVPCSSASCSDLPTSKCTSASK-CGYTYTYG-DSSSTQGVLATETFTLAK 266

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                   S    V+ GCG    G      A  G++GLG G +S   L+++ GL  + FS
Sbjct: 267 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 311

Query: 163 ICF---DENDSGSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS- 210
            C    D+ ++  +  G            ++ Q+T  +    +   Y+V +++  +G++ 
Sbjct: 312 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 371

Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
             L  S F          +VDSG S T+L  + Y  +   F   ++       G     C
Sbjct: 372 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 431

Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           + A ++ + +V   RL+F      +      N++     + G    CLTVM + G   II
Sbjct: 432 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 487

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       + V+D  +  L+++  +C ++
Sbjct: 488 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/327 (23%), Positives = 124/327 (37%), Gaps = 43/327 (13%)

Query: 48  SSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
           ++S S   ++CS   C S      ++C S   PC Y  DY   D S++ G +  D   +A
Sbjct: 150 AASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYRDGSAARGVVGTDSATIA 207

Query: 102 --------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
                          + +    V++GC     G     +  DGV+ LG  ++S  S    
Sbjct: 208 LSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSS--DGVLSLGNSNISFASR--A 263

Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
           A      FS C        N +  + FG    A    T  L        Y V V++  + 
Sbjct: 264 AARFGGRFSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVA 323

Query: 209 NSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWK 259
              L              A++DSG S T L T  Y  VV    K L    R+++  + ++
Sbjct: 324 GEALDIPADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM--DPFE 381

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GI 316
           YCYN +    L++P M + F+ +         +      G  V C+ V   +G +    +
Sbjct: 382 YCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPG--VKCIGVQ--EGSWPGVSV 437

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IG      H   FD  +  L + H++C
Sbjct: 438 IGNILQQEHLWEFDLRDRWLRFKHTRC 464


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 145/346 (41%), Gaps = 67/346 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
           S +DPS SSS   + C+HPLCK R    +L   C      + + +  + T + G LV + 
Sbjct: 122 SVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREK 181

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +   +FS+    S     +I+GC  + + +        G++G+ LG +S  S   +A L 
Sbjct: 182 I---TFSR----SQSTPPLILGCAEESSDA-------KGILGMNLGRLSFAS---QAKLT 224

Query: 158 QNSFSICFDEND-------SGSVFFGDQGPA-------------TQQSTSFLPIGEKYDA 197
           +  FS C            +GS + G+   +             +Q+  +  P+     A
Sbjct: 225 K--FSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPL-----A 277

Query: 198 YFVGVESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
           Y V ++   IGN  L    S F        Q ++DSG+ FT+L  E Y +V  +  +LV 
Sbjct: 278 YTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVG 337

Query: 248 S--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 304
           +  K+  + G     C+N ++ E+ + + +M   F K    VV         + G  V C
Sbjct: 338 ARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEKE--RVLADVGGGVHC 395

Query: 305 LTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           + +  ++       IIG        + FD  N ++ +  + C   +
Sbjct: 396 VGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGKADCSRSV 441


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/326 (23%), Positives = 124/326 (38%), Gaps = 36/326 (11%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           L  +D ++S++ ++V+CS PLC + S        C Y++ Y     S   +L D      
Sbjct: 132 LPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSF---- 187

Query: 102 SFSKHAPQSSVQSSVI-IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +F        V    I  GCG    G +L      G+ G G G +S+PS L         
Sbjct: 188 TFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ--TETGIAGFGRGPLSLPSQLK-----VRQ 240

Query: 161 FSICFD---ENDSGSVFFGDQGPATQQ------STSF---LPIGEKYDAYFVGVESYCIG 208
           FS CF    E  S  VF G  G           ST F   LP G     Y +  +   +G
Sbjct: 241 FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVG 300

Query: 209 NSCLTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
            + L     +A       +DSG   T  P  ++ ++   F    +   ++   +    C+
Sbjct: 301 KTRLPVPEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALP-VNKTADEDDICF 359

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQN 320
           +   ++   +P +          + R +  +     G    C+ V ST G  D  +IG  
Sbjct: 360 SWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESG--QVCVAV-STSGQMDRTLIGNF 416

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
                 IV+D    KL    ++C+++
Sbjct: 417 QQQNTHIVYDLAAGKLLLVPAQCDKL 442


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/250 (27%), Positives = 105/250 (42%), Gaps = 22/250 (8%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           S YDPS S SS   SCS P C +     +    + C Y+  Y  + +S+SG  + D+L L
Sbjct: 188 SFYDPSRSPSSAPFSCSSPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTL 246

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +        +  S    GC   + GS+   AA  G+M LG G  S+  L   A    N+
Sbjct: 247 DA-------GNAVSGFKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNA 295

Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
           FS C     S S FF    P    S    T  +   +    Y V + +  +G   L    
Sbjct: 296 FSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAP 355

Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           + F A  ++DS  + T LP   Y  +   F   ++  R +        CY+ +    +++
Sbjct: 356 AVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRL 415

Query: 273 PDMRLIFSKN 282
           P + L+F +N
Sbjct: 416 PKISLVFDRN 425


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 137/343 (39%), Gaps = 59/343 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP++SSS +NV+C    C         R+  +  +D CPY   Y  +  ++        
Sbjct: 193 FDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGD------ 246

Query: 98  LHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           L L SF+ +  AP +S +   V+ GCG    G +   A   G+    L   S   L A  
Sbjct: 247 LALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFAS--QLRAVY 304

Query: 155 GLIQNSFSICFDENDS---GSVFFGDQGPATQQS-------TSFLPIGEKYDA-YFVGVE 203
           G   ++FS C  ++ S     V FG+       +       T+F P     D  Y+V ++
Sbjct: 305 G---HTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLK 361

Query: 204 SYCIGNSCLTQSG------------FQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKR 250
              +G   L  S                ++DSG + ++     Y  +   F D++  S  
Sbjct: 362 GVLVGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYP 421

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFC 304
           +         CYN S  +  +VP++ L+F+          ++ FP    F       + C
Sbjct: 422 LIPDFPVLSPCYNVSGVDRPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMC 474

Query: 305 LTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           L V+ T      IIG        +V+D +N +L ++  +C EV
Sbjct: 475 LAVLGTPRTGMSIIGNFQQQNFHVVYDLKNNRLGFAPRRCAEV 517


>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
 gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
 gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
 gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
 gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
          Length = 410

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/324 (20%), Positives = 133/324 (41%), Gaps = 42/324 (12%)

Query: 54  KNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
           K V+C+  LC            C S K  C Y+  Y   D+SS G LV D      FS  
Sbjct: 87  KLVTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLS 138

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
           A   +  +++  GCG  Q     +   P D ++GL  G V++ S L   G+I ++    C
Sbjct: 139 ASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 198

Query: 165 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
                 G +FFGD Q P +  + + +    KY +   G   +   +  ++ +    + DS
Sbjct: 199 ISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDS 258

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           GA++T+   + Y   +      ++S+      ++ +  +   C+    ++++ + +++  
Sbjct: 259 GATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGK-DKIVTIDEVKKC 317

Query: 279 FS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY------GIIGQNFM 322
           F           K  +  +    +     EG    CL ++    ++       +IG   M
Sbjct: 318 FRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEHLSLAGTNLIGGITM 375

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
           +   +++D E   L W + +C+ +
Sbjct: 376 LDQMVIYDSERSLLGWVNYQCDRI 399


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 47/339 (13%)

Query: 33  GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDT 87
           G ++  DR    Y P ++     V C  PLC      S+S CK+  D C Y  +Y+ +  
Sbjct: 89  GCTLPHDR---LYKPHNNV----VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYA-DHG 140

Query: 88  SSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLG 143
           SS G LV D   L L + +  AP      ++  GCG  Q   GS L      GV+GLG  
Sbjct: 141 SSIGVLVKDPVPLRLTNGTILAP------NLGFGCGYDQHNGGSQLPPLT-AGVLGLGNS 193

Query: 144 DVSVPSLLAKAGLIQNSFSIC-FDENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAY 198
             ++ + L+    ++N    C   +      F GD  P++    S++PI    G KY A 
Sbjct: 194 KATMATQLSALSHVRNVLGHCFSGQGGGFLFFGGDLVPSS--GMSWMPILRTPGGKYSA- 250

Query: 199 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGN 256
             G      G + +   G     DSG+S+T+  +++Y  V+      +  +  R + +  
Sbjct: 251 --GPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDK 308

Query: 257 SWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTV-----FCLTVMST 310
           +   C+   S+    V D+R  F     SF      F  P      +      CL +++ 
Sbjct: 309 TLPICWKG-SKAFKSVADVRNFFKPLALSFGNSKVQFQIPPEAYLIISNLGNVCLGILNG 367

Query: 311 D----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
                G+  +IG   M+   +V+D E  ++ W+ + C +
Sbjct: 368 SQVGLGNVNLIGDISMLDKMMVYDNERQQIGWAPANCSK 406


>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
          Length = 290

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 69/136 (50%), Gaps = 7/136 (5%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP SSS+S  +SC    C+S      +SC    + C Y   Y  + + +SGY V D
Sbjct: 121 LNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSD 179

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
           ++H AS  +    ++  +SV+ GC   QTG       A DG+ G G   +SV S L+  G
Sbjct: 180 LMHFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQG 239

Query: 156 LIQNSFSICFDENDSG 171
           +    FS C   ++SG
Sbjct: 240 IAPRVFSHCLKGDNSG 255


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/314 (23%), Positives = 125/314 (39%), Gaps = 31/314 (9%)

Query: 44  EYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           ++DP+ S+S  NVSCS   C     S   C +    C Y   Y  + + S G+   + L 
Sbjct: 177 KFDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYG-DQSYSQGFFATETLT 235

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           ++S       S V ++ + GCG+   G +   A   G+        SV      A   Q 
Sbjct: 236 ISS-------SDVFTNFLFGCGQSNNGLFGQAAGLLGLS-----SSSVSLPSQTAEKYQK 283

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGV----ESYCIGNSC 211
            FS C     S + +  + G    Q+  F PI   + +++    VG+        I  S 
Sbjct: 284 QFSYCLPSTPSSTGYL-NFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSI 342

Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T SG  A++DSG   T LP   Y  +   FD+ +S+   +        CY+ S+   + 
Sbjct: 343 FTTSG--AIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVS 400

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVF 329
            P + + F       +      +  N G  + CL   +   D ++GI G +    + +V+
Sbjct: 401 FPKVSVSFKGGVEVDIDASGILYLVN-GVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVY 459

Query: 330 DRENLKLAWSHSKC 343
           D     + ++   C
Sbjct: 460 DGAKGMIGFAAGAC 473


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 103/243 (42%), Gaps = 22/243 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
           L  ++P +SS+S  + CS   C      S + C++  + PC Y   Y  + + +SGY V 
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D ++  +   +   ++  +S++ GC   Q+G       A DG+ G G   +SV S L   
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
           G+    FS C   +D+G       G   +    + P+      Y + +ES         I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312

Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
            +S  T S  Q  +VDSG +  +L    Y   V      VS    SL  +GN    C+  
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369

Query: 265 SSE 267
           SS 
Sbjct: 370 SSR 372


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 85/354 (24%), Positives = 141/354 (39%), Gaps = 75/354 (21%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPC-----PYIADYSTEDTS 88
            +S + P  SSSSK + C +P C          + C +    C     PY+  Y +  T 
Sbjct: 119 RISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTT- 177

Query: 89  SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
             G  + + LHL           +  + ++GC      S      P G+ G G G  S+P
Sbjct: 178 -GGVALSETLHLHGL--------IVPNFLVGC------SVFSSRQPAGIAGFGRGPSSLP 222

Query: 149 SLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQSTSFL-------PIGEKY 195
           S L   GL +  FS C       D  +S S+    Q  + +++ + +       P  +  
Sbjct: 223 SQL---GLTK--FSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDK 277

Query: 196 DA----YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
            A    Y+V +    IG   +                 ++DSG +FT++ TE +  +  +
Sbjct: 278 PAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNE 337

Query: 242 FDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPE 296
           F   V +   +L   +    K C+N S  + L++P +RL F    +    + N+      
Sbjct: 338 FISQVKNYERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGS 397

Query: 297 NEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV-FDRENLKLAWSHSKCE 344
            E   V C TV+ TDG       G+I  NF M +  V +D +N +L +    C+
Sbjct: 398 RE---VACFTVV-TDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESCK 447


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 136/318 (42%), Gaps = 39/318 (12%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           + +DP++S+S + V C  PLC     ++C      C +   Y+  D+S    L  D L +
Sbjct: 152 APFDPAASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAV 209

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           A  +  A           GC ++ TG+    A P G++GLG G +S   L     + + +
Sbjct: 210 AGNAVKA--------YTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEAT 256

Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           FS C       N SG++  G  G P   ++T  L    +   Y+V +    +G   +   
Sbjct: 257 FSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP 316

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   ++DSG  FT L    Y  V  +  + V +   SL G  +  C+N ++   
Sbjct: 317 AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA--- 371

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGH 325
           +  P M L+F   Q  +   ++     +   T+ CL + +  DG      +I       H
Sbjct: 372 VAWPPMTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNH 429

Query: 326 RIVFDRENLKLAWSHSKC 343
           R++FD  N ++ ++  +C
Sbjct: 430 RVLFDVPNGRVGFARERC 447


>gi|452821304|gb|EME28336.1| aspartyl protease isoform 1 [Galdieria sulphuraria]
          Length = 456

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 151/356 (42%), Gaps = 70/356 (19%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------------CPYIADY 82
           V DR    Y+ ++S++   +SC+ P C + ++C                    C +  +Y
Sbjct: 117 VPDR----YNLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEY 171

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
             + T+++G L  DI+ +  +S  A  +   +         +T ++L G A  GV+GL  
Sbjct: 172 G-DGTTATGALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAY 220

Query: 143 GDVS--------VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSF 188
             +S        V   L ++  + N FS+  ++ D G+   G       +GP    S + 
Sbjct: 221 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLAN 279

Query: 189 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
               + YD   V +ES  + ++ L+   F A+VD+G +       I+  +   F     +
Sbjct: 280 EQNPQFYD---VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCN 336

Query: 249 -----KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENE 298
                   S  G +W    YC N + EE+ ++PD+    +   +  +   +++F    N 
Sbjct: 337 VPGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNN 396

Query: 299 GFTV----FCLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            F+     +CL +         ++DG+  I+G    + + +VFDREN ++ ++  K
Sbjct: 397 IFSAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 452


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 142/380 (37%), Gaps = 76/380 (20%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
           C+L  G +  ++   S   P  SS++++V C    C +  S     D C  IAD   E  
Sbjct: 116 CILCEGKA--ENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCA-IADCPLESI 172

Query: 88  SSS----------------GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 131
            +S                G LV  + H +     A  S    +   GC           
Sbjct: 173 ETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCAHTAL------ 226

Query: 132 AAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGS--------VFFGDQGPAT 182
           A P GV G G G +S+P+ LA  A  + N FS C   +   S        +  G      
Sbjct: 227 AEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKE 286

Query: 183 QQ---------STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF----------QALVDS 223
           ++          TS L   +    Y VG+E   IG   +    F            +VDS
Sbjct: 287 KRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDS 346

Query: 224 GASFTFLPTEIYAEVVVKFDKLV-----SSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           G +FT LP  +Y  VV +FD  V      +K +         CY    + ++ +P + L 
Sbjct: 347 GTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE-DKTGLGPCYYY--DTVVNIPSLVLH 403

Query: 279 FSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQNFMM 323
           F  N+S VV   +N+ + F +          V CL +M+       T G    +G     
Sbjct: 404 FVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQH 463

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G  +V+D E  ++ ++  KC
Sbjct: 464 GFEVVYDLEQRRVGFARRKC 483


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/323 (24%), Positives = 136/323 (42%), Gaps = 43/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP++S+S ++V C  PLC     ++C      C +   Y+  D+S    L  D L +A 
Sbjct: 152 FDPAASTSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVA- 208

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                       +   GC +K TG+    A P G++GLG G +S   L     + Q +FS
Sbjct: 209 -------GDAVKTYTFGCLQKATGT---AAPPQGLLGLGRGPLSF--LSQTRDMYQGTFS 256

Query: 163 ICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
            C       N SG++  G  G P   ++T  L    +   Y+V +    +G   +     
Sbjct: 257 YCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPP 316

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                  +G   ++DSG  FT L    Y  V  +  + V +   SL G  +  C+N ++ 
Sbjct: 317 ALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA- 373

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMM 323
             +  P + L+F   Q  +   ++     +   T+ CL + +  DG      +I      
Sbjct: 374 --VAWPPVTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQ 429

Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
            HR++FD  N ++ ++  +C  V
Sbjct: 430 NHRVLFDVPNGRVGFARERCTAV 452


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 138/316 (43%), Gaps = 26/316 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLK-------DPCPYIADYSTEDTSSSGYLVDDI 97
           +DP++SS+   V C    C+  +S  S +         CPY   Y  +D+ + G L  D 
Sbjct: 181 FDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYD-DDSHTVGDLARDT 239

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L+     +P  +V    + GCG    G++ +    DG++GLGLG  S+PS +  A   
Sbjct: 240 LTLSPSPSPSPADTVPG-FVFGCGHSNAGTFGE---VDGLLGLGLGKASLPSQV--AARY 293

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLT-- 213
             +FS C   + S + +    G A + +  F  +  G+   +Y++ +    +    +   
Sbjct: 294 GAAFSYCLPSSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVP 353

Query: 214 QSGFQ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 267
            S F      ++DSG +F+ LP   YA +   F   +   R     +S  +  CY+ +  
Sbjct: 354 ASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGH 413

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
           E +++P + L+F+   +  +      +  N+     CL  +  + D GI+G        +
Sbjct: 414 ETVRIPAVELVFADGATVHLHPSGVLYTWND-VAQTCLAFVP-NHDLGILGNTQQRTLAV 471

Query: 328 VFDRENLKLAWSHSKC 343
           ++D  + ++ +    C
Sbjct: 472 IYDVGSQRIGFGRKGC 487


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 105/250 (42%), Gaps = 22/250 (8%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           S YDPS S +S   SCS P C +     +    + C Y+  Y  + +S+SG  + D+L L
Sbjct: 58  SFYDPSRSPTSAAFSCSSPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTL 116

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +        +  S    GC   + GS+   AA  G+M LG G  S+  L   A    N+
Sbjct: 117 DA-------GNAVSGFKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNA 165

Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
           FS C     S S FF    P    S    T  +   +    Y V + +  +G   L    
Sbjct: 166 FSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAP 225

Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           + F A  ++DS  + T LP   Y  +   F   ++  R +        CY+ +    +++
Sbjct: 226 AVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRL 285

Query: 273 PDMRLIFSKN 282
           P + L+F +N
Sbjct: 286 PKISLVFDRN 295


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 127/318 (39%), Gaps = 41/318 (12%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVD 95
           L  +D S+SS+    SC   LC+    +SC + K      C Y   Y+ +  ++    VD
Sbjct: 22  LPYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVD 81

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
                A  S           V  GCG    G +       G+ G G G +S+PS L K G
Sbjct: 82  KFTFGAGASV--------PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG 130

Query: 156 LIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYC 206
               +FS CF   +     +V           G    QST  +        Y++ ++   
Sbjct: 131 ----NFSHCFTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGIT 186

Query: 207 IGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 257
           +G++          LT      ++DSG S T LP ++Y  V  +F   +    +      
Sbjct: 187 VGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATG 246

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGI 316
              C++A S+    VP + L F      + R N++F  P++ G ++ CL +   D +  I
Sbjct: 247 PYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTI 305

Query: 317 IGQNFMMGHRIVFDRENL 334
           IG        +++D +N+
Sbjct: 306 IGNFQQQNMHVLYDLQNM 323


>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
          Length = 775

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/324 (20%), Positives = 133/324 (41%), Gaps = 42/324 (12%)

Query: 54  KNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
           K V+C+  LC            C S K  C Y+  Y   D+SS G LV D      FS  
Sbjct: 452 KLVTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLS 503

Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
           A   +  +++  GCG  Q     +   P D ++GL  G V++ S L   G+I ++    C
Sbjct: 504 ASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 563

Query: 165 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
                 G +FFGD Q P +  + + +    KY +   G   +   +  ++ +    + DS
Sbjct: 564 ISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDS 623

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNASSEEMLKVPDMRLI 278
           GA++T+   + Y   +      ++S+      ++ +  +   C+    ++++ + +++  
Sbjct: 624 GATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGK-DKIVTIDEVKKC 682

Query: 279 FS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY------GIIGQNFM 322
           F           K  +  +    +     EG    CL ++    ++       +IG   M
Sbjct: 683 FRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEHLSLAGTNLIGGITM 740

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
           +   +++D E   L W + +C+ +
Sbjct: 741 LDQMVIYDSERSLLGWVNYQCDRI 764



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 118/318 (37%), Gaps = 31/318 (9%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ-TGSYLDGAAP 134
           C Y   Y+ +  S+ G L+ D   L       P+ + + ++  GCG  Q  G      +P
Sbjct: 29  CDYEIKYA-DGASTIGALIVDQFSL-------PRIATRPNLPFGCGYNQGIGENFQQTSP 80

Query: 135 -DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 192
            +G++GL  G VS  S L   G+I ++    C      G +F GD         + + + 
Sbjct: 81  VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGDG------DGNLVLLH 134

Query: 193 EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
             Y  Y  G  +       L  +    + DSG+++T+   + Y   V      +SS  + 
Sbjct: 135 ANY--YSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLE 192

Query: 253 -LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLT 306
            +   S   C+    +    V D++  F   Q     N +   P      V      CL 
Sbjct: 193 QVSDPSLPLCWKGQ-KAFESVFDVKKEFKSLQLNFGNNAVMEIPPENYLIVTEYGNVCLG 251

Query: 307 VM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
           ++     ++ IIG   M    +++D E  +L W    C    D S       P+ +    
Sbjct: 252 ILHGCRLNFNIIGDITMQDQMVIYDNEREQLGWIRGSC----DGSQEAPTQAPSAEEVVG 307

Query: 366 LPTTEQQSTSNGQAAAPP 383
                + S + G   APP
Sbjct: 308 AAARREASQATGSYLAPP 325


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/308 (23%), Positives = 134/308 (43%), Gaps = 42/308 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP SS + +++SC    C++    SSC S +  C Y + Y  + + ++G L  D + L 
Sbjct: 135 FDPKSSKTYRDLSCDTRQCQNLGESSSCSS-EQLCQY-SYYYGDRSFTNGNLAVDTVTLP 192

Query: 102 SFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           S +    + P++      +IGCGR+  G++       G++GLG G +S+ S +  +  + 
Sbjct: 193 STNGGPVYFPKT------VIGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VG 242

Query: 159 NSFSICF------DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNS 210
             FS C          +S  + FG     +       P+  K     Y++ +E+  +G+ 
Sbjct: 243 GKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDK 302

Query: 211 CL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCY 262
            +         S    ++DSG S T  P   + E     +  +++ +R         +CY
Sbjct: 303 KIEFGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCY 362

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQ- 319
             + +  LKVP +   F+     +   + F    ++   V CL   ST     +G + Q 
Sbjct: 363 RPTPD--LKVPVITAHFNGADVVLQTLNTFILISDD---VLCLAFNSTQSGAIFGNVAQM 417

Query: 320 NFMMGHRI 327
           NF++G+ I
Sbjct: 418 NFLIGYDI 425


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 40/317 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSSS   +SC+   CK   +++C S  D C Y   Y  + + ++G L  + L   +
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN 249

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S   P      ++ IGCG    G +  GA    ++GLG G +S+ S L  +     SFS
Sbjct: 250 -SNSIP------NLPIGCGHDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFS 294

Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL------ 212
            C    D + S ++ F    P+    TS L   +++ +Y +V V    +G   L      
Sbjct: 295 YCLVNLDSDSSSTLEFNSNMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTR 353

Query: 213 ---TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
               +SG    +VDSG   + LP+++Y  +   F KL SS   +   + +  CYN S + 
Sbjct: 354 FEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQS 413

Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            ++VP +  + S+  S  +  RN++    +  G   +CL  + T     IIG     G R
Sbjct: 414 NVEVPTIAFVLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIR 470

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D  N  + +S +KC
Sbjct: 471 VSYDLTNSLVGFSTNKC 487


>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
          Length = 154

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 70/138 (50%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
           + ++  GCG KQ        +P DG++GLG+G     + L    +I +N    C      
Sbjct: 6   KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P T+  T ++P+ E    Y  G+ +  I    +  +  F+A+ DSG+++T+
Sbjct: 66  GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 125 MPAQIYNELVSKIRGTLS 142


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 140/371 (37%), Gaps = 74/371 (19%)

Query: 21  PVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------------ 68
           P TT   C      S    + +  ++P  SSS K + C  P C + SS            
Sbjct: 114 PCTTHYTCT---NCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNG 170

Query: 69  -CKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
             K     CP Y   Y T   ++SG+ + + L     + H          ++GC    T 
Sbjct: 171 NSKKCSHACPQYTLQYGTG--AASGFFLLENLDFPGKTIH--------KFLVGC----TT 216

Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGP 180
           S     + D + G G    S+P  +         F+ C + +D      SG +   D   
Sbjct: 217 SADREPSSDALAGFGRTMFSLPMQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSD 270

Query: 181 ATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQ----------ALVDSGAS 226
              Q  S+ P  +        Y++GV+   IGN  L   G             ++DSG +
Sbjct: 271 GETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFA 330

Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQ 283
           + ++   ++  V  +  K +S  R SL+  +      CYN +  + +K+PD+   F+   
Sbjct: 331 YGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGA 390

Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMS---------TDGDYGIIGQNFMMGHRIVFDRE 332
           + VV   N+   F E    ++ C  V +         T G   I+G    + H + FD +
Sbjct: 391 NMVVPGMNYFLLFSEA---SLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLK 447

Query: 333 NLKLAWSHSKC 343
           N +L +    C
Sbjct: 448 NERLGFRQQTC 458


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 130/315 (41%), Gaps = 50/315 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           Y P+ S++  NVSC  P+C++     S C      C Y   Y  + TS+ G L  +   L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S        +    V  GCG +  GS  + +   G++G+G G +   SL+++ G+ +  
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTRPR 240

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
            S        G        P T      + +G+      + ++      + +   G   +
Sbjct: 241 RSCRARAAARGGGA-----PTTTSPLEGITVGDT----LLPIDPAVFRLTPMGDGGV--I 289

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRL 277
           +DSG +FT L    +   V     L S  R+ L   +      C+ A+S E ++VP + L
Sbjct: 290 IDSGTTFTALEERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVL 346

Query: 278 IFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
            F       + +S+VV        E+    V CL ++S  G   ++G        I++D 
Sbjct: 347 HFDGADMELRRESYVV--------EDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDL 397

Query: 332 ENLKLAWSHSKCEEV 346
           E   L++  +KC E+
Sbjct: 398 ERGILSFEPAKCGEL 412


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/322 (25%), Positives = 127/322 (39%), Gaps = 44/322 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           YDP +SS+   V CS P C        + SSC S    C Y A Y  + + S GYL  D 
Sbjct: 151 YDPRASSTYAAVPCSAPQCAELQAATLNPSSC-SGSGVCQYQASYG-DGSFSFGYLSKDT 208

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           + L+S       S        GCG+   G +   A   G++GL    +S+ S LA +  +
Sbjct: 209 VSLSS-------SGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--V 256

Query: 158 QNSFSICFDEN---DSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
            NSF+ C   +    +G + FG    ++ P     TS +        YFV +    +  S
Sbjct: 257 GNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGS 316

Query: 211 CLT-----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW---KYCY 262
            L            ++DSG   T LPT +Y        K V +   +    ++   + C+
Sbjct: 317 PLAVPSSEYGSLPTIIDSGTVITRLPTPVY----TALSKAVGAALAAPSAPAYSILQTCF 372

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
                + L VP + + F+   +  +         NE  T  CL    TD    IIG    
Sbjct: 373 KGQVAK-LPVPAVNMAFAGGATLRLTPGNVLVDVNE--TTTCLAFAPTDST-AIIGNTQQ 428

Query: 323 MGHRIVFDRENLKLAWSHSKCE 344
               +V+D +  ++ ++   C 
Sbjct: 429 QTFSVVYDVKGSRIGFAAGGCS 450


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/313 (21%), Positives = 123/313 (39%), Gaps = 33/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+   C    +       C Y   Y  + + + G+   D L +A   
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--- 261

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                         GCG K  G +   A   G+MGLG G  S+   +        +F+ C
Sbjct: 262 -----HDAIKGFRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYC 311

Query: 165 FDENDSGSVFFGDQGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCL 212
                +G+ +  D GP +  + + L P+    G+ +  Y+VG+    +G        S  
Sbjct: 312 LPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVF 368

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
           + +G   LVDSG   T LP   Y  +   FDK++ ++  + +   +    CY+ +    +
Sbjct: 369 STAG--TLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDV 426

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           ++P + L+F       V      +  +E            D    I+G      + +++D
Sbjct: 427 ELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYD 486

Query: 331 RENLKLAWSHSKC 343
                + ++   C
Sbjct: 487 LGKKTVGFAPGSC 499


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 132/316 (41%), Gaps = 33/316 (10%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++DP+ SSS   V C  P+C +    C      C Y   Y  + +S++G L  D L   S
Sbjct: 179 DFDPAKSSSYAAVPCGTPVCAAAGGMCNGTT--CLYGVQYG-DGSSTTGVLSRDTLTFNS 235

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            SK        +    GCG K  G   D    DG++GLG G +S+PS  A +      FS
Sbjct: 236 SSKF-------TGFTFGCGEKNIG---DFGEVDGLLGLGRGKLSLPSQAAPS--FGGVFS 283

Query: 163 ICFDENDS--GSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGN-------S 210
            C    ++  G +  G   P +    Q T+ +   +    YF+ + S  IG        S
Sbjct: 284 YCLPSYNTTPGYLNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPS 343

Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
             T++G   L+DSG   T+LP   Y  +  +F   +   + +        CY+ + +  +
Sbjct: 344 VFTKTG--TLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAI 401

Query: 271 KVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRI 327
            +P +   FS    F +  + I  FP++    + CL  +S      + I+G        +
Sbjct: 402 VIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEV 461

Query: 328 VFDRENLKLAWSHSKC 343
           ++D  + K+ +    C
Sbjct: 462 IYDVPSQKIGFIPISC 477


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/318 (22%), Positives = 116/318 (36%), Gaps = 41/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ S++  N+SCS   C            C Y   Y  + + + G+   D L LA   
Sbjct: 204 FDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--- 259

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                     +   GCG K  G +   A   G++GLG G  S+P     K G +   F+ 
Sbjct: 260 -----YDTIKNFRFGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAY 308

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----- 216
           C     +G+ F   G   PA     + + +      Y+VG+    +G   L   G     
Sbjct: 309 CLPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFST 368

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASSE 267
              LVDSG   T LP   YA +   F K       ++QG  +           CY+ +  
Sbjct: 369 AGTLVDSGTVITRLPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGH 421

Query: 268 E--MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           +   + +P + L+F       V      +  +           + D D  I+G      H
Sbjct: 422 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTH 481

Query: 326 RIVFDRENLKLAWSHSKC 343
            +++D     + ++   C
Sbjct: 482 GVLYDIGKKIVGFAPGAC 499


>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 498

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 63/283 (22%), Positives = 121/283 (42%), Gaps = 39/283 (13%)

Query: 91  GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVP 148
           GY+ +D   L    + AP     + +  GCG      Y DG+    DG+ G   G+ +  
Sbjct: 166 GYMAEDTFTLGD--ELAP-----AKITFGCGGMY---YPDGSNLRQDGMAGFSRGNTAFH 215

Query: 149 SLLAKAGLIQ-NSFSICFDENDS-------GSVFFGDQGPATQQSTSFLPIGEKYDAYFV 200
           + LAKAG+I  + F  C +  ++       G   FG + P    +     +GE  D   V
Sbjct: 216 TQLAKAGVIDAHVFGFCSEGMETSTAMLTLGRYNFGRRVPELAWTRM---LGE--DDLAV 270

Query: 201 GVESYCIGNSCL-TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
              S+ +G+  + + S    ++DSG + T LP+ ++ + +   ++   S  +S+      
Sbjct: 271 RTMSWKLGDKTIASSSNVYTVLDSGTTLTVLPSAMHHDFMTHLNETARSAGLSVVVRGTH 330

Query: 260 YCYNASSEEMLK-------VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-- 310
             Y    +  L         P + + +  + + V+R   + F +      FC  +MS   
Sbjct: 331 CFYENQRQSSLTQYTLTRWFPSLTITYDPDVTLVLRPENYLFADTVNLHAFCAGIMSASD 390

Query: 311 ----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
               +G+  I+GQ  +    + +D EN ++  +  +CE++ +K
Sbjct: 391 AALANGEQIILGQQTLRNTFVEYDLENSRVGMATVQCEKLREK 433


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/313 (21%), Positives = 123/313 (39%), Gaps = 33/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+   C    +       C Y   Y  + + + G+   D L +A   
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--- 261

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                         GCG K  G +   A   G+MGLG G  S+   +        +F+ C
Sbjct: 262 -----HDAIKGFRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYC 311

Query: 165 FDENDSGSVFFGDQGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCL 212
                +G+ +  D GP +  + + L P+    G+ +  Y+VG+    +G        S  
Sbjct: 312 LPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVF 368

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
           + +G   LVDSG   T LP   Y  +   FDK++ ++  + +   +    CY+ +    +
Sbjct: 369 STAG--TLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDV 426

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           ++P + L+F       V      +  +E            D    I+G      + +++D
Sbjct: 427 ELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYD 486

Query: 331 RENLKLAWSHSKC 343
                + ++   C
Sbjct: 487 LGKKTVGFAPGSC 499


>gi|340500865|gb|EGR27703.1| plasmepsin 5, putative [Ichthyophthirius multifiliis]
          Length = 602

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/348 (21%), Positives = 135/348 (38%), Gaps = 53/348 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA--- 101
           YD   S ++K   C       +  C +    C +   Y+ E +S SGY+  D + L    
Sbjct: 88  YDLEKSLTAKKEKCKSTKLSCQGYCNNFSQECNWSVSYA-EGSSISGYMAGDYVVLGDEM 146

Query: 102 ----------SFSKHAPQSSV----QSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDV 145
                       S+   Q  +      SV +  GC   +T  +L    PDG++GL   D 
Sbjct: 147 QDYIEKLTKNQISEKEEQEYLTYIKHESVFLNFGCTTNETNLFL-SQVPDGIIGLAPSDK 205

Query: 146 S--------VPSLLAKAGLIQNS----FSICFDENDSGSVFFGDQGPATQQS---TSFLP 190
           S        V  +  K    QN+    FS+C +    G +  G       +    T  +P
Sbjct: 206 SGRANTGNIVDEIFKKHK--QNNETHVFSLCLNAEKGGYMSVGGYNYELHEKNARTQIIP 263

Query: 191 IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
                  Y V ++   I N+ +  +    ++DSG +    P+ I   ++ K ++L  S++
Sbjct: 264 FDSDSGYYSVSIKQILIQNNVIVTNIGYTIIDSGTTIVLGPSRIINPIIQKINELCESEQ 323

Query: 251 ISLQGNSW-------KYCYNASSEE------MLKVPDMRLIFSKNQSFVVR--NHIFSFP 295
            S  G+         K+ YN S  E          P++   F   Q  V +   +++   
Sbjct: 324 YSCGGSKKNGDKQQSKFLYNPSKYENNVNNFFDSFPNIDFKFENGQVIVWKPSAYLYIDR 383

Query: 296 ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +N    ++     + +     +G  FM  + I+FDR+N ++ ++ SKC
Sbjct: 384 KNGYKNLYQFGFEAYESGKLYLGGPFMKNYDILFDRDNQEIHFTASKC 431


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 134/325 (41%), Gaps = 38/325 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DPS SS+  +V C  P CK       +C      C Y   Y  + + + G L  +   L
Sbjct: 169 FDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTT--CEYSVKYG-DQSVTRGNLAQEAFTL 225

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 156
                 +P +   + V+ GC  + + S + GA  +    G++GLG GD S+ S   + G 
Sbjct: 226 ------SPSAPPAAGVVFGCSHEYS-SGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGN 277

Query: 157 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNS 210
             + FS C     S  G +  G   P  Q + SF P+     +    Y V +    +  +
Sbjct: 278 SGDVFSYCLPPRGSSAGYLTIGAAAPP-QSNLSFTPLVTDNSQLSSVYVVNLVGISVSGA 336

Query: 211 CL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNA 264
            L    S F    ++DSG   T +P   Y  +  +F + +    +  +G+  S   CY+ 
Sbjct: 337 ALPIDASAFYIGTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDV 396

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNH----IFSF-PENEGFTVFCLTVMSTD-GDYGIIG 318
           +  +++  P + L F       V       +F+     +  T+ CL  + T+   + IIG
Sbjct: 397 TGHDVVTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIG 456

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                 + +VFD E  ++ +  + C
Sbjct: 457 NMQQRAYNVVFDVEGRRIGFGANGC 481


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 157/349 (44%), Gaps = 68/349 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---DP---CPYIADYSTEDTSSSGYLVDD 96
           S ++P +S +   + CS   CK+R+S  +L    DP   C +I  Y+ + +S  G+L  +
Sbjct: 103 SIFNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYA-DASSVEGHLAFE 161

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
                S ++ A         + GC    + S  +  A   G+MG+  G +S    + + G
Sbjct: 162 TFRFGSLTRPA--------TVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLS---FVNQMG 210

Query: 156 LIQNSFSICFDENDS-GSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGVES 204
                FS C    DS G +  G+            P  Q ST  LP  ++  AY V +E 
Sbjct: 211 F--RKFSYCISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTP-LPYFDRV-AYSVQLEG 266

Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-------DKLV 246
             + N  L           T +G Q +VDSG  FTFL   +Y+ +  +F        +++
Sbjct: 267 IKVNNKVLPLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVL 325

Query: 247 SSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFP-ENEGF-T 301
           +  +   QG +   CY  +++S  +  +P ++L+F   +  V  +  ++  P E  G  +
Sbjct: 326 NEPQYVFQG-AMDLCYLIDSTSSTLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDS 384

Query: 302 VFCLTVMSTDGDYGIIGQNFMMGHR------IVFDRENLKLAWSHSKCE 344
           V+C T  ++D + GI   +F++GH       + +D EN ++ ++  +C+
Sbjct: 385 VWCFTFGNSD-ELGI--SSFLIGHHQQQNVWMEYDLENSRIGFAELRCD 430


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/315 (23%), Positives = 126/315 (40%), Gaps = 23/315 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           ++P  SS+ K  +C    C     S+  C  L   C Y   Y  + + S G L  + L  
Sbjct: 131 FEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQ-CIYGIMYG-DKSFSVGILGTETLSF 188

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            S      Q+    + I GCG     +        G+ GLG G +S+ S L     I + 
Sbjct: 189 GS--TGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHK 244

Query: 161 FSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT- 213
           FS C   +D   +  + FG +   T       P+  K      YF+ +E+  IG   ++ 
Sbjct: 245 FSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVST 304

Query: 214 -QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
            Q+    ++DSG   T+L    Y   V    + +  K +    +  K C+   +   L +
Sbjct: 305 GQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCF--PNRANLAI 362

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
           PD+   F+   S  +R      P  +   + CL V+ + G    + G       ++ +D 
Sbjct: 363 PDIAFQFT-GASVALRPKNVLIPLTDS-NILCLAVVPSSGIGISLFGSIAQYDFQVEYDL 420

Query: 332 ENLKLAWSHSKCEEV 346
           E  K++++ + C +V
Sbjct: 421 EGKKVSFAPTDCAKV 435


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 119/291 (40%), Gaps = 39/291 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ SSS   V C    C       S+C + +  C Y+  Y  + ++++G    D L L
Sbjct: 181 FDPAQSSSYAAVPCGRSACAGLGIYASACSAAQ--CGYVVSYG-DGSNTTGVYSSDTLTL 237

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
           A+       ++VQ   + GCG  Q+G    G   DG++G G      PSL+ + AG    
Sbjct: 238 AA------NATVQ-GFLFGCGHAQSGGLFTGI--DGLLGFGR---EQPSLVQQTAGAYGG 285

Query: 160 SFSICFDENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
            FS C     S + +    GP+       +T  LP       Y V +    +G   L+  
Sbjct: 286 VFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVP 345

Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            S F A  +VD+G   T LP   YA +   F   ++S   +        CY+ +    + 
Sbjct: 346 ASAFAAGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVN 405

Query: 272 VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQ 319
           +  + L FS   +  +  + I SF         CL   S  +DG   I+G 
Sbjct: 406 LTSVALTFSSGATMTLGADGIMSF--------GCLAFASSGSDGSMAILGN 448


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/324 (22%), Positives = 121/324 (37%), Gaps = 34/324 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           +DP+ S++   V C+   C +          SC    + C Y   Y  + + S G L  D
Sbjct: 232 FDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAYG-DGSFSRGVLATD 290

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS--LLAKA 154
            + L   S            + GCG    G +       G+MGLG  ++S+ S   L   
Sbjct: 291 TVALGGAS--------LDGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTALRYG 339

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGN 209
           G+           + SGS+  G    + + +T         D      YF+ V    +G 
Sbjct: 340 GVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG 399

Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS-WKYCYNA 264
           + L   G  A   L+DSG   T L   +Y  V  +F +   ++   +  G S    CY+ 
Sbjct: 400 TALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDL 459

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFM 322
           +  + +KVP + L         V      F   +  +  CL +  +S +    IIG    
Sbjct: 460 TGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQ 519

Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
              R+V+D    +L ++   C  V
Sbjct: 520 KNKRVVYDTVGSRLGFADEDCNYV 543


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 137/357 (38%), Gaps = 66/357 (18%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY----IADYSTEDTS-----SSGYLVDDI 97
           P + S S  +SC    C +  +  S  D C      + +  T D S     S  Y   D 
Sbjct: 139 PLNVSKSSLISCKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDG 198

Query: 98  LHLASFSKH---APQSSVQ----SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
             +A   KH    P +S +         GC     G       P GV G G G +S+P+ 
Sbjct: 199 SLIAKLHKHNLIMPSTSNKPFSLKDFTFGCAHSALGE------PIGVAGFGFGSLSLPAQ 252

Query: 151 LAKAGL-IQNSFSIC-----FDENDS--------GSVFFGDQGPATQQSTSFLPIGEKYD 196
           LA     + N FS C     FD            G V   D    TQ   + +    K+ 
Sbjct: 253 LANLSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHP 312

Query: 197 AYF-VGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
            ++ V +E+  +G+S +                 +VDSG ++T LPT  Y  V  + D+ 
Sbjct: 313 YFYSVSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRR 372

Query: 246 V------SSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVV--RNHIFSF- 294
           V      +S+  S  G S  Y    +  E L   VP +   F  N S V+  RN+ + F 
Sbjct: 373 VGRVFKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFL 432

Query: 295 ---PENEGFTVFCLTVM-----STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               E +G  V CL +M     S  G    +G     G ++V+D E  ++ ++  KC
Sbjct: 433 DGEDEKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/303 (24%), Positives = 136/303 (44%), Gaps = 30/303 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP +SS+ K+VSCS   C   ++++SC +  + C Y   Y  +++ + G +  D L L 
Sbjct: 132 FDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYG-DNSYTKGNIAVDTLTLG 190

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           S      Q     ++IIGCG    G++       G++GLG G VS+   L  +  I   F
Sbjct: 191 SSDTRPMQ---LKNIIIGCGHNNAGTF--NKKGSGIVGLGGGPVSLIKQLGDS--IDGKF 243

Query: 162 SICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL- 212
           S C       ++ +  + FG     +     ST  +    +   Y++ ++S  +G+  + 
Sbjct: 244 SYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ 303

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                   S    ++DSG + T LPTE Y+E+       + +++     +    CY+A+ 
Sbjct: 304 YSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG 363

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGH 325
           +  LKVP + + F      +  ++ F    +E    F      +   YG + Q NF++G+
Sbjct: 364 D--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGY 420

Query: 326 RIV 328
             V
Sbjct: 421 DTV 423


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/300 (23%), Positives = 121/300 (40%), Gaps = 30/300 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+ P C   +        C Y   Y  + + S G+   D L L+S+ 
Sbjct: 223 FDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
           C     +G+ +  FG    A  ++    P+    G  +  Y+VG+    +G   L+  QS
Sbjct: 329 CLPARSTGTGYLDFGAGSLAAARARLTTPMLTENGPTF--YYVGMTGIRVGGQLLSIPQS 386

Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
            F     +VDSG   T LP   Y+ +   F   ++++  + +   +    CY+ +    +
Sbjct: 387 VFATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQV 446

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            +P + L+F       V      +  +              GD GI+G   +    + +D
Sbjct: 447 AIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/316 (23%), Positives = 126/316 (39%), Gaps = 34/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S+S   VSC    C+    ++C++    C Y   Y  + + + G    + L L  
Sbjct: 208 FDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 265

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+   +V IGCG    G ++  A    + G  L   S PS ++      ++FS
Sbjct: 266 ------DSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 311

Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL------ 212
            C  + DS    ++ FGD        T+ L    +    Y+V +    +G   L      
Sbjct: 312 YCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASA 371

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T      +VDSG + T L +  YA +   F +   S   +   + +  CY+ S  
Sbjct: 372 FAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDR 431

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             ++VP + L F    +  +    +  P  +G   +CL    T+    IIG     G R+
Sbjct: 432 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 490

Query: 328 VFDRENLKLAWSHSKC 343
            FD     + ++ +KC
Sbjct: 491 SFDTARGAVGFTPNKC 506


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 137/327 (41%), Gaps = 43/327 (13%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           LS Y+ S+SS+S   SCS PLC       SRS   S    C Y   Y  + TS   Y+ D
Sbjct: 127 LSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNS---ACAYGISYQDKSTSIGAYVKD 183

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+ ++         ++  S +  GC    TGS+      DG+MG G    +VP+ +A   
Sbjct: 184 DMHYVLQ-----GGNATTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQIATQR 234

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
            +   FS C   +++  G + FG++   T+    F P+      Y V + S  + +  L 
Sbjct: 235 NMSRVFSHCLGGEKHGGGILEFGEEPNTTEM--VFTPLLNVTTHYNVDLLSISVNSKVLP 292

Query: 213 -------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSW 258
                         ++G   ++DSG SF  L T+    +  +   L ++K    L+G   
Sbjct: 293 IDSKEFSYVSNSTNETG--VIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQC 350

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGI 316
            Y  +  + E    P++ L FS   +  ++  N++      +    +C    S DG   I
Sbjct: 351 FYLKSGLTVET-SFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADG-LTI 408

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
            G+  +    + +D EN ++ W    C
Sbjct: 409 FGEIVLKDKLVFYDVENRRIGWKGQNC 435


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 136/318 (42%), Gaps = 39/318 (12%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           + +DP+SS+S + V C  PLC     ++C      C +   Y+  D+S    L  D L +
Sbjct: 152 APFDPASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAV 209

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           A  +  A           GC ++ TG+    A P G++GLG G +S   L     + + +
Sbjct: 210 AGNAVKA--------YTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEAT 256

Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           FS C       N SG++  G  G P   ++T  L    +   Y+V +    +G   +   
Sbjct: 257 FSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP 316

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   ++DSG  FT L    Y  V  +  + V +   SL G  +  C+N ++   
Sbjct: 317 AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA--- 371

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGH 325
           +  P + L+F   Q  +   ++     +   T+ CL + +  DG      +I       H
Sbjct: 372 VAWPPVTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNH 429

Query: 326 RIVFDRENLKLAWSHSKC 343
           R++FD  N ++ ++  +C
Sbjct: 430 RVLFDVPNGRVGFARERC 447


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 122/325 (37%), Gaps = 37/325 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           + P +S S   V CS   CK     S ++C S   PC Y   Y      + G +  D   
Sbjct: 130 FRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSAT 189

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +A       Q      V++GC     G        DGV+ LG   +S  S    A     
Sbjct: 190 IALPGGKVAQ---LQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGG 242

Query: 160 SFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPI----GEKYDAYFVGVESYC 206
           SFS C        N +G + FG     + PATQ      P     G K DA  V  ++  
Sbjct: 243 SFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALD 302

Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYN-- 263
           I            ++DSG + T L T  Y  VV    KL++   ++      +++CYN  
Sbjct: 303 IPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWT 360

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQN 320
           A      ++P + + F+           +      G  V C+ +   +G++    +IG  
Sbjct: 361 APRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPG--VKCIGLQ--EGEWPGVSVIGNI 416

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
               H   FD +N+++ +  S C  
Sbjct: 417 MQQEHLWEFDLKNMEVRFMPSTCTR 441


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 133/305 (43%), Gaps = 34/305 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP +SS+ K+VSCS   C   ++++SC +  + C Y   Y  +++ + G +  D L L 
Sbjct: 132 FDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYG-DNSYTKGNIAVDTLTLG 190

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAG-LIQN 159
           S      Q     ++IIGCG    G++      +      +G    P SL+ + G  I  
Sbjct: 191 SSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDG 241

Query: 160 SFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSC 211
            FS C       ++ +  + FG     +     ST  +    +   Y++ ++S  +G+  
Sbjct: 242 KFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ 301

Query: 212 L-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           +         S    ++DSG + T LPTE Y+E+       + +++     +    CY+A
Sbjct: 302 IQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA 361

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMM 323
           + +  LKVP + + F      +  ++ F    +E    F      +   YG + Q NF++
Sbjct: 362 TGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRGSPSFSIYGNVAQMNFLV 418

Query: 324 GHRIV 328
           G+  V
Sbjct: 419 GYDTV 423


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 119/319 (37%), Gaps = 28/319 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + PS SS+   V C    C++R SC      D CPY   Y  + + + G+L +D L L +
Sbjct: 198 FAPSDSSTFSAVRCGARECRARQSCGGSPGDDRCPYEVVYG-DKSRTQGHLGNDTLTLGT 256

Query: 103 FS---KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
            +     A   +     + GCG   TG  L G A DG+ GLG G VS+ S    AG    
Sbjct: 257 MAPANASAENDNKLPGFVFGCGENNTG--LFGQA-DGLFGLGRGKVSLSS--QAAGKFGE 311

Query: 160 SFSICFDENDSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
            FS C   + S +  +   G     PA  Q T  L        Y+V +    +    +  
Sbjct: 312 GFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRV 371

Query: 215 S----GFQALVDSGASFTFLPTEIYAEVVVKFDKLVS------SKRISLQGNSWKYCYNA 264
           S        +VDSG   T L    Y  +   F   +       + R+S+      Y + A
Sbjct: 372 SSPRVALPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTC--YDFTA 429

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
            +   + +P + L+F+   +  V      +                    GI+G      
Sbjct: 430 HANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRT 489

Query: 325 HRIVFDRENLKLAWSHSKC 343
             +V+D    K+ ++   C
Sbjct: 490 LAVVYDVARQKIGFAAKGC 508


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 148/362 (40%), Gaps = 83/362 (22%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADYSTED 86
            + ++ P  SSSSK + C +P C        +S+  +C      C     PYI  Y    
Sbjct: 130 KIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLG- 188

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
            S++G L+ + ++        P  ++ S  + GC      S L    P+G+ G G    S
Sbjct: 189 -STAGLLLSETINF-------PNKTI-SDFLAGC------SLLSTRQPEGIAGFGRSQES 233

Query: 147 VPSLLAKAGLIQNSFSIC-----FDENDSGSVFFGDQGPATQQST----SFLPIGEKY-- 195
           +P  L   GL    FS C     FD++   S    D GP+T  S     S+ P  +    
Sbjct: 234 LPLQL---GL--KKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLAS 288

Query: 196 -------DAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEV 238
                  + Y+V +    +G + +          +      +VDSG++FTF+   ++  +
Sbjct: 289 QSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELL 348

Query: 239 VVKFDKLVSSKRISL---QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 294
             +F+K +++  ++    +    + C++ S E+ + +PD+   F       +  ++ F+F
Sbjct: 349 AKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAF 408

Query: 295 PENEGFTVFCLTVMSTD-----GDYG--------IIGQNFMMGHRIVFDRENLKLAWSHS 341
            +     V CLT++S +     GD G        I+G        I +D EN +  +   
Sbjct: 409 VD---MGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQ 465

Query: 342 KC 343
            C
Sbjct: 466 SC 467


>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
          Length = 148

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  V  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/242 (23%), Positives = 98/242 (40%), Gaps = 17/242 (7%)

Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 170
           S+ + GCGR   G +       G+MGLG  ++S+ S           FS C    D   S
Sbjct: 241 SNFVFGCGRNNKGLF---GGVSGIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTDSGAS 295

Query: 171 GSVFFGDQGPATQQ-----STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---ALVD 222
           GS+  G++    +       TS +   +  + Y + +    +G   +  + F     L+D
Sbjct: 296 GSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILID 355

Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 282
           SG   T L   +Y  +  +F K  S   I+   +    C+N +  E + +P + + F  N
Sbjct: 356 SGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENN 415

Query: 283 QSFVVRN-HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
               V    I   P++       L  +S + D  IIG       R+++D +  K+ ++  
Sbjct: 416 VDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFARE 475

Query: 342 KC 343
            C
Sbjct: 476 DC 477


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 104/259 (40%), Gaps = 34/259 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
           D+    ++PS S+S  NVSCS   C S SS       C      Y   Y  + + S G+L
Sbjct: 141 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 199

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             +   L         S V   V  GCG    G +   A   G++GLG   +S PS  A 
Sbjct: 200 AKEKFTLT-------NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 249

Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
           A      FS C   + S  G + FG  G    +S  F PI    D          A  VG
Sbjct: 250 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 305

Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
            +   I ++  +  G  AL+DSG   T LP + YA +   F   +S    +   +    C
Sbjct: 306 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 363

Query: 262 YNASSEEMLKVPDMRLIFS 280
           ++ S  + + +P +   FS
Sbjct: 364 FDLSGFKTVTIPKVAFSFS 382


>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
 gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
          Length = 107

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 1/76 (1%)

Query: 115 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 173
           +V   C    TGS+LDG A +G+MGLG   VSV  +L  +GL+  +SFS+CF E+  G +
Sbjct: 13  AVAKACRCGPTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRI 72

Query: 174 FFGDQGPATQQSTSFL 189
            FGD G   Q    F+
Sbjct: 73  NFGDAGIRGQGEMPFI 88


>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
 gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
          Length = 376

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 34/314 (10%)

Query: 56  VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 112
           V C  P+C+S  S    +   P   DY  E     SS G LV D  +L +F+     S +
Sbjct: 70  VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNL-NFTSEKRHSPL 128

Query: 113 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
            +  + G  +   GS+      DGV+GLG G  S+ S L+  GL++N    C   +  G 
Sbjct: 129 LALGLCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGF 185

Query: 173 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 229
           +FFGD    + +  ++ P+      Y  G+            +GF+ L+   DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGASYTY 241

Query: 230 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 287
           L ++ Y  ++    K +S K  R +L   +   C+    +    + D++  F K  +   
Sbjct: 242 LNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTFALSF 299

Query: 288 RNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 332
            N         FP  E + +       CL +++       D  +IG   M    +++D E
Sbjct: 300 TNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNE 358

Query: 333 NLKLAWSHSKCEEV 346
             ++ W+   C  +
Sbjct: 359 KERIGWAPGNCNRL 372


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
           R+   +  + SSS K + C   +CK       S ++C +   PC Y  DY   D S++ G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 186

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
           +  ++ + +    K   +  + + V+IGC     G     A  DGVMGLG    S    +
Sbjct: 187 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 239

Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
             A      FS C       +N S  + FG     +      + + L +G     Y V +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
               IG + L         +     ++DSG+S TFL    Y  V+      L+  +++ +
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
                +YC+N++  E   VP +   F+    F   V++++ S  +     V CL  +S  
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 415

Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                ++G      H   FD    KL ++ S C
Sbjct: 416 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
           R+   +  + SSS K + C   +CK       S ++C +   PC Y  DY   D S++ G
Sbjct: 58  RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 115

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
           +  ++ + +    K   +  + + V+IGC     G     A  DGVMGLG    S    +
Sbjct: 116 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 168

Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
             A      FS C       +N S  + FG     +      + + L +G     Y V +
Sbjct: 169 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 228

Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
               IG + L         +     ++DSG+S TFL    Y  V+      L+  +++ +
Sbjct: 229 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 288

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
                +YC+N++  E   VP +   F+    F   V++++ S  +     V CL  +S  
Sbjct: 289 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 344

Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                ++G      H   FD    KL ++ S C
Sbjct: 345 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 82/346 (23%), Positives = 145/346 (41%), Gaps = 68/346 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DPS SSS  ++ CSHPLCK R       +SC S    C Y   Y+ + T + G LV 
Sbjct: 123 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDS-NRLCHYSYFYA-DGTFAEGNLVK 180

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +    ++     P       +I+GC ++ T          G++G+ LG +   S +++A 
Sbjct: 181 EKFTFSNSQTTPP-------LILGCAKESTDV-------KGILGMNLGRL---SFISQAK 223

Query: 156 LIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
           + + S+ I    N      +GS + G+               P +Q+  +  P+     A
Sbjct: 224 ISKFSYCIPTRSNRPGLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPL-----A 278

Query: 198 YFVGVESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
           Y V +    IG   L    S F        Q +VDSG+ FT L    Y +V  +  +LV 
Sbjct: 279 YTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVG 338

Query: 248 S--KRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 303
           S  K+  + G++   C++ + + ++   + D+   F +    +V         N G  + 
Sbjct: 339 SRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQ--RLLVNVGGGIH 396

Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           C+ +  +        IIG        + FD  N ++ +S ++C  +
Sbjct: 397 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGFSKAECSRL 442


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 138/341 (40%), Gaps = 57/341 (16%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
           + +DPS SSS   + C+HPLCK R    +L   C      + + +  + T + G LV + 
Sbjct: 121 TSFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREK 180

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
           +  +S     P       +I+GC    T          G++G+ LG  S           
Sbjct: 181 ITFSSSQSTPP-------LILGCAEASTDE-------KGILGMNLGRRSFASQAKISKFS 226

Query: 147 --VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT--QQSTSFLPIGEKYDAYFVGV 202
             VP+  A+AGL         +  +SG   + +    T  Q+S +  P+     AY + +
Sbjct: 227 YCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPL-----AYTIPM 281

Query: 203 ESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
           +   +GN+ L  S            Q ++DSG+ FT+L  E Y +V  +  +LV    K+
Sbjct: 282 QGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKK 341

Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
             + G     C++ +  E+ + + +M   F K    V+    +    + G  V C+ +  
Sbjct: 342 GYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDK--WRVLADVGGGVHCIGIGR 399

Query: 310 TD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           ++       IIG        + +D  N ++    + C   +
Sbjct: 400 SEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKADCSRSV 440


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 140/371 (37%), Gaps = 74/371 (19%)

Query: 21  PVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------------ 68
           P TT   C      S    + +  ++P  SSS K + C  P C   SS            
Sbjct: 114 PCTTHYTCT---NCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNG 170

Query: 69  -CKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
             K     CP Y   Y T   ++SG+ + + L     + H          ++GC    T 
Sbjct: 171 NSKKCSHACPQYTLQYGTG--AASGFFLLENLDFPGKTIH--------KFLVGC----TT 216

Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGP 180
           S     + D + G G    S+P  +         F+ C + +D      SG +   D   
Sbjct: 217 SADREPSSDALAGFGRTMFSLPMQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSD 270

Query: 181 ATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQ----------ALVDSGAS 226
              Q  S+ P  +        Y++GV+   IGN  L   G             ++DSG +
Sbjct: 271 GETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFA 330

Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCYNASSEEMLKVPDMRLIFSKNQ 283
           ++++   ++  V  +  K +S  R SL+  +      CYN +  + +K+PD+   F+   
Sbjct: 331 YSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGA 390

Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMS---------TDGDYGIIGQNFMMGHRIVFDRE 332
           + VV   N+   F E    ++ C  V +         T G   I+G    + H + FD +
Sbjct: 391 NMVVPGMNYFLLFSEA---SLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLK 447

Query: 333 NLKLAWSHSKC 343
           N +L +    C
Sbjct: 448 NERLGFRQQTC 458


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 40/317 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSSS   +SC+   CK   +++C S  D C Y   Y  + + ++G L  + L   +
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN 249

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S   P      ++ IGCG    G +  GA    ++GLG G +S+ S L  +     SFS
Sbjct: 250 -SNSIP------NLPIGCGHDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFS 294

Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL------ 212
            C    D + S ++ F    P+    TS L   +++ +Y +V V    +G   L      
Sbjct: 295 YCLVNLDSDSSSTLEFNSYMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTR 353

Query: 213 ---TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
               +SG    +VDSG   + LP+++Y  +   F KL SS   +   + +  CYN S + 
Sbjct: 354 FEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQS 413

Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            ++VP +  + S+  S  +  RN++    +  G   +CL  + T     IIG     G R
Sbjct: 414 NVEVPTIAFVLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIR 470

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D  N  + +S +KC
Sbjct: 471 VSYDLTNSIVGFSTNKC 487


>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
          Length = 150

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 70/138 (50%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
           + ++  GCG KQ        +P DG++GLG+G     + L    +I +N    C      
Sbjct: 4   KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 63

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P T+  T ++P+ E    Y  G+ +  I    +  +  F+A+ DSG+++T+
Sbjct: 64  GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 122

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 123 VPAQIYNELVSKIRGTLS 140


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
           R+   +  + SSS K + C   +CK       S ++C +   PC Y  DY   D S++ G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 186

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
           +  ++ + +    K   +  + + V+IGC     G     A  DGVMGLG    S    +
Sbjct: 187 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 239

Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
             A      FS C       +N S  + FG     +      + + L +G     Y V +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
               IG + L         +     ++DSG+S TFL    Y  V+      L+  +++ +
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
                +YC+N++  E   VP +   F+    F   V++++ S  +     V CL  +S  
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 415

Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                ++G      H   FD    KL ++ S C
Sbjct: 416 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 126/313 (40%), Gaps = 29/313 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS SS+   V C  P C +    C      C Y+  Y  + +S++G L  D L L S 
Sbjct: 189 FDPSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRYG-DGSSTTGVLSRDTLALTS- 246

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 S   +    GCG +  G +      DG++GLG G++S+PS  A +      FS 
Sbjct: 247 ------SRALTGFPFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSY 295

Query: 164 CFDENDSGSVFFG-DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
           C   ++S + +      PAT     Q T+ L   +    YFV + S  IG   L      
Sbjct: 296 CLPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAV 355

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T+ G   L+DSG   T+LP + YA +  +F   +     +   +    CY+ + E  + 
Sbjct: 356 FTRGG--TLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVV 413

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFD 330
           VP +   F     F +         +E         M T G    IIG        +++D
Sbjct: 414 VPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYD 473

Query: 331 RENLKLAWSHSKC 343
               K+ +  + C
Sbjct: 474 VAAEKIGFVPASC 486


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 127/325 (39%), Gaps = 46/325 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS+S +  N+SC+   C S  S       C S    C Y   Y  + + + G+   D 
Sbjct: 197 FDPSTSKTYSNISCTSAACSSLKSATGNSPGCSS--SNCVYGIQYG-DSSFTIGFFAKDK 253

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L L        Q+ V    + GCG+   G +   A   G++GLG   +S+    A+    
Sbjct: 254 LTLT-------QNDVFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--F 301

Query: 158 QNSFSICF--DENDSGSVFFGD-----QGPATQQSTSFLPI----GEKYDAYFVGVESYC 206
              FS C       +G + FG+        A +   +F P     G  Y  YF+ V    
Sbjct: 302 GKYFSYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAY--YFIDVLGIS 359

Query: 207 IGNSCLTQSG--FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
           +G   L+ S   FQ    ++DSG   T LP+  Y  +   F + +S    +   +    C
Sbjct: 360 VGGKALSISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTC 419

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGIIG 318
           Y+ S+   + +P +   F+ N +  +  N I       G +  CL       D   GI G
Sbjct: 420 YDLSNYTSISIPKISFNFNGNANVELDPNGIL---ITNGASQVCLAFAGNGDDDSIGIFG 476

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                   +V+D    +L + +  C
Sbjct: 477 NIQQQTLEVVYDVAGGQLGFGYKGC 501


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 134/323 (41%), Gaps = 42/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP++S+S   V C   +C++     S C      C Y   Y  + + + G L  + L  
Sbjct: 175 FDPAASASFTAVPCDSGVCRTLPGGSSGCAD-SGACRYQVSYG-DGSYTQGVLAMETL-- 230

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +F    P   VQ  V IGCG +  G ++  A   G++GLG G +S+   L  A     +
Sbjct: 231 -TFGDSTP---VQG-VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGA 280

Query: 161 FSICFD----ENDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCI------- 207
           FS C      +  +GS+ FG  D  P        L   ++   Y+VG+    +       
Sbjct: 281 FSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPL 340

Query: 208 --GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW-KYCYN 263
             G   LT+ G   +V D+G + T LP + YA +   F   +        G S    CY+
Sbjct: 341 QDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYD 400

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
            S    ++VP + L F ++ + +    RN +       G  V+CL   ++     I+G  
Sbjct: 401 LSGYASVRVPTVALYFGRDGAALTLPARNLLVEM----GGGVYCLAFAASASGLSILGNI 456

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
              G +I  D  N  + +  S C
Sbjct: 457 QQQGIQITVDSANGYVGFGPSTC 479


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 131/356 (36%), Gaps = 73/356 (20%)

Query: 40  RNLSEYDPSS------SSSSKNVSCSHPLCK-----SRSSCKS--LKDPCPYIADYSTED 86
           RN S   P S      S++   + C  P C+       + C    L  PC Y   Y+ + 
Sbjct: 118 RNCSHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYA-DS 176

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLG 143
           ++++G+   + L L + +    +    + +  GCG + +G  L GA+     GVMGLG  
Sbjct: 177 STTTGFFSKEALTLNTSTGKVKK---LNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRA 233

Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------ 197
            +S  S L +     + FS C  +              +   TSFL IG   +       
Sbjct: 234 PISFSSQLGRR--FGSKFSYCLMDYT-----------LSPPPTSFLTIGGAQNVAVSKKG 280

Query: 198 ----------------YFVGVESYCIGNSCLTQS----------GFQALVDSGASFTFLP 231
                           Y++ ++   +    L  +              ++DSG + TF+ 
Sbjct: 281 IMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFIT 340

Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF--VVRN 289
              Y E++  F K V     +     +  C N S      +P M    +    F    RN
Sbjct: 341 EPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRN 400

Query: 290 HIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +        G  + CL V  +S DG + ++G     G  + FDR+  +L ++   C
Sbjct: 401 YFI----ETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 80/329 (24%), Positives = 128/329 (38%), Gaps = 43/329 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD-DIL 98
           + P+ S S   + C    CKS      ++C S  DPC Y  DY  +D SS+  +V  D  
Sbjct: 151 FRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSY--DYRYKDNSSARGVVGLDSA 208

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            ++       + +    V++GC     G     +  DGV+ LG  ++S  S    A    
Sbjct: 209 TVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFAS--RAASRFG 264

Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQSTS-------FLPIGEKYDAYFVGVESYC 206
             FS C        N +  + FG+   +    +S        L        YFV V++  
Sbjct: 265 GRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVT 324

Query: 207 IGNSCLT--------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS 257
           +    L         +    A++DSG S T L T  Y  VV    K      R+++  + 
Sbjct: 325 VAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM--DP 382

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--- 314
           ++YCYN +     ++P M L F+   +       +      G  V C+ V+  +G +   
Sbjct: 383 FEYCYNWTGVSA-EIPRMELRFAGAATLAPPGKSYVIDTAPG--VKCIGVV--EGAWPGV 437

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            +IG      H   FD  N  L +  S+C
Sbjct: 438 SVIGNILQQEHLWEFDLANRWLRFKQSRC 466


>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
          Length = 141

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 4/136 (2%)

Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 173
           +  GCG KQ        +P DG++GLG+G     + L    +I +N    C      G +
Sbjct: 1   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60

Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 232
           + GD  P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T +P 
Sbjct: 61  YVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119

Query: 233 EIYAEVVVKFDKLVSS 248
           +IY E+V K    +S 
Sbjct: 120 QIYNEIVSKVRGTLSE 135


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 71/318 (22%), Positives = 116/318 (36%), Gaps = 41/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ S++  N+SCS   C            C Y   Y  + + + G+   D L LA   
Sbjct: 139 FDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--- 194

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                     +   GCG K  G +   A   G++GLG G  S+P     K G +   F+ 
Sbjct: 195 -----YDTIKNFRFGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAY 243

Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----- 216
           C     +G+ F   G   PA     + + +      Y+VG+    +G   L   G     
Sbjct: 244 CLPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFST 303

Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW---------KYCYNASSE 267
              LVDSG   T LP   YA +   F K       ++QG  +           CY+ +  
Sbjct: 304 AGTLVDSGTVITRLPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGH 356

Query: 268 E--MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           +   + +P + L+F       V      +  +           + D D  I+G      H
Sbjct: 357 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTH 416

Query: 326 RIVFDRENLKLAWSHSKC 343
            +++D     + ++   C
Sbjct: 417 GVLYDIGKKIVGFAPGAC 434


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 128/318 (40%), Gaps = 39/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P  SSS   + C    C+   S     + C Y   Y  + +++ GY+  +        
Sbjct: 138 FNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF---- 192

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
               ++S   ++  GCG    G      A  G++G+G G +S+PS L         FS C
Sbjct: 193 ----ETSSVPNIAFGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYC 241

Query: 165 FDENDSGS---VFFGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSG 216
                S S   +  G      P    ST+ +        Y++ ++   +G  N  +  S 
Sbjct: 242 MTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSST 301

Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE- 267
           FQ         ++DSG + T+LP + Y  V   F   ++   +    +    C+   S+ 
Sbjct: 302 FQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDG 361

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGH 325
             ++VP++ + F      +   +I   P  EG  V CL  M +    G  I G       
Sbjct: 362 STVQVPEISMQFDGGVLNLGEQNILISPA-EG--VICL-AMGSSSQLGISIFGNIQQQET 417

Query: 326 RIVFDRENLKLAWSHSKC 343
           ++++D +NL +++  ++C
Sbjct: 418 QVLYDLQNLAVSFVPTQC 435


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 132/321 (41%), Gaps = 42/321 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP+ S S  N+ C  PLC+      C + K  C Y   Y            D    +  
Sbjct: 187 FDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYG-----------DGSFTVGE 235

Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           FS      + +    V++GCG    G ++  A    ++GLG G +S PS + +     + 
Sbjct: 236 FSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQIGRR--FNSK 290

Query: 161 FSICFDENDS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---S 210
           FS C  +  +     S+ FGD   A  ++T F P+    K D  Y+V +    +G    S
Sbjct: 291 FSYCLGDRSASSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVS 348

Query: 211 CLTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
            ++ S F+         ++DSG S T L    Y  +   F    S+ + + + + +  C+
Sbjct: 349 GISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCF 408

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           + S +  +KVP + L F      +  ++     +N G   FC     T     IIG    
Sbjct: 409 DLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNSG--SFCFAFAGTASGLSIIGNIQQ 466

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
            G R+V+D    ++ ++   C
Sbjct: 467 QGFRVVYDLATSRVGFAPRGC 487


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 68/307 (22%), Positives = 114/307 (37%), Gaps = 22/307 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS S++   V C    C    +C S K  C Y   Y  + + + G L  D L L    
Sbjct: 230 FDPSQSTTYSAVPCGAQECLDSGTCSSGK--CRYEVVYG-DMSQTDGNLARDTLTL---- 282

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
              P S      + GCG   TG +      DG+ GLG   VS+ S    A      FS C
Sbjct: 283 --GPSSDQLQGFVFGCGDDDTGLF---GRADGLFGLGRDRVSLAS--QAAARYGAGFSYC 335

Query: 165 FDE--NDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA 219
                   G +  G    P   Q T+ +   +    Y++ +    +      +  + F+A
Sbjct: 336 LPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKA 395

Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
              ++DSG   T LP+  Y+ +   F   +   + +   +    CY+ +    +++P + 
Sbjct: 396 PGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVA 455

Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
           L+F    +  +      +  N             D   GI+G        +V+D  N K+
Sbjct: 456 LLFDGGATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKI 515

Query: 337 AWSHSKC 343
            +    C
Sbjct: 516 GFGAKGC 522


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 75/316 (23%), Positives = 126/316 (39%), Gaps = 34/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S+S   VSC    C+    ++C++    C Y   Y  + + + G    + L L  
Sbjct: 28  FDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 85

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+   +V IGCG    G ++  A    + G  L   S PS ++      ++FS
Sbjct: 86  ------DSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 131

Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL------ 212
            C  + DS    ++ FGD        T+ L    +    Y+V +    +G   L      
Sbjct: 132 YCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASA 191

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
                T      +VDSG + T L +  YA +   F +   S   +   + +  CY+ S  
Sbjct: 192 FAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDR 251

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             ++VP + L F    +  +    +  P  +G   +CL    T+    IIG     G R+
Sbjct: 252 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 310

Query: 328 VFDRENLKLAWSHSKC 343
            FD     + ++ +KC
Sbjct: 311 SFDTARGAVGFTPNKC 326


>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
          Length = 154

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 4/132 (3%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVK 241
           +P +IY E+V K
Sbjct: 125 VPAQIYNEIVSK 136


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 141/367 (38%), Gaps = 53/367 (14%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           + ++P  S++  +V C+   C+        +   +    C Y   Y     +++G L  +
Sbjct: 132 APFNPVRSTTVADVPCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTE 191

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
                         +    V+ GCG +  G +   +   GV+GLG G++S+ S L     
Sbjct: 192 AFTFGD--------TRIDGVVFGCGLQNVGDF---SGVSGVIGLGRGNLSLVSQLQV--- 237

Query: 157 IQNSFSICFDENDS----GSVFFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-G 208
             + FS  F  +DS      + FGD   P T    ST  L        Y+V +    + G
Sbjct: 238 --DRFSYHFAPDDSVDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDG 295

Query: 209 NSCLTQSG-FQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW-- 258
                 SG F      G+   FL       +  E   K  +   + +I L    G++   
Sbjct: 296 KDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGL 355

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGI 316
             CY   S    KVP M L+F+      +   + F      G    CLT++ S+ GD  +
Sbjct: 356 DLCYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSV 413

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
           +G    +G  +++D    KL +     E +   +     PPP+G S      T QQ+   
Sbjct: 414 LGSLIQVGTHMMYDINGSKLVF-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGR 464

Query: 377 GQAAAPP 383
             A+APP
Sbjct: 465 RSASAPP 471


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 104/259 (40%), Gaps = 34/259 (13%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
           D+    ++PS S+S  NVSCS   C S SS       C      Y   Y  + + S G+L
Sbjct: 169 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 227

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             +   L         S V   V  GCG    G +   A   G++GLG   +S PS  A 
Sbjct: 228 AKEKFTLT-------NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 277

Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
           A      FS C   + S  G + FG  G    +S  F PI    D          A  VG
Sbjct: 278 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 333

Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
            +   I ++  +  G  AL+DSG   T LP + YA +   F   +S    +   +    C
Sbjct: 334 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 391

Query: 262 YNASSEEMLKVPDMRLIFS 280
           ++ S  + + +P +   FS
Sbjct: 392 FDLSGFKTVTIPKVAFSFS 410


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 107/252 (42%), Gaps = 32/252 (12%)

Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-------- 167
           V  GCG    G +       G+ G G G +S+PS L K G    +FS CF          
Sbjct: 175 VAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPST 227

Query: 168 ---NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---- 218
              +    +F   QG    Q+T  +        Y++ ++   +G++ L   +S F     
Sbjct: 228 VLLDLPADLFSNGQGAV--QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 285

Query: 219 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
               ++DSG + T LPT +Y  V   F   V    +S       +C +A       VP +
Sbjct: 286 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKL 345

Query: 276 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
            L F      + R N++F   E+ G ++ CL ++   G+   IG        +++D +N 
Sbjct: 346 VLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNS 403

Query: 335 KLAWSHSKCEEV 346
           KL++  ++C+++
Sbjct: 404 KLSFVPAQCDKL 415


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 139/351 (39%), Gaps = 64/351 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDIL 98
           + P  S +   + CS   C      S ++C +   PC Y  +Y  +D S++ G +  D  
Sbjct: 125 FRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAY--EYRYKDGSAARGTVGTDSA 182

Query: 99  HLASFSKHAPQSSVQSS---VIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            +A   + A +   ++    V++GC    TG S+L   A DGV+ LG  +VS  S    A
Sbjct: 183 TIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSNVSFASR--AA 237

Query: 155 GLIQNSFSICF-----DENDSGSVFFGDQ------------------GPATQQSTSFLPI 191
                 FS C        N +  + FG                     P  +Q T  L  
Sbjct: 238 ARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQ-TPLLLD 296

Query: 192 GEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF- 242
                 Y V V    +    L         Q G  A++DSG S T L +  Y  VV    
Sbjct: 297 HRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAALG 356

Query: 243 DKLVSSKRISLQGNSWKYCYNASS----EEM-LKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
            KLV   R+++  + + YCYN +S    E++ + VP + + F+ +         +     
Sbjct: 357 KKLVGLPRVAM--DPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAA 414

Query: 298 EGFTVFCLTVMSTDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            G  V C+ +   +GD+    +IG      H   FD +N +L +  S+C +
Sbjct: 415 PG--VKCIGLQ--EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 461


>gi|414888272|tpg|DAA64286.1| TPA: hypothetical protein ZEAMMB73_677781 [Zea mays]
          Length = 118

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 62/118 (52%), Gaps = 5/118 (4%)

Query: 303 FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID-KSHVHLVPPPAGQ 361
           +CL VM ++G   +IG+NFM G ++VFDRE   L W +  C  V + +S++ + P P+G 
Sbjct: 3   YCLAVMKSEG-VNLIGENFMSGLKVVFDRERKVLGWKNFDCYSVGNSRSNLPVNPNPSGV 61

Query: 362 SPNPLPTTEQQSTSNGQAAAPPSTAKTA--PSKSIAASAQQLDSVLRVACSLLVLMCL 417
            P P       +    + A+P  T      PS S +   +   +VL VA +LL L+ L
Sbjct: 62  PPKPALGPNSYTPEATKGASPNGTQVNVLQPSASFSPKLRCNRNVL-VAAALLFLVIL 118


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 127/313 (40%), Gaps = 41/313 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP  S+S + +S +   C++  RS     K   C Y   Y  + +++ G  +++ L  A
Sbjct: 180 FDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYG-DGSTTVGDFIEETLTFA 238

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 P+ S      IGCG    G  L GA   G++GLG G +S P+ +   G    +F
Sbjct: 239 G-GVRLPRIS------IGCGHDNKG--LFGAPAAGILGLGRGLMSFPNQIDHNG----TF 285

Query: 162 SICFDENDSG------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN--- 209
           S C  +  SG      ++ FG     T    SF P     +    Y+V +    +G    
Sbjct: 286 SYCLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV 345

Query: 210 SCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNS-- 257
             +T+   Q          +VDSG + T L    Y      F  + V   ++S+ G S  
Sbjct: 346 PGVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGF 405

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           +  CY      M KVP + + F+ +    ++   +  P +   TV      + D    II
Sbjct: 406 FDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSII 465

Query: 318 GQNFMMGHRIVFD 330
           G     G RIV+D
Sbjct: 466 GNIQQQGFRIVYD 478


>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
 gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
 gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
 gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
 gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
          Length = 154

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142


>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
          Length = 154

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 66/132 (50%), Gaps = 4/132 (3%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I +N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P T+  T ++P+ E    Y  G+    I    +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVK 241
           +P +IY+E+V K
Sbjct: 125 VPAQIYSEIVSK 136


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 126/320 (39%), Gaps = 33/320 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP  SS+   + C+   C +      + + C Y  DY  + + S+G    D + L S S
Sbjct: 79  FDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYG-DGSFSTGEFATDAVSLNSTS 137

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                  V + + +GCG    G ++  A   G+       +S P+ +         FS C
Sbjct: 138 GGG--QVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQINSEN--GGRFSYC 190

Query: 165 F-----DENDSGSVFFGDQG--PATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT- 213
                 D  +  S+ FGD    PA      F P          Y++ +    +G S LT 
Sbjct: 191 LTGRDTDSTERSSLIFGDAAVPPA---GVRFTPQASNLRVSTFYYLKMTGISVGGSILTI 247

Query: 214 -QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
             S FQ         ++DSG S T L    YA +   F    S   ++ + + +  CYN 
Sbjct: 248 PTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNL 307

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           S    + VP + L F       +    +  P +   T FCL    T G   IIG     G
Sbjct: 308 SDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSST-FCLAFAGTTGP-SIIGNIQQQG 365

Query: 325 HRIVFDRENLKLAWSHSKCE 344
            R+++D  + ++ +  S+C+
Sbjct: 366 FRVIYDNLHNQVGFVPSQCD 385


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 71/314 (22%), Positives = 125/314 (39%), Gaps = 33/314 (10%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           PS S++  N+SCS P C    S        S    C Y   Y  + + S GY   + L L
Sbjct: 176 PSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYG-DQSFSVGYFAKETLTL 234

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQN 159
            S       + V  + + GCG+   G +   A   G++GLG   +S+    A K G +  
Sbjct: 235 TS-------TDVIENFLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQV-- 282

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVE---------SYCIGN 209
            FS C  +  S + +    G     +  + PI + +  A F GV+            I +
Sbjct: 283 -FSYCLPKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISS 341

Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
           S  + SG  A++DSG   T LP + Y+ +   F+K ++    + + +    CY+ S    
Sbjct: 342 SVFSTSG--AIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYST 399

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           +++P +  +F   +   +      +  +                  IIG       ++V+
Sbjct: 400 IQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVY 459

Query: 330 DRENLKLAWSHSKC 343
           D    K+ + ++ C
Sbjct: 460 DVGGGKIGFGYNGC 473


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 74/332 (22%), Positives = 126/332 (37%), Gaps = 29/332 (8%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ +DP  SS++  +SC    C S      S C + +  C Y  +Y  + + + GY V D
Sbjct: 85  LNFFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRY-CGYSFEYG-DGSGTLGYYVSD 142

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
                 +      ++  + +  GC   Q+G       A DG+ G G  D+SV S L   G
Sbjct: 143 EFDYNQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQG 202

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           L    FS C +  D G       G  T+    + PI      Y + ++   +    L   
Sbjct: 203 LAPKIFSHCLEGADPGGGILV-LGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSID 261

Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
                T +    ++D G +  +L  E Y   V      V  S++   L+GN    C+   
Sbjct: 262 PQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNP---CFLTV 318

Query: 266 SEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTD-GDYGIIG 318
                  P + L F         ++++      +   V+C+        +TD     I+G
Sbjct: 319 HSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILG 378

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
              +     V+D EN ++ W+   C   ++ S
Sbjct: 379 DLVLKDKVFVYDLENQRIGWTSFDCSSTVNVS 410


>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
 gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
          Length = 433

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 76/323 (23%), Positives = 129/323 (39%), Gaps = 32/323 (9%)

Query: 47  PSSSSSSKNVSCSHPLCKSRS--SCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLAS 102
           P    S+  V C  PLC S       + +DP  C Y  +Y+ +  SS G LV D+  L +
Sbjct: 112 PLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYA-DGGSSLGVLVKDVFVL-N 169

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F+       +   + +GCG  Q     +    DG++GLG G  S+PS L+  GL+ N   
Sbjct: 170 FTN---GKRLNPLLALGCGYDQLPGRSNHPL-DGILGLGRGISSIPSQLSSQGLVSNVIG 225

Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-DAYFVGVESYCIGNSCLTQSGFQALV 221
            C      G +FFG+         ++ P+   +   Y  G                  + 
Sbjct: 226 HCLSGRGGGFLFFGED-IYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVF 284

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIF 279
           DSG+S+T+L  + Y  +V    + +S K IS  L   +   C+         + D++  F
Sbjct: 285 DSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCWKG-KRPFKSIRDVKKYF 343

Query: 280 S------KNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMM 323
                  K  S       F F   E + +       CL +++       D  +IG   M+
Sbjct: 344 KPFALVFKTSSGRSSKTQFEF-SPEAYLIISSKGNACLGILNGTEVGLRDLNVIGDVSML 402

Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
              ++++ E   + W+ + C+ +
Sbjct: 403 DRLVIYNNEKQMIGWAAASCDRL 425


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 75/347 (21%), Positives = 149/347 (42%), Gaps = 69/347 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DPS SSS   + CSHPLCK R       +SC S +  C Y   Y+ + T + G LV 
Sbjct: 111 TSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 168

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           + +  ++       + +   +I+GC  + +          G++G+  G +   S +++A 
Sbjct: 169 EKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAK 211

Query: 156 LIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
           + + S+ I    N      +GS + GD               P +Q+  +  P+   Y  
Sbjct: 212 ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA--YTV 269

Query: 198 YFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AEVVVKFDKL 245
             +G+  + +    ++ S F        Q +VDSG+ FT L    Y    AE++ +  + 
Sbjct: 270 PMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRR 328

Query: 246 VSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVF 303
           +  K+  + G +   C++ +   + + + D+  +F++  + FV +  +     N G  + 
Sbjct: 329 L--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVLV---NVGGGIH 383

Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           C+ +  +        IIG        + FD  N ++ ++ + C  V+
Sbjct: 384 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 48/338 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           YDP +S+S KN++C+ P C   SS      C+S    CPY   Y     ++  + V+   
Sbjct: 202 YDPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFT 261

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
              + ++         +++ GCG    G +   +    ++GLG G +S  S L    L  
Sbjct: 262 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQL--QSLYG 316

Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCI 207
           +SFS C      + N S  + FG+       +    TSF+   E      Y++ ++S  +
Sbjct: 317 HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILV 376

Query: 208 GNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
           G   L          +      ++DSG + ++     Y  +  KF +K+  +  I     
Sbjct: 377 GGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP 436

Query: 257 SWKYCYNAS--SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMS 309
               C+N S   E  + +P++ + F       V   +++FP    F      + CL ++ 
Sbjct: 437 VLDPCFNVSGIEENNIHLPELGIAF-------VDGTVWNFPAENSFIWLSEDLVCLAILG 489

Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           T    + IIG        I++D +  +L ++ +KC ++
Sbjct: 490 TPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCADI 527


>gi|281210961|gb|EFA85127.1| hypothetical protein PPL_02125 [Polysphondylium pallidum PN500]
          Length = 601

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 60/268 (22%), Positives = 121/268 (45%), Gaps = 24/268 (8%)

Query: 93  LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLG---LGDVS 146
           LV+D + +  +S  +   +V   +++    K+  +  D   P   DG+ GL    + D +
Sbjct: 165 LVEDTVRIGGYSIDSIFGNVNKILLLAFQYKECPA-PDVYTPRSFDGIFGLSTKVIDDTA 223

Query: 147 VPSLLAKAGL---IQNSFSICFDENDSGSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVG 201
              +L +  L   + NSFS+CF E+  G  F  G   P    +   ++P+ + Y  Y + 
Sbjct: 224 GEDILTQISLKYNLSNSFSLCFGESGYGGQFKIGGYDPELIVEPMRYIPVAKPY-TYNLT 282

Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV-VKFDK--LVSSKRISLQGNSW 258
           +    IG   L  + + A +DSG++   +PT +Y  ++   ++K  L   +  +    S+
Sbjct: 283 ISQVHIGQYKLEHTTYNAWIDSGSASIVIPTPLYNNMINTMYEKFPLAGFQDGAFWNTSF 342

Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFTVFCLTVMSTDGD 313
             C     +++   P   + F      +   H+   P+N     E    + L + + D +
Sbjct: 343 P-CAFIDEKDIPNYPKFNISFVDTDGEIF--HLSVLPQNYLVYNEEEKCYELLLRTVDNN 399

Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHS 341
           Y IIG   ++G+ I FD++N ++ ++ +
Sbjct: 400 YFIIGDLGLIGYNIHFDKQNQRIGFAKA 427


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 136/325 (41%), Gaps = 35/325 (10%)

Query: 45  YDPSSSSSSKNVSCS-HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           Y  S S S K VSC+ H  C+     + L   C Y   Y    + +SG L ++      +
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCEPNQCKEGL---CAYNVTYG-PGSYTSGNLANETFTF--Y 187

Query: 104 SKHAPQSSVQSSVIIGCG---RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-Q 158
           S H   ++++S +  GC    R    ++L    P  GV+G+G G  S    LA+ G I  
Sbjct: 188 SNHGKHTALKS-ISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRS---FLAQLGSISH 243

Query: 159 NSFSICFDENDSGSVF--FGDQGPATQ--QSTSFLPI--GEKYDAYFVGVESYCIG---- 208
             FS C   N++ + +  FG     ++  Q+T  + +     Y    +G+    +     
Sbjct: 244 GKFSYCITANNTHNTYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNIT 303

Query: 209 --NSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQGNSWKYC 261
             +  + + G +  ++D+G   T L   I+  +       +SS    KR  +       C
Sbjct: 304 KTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLC 363

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
           Y   S+   K   +     +N    V+   IF F E EG  VFCL+++S D    IIG  
Sbjct: 364 YEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDD-SKTIIGAY 422

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
             M  + V+D +   L++    CE+
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDCEK 447


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 75/320 (23%), Positives = 132/320 (41%), Gaps = 33/320 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           +N   YDP +SS+   + C    C     S+  C    D C Y   Y  +++ S G L  
Sbjct: 135 QNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGD-CIYAYTYG-DNSYSYGGLSS 192

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D + L     H       S +  GCG +   +        G++GLG G +S+ S L    
Sbjct: 193 DSIRLMLLQLH-----YNSKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE- 246

Query: 156 LIQNSFSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS 210
            I + FS C   F  N +  + FG+            P+  K D   Y++ +E   +G  
Sbjct: 247 -IGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAK 305

Query: 211 CLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
            +   Q+    ++DSG++ T+L    Y E V    + V+ +        + +C+    E 
Sbjct: 306 TVKTGQTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCF-TYKEG 364

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQ-NFMM 323
           M   PD+   F+     +   +     E+    + C TV+ +  D    +G +GQ +F +
Sbjct: 365 MSTPPDVVFHFTGGDVVLKPMNTLVLIEDN---LICSTVVPSHFDGIAIFGNLGQIDFHV 421

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G    +D +  K++++ + C
Sbjct: 422 G----YDIQGGKVSFAPTDC 437


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 70/311 (22%), Positives = 120/311 (38%), Gaps = 22/311 (7%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS S++   V C    C+   S       C Y   Y  + + + G L  D L L   S
Sbjct: 180 FDPSQSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYG-DMSQTDGNLARDTLTLGPSS 238

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSI 163
             +    +Q   + GCG   TG +      DG+ GLG   VS+ S   AK G     FS 
Sbjct: 239 SSSSSDQLQ-EFVFGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAKYGA---GFSY 291

Query: 164 CFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ- 218
           C   + +  G +  G   P   + T+ +   +    Y++ +    +    +  S   F+ 
Sbjct: 292 CLPSSSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRT 351

Query: 219 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNASSEEMLKV 272
              ++DSG   T LP+  YA +   F  L+  +R S +     +    CY+ +    +++
Sbjct: 352 PGTVIDSGTVITRLPSRAYAALRSSFAGLM--RRYSYKRAPALSILDTCYDFTGRNKVQI 409

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
           P + L+F    +  +      +  N+            D    I+G        +V+D  
Sbjct: 410 PSVALLFDGGATLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVA 469

Query: 333 NLKLAWSHSKC 343
           N K+ +    C
Sbjct: 470 NQKIGFGAKGC 480


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 124/314 (39%), Gaps = 34/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P++SSS   ++C    C S   SSC++ +  C Y  +Y  + + + G  V + +    
Sbjct: 201 FTPAASSSYSPLTCDSQQCNSLQMSSCRNGQ--CRYQVNYG-DGSFTFGDFVTETMSFGG 257

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S   +S+ +GCG    G +        V   GL  +    L   + L   SFS
Sbjct: 258 -------SGTVNSIALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTSQLKATSFS 302

Query: 163 ICFDENDSG--SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGF 217
            C    DS   S    +  P      + L    K D  Y+VG+    +G   L   Q  F
Sbjct: 303 YCLVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVF 362

Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
           +         +VD G + T L +E Y  +   F  +    R +     +  CY+ S +  
Sbjct: 363 KLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSS 422

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
           +KVP +   F   +S+ +    +  P +   T +C     T     IIG     G R+ F
Sbjct: 423 VKVPTVSFHFDGGKSWDLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVSF 481

Query: 330 DRENLKLAWSHSKC 343
           D  N ++ +S +KC
Sbjct: 482 DLANNRVGFSTNKC 495


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 75/326 (23%), Positives = 128/326 (39%), Gaps = 36/326 (11%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +N S + P++S+S   ++C   LC         +  C Y   Y  + + S+G  V D + 
Sbjct: 40  QNDSLFIPNTSTSFTKLACGTELCNGLPYPMCNQTTCVYWYSYG-DGSLSTGDFVYDTIT 98

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +   +    Q     +   GCG    GS+   A  DG++GLG G +S PS L    +   
Sbjct: 99  MDGINGQKQQVP---NFAFGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNG 150

Query: 160 SFSICFDE-----NDSGSVFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGN 209
            FS C  +       +  + FGD    T     ++     P    Y  Y+V +    +G 
Sbjct: 151 KFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTY--YYVKLNGISVGG 208

Query: 210 SCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSW 258
             L  + + F          + DSG + T L  E++ EV+   +   +   R S   +  
Sbjct: 209 KLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGL 268

Query: 259 KYCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
             C    +E  L  VP M   F      +  ++ F F E+     +C +++S+  D  II
Sbjct: 269 DLCLGGFAEGQLPTVPSMTFHFEGGDMELPPSNYFIFLESS--QSYCFSMVSSP-DVTII 325

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G       ++ +D    K+ +    C
Sbjct: 326 GSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 144/350 (41%), Gaps = 70/350 (20%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSL------KDPCPYIADYSTEDTSSSGYLVDD 96
           S +DP  SSS   + C+ P C++R+   S+      K  C  I  Y+ + +S  G L  D
Sbjct: 99  SVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYA-DASSIEGNLASD 157

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
             H+         +S   + I GC      S  D  +   G++G+  G +   S + + G
Sbjct: 158 TFHIG--------NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMG 206

Query: 156 LIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVES 204
           L    FS C    D SG + FG+            P  Q ST  LP  ++  AY V +E 
Sbjct: 207 L--QKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEG 262

Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
             + NS L           T +G Q +VDSG  FTFL   +Y  +  +F        K++
Sbjct: 263 IKVANSMLQLPKSVYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVL 321

Query: 247 SSKRISLQGNSWKYCYNA--SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPE--NEGFT 301
                  QG +   CY    +   +  +P + L+F     S      ++  P       +
Sbjct: 322 EDPNFVFQG-AMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDS 380

Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
           V+C T     G+  ++G +++++GH       + FD    ++ ++  +C+
Sbjct: 381 VYCFTF----GNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCD 426


>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 406

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 90/197 (45%), Gaps = 18/197 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           Y P+ ++ +  +  S PLC+   +     + C Y   Y+   +S   Y+ D +  +    
Sbjct: 204 YRPARTADA--LPASDPLCEG--AQHENPNQCDYEISYADGSSSMGVYVRDSMQFVGEDG 259

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
           +        + ++ GCG  Q G  L+     DGV+GL    +S+P+ LA  G+I N+F  
Sbjct: 260 ERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314

Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 216
           C   + SG+   +F GD     +   +++PI  G   D     V+    G+  L   G  
Sbjct: 315 CMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKL 373

Query: 217 FQALVDSGASFTFLPTE 233
            Q + D+G+++T+ P E
Sbjct: 374 TQVVFDTGSTYTYFPDE 390


>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
          Length = 152

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 4   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGK 63

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 64  GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 122

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 123 VPAQIYNEIVSKVRGTLS 140


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 57/253 (22%), Positives = 104/253 (41%), Gaps = 24/253 (9%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D+ +   DP++SS+   + C  P C++          C Y+  Y  + + + G +  D  
Sbjct: 122 DQGIPLLDPAASSTYAALPCGAPRCRALPFTSCGGRSCVYVYHYG-DKSVTVGKIATDRF 180

Query: 99  HLASFSKHAPQSSVQSS--VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
                 +     S+ ++  +  GCG    G +       G+ G G G  S+PS L     
Sbjct: 181 TFGDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA--- 235

Query: 157 IQNSFSICFD---ENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESY 205
              SFS CF    ++ S  V  G    A          ++T       +   YF+ ++  
Sbjct: 236 --TSFSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGI 293

Query: 206 CIGNSCLT--QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
            +G + L   ++ F++ ++DSGAS T LP E+Y  V  +F   V      ++G++   C+
Sbjct: 294 SVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCF 353

Query: 263 NASSEEMLKVPDM 275
                 + + P +
Sbjct: 354 ALPVSALWRRPAV 366


>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
          Length = 154

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 88/358 (24%), Positives = 145/358 (40%), Gaps = 72/358 (20%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLC--------KSR-----SSCKSLKDPCP-YIADY 82
           V    +S++ P  SSS K V C +P C        KSR     S  +   D CP Y   Y
Sbjct: 174 VDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQY 233

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
            +  T+  G L+ + L L +  K  P        ++GC      S +    P G+ G G 
Sbjct: 234 GSGATA--GILLSETLDLEN--KRVPD------FLVGC------SVMSVHQPAGIAGFGR 277

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS----FLPIGEK---- 194
           G  S+PS +          S  FD++   S    D G  + +S +    + P  E     
Sbjct: 278 GPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVS 337

Query: 195 ----YDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVV 239
                + Y++ +    IG   +           T +G  A++DSG++FTFL   I+  + 
Sbjct: 338 NAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNG-GAIIDSGSTFTFLDKPIFEAIA 396

Query: 240 VKFDKLV----SSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIFS--KNQSFVVRNHIF 292
            + +K +     +K +  Q +  + C+N    EE  + PD+ L F      S    N++ 
Sbjct: 397 DELEKQLVKYPRAKDVEAQ-SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYL- 454

Query: 293 SFPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIV-FDRENLKLAWSHSKC 343
           +   +EG  V CLT+M+ +          II   F   + +V +D    ++ +   KC
Sbjct: 455 AMVTDEG--VVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 75/328 (22%), Positives = 123/328 (37%), Gaps = 52/328 (15%)

Query: 44  EYDPSSSSSSKNVSCSHPLCKSRS--SCKS-LKDPCPYIADYSTEDTSS-----SGYLVD 95
            +DP+ SS+ + V C  P C      SC   L   C +   Y+     +     +  L D
Sbjct: 146 SFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHD 205

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           D+  +A+++              GC    TG  +    P G++G G G +S PS      
Sbjct: 206 DVDAVAAYT-------------FGCLHVVTGGSVP---PQGLVGFGRGPLSFPSQTKD-- 247

Query: 156 LIQNSFSICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
           +  + FS C       N SG++  G  G P   ++T  L    +   Y+V +    +G  
Sbjct: 248 VYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGR 307

Query: 211 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            +            SG   +VD+G  FT L   +YA V   F   V +      G  +  
Sbjct: 308 PVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG-FDT 366

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYG 315
           CYN +    + VP +   F    S  +         + G  + CL + +      D    
Sbjct: 367 CYNVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSG-GIACLAMAAGPPDGVDAALN 421

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           ++       HR++FD  N ++ +S   C
Sbjct: 422 VLASMQQQNHRVLFDVANGRVGFSRELC 449


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 83/349 (23%), Positives = 141/349 (40%), Gaps = 66/349 (18%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVD 95
           R    + P+SSS+   + C+  LC+  +S    C +    C Y   Y    T+  GYL  
Sbjct: 127 RPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFTA--GYLAT 182

Query: 96  DILHL--ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           + LH+  ASF   A   S ++ V              G +  G++GLG   +S   L+++
Sbjct: 183 ETLHVGGASFPGVAFGCSTENGV--------------GNSSSGIVGLGRSPLS---LVSQ 225

Query: 154 AGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVES 204
            G+    FS C     D  DS  + FG     T  +    P+ E  +      Y+V +  
Sbjct: 226 VGV--GRFSYCLRSDADAGDS-PILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTG 282

Query: 205 YCIGNSCL----TQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
             +G + L    T  GF            +VDSG + T+L  E YA V   F   +++  
Sbjct: 283 ITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATAN 342

Query: 251 ISLQGNSWKY----CYNASSE---EMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEG 299
           ++   N  ++    C++A++      + VP + L F+    + VR      + +      
Sbjct: 343 LTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGR 402

Query: 300 FTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
             V CL V+  S      IIG    M   +++D +    +++ + C  V
Sbjct: 403 AAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 451


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 79/326 (24%), Positives = 126/326 (38%), Gaps = 42/326 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           + P +S S   + CS   CK     + ++C S   PC Y  DY  ++ S+       I+ 
Sbjct: 154 FRPKTSRSWAPIPCSSDTCKLDVPFTLANCSSPASPCTY--DYRYKEGSAG---ARGIVG 208

Query: 100 LASFSKHAPQSSVQ--SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
             S +   P   V     V++GC     G     A  DGV+ LG   +S  +    A   
Sbjct: 209 TESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARF 264

Query: 158 QNSFSICF-----DENDSGSVFFGD----QGPATQQSTSFLP----IGEKYDAYFVGVES 204
             SFS C        N +G + FG     + PATQ      P     G K DA  V  ++
Sbjct: 265 GGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKA 324

Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 263
             I            ++DSG + T L    Y  VV    K L    ++S     +++CYN
Sbjct: 325 LDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFP--PFEHCYN 382

Query: 264 ASSEEMLK---VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GII 317
            ++        +P + + F+ +         +      G  V C+ V   +G++    +I
Sbjct: 383 WTARRPGAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPG--VKCIGVQ--EGEWPGLSVI 438

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G      H   FD +N+++ +  S C
Sbjct: 439 GNIMQQEHLWEFDLKNMQVRFKQSNC 464


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/341 (22%), Positives = 139/341 (40%), Gaps = 59/341 (17%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
           + +DPS SS+   + C+HP+CK R    +L   C      + + +  + T + G LV + 
Sbjct: 137 ASFDPSLSSTFSTLPCTHPVCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREK 196

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
               +FS+    S     +I+GC  + T        P G++G+  G +S           
Sbjct: 197 F---TFSR----SLFTPPLILGCATESTD-------PRGILGMNRGRLSFASQSKITKFS 242

Query: 147 --VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGV 202
             VP+ + + G     SF +  + N +   +      A  Q    L P+     AY V +
Sbjct: 243 YCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFARSQRMPNLDPL-----AYTVAL 297

Query: 203 ESYCIGNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
           +   IG   L  S            Q ++DSG+ FT+L  E Y +V  +  + V    K+
Sbjct: 298 QGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKK 357

Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM 308
             + G     C++ ++ E+ + + DM   F K    VV +  + +  E     V C+ + 
Sbjct: 358 GYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEGG---VHCIGIA 414

Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           ++D       IIG        + FD  N ++ +  + C  +
Sbjct: 415 NSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADCSRL 455


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 150/352 (42%), Gaps = 75/352 (21%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLV 94
           S ++P SSSS   + CS P+C++R+  + L +P        C  I  Y+ + +S  G L 
Sbjct: 76  SVFNPLSSSSYSPIPCSSPVCRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLA 132

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
            D   +         SS     + GC      S   + A   G+MG+  G +   S + +
Sbjct: 133 SDNFRIG--------SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQ 181

Query: 154 AGLIQNSFSICFDEND-SGSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGV 202
            GL +  FS C    D SG + FGD            P  Q ST  LP  ++  AY V +
Sbjct: 182 LGLPK--FSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQL 237

Query: 203 ESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
           +   +GN  L           T +G Q +VDSG  FTFL   +Y  +  +F +       
Sbjct: 238 DGIRVGNKILPLPKSIFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLA 296

Query: 252 SLQGNSWKY------CYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT--- 301
            L   ++ +      CY   +  ++ ++P + L+F +    VV   +  + +  G     
Sbjct: 297 PLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMF-RGAEMVVGGEVLLY-KVPGMMKGK 354

Query: 302 --VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
             V+CLT  ++D    ++G + F++GH       + FD    ++ +  ++C+
Sbjct: 355 EWVYCLTFGNSD----LLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 402


>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
           AltName: Full=Nucellin-like protein; Flags: Precursor
          Length = 410

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
           K+ C Y   Y     SS G L+ D     SFS  A   +  +S+  GCG  Q  +  +  
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164

Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
            P +G++GLG G V++ S L   G+I ++    C      G +FFGD    T   T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223

Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           +  ++  Y    G   +   +  ++ +  + + DSGA++T+   + Y   +      +S 
Sbjct: 224 MNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 283

Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
           +      +  +  +   C+    +++  + +++  F           K  +  +    + 
Sbjct: 284 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 342

Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               EG    CL ++    ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 343 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 143/349 (40%), Gaps = 70/349 (20%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSL------KDPCPYIADYSTEDTSSSGYLVDD 96
           S +DP  SSS   + C+ P C++R+   S+      K  C  I  Y+ + +S  G L  D
Sbjct: 92  SVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYA-DASSIEGNLASD 150

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
             H+         +S   + I GC      S  D  +   G++G+  G +   S + + G
Sbjct: 151 TFHIG--------NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMG 199

Query: 156 LIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVES 204
           L    FS C    D SG + FG+            P  Q ST  LP  ++  AY V +E 
Sbjct: 200 L--QKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEG 255

Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
             + NS L           T +G Q +VDSG  FTFL   +Y  +  +F        K++
Sbjct: 256 IKVANSMLQLPKSVYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVL 314

Query: 247 SSKRISLQGNSWKYCYNA--SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPE--NEGFT 301
                  QG +   CY    +   +  +P + L+F     S      ++  P       +
Sbjct: 315 EDPNFVFQG-AMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDS 373

Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKC 343
           V+C T     G+  ++G +++++GH       + FD    ++ ++  +C
Sbjct: 374 VYCFTF----GNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 86/355 (24%), Positives = 130/355 (36%), Gaps = 72/355 (20%)

Query: 40  RNLSEYDPSS------SSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYST---EDTSS 89
           RN S + P++      SS+     C  P+C  R   K  + P C +   +ST   E   +
Sbjct: 116 RNCSHHSPATVFFPRHSSTFSPAHCYDPVC--RLVPKPDRAPICNHTRIHSTCHYEYGYA 173

Query: 90  SGYLVDDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGD 144
            G L   +      S    + + +   SV  GCG + +G  + G +    +GVMGLG G 
Sbjct: 174 DGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGP 233

Query: 145 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------- 197
           +S  S L +     N FS C  +              +   TS+L IG   D        
Sbjct: 234 ISFASQLGRR--FGNKFSYCLMDYT-----------LSPPPTSYLIIGNGGDGISKLFFT 280

Query: 198 -----------YFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYA 236
                      Y+V ++S  +  + L                 +VDSG +  FL    Y 
Sbjct: 281 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 340

Query: 237 EVVVKFDKLVSSKRISLQGNSWKYCYNASS----EEMLKVPDMRLIFSKNQSFV--VRNH 290
            V+    + V           +  C N S     E++L  P ++  FS    FV   RN+
Sbjct: 341 SVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNY 398

Query: 291 IFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                E     + CL + S D   G  +IG     G    FDR+  +L +S   C
Sbjct: 399 FIETEEQ----IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449


>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
          Length = 410

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
           K+ C Y   Y     SS G L+ D     SFS  A   +  +S+  GCG  Q  +  +  
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164

Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
            P +G++GLG G V++ S L   G+I ++    C      G +FFGD    T   T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223

Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           +  ++  Y    G   +   +  ++ +  + + DSGA++T+   + Y   +      +S 
Sbjct: 224 MNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 283

Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
           +      +  +  +   C+    +++  + +++  F           K  +  +    + 
Sbjct: 284 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 342

Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               EG    CL ++    ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 343 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 132/328 (40%), Gaps = 45/328 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   + CS  LC     S+C S    C Y   Y  + +S+ G L  +   LA 
Sbjct: 160 FDPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYG-DASSTQGVLAAETFTLA- 217

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                   +    V  GCG    G  +  GA   G++GLG G +   SL+++ GL    F
Sbjct: 218 -------KTKLPGVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--GKF 262

Query: 162 SIC---FDENDSGSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
           S C    D+     +  G            A  Q+T  +    +   Y+V +++  +G++
Sbjct: 263 SYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGST 322

Query: 211 C--LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
              L  S F          +VDSG S T+L  + Y  +   F   +              
Sbjct: 323 RIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDL 382

Query: 261 CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
           C+   AS  + ++VP + L F       +    +   ++      CLTVM + G   IIG
Sbjct: 383 CFKAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVMGSRG-LSIIG 440

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                  + V+D +   L+++  +C ++
Sbjct: 441 NFQQQNIQFVYDVDKDTLSFAPVQCAKL 468


>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
          Length = 147

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%)

Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSV 173
           +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      G +
Sbjct: 2   IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 61

Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLPT 232
           + GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T +P 
Sbjct: 62  YVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 120

Query: 233 EIYAEVVVK 241
           +IY E+V K
Sbjct: 121 QIYNEIVSK 129


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)

Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
            GCGR   G +  GA  DG++GLG G +S  S  A     +  FS C  E DS GS+ FG
Sbjct: 258 FGCGRNNEGDFGSGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFG 313

Query: 177 DQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
           ++  +   S  F  +         E+   YFV +    +GN  L    S F +   ++DS
Sbjct: 314 EKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDS 373

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
           G   T LP   Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L F
Sbjct: 374 GTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 433

Query: 280 SK 281
            +
Sbjct: 434 GE 435


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/369 (22%), Positives = 145/369 (39%), Gaps = 55/369 (14%)

Query: 7   FGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHP 61
            G+ A  Y+A+L    + L+W     CLL        D+    +DP++SS+ +++ CS P
Sbjct: 98  IGTPARFYSAILDTG-SDLIWTQCAPCLLCV------DQPTPYFDPANSSTYRSLGCSAP 150

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
            C +       +  C Y   Y  +  S++G L ++     +         +      GCG
Sbjct: 151 ACNALYYPLCYQKTCVYQYFYG-DSASTAGVLANETFTFGTNDTRVTLPRIS----FGCG 205

Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFG-- 176
               GS  +G+   G++G G G +S+ S L         FS C   F       ++FG  
Sbjct: 206 NLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSRLYFGAY 257

Query: 177 ----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALV 221
                   +T QST F+        YF+ +    +G + L           T      ++
Sbjct: 258 ATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTII 317

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKYCYN--ASSEEMLKVPDMR 276
           DSG + T+L    Y  V   F   ++S    L   + +    C+       + + +P + 
Sbjct: 318 DSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLV 377

Query: 277 LIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
           L F   +    ++N++   P   G    CL  M+T  D  IIG        +++D EN  
Sbjct: 378 LHFDGADWELPLQNYMLVDPSTGG---LCL-AMATSSDGSIIGSYQHQNFNVLYDLENSL 433

Query: 336 LAWSHSKCE 344
           L++  + C 
Sbjct: 434 LSFVPAPCN 442


>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
          Length = 154

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 4/132 (3%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I +N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 229
           G ++ GD  P T+  T + P+ E    Y  G+    I    +     F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVK 241
           +P +IY E+V K
Sbjct: 125 VPAQIYNEIVSK 136


>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
          Length = 423

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
           K+ C Y   Y     SS G L+ D     SFS  A   +  +S+  GCG  Q  +  +  
Sbjct: 125 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 177

Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
            P +G++GLG G V++ S L   G+I ++    C      G +FFGD    T   T + P
Sbjct: 178 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 236

Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
           +  ++  Y    G   +   +  ++ +  + + DSGA++T+   + Y   +      +S 
Sbjct: 237 MNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 296

Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
           +      +  +  +   C+    +++  + +++  F           K  +  +    + 
Sbjct: 297 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 355

Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               EG    CL ++    ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 356 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 412


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 73/352 (20%), Positives = 137/352 (38%), Gaps = 63/352 (17%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADY 82
           +    +  + P  SSSSK V C +P C        +S C+S           CP Y+  Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
            +   S++G L+ + L      K  P      + ++GC      S+L    P G+ G G 
Sbjct: 183 GS--GSTAGLLLSETLDFPD--KKIP------NFVVGC------SFLSIHQPSGIAGFGR 226

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-------- 194
           G  S+PS +          S  FD++        D         ++ P  +         
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286

Query: 195 YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
            + Y++ +    +GN  +                +++DSG++FTF+   +   V  +F+K
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346

Query: 245 LVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGF 300
            +++   +    +    + C++ S E+ +K P++   F     + +  N+ F+   + G 
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG- 405

Query: 301 TVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            V CLTV++   + G         I+G        + +D  N +L +    C
Sbjct: 406 -VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 87/357 (24%), Positives = 147/357 (41%), Gaps = 78/357 (21%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++P+ SSS   +SCS P C +R+       SC S  + C     Y+ + +SS G L  D 
Sbjct: 107 FNPNISSSYTPISCSSPTCTTRTRDFPIPASCDS-NNLCHATLSYA-DASSSEGNLASDT 164

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAK 153
                       SS    ++ GC      SY   +  D    G+MG+ LG +S+ S L  
Sbjct: 165 FGFG--------SSFNPGIVFGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKI 213

Query: 154 AGLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGV 202
                  FS C   +D SG +  G+            P  Q ST  LP  ++  AY V +
Sbjct: 214 P-----KFSYCISGSDFSGILLLGESNFSWGGSLNYTPLVQISTP-LPYFDR-SAYTVRL 266

Query: 203 ESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
           E   I +  L  SG           Q + D G  F++L   +Y  +  +F    +    +
Sbjct: 267 EGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRA 326

Query: 253 LQGNSWKY------CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF---- 300
           L   ++ +      CY    +  E+ ++P + L+F   +  V  + +       GF    
Sbjct: 327 LDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLY--RVPGFVWGN 384

Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCEEVIDK 349
            +V+C T  ++D    ++G + F++GH       + FD    ++  +H++C+ V  K
Sbjct: 385 DSVYCFTFGNSD----LLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARCDLVGQK 437


>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
          Length = 154

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E++ K    +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 143/329 (43%), Gaps = 73/329 (22%)

Query: 43   SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLV 94
            S ++P SSSS   + CS P+C++R+  + L +P        C  I  Y+ + +S  G L 
Sbjct: 1036 SVFNPLSSSSYSPIPCSSPICRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLA 1092

Query: 95   DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
             D   +         SS     + GC      S   + A   G+MG+  G +   S + +
Sbjct: 1093 SDNFRIG--------SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQ 1141

Query: 154  AGLIQNSFSICFDEND-SGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGV 202
             GL +  FS C    D SG + FGD            P  Q ST  LP  ++  AY V +
Sbjct: 1142 LGLPK--FSYCISGRDSSGVLLFGDLHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQL 1197

Query: 203  ESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL------ 245
            +   +GN  L           T +G Q +VDSG  FTFL   +Y  +  +F +       
Sbjct: 1198 DGIRVGNKILPLPKSIFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLA 1256

Query: 246  -VSSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHI--FSFPE----N 297
             +       QG +   CY+ A+  ++  +P + L+F +    VV   +  +  PE    N
Sbjct: 1257 PLGDPNFVFQG-AMDLCYSVAAGGKLPTLPSVSLMF-RGAEMVVGGEVLLYRVPEMMKGN 1314

Query: 298  EGFTVFCLTVMSTDGDYGIIG-QNFMMGH 325
            E   V+CLT  ++D    ++G + F++GH
Sbjct: 1315 E--WVYCLTFGNSD----LLGIEAFVIGH 1337


>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 21/145 (14%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           + +Y P  ++    V C  P+C      ++  C + K+ C Y  +Y+ +  SS G LV D
Sbjct: 94  IRQYKPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYA-DQGSSMGALVID 148

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLA 152
              L   +     S++Q  +  GCG  Q    L  A P     GV+GLG G + V   L 
Sbjct: 149 QFPLKLLNG----SAMQPRLAFGCGYDQI---LPKAHPPPATAGVLGLGRGKIGVLPQLV 201

Query: 153 KAGLIQNSFSICFDENDSGSVFFGD 177
            AGL +N    C      G +FFGD
Sbjct: 202 AAGLTRNVVGHCLSSKGGGYLFFGD 226


>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
          Length = 149

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E++ K    +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 139/341 (40%), Gaps = 46/341 (13%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYL 93
           +N + YDP +S+S KN++C+ P C   S       CKS    CPY   Y     ++  + 
Sbjct: 192 QNGAFYDPKASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFA 251

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           V+      + S  + +     +++ GCG    G +   A    ++GLG G +S  S L  
Sbjct: 252 VETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL-- 306

Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVGV 202
             L  +SFS C      D N S  + FG+            TSF+   E      Y+V +
Sbjct: 307 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQI 366

Query: 203 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-I 251
           +S  +    L          +      ++DSG + ++     Y  +  K  +    K  +
Sbjct: 367 KSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPV 426

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLT 306
                    C+N S  + +++P++ + F+          +++FP    F      + CL 
Sbjct: 427 YRDFPILDPCFNVSGIDSIQLPELGIAFADGA-------VWNFPTENSFIWLNEDLVCLA 479

Query: 307 VMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           ++ T    + IIG        I++D +  +L ++ +KC ++
Sbjct: 480 ILGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 520


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 77/354 (21%), Positives = 138/354 (38%), Gaps = 67/354 (18%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADY 82
           +    +  + P  SSSSK V C +P C        +S C+S           CP Y+  Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182

Query: 83  STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
            +   S++G L+ + L      K  P      + ++GC      S+L    P G+ G G 
Sbjct: 183 GS--GSTAGLLLSETLDFPD--KXIP------NFVVGC------SFLSIHQPSGIAGFGR 226

Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-------- 194
           G  S+PS +          S  FD++        D         ++ P  +         
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286

Query: 195 YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
            + Y++ +    +GN  +                +++DSG++FTF+   +   V  +F+K
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346

Query: 245 -LVSSKRI----SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
            L +  R     +L G   + C++ S E+ +K P++   F     + +  N+ F+   + 
Sbjct: 347 QLANWTRATDVETLTG--LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSS 404

Query: 299 GFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           G  V CLTV++   + G         I+G        + +D  N +L +    C
Sbjct: 405 G--VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/276 (24%), Positives = 114/276 (41%), Gaps = 50/276 (18%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-HAPQ 109
           S +K+ SC       +  C    + C  I D +   +++ G L +D+L + S S  +  Q
Sbjct: 97  SLAKSDSCGDCFSSPKPGCN---NTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFNTGQ 153

Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
           + V S  +  C        L G A  G+ GLG   +++PS LA A + +  F+ CF  +D
Sbjct: 154 NVVVSRFLFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSD 212

Query: 170 SGSVFFGDQGPAT--------------QQSTSFLPI-------------GEKYDAYFVGV 202
            G + FGD GP +               +S ++ P+             GE    YF+GV
Sbjct: 213 -GVIIFGD-GPYSFLADNPSLPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGV 270

Query: 203 ESYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRIS 252
           ++  I G      S   ++ + G           +T L   IY  V   F K   ++ I+
Sbjct: 271 KTIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNIT 330

Query: 253 LQGNS--WKYCYN----ASSEEMLKVPDMRLIFSKN 282
            + +S  +++CY+      +     VP + L+   N
Sbjct: 331 TEDSSPPFEFCYSFDNLPGTPLGASVPTIELLLQNN 366


>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
 gi|219887685|gb|ACL54217.1| unknown [Zea mays]
          Length = 292

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 117/264 (44%), Gaps = 30/264 (11%)

Query: 114 SSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
           + ++ GCG  Q G  L+     DGV+GL    +S+P+ LA  G+I N+F  C   + SG+
Sbjct: 21  ADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGA 80

Query: 173 ---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG--FQALVDSGA 225
              +F GD     +   +++PI  G   D     V+    G+  L   G   Q + D+G+
Sbjct: 81  GGYLFLGDD-YIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGS 139

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YC----YNASSEEMLK--VPDMR 276
           ++T+ P E    ++    +  S + +  Q +S K   +C    +   S E +K     + 
Sbjct: 140 TYTYFPDEALTRLISSLKEAASPRFV--QDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLS 197

Query: 277 LIFSKN----QSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
           L F K     ++F +R   +    ++G    CL V++ T   Y    I+G   + G  + 
Sbjct: 198 LQFEKRFFFSRTFNIRPEHYLVISDKGNV--CLGVLNGTTIGYDSVVIVGDVSLRGKLVA 255

Query: 329 FDRENLKLAWSHSKCEEVIDKSHV 352
           +D +  ++ W    C     +S +
Sbjct: 256 YDNDKNEVGWVDFDCTNPRKRSRI 279


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/309 (25%), Positives = 132/309 (42%), Gaps = 41/309 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS S + K + CS   C+S R++  S  + C Y  DY  + + S G L  + L L S 
Sbjct: 133 FDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYG-DGSHSDGDLSVETLTLGST 191

Query: 104 ---SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
              S H P++      +IGCG    G++ +    +G   +GLG   V  +   +  I   
Sbjct: 192 DGSSVHFPKT------VIGCGHNNGGTFQE----EGSGIVGLGGGPVSLISQLSSSIGGK 241

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSC 211
           FS C      + N S  + FGD    + + T   P+    G+ +  YF+ +E++ +G++ 
Sbjct: 242 FSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVF--YFLTLEAFSVGDNR 299

Query: 212 L----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
           +                 ++DSG + T LP E Y  +      ++  +R          C
Sbjct: 300 IEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLC 359

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIG-Q 319
           Y  +S+E L +P +   F      V  N I +F P  +G   F          +G +  Q
Sbjct: 360 YKTTSDE-LDLPVITAHFKGAD--VELNPISTFVPVEKGVVCFAFISSKIGAIFGNLAQQ 416

Query: 320 NFMMGHRIV 328
           N ++G+ +V
Sbjct: 417 NLLVGYDLV 425


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 141/367 (38%), Gaps = 75/367 (20%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY----IADYSTEDTSS-----------SG 91
           P + +SS +VSC  P C +  +  S  D C      +    T D SS            G
Sbjct: 124 PPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDG 183

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            LV   L+  S S  A    V  +   GC     G       P GV G G G +S+P+ L
Sbjct: 184 SLVAR-LYRDSLSMPASSPLVLHNFTFGCAHTALGE------PVGVAGFGRGVLSLPAQL 236

Query: 152 AK-AGLIQNSFSIC-----FD-----------------ENDSGSVFFGDQGPATQQSTSF 188
           A  +  + N FS C     FD                 +++       D+G      T+ 
Sbjct: 237 ASFSPHLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVY--TAM 294

Query: 189 LPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEV 238
           L   +    Y VG+E   +GN  +           +     +VDSG +FT LP  +Y  +
Sbjct: 295 LDNPKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESL 354

Query: 239 VVKFDKLVSS--KRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIF 292
           V +F+  +    KR +   +      CY  S +   KVP + L F  N + ++   N+ +
Sbjct: 355 VTEFNHRMGRVYKRATQIEERTGLGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYY 413

Query: 293 SF-----PENEGFTVFCLTVMS------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
            F      + +   V CL +M+      + G    +G     G  +V+D E  ++ ++  
Sbjct: 414 EFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARR 473

Query: 342 KCEEVID 348
           KC  + D
Sbjct: 474 KCALLWD 480


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 12/121 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +   SSS+ + V+C HP C     C  L+  C Y   Y  + + S G L +DI+   + S
Sbjct: 93  FQTESSSTYQPVNC-HPSCD----CDYLRSQCSYKMHYG-DGSYSRGVLAEDIISFGNES 146

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           + APQ      ++ GC     GS L     DG++GLG G  ++   L   G+I +SFS+C
Sbjct: 147 EFAPQR-----LVFGCELDAIGS-LYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200

Query: 165 F 165
           +
Sbjct: 201 Y 201


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 128/328 (39%), Gaps = 39/328 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP  S+S + +    P C++  RS     K   C Y   Y  + +++ G  +++ L  A
Sbjct: 176 FDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA 235

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 P  S      IGCG    G +   AA  G++GLG G +S PS +A  G    SF
Sbjct: 236 G-GVQVPHMS------IGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSF 286

Query: 162 SICFDE--------NDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCI 207
           S C  +        + S ++  GD   A     SF P  +       Y    VGV    +
Sbjct: 287 SYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGV 346

Query: 208 GNSCLTQSGFQ---------ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNS 257
               +T+   +          ++DSG + T L    Y A         V   ++S+ G S
Sbjct: 347 RVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPS 406

Query: 258 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
             +  CY      M KVP + + F+      +    +  P +   TV      + D    
Sbjct: 407 GFFDTCYTMGGRAM-KVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVS 465

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IIG     G R+V++    ++ ++ + C
Sbjct: 466 IIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 109/246 (44%), Gaps = 27/246 (10%)

Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
            GCGR   G +  G+  DG++GLG G +S  S  A        FS C  E DS GS+ FG
Sbjct: 224 FGCGRNNKGDF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLFG 279

Query: 177 DQGPATQQSTSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
           ++  AT QS+S     L  G    ++   YFV +    +GN  L    S F +   ++DS
Sbjct: 280 EK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDS 337

Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
               T LP   Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L F
Sbjct: 338 RTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 397

Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 339
                  VR +  +       +  CL    T  +  IIG    +   +++D +  ++ + 
Sbjct: 398 GGGAD--VRLNGTNIVWGSDASRLCLAFAGTS-ELTIIGNRQQLSLTVLYDIQGRRIGFG 454

Query: 340 HSKCEE 345
            + C +
Sbjct: 455 GNGCSK 460


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/322 (24%), Positives = 130/322 (40%), Gaps = 44/322 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S S  +++C  PLC    S  C + K  C Y   Y            D       
Sbjct: 168 FDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYG-----------DGSFTFGD 216

Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           FS      + +  + V +GCG    G ++  A    ++GLG G +S PS   +     + 
Sbjct: 217 FSTETLTFRRTRVARVALGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQTGRR--FNHK 271

Query: 161 FSICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS--- 210
           FS C  +  + S    + FGD   A  ++  F P+    K D  Y+V +    +G +   
Sbjct: 272 FSYCLVDRSASSKPSSMVFGDS--AVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVP 329

Query: 211 CLTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
            +T S F+         ++DSG S T L    Y      F    S+ + + Q + +  C+
Sbjct: 330 GITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCF 389

Query: 263 NASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
           + S +  +KVP + L F   + S    N++     +  F   CL    T G   IIG   
Sbjct: 390 DLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDTSGNF---CLAFAGTMGGLSIIGNIQ 446

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
             G R+V+D    ++ ++   C
Sbjct: 447 QQGFRVVYDLAGSRVGFAPHGC 468


>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 435

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 98/240 (40%), Gaps = 44/240 (18%)

Query: 46  DPSSSSSSKNVSCSHPLCKSRSS------CKSLKDP------CPYIADYSTEDTSSSGYL 93
           D   SSS   V C   LCK   S      C S   P      C +I        S+SG +
Sbjct: 78  DNYVSSSYTPVRCDSALCKLADSHSCTTECYSSPKPGCYNNTCSHIPYNPVVHVSTSGDI 137

Query: 94  VDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSL 150
             D++ L S     P  +V   +V   CG   TG  L+  A    GV GLG G++S+P+ 
Sbjct: 138 GLDVVSLQSMDGKYPGRNVSVPNVPFVCG---TGFMLENLADGVLGVAGLGRGNISLPAY 194

Query: 151 LAKAGLIQNSFSICFDE--NDSGSVFFGDQ-GPATQQSTSFLPI-------------GEK 194
            + A  +Q+ F+IC     N SG ++FGD  GP +     + P+             G+ 
Sbjct: 195 FSSALGLQSKFAICLSSLTNSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQS 254

Query: 195 YDAYFVGVESYCIGNSCLTQSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDK 244
              YF+ V++  +G   +  +     +D+             +T L T IY  V+  F K
Sbjct: 255 STDYFIAVKTLRVGGKEIKFNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAK 314


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/236 (27%), Positives = 102/236 (43%), Gaps = 39/236 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLK-----DPCP-YIADYSTEDTSSSGYL 93
           + P +SSSS+ V C +P C     KS S+C S       D CP Y+  Y +  TS  G L
Sbjct: 141 FHPKNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTS--GLL 198

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           + D L L+  S  +  +  ++   IGC             P G+ G G G  SVPS L  
Sbjct: 199 ISDTLRLSPSSSSSAPAPFRN-FAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLKV 252

Query: 154 AGLIQNSFSICFDEND--SGSVFFGD-QGPATQQSTS--FLPIGEKYDA-------YFVG 201
                   S  FD+N   SG +  GD   PA ++ T+  ++P+     +       Y++ 
Sbjct: 253 PKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLA 312

Query: 202 VESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
           +    +G         + +  SG  A++DSG +FT+L   ++  V    +  V  +
Sbjct: 313 LTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGR 368


>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
          Length = 154

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
           +  +  GCG KQ        +P DG++GLG+G     + L    +I  N    C      
Sbjct: 6   KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
           G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+ + DSG+++T 
Sbjct: 66  GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEVVFDSGSTYTH 124

Query: 230 LPTEIYAEVVVKFDKLVS 247
           +P +IY E+V K    +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 122/316 (38%), Gaps = 34/316 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++P++SS+ K V C   LC +       R SC +  + C Y   Y  + + S G +  D 
Sbjct: 166 FNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYH-DYSLSVGVVSSDT 224

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L     S+           I GC     G    G    G++G+ +   S+ S +   G  
Sbjct: 225 LTYGLGSQ---------KFIFGCCNLFRGV---GGRYSGILGMSVNKFSLFSQMT-VGHR 271

Query: 158 QNSFSICFDE-NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
             + S CF    + G + FG +    +    F P+    + YFV V +  +    L    
Sbjct: 272 YRAMSYCFPHPRNQGFLQFG-RYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQS 330

Query: 215 SGFQAL---VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS---EE 268
           SG Q +    D+G  +T LP  ++  +      LV      +  ++ + C+ A     E 
Sbjct: 331 SGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGY-YRVGASTGQTCFQADGNWIEG 389

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            L +P +++ F       + +    F E     VFCL     DG   ++G   +MG   V
Sbjct: 390 DLYMPTVKIEFQNGARITLNSEDLMFMEEP--NVFCLAFKMNDGGDIVLGSRHLMGVHTV 447

Query: 329 FDRENLKLAWSHSKCE 344
            D E + +      C 
Sbjct: 448 VDLEMMTMGLRGQGCN 463


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 117/290 (40%), Gaps = 34/290 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           YDPS S SS++ +CS P C+         SS  +    C Y   Y  + +++SG LV D 
Sbjct: 213 YDPSKSRSSESFACSSPTCRQLGPYANGCSSSSNSAGQCQYRVRYP-DGSTTSGTLVADQ 271

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
           L L      +P S V      GC     GS+   +   G+M LG G  S+ S  + K G 
Sbjct: 272 LSL------SPTSQV-PKFEFGCSHAARGSF-SRSKTAGIMALGRGVQSLVSQTSTKYGQ 323

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQST-SFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +   FS CF    S   FF    P    S  +  P+ +    Y V +E+  +    L   
Sbjct: 324 V---FSYCFPPTASHKGFFVLGVPRRSSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVP 380

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T     A +DS    T LP   Y  +   F   +S  R +        CY+ +    + 
Sbjct: 381 PTVFAAGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIM 440

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD---YGIIG 318
           +P + L+F +  + V  +     P    F   CL   ST GD    GIIG
Sbjct: 441 LPTISLVFDRTGAGVQLD-----PSGVLFGS-CLAFASTAGDDRATGIIG 484


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/308 (24%), Positives = 126/308 (40%), Gaps = 32/308 (10%)

Query: 57  SCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 115
           SC  PLC K  +   S +  C Y   Y  +++ + G L  D    A+F+ +  +    S 
Sbjct: 20  SCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATFTSNTGKLVSLSR 75

Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSICF-----DEN 168
            + GCG   TG + D     G++GLG G     SL+++ G +     FS C      D  
Sbjct: 76  FLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKFSQCLVPFLTDIK 130

Query: 169 DSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGV------ESYCIGNSCLTQSGFQA 219
            S  + FG             P+ ++     +YFV +      ++Y   NS + +     
Sbjct: 131 ISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKG--NM 188

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
           LVDSG     LP ++Y  V V+    V  + I+   +   + CY   +   LK P +   
Sbjct: 189 LVDSGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTN--LKGPTLTYH 246

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLKLA 337
           F      +     F  P  E   VFCL + + T+ + G+ G      + I FD +   ++
Sbjct: 247 FEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVS 306

Query: 338 WSHSKCEE 345
           +  + C +
Sbjct: 307 FKATDCTK 314


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 43/318 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP SS+S   + C  P CKS    +     C Y   Y  + + + G    + + L    
Sbjct: 191 FDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--- 246

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                ++   +V IGCG    G ++  A   G+ G  L   S P     A +   SFS C
Sbjct: 247 -----TAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYC 293

Query: 165 FDENDSGSVF---FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA 219
               DS +V    F    P    +       E    Y++G++   +G   L   +S F+ 
Sbjct: 294 LVNRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEV 353

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
                   ++DSG + T L +E+Y  +   F K    +  +  +SL    +  CY+ SS 
Sbjct: 354 DAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSR 409

Query: 268 EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
           E ++VP +   F + +   +  RN++      +    FC     T     I+G     G 
Sbjct: 410 ESVQVPTVSFHFPEGRELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIMGNVQQQGT 466

Query: 326 RIVFDRENLKLAWSHSKC 343
           R+ FD  N  + +S   C
Sbjct: 467 RVGFDIANSLVGFSADSC 484


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/318 (24%), Positives = 130/318 (40%), Gaps = 40/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           + P  S++ KNVSC+ P CK   +       C +   Y +  +S +  LV D + LA  +
Sbjct: 117 FAPEKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGS--SSIAANLVQDTITLA--T 172

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
              P      S   GC  K TG+    A P G++GLG G +S+ S      L Q++FS C
Sbjct: 173 DPVP------SYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYC 221

Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
                  N SGS+  G    P   + T  L    +   Y+V +E+  +G   +       
Sbjct: 222 LPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAAL 281

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   + DSG  FT L   +Y  V  +F + V  K        +  CYN      
Sbjct: 282 AFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP---- 337

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQNFMMGH 325
           + VP +  IF+     + +++I     +   +  CL +     + +    +I       H
Sbjct: 338 IVVPTITFIFTGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNH 395

Query: 326 RIVFDRENLKLAWSHSKC 343
           R+++D  N ++  +   C
Sbjct: 396 RVLYDVPNSRVGVARELC 413


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/319 (22%), Positives = 131/319 (41%), Gaps = 42/319 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P  SSS   + C    C+   S +S  + C Y   Y  + +S+ GY+  +        
Sbjct: 138 FNPQDSSSFSTLPCESQYCQDLPS-ESCYNDCQYTYGYG-DGSSTQGYMATETFTF---- 191

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
               ++S   ++  GCG    G      A  G++G+G G +S+PS L         FS C
Sbjct: 192 ----ETSSVPNIAFGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYC 240

Query: 165 FDENDS--------GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQ 214
              + S        GS   G   P    ST+ +        Y++ ++   +G  N  +  
Sbjct: 241 MTSSGSSSPSTLALGSAASGV--PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPS 298

Query: 215 SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
           S FQ         ++DSG + T+LP + Y  V   F   ++   +    +    C+   S
Sbjct: 299 STFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPS 358

Query: 267 E-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGDYGIIGQNFMMG 324
           +   ++VP++ + F      +   ++   P  EG  V CL +  S+     I G      
Sbjct: 359 DGSTVQVPEISMQFDGGVLNLGEENVLISPA-EG--VICLAMGSSSQQGISIFGNIQQQE 415

Query: 325 HRIVFDRENLKLAWSHSKC 343
            ++++D +NL +++  ++C
Sbjct: 416 TQVLYDLQNLAVSFVPTQC 434


>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 435

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 125/324 (38%), Gaps = 43/324 (13%)

Query: 53  SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSS 111
           +K+ +C+   C   +S   L D C    +Y+    S+ G ++ D L L +  +  P   +
Sbjct: 102 AKSAACATG-CSGAASPGCLNDTCTGFPEYTITRVSTGGNIITDKLSLYTTCRPMPVPRA 160

Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-S 170
                +  CG       L GAA  G+M L     ++P+ +A        F++C    + S
Sbjct: 161 TAPGFLFTCGATSLTKGL-GAAATGMMSLSRARFALPTQVASIFRFSRKFALCLAPAESS 219

Query: 171 GSVFFGDQ----GPATQQSTSFL-------PI----GEKYDAYFVGVESYCI-GNSCLTQ 214
           G V FGD      P    S S +       P+    G+K   YF+GV    + G +    
Sbjct: 220 GVVVFGDAPYEFQPVMDLSKSLIYTPLLVNPVTTTGGDKSTEYFIGVTGIKVNGRAVPLN 279

Query: 215 SGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-- 263
           +   A+  SG           +T L T IY  V   F    +          +K CY+  
Sbjct: 280 ATLLAIAKSGVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAMIPRVPAVAPFKLCYDGT 339

Query: 264 --ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG----- 315
              S+     VP + L+  SK  S+VV          +G    C  V+  DG        
Sbjct: 340 MVGSTRAGPAVPTVELVLQSKAVSWVVFGANSMVATKDG--ALCFGVV--DGGVAPETSV 395

Query: 316 IIGQNFMMGHRIVFDRENLKLAWS 339
           +IG + M  + + FD E  +L ++
Sbjct: 396 VIGGHMMEDNLLEFDLEGSRLGFT 419


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 74/347 (21%), Positives = 148/347 (42%), Gaps = 69/347 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DPS SSS   + CSHPLCK R       +SC S +  C Y   Y+ + T + G LV 
Sbjct: 111 TSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 168

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           + +  ++       + +   +I+GC  + +          G++G+  G +   S +++A 
Sbjct: 169 EKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAK 211

Query: 156 LIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
           + + S+ I    N      +GS + GD               P +Q+  +  P+   Y  
Sbjct: 212 ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA--YTV 269

Query: 198 YFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AEVVVKFDKL 245
             +G+  + +    ++ S F        Q +VDSG+ FT L    Y    AE++ +  + 
Sbjct: 270 PMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRR 328

Query: 246 VSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVF 303
           +  K+  + G +   C++ +   + + + D+  +F++    +V +  +     N G  + 
Sbjct: 329 L--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEILVPKERVLV---NVGGGIH 383

Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           C+ +  +        IIG        + FD  N ++ ++ + C  V+
Sbjct: 384 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 125/305 (40%), Gaps = 28/305 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS S + K + CS  +C+S     SC S K  C Y   Y  + + S G L  + L L 
Sbjct: 139 FDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYG-DGSHSQGDLSVETLTLG 197

Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           S +     SSVQ  + +IGCG    G++    +    +G G   +      +  G     
Sbjct: 198 STNG----SSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYC 253

Query: 161 FSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL---- 212
            +  F +++S S   FGD    +       P+  K  +   Y++ +E++ +G+  +    
Sbjct: 254 LAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVG 313

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                  +      ++DSG + T LP E Y+ +       + + R+S   N    CY  +
Sbjct: 314 GSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTT 373

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NFMM 323
               L VP +   F      V  N I +F +  EG   F          +G + Q N ++
Sbjct: 374 PSGQLDVPVITAHFKGAD--VELNPISTFVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLV 431

Query: 324 GHRIV 328
           G+ ++
Sbjct: 432 GYDLM 436


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 123/312 (39%), Gaps = 30/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDP+ SSSS   SC+ P C       + C +  + C Y   Y  + TS++G  + D+L +
Sbjct: 175 YDPTKSSSSGVFSCNSPTCTQLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI 232

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                  P ++V+ S   GC     GS+  G++  G+M LG G  S+ S    A      
Sbjct: 233 ------TPATAVR-SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRV 283

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL--- 212
           FS CF    +   FF    P        L    K  A     Y V +E+  +    +   
Sbjct: 284 FSHCFPP-PTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 342

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T     A +DS  + T LP   Y  +   F   ++  + +        CY+ +      
Sbjct: 343 PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFA 402

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           +P + L+F KN +  +      F   +G   F  T    D   GIIG   +    ++++ 
Sbjct: 403 LPRITLVFDKNAAVELDPSGVLF---QGCLAF--TAGPNDQVPGIIGNIQLQTLEVLYNI 457

Query: 332 ENLKLAWSHSKC 343
               + + H+ C
Sbjct: 458 PAALVGFRHAAC 469


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 142/334 (42%), Gaps = 57/334 (17%)

Query: 50  SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           SS+ K V+C  P+C+     S S+C      C Y+  Y  + + ++G++  D     +F+
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYG-DRSITAGHIFKD-----TFT 55

Query: 105 KHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
             +P       S +  GCG   TG ++   +  G+ G G G  S+PS L K G     FS
Sbjct: 56  FMSPNGVPVAVSELAFGCGDYNTGLFVSNES--GIAGFGRGPQSLPSQL-KVG----RFS 108

Query: 163 ICFD---ENDSGSVFFGD-----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
            C     E+ S  V  G             GP       + P+   +  Y++ +E   +G
Sbjct: 109 YCLTLVTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTF--YYLSLEGITVG 166

Query: 209 NSCLT--QSGFQ--------ALVDSGASFTFLPTEIYA----EVVVKFDKLVSSKRISLQ 254
            + L   +S F          ++DSG S T LP  ++     E+V +F  L         
Sbjct: 167 KTRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQF-PLPRYDNTPEV 225

Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
           G+  + C+      + + VP + L  +     + R++ F    + G  V CL +   +  
Sbjct: 226 GD--RLCFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSG--VMCLQINGAEDT 281

Query: 314 YGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
             ++  NF   +  +V+D EN KL ++ ++C+++
Sbjct: 282 TMVLIGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 122/312 (39%), Gaps = 30/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDP+ SSSS   SC+ P C       + C +  + C Y   Y  + TS++G  + D+L +
Sbjct: 200 YDPTKSSSSGVFSCNSPTCTQLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI 257

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                  P ++V+ S   GC     GS+  G++  G+M LG G  S+ S    A      
Sbjct: 258 ------TPATAVR-SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRV 308

Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL--- 212
           FS CF        FF    P        L    K  A     Y V +E+  +    +   
Sbjct: 309 FSHCFPPPTRRG-FFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 367

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T     A +DS  + T LP   Y  +   F   ++  + +        CY+ +      
Sbjct: 368 PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFA 427

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           +P + L+F KN +  +      F   +G   F  T    D   GIIG   +    ++++ 
Sbjct: 428 LPRITLVFDKNAAVELDPSGVLF---QGCLAF--TAGPNDQVPGIIGNIQLQTLEVLYNI 482

Query: 332 ENLKLAWSHSKC 343
               + + H+ C
Sbjct: 483 PAALVGFRHAAC 494


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 120/284 (42%), Gaps = 28/284 (9%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +++DPS SS+   VSC    C++  R++C    + C Y+  Y  + ++++G L  +    
Sbjct: 144 TQFDPSRSSTYGRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTF 201

Query: 101 A-SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
               S  +P+      V  GC     GS+          G     VS+ + L  A  +  
Sbjct: 202 DDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGR 257

Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQ 214
            FS C      N S ++ FG     T+   +  P+  G+    Y V ++S  +GN  +  
Sbjct: 258 RFSYCLVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVAS 317

Query: 215 SGF-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM---L 270
           +   + +VDSG + TFL   +   +V +  + ++   +       + CYN +  E+    
Sbjct: 318 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 377

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMST 310
            +PD+ L F    +  ++      PEN    V     CL +++T
Sbjct: 378 SIPDLTLEFGGGAAVALK------PENAFVAVQEGTLCLAIVAT 415


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 129/319 (40%), Gaps = 39/319 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS+S S  NVSC  P C+   S       C S    C Y   Y  + + S G+   + 
Sbjct: 190 FDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSS--STCLYGIRYG-DGSYSIGFFAREK 246

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
           L L S       + V ++   GCG+   G +  G A  G++GL    +S+ S  A K G 
Sbjct: 247 LSLTS-------TDVFNNFQFGCGQNNRGLF-GGTA--GLLGLARNPLSLVSQTAQKYGK 296

Query: 157 IQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLP--IGEKYDAYF--------VGVES 204
           +   FS C     + +G + FG  G    ++  F P  +   Y +++        VG   
Sbjct: 297 V---FSYCLPSSSSSTGYLSFG-SGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERK 352

Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
             I  S  + +G   ++DSG   + LP  +Y+ V   F +L+S        +    CY+ 
Sbjct: 353 LPIPKSVFSTAG--TIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 410

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           S  + +KVP + L FS      +      +              S D +  IIG      
Sbjct: 411 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKT 470

Query: 325 HRIVFDRENLKLAWSHSKC 343
             +V+D    ++ ++ S C
Sbjct: 471 IHVVYDDAEGRVGFAPSGC 489


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 104/250 (41%), Gaps = 27/250 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS S+S  N++C+  LC   S+       C +    C Y   Y  + + S GY   + 
Sbjct: 188 FDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYG-DSSFSVGYFSRER 246

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L + +       + +  + + GCG+   G +   A   G++GLG   +S   +   A + 
Sbjct: 247 LSVTA-------TDIVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVY 294

Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           +  FS C     S  G + FG    +  + T F  I      Y + +    +G + L   
Sbjct: 295 RKIFSYCLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVS 354

Query: 213 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
             T S   A++DSG   T LP   Y  +   F + +S    + + +    CY+ S  E+ 
Sbjct: 355 SSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVF 414

Query: 271 KVPDMRLIFS 280
            +P +   F+
Sbjct: 415 SIPKIDFSFA 424


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 139/330 (42%), Gaps = 45/330 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           YD + SSS   V C+   C    S  +C +   PC Y   Y  +   S+G L  + L   
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYG-DGAYSAGVLGTETLTFP 193

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                AP  SV   +  GCG    G   +     G +GLG G +S   L+A+ G+    F
Sbjct: 194 G----APGVSV-GGIAFGCGVDNGGLSYNST---GTVGLGRGSLS---LVAQLGV--GKF 240

Query: 162 SIC----FDENDSGSVFFGD----QGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNS 210
           S C    F+ +    V FG       P+T    QST  +        Y+V +E   +G++
Sbjct: 241 SYCLTDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDA 300

Query: 211 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            L                 +VDSG +FTFL  E    VVV     V  + +    +    
Sbjct: 301 RLPIPNGTFDLRDDGSGGMIVDSGTTFTFL-VESAFRVVVDHVAGVLRQPVVNASSLDSP 359

Query: 261 CYNASS--EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM-STDGDYGI 316
           C+ A++  +++  +PDM L F+      + R++  SF + E  + FCL +  S   D  I
Sbjct: 360 CFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEE--SSFCLNIAGSPSADVSI 417

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +G       +++FD    +L++  + C ++
Sbjct: 418 LGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 144/371 (38%), Gaps = 57/371 (15%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SC 69
           N  + L   + L  LL   A      +   + P +SS+   V C+   C+SR      +C
Sbjct: 97  NVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPAC 156

Query: 70  KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL 129
                 C     Y+ + +SS G L  D+  + S                GC      S  
Sbjct: 157 DGASSRCSVSLSYA-DGSSSDGALATDVFAVGS--------GPPLRAAFGCMSSAFDSSP 207

Query: 130 DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSF 188
           DG A  G++G+  G +S  S  +        FS C  D +D+G +  G     T    ++
Sbjct: 208 DGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGHSDLPTFLPLNY 262

Query: 189 LPIGEK------YD--AYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTF 229
            P+ +       +D  AY V +    +G   L           T +G Q +VDSG  FTF
Sbjct: 263 TPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAG-QTMVDSGTQFTF 321

Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYN---ASSEEMLKVPDMRLIFS 280
           L  + Y+ +  +F +       +L   S+ +      C+      S    ++P + L+F+
Sbjct: 322 LLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFN 381

Query: 281 KNQSFVVRNH-IFSFP--ENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
             +  V  +  ++  P     G  V+CLT  + D      Y +IG +  M   + +D E 
Sbjct: 382 GAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAY-VIGHHHQMNVWVEYDLER 440

Query: 334 LKLAWSHSKCE 344
            ++  +  +C+
Sbjct: 441 GRVGLAPVRCD 451


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 109/266 (40%), Gaps = 32/266 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ SSS   V C  P+C       SSC + +  C Y+  Y  + + ++G    D L L
Sbjct: 184 FDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQ--CGYVVSYG-DGSKTTGVYSSDTLTL 240

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                 +P  +V+     GCG  Q+G   +    DG++GLG  + S+  +   AG     
Sbjct: 241 ------SPNDAVR-GFFFGCGHAQSGFTGN----DGLLGLGREEASL--VEQTAGTYGGV 287

Query: 161 FSICFDENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
           FS C     S + +    GP+        +T  L        Y V +    +G   L+  
Sbjct: 288 FSYCLPTRPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVP 347

Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEM 269
            S F    +VD+G   T LP   YA +   F   ++S     +        CYN S    
Sbjct: 348 SSVFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT 407

Query: 270 LKVPDMRLIFSKNQSFVV-RNHIFSF 294
           + +P++ L FS   +  +  + I SF
Sbjct: 408 VTLPNVALTFSGGATVTLGADGILSF 433


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 82/354 (23%), Positives = 135/354 (38%), Gaps = 74/354 (20%)

Query: 59  SHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYLVDDILH 99
           + P C    S  +  D                   PCP  A      T  +G +V  IL 
Sbjct: 146 ASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFA-----YTYGAGGVVTGILT 200

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
             +   +     V   +   C      +Y +   P G+ G G G +   S++++ G +Q 
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQK 254

Query: 160 SFSICF-------DENDSGSVFFGDQGPATQQSTSFLPI--GEKY-DAYFVGVESYCIGN 209
            FS CF       + N S  +  GD    ++    F P+     Y + Y+VG+E+  +GN
Sbjct: 255 GFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGN 314

Query: 210 SCLTQ-----SGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQG 255
              T+       F +L      +DSG ++T LP   Y++V+      ++  R   + +Q 
Sbjct: 315 VSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQ- 373

Query: 256 NSWKYCYNA--------SSEEMLKVPDMRLIFSKNQSFVVR--NHIF--SFPENEGFTVF 303
             +  CY          +S+++L  P +   F  N S V+   NH +  S P N    V 
Sbjct: 374 TGFDLCYKVPRPNNNTLTSDDLL--PSITFHFLNNVSLVLPQGNHFYPVSAPGNPA-VVK 430

Query: 304 CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
           CL   ST    DG  G+ G        +V+D E  ++ +    C        +H
Sbjct: 431 CLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAASSQGLH 484


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/325 (22%), Positives = 126/325 (38%), Gaps = 34/325 (10%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +N + + P++S+S   ++C   LC         +  C Y   Y  + + ++G  V D + 
Sbjct: 50  QNDALFLPNTSTSFTKLACGSALCNGLPFPMCNQTTCVYWYSYG-DGSLTTGDFVYDTIT 108

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +   +    Q     +   GCG    GS+   A  DG++GLG G +S  S L    +   
Sbjct: 109 MDGINGQKQQV---PNFAFGCGHDNEGSF---AGADGILGLGQGPLSFHSQLKS--VYNG 160

Query: 160 SFSICFDE-----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC 211
            FS C  +       +  + FGD          +LPI         Y+V +    +G++ 
Sbjct: 161 KFSYCLVDWLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNL 220

Query: 212 LTQS----------GFQALVDSGASFTFLPTEIYAEVVVKFD--KLVSSKRISLQGNSWK 259
           L  S          G   + DSG + T L    Y EV+   +   +  S++I    +   
Sbjct: 221 LNISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKID-DISRLD 279

Query: 260 YCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
            C +   ++ L  VP M   F      +  ++ F + E+     +C   M++  D  IIG
Sbjct: 280 LCLSGFPKDQLPTVPAMTFHFEGGDMVLPPSNYFIYLESS--QSYCF-AMTSSPDVNIIG 336

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                  ++ +D    KL +    C
Sbjct: 337 SVQQQNFQVYYDTAGRKLGFVPKDC 361


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 75/329 (22%), Positives = 128/329 (38%), Gaps = 34/329 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDIL 98
           YD SSSSS + + C+   C        SSC S+K P P    Y   D S ++G L  + +
Sbjct: 72  YDKSSSSSYREIPCTDDECLFLPAPIGSSC-SIKSPSPCDYTYGYSDQSRTTGILAYETI 130

Query: 99  HLASFSK-------HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            + S  +       H  ++    +V +GC R+  G+   GA+  GV+GLG G +S+ +  
Sbjct: 131 SMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQT 188

Query: 152 AKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 206
               L    FS C  +   GS    F   G    +  +  PI     A   Y+V V    
Sbjct: 189 RHTAL-GGIFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVA 247

Query: 207 IGNSCL-----TQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
           +    +     +  G         + DSG + ++L    Y++V+   +  +   R     
Sbjct: 248 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 307

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
             ++ CYN +  E   +P + + F       +  + +     E      L  ++T     
Sbjct: 308 EGFELCYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSN 366

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           I+G      H I +D    ++ +  S C 
Sbjct: 367 ILGNLLQQDHHIEYDLAKARIGFKWSPCH 395


>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
          Length = 142

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 4/131 (3%)

Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD 177
           CG KQ        +P DG++GLG+G       L    +I +N    C      G ++ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60

Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
             P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T +P  IY+
Sbjct: 61  FNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHIYS 119

Query: 237 EVVVKFDKLVS 247
           E+V K    +S
Sbjct: 120 EIVSKVRGTLS 130


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/340 (22%), Positives = 141/340 (41%), Gaps = 55/340 (16%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
           + +DPS SS+   + C+HPLCK R    +L   C      + + +  + T + G LV + 
Sbjct: 111 ASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREK 170

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
               +FS+    S     +I+GC  + T        P G++G+ LG +S           
Sbjct: 171 F---TFSR----SVSTPPLILGCATESTD-------PRGILGMNLGRLSFAKQSKITKFS 216

Query: 147 --VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
             VP    + G     SF +  + +  G  + G    + Q+  +F P+   Y    VG+ 
Sbjct: 217 YCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLA--YTIPMVGIR 274

Query: 204 --------SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISL 253
                   S  +  +    SG Q ++DSG+ FT+L +E Y +V  +  + V    K+  +
Sbjct: 275 IAGKKLNISPAVFRADAGGSG-QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYV 333

Query: 254 QGNSWKYCYNA--SSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST 310
            G     C+++  + E    + +M   F +    V+ +  + +   + G  V C+ + S+
Sbjct: 334 YGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEVVIPKERVLA---DVGGGVHCVGIGSS 390

Query: 311 D---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
           D       IIG        + FD    ++ +  + C  ++
Sbjct: 391 DKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKADCSRLV 430


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 129/335 (38%), Gaps = 54/335 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----------RSSCKSLKD---PCPYIADYSTEDTSSS 90
           +DPSSS S   V C+   C +            ++C+        C Y   Y  + + S 
Sbjct: 193 FDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYR-DGSYSR 251

Query: 91  GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPS 149
           G L  D L LA          V    + GCG    G    G +  G+MGLG   +S V  
Sbjct: 252 GVLAHDRLSLAG--------EVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQ 301

Query: 150 LLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVG 201
            + + G +   FS C    + + SGS+  GD     + ST  +      D      YFV 
Sbjct: 302 TMDQFGGV---FSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVN 358

Query: 202 VESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKR 250
           +    +G   +          G +A++DSG   T L   IY    AE + +F +   +  
Sbjct: 359 LTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPG 418

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
            S+       C+N +    ++VP ++L+F       V +    +  +   +  CL +   
Sbjct: 419 FSI----LDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPL 474

Query: 311 DGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             +Y   IIG       R++FD    ++ ++   C
Sbjct: 475 KSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509


>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
          Length = 445

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 100/240 (41%), Gaps = 43/240 (17%)

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAP--DGVMGLGLG 143
           TS+SG L  DI+ + S +   P   V   +VI  CG       L+G A    G+ GLG  
Sbjct: 129 TSTSGELAQDIISIQSTNGSNPSKVVSFPNVIFTCGST---FLLEGLASGVTGIAGLGRK 185

Query: 144 DVSVPSLLAKAGLIQNSFSICFDEND--SGSVFFGDQGPA-------TQQSTSFLPI--- 191
            +++PS  A A   +  F++C   +   +G VFFGD GP          Q+  + P+   
Sbjct: 186 KIALPSQFAAAFSFKRKFALCLSSSTRATGVVFFGD-GPYIMLPNKDVSQNLIYTPLILN 244

Query: 192 ----------GEKYDAYFVGVESYCI-GNSCLTQSGFQALVDSGAS---------FTFLP 231
                     GE    YF+GV+   + G      +   ++   G           +T L 
Sbjct: 245 PVSTAGASFEGEPSADYFIGVKGIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPYTSLE 304

Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSFVV 287
           T IY  V+  F K V+          ++ C+N++S    +    VP + L+   N+++ +
Sbjct: 305 TSIYKAVIGAFGKAVAKVPRVTAVAPFELCFNSTSFSSTRVGPGVPQIDLVLPNNKAWTI 364


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 43/315 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DPS SS+    SCS   C    +  + C S    C YI  Y+ + +S++G    D L L
Sbjct: 173 FDPSLSSTYSPFSCSSAACAQLGQDGNGCSS-SSQCQYIVRYA-DGSSTTGTYSSDTLAL 230

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
            S        +  S+   GC   ++G + D    DG+MGLG G    PSL ++ AG    
Sbjct: 231 GS--------NTISNFQFGCSHVESG-FND--LTDGLMGLGGG---APSLASQTAGTFGT 276

Query: 160 SFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
           +FS C     S S F     G +    T  L        Y V +E+  +G + L+   S 
Sbjct: 277 AFSYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSV 336

Query: 217 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           F A  ++DSG   T LP   Y+ +   F   +   R +   +    C++ S +  +++P 
Sbjct: 337 FSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPS 396

Query: 275 MRLIFSK------NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           + L+FS       + + ++  +  +F  N           S D   GI+G        ++
Sbjct: 397 VALVFSGGAVVNLDANGIILGNCLAFAAN-----------SDDSSPGIVGNVQQRTFEVL 445

Query: 329 FDRENLKLAWSHSKC 343
           +D     + +    C
Sbjct: 446 YDVGGGAVGFKAGAC 460


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 128/317 (40%), Gaps = 37/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DPS SS+   V C  P C +    C      C Y+  Y  + +S++G L  D L L S 
Sbjct: 194 FDPSKSSTYAAVHCGEPQCAAAGGLCSEDNTTCLYLVHYG-DGSSTTGVLSRDTLALTS- 251

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 S   +    GCG +  G +      DG++GLG G++S+PS  A +      FS 
Sbjct: 252 ------SRALAGFPFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSY 300

Query: 164 CFDENDSGSVFFG-DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
           C   ++S + +      PAT     Q T+ L   +    YFV + S  IG   L      
Sbjct: 301 CLPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAV 360

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T+ G   L+DSG   T+LP + Y  +  +F   +     +   +    CY+ + E  + 
Sbjct: 361 FTRGG--TLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVI 418

Query: 272 VPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGD---YGIIGQNFMMGHR 326
           VP +   F     F +     +    EN G    CL   + D       IIG        
Sbjct: 419 VPAVSFRFGDGAVFELDFFGVMIFLDENVG----CLAFAAMDAGGLPLSIIGNTQQRSAE 474

Query: 327 IVFDRENLKLAWSHSKC 343
           +++D    K+ +  + C
Sbjct: 475 VIYDVAAEKIGFVPASC 491


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 2/125 (1%)

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 278
           ++DSG S T L   +Y  V   F       R++  G S +  CY+     ++KVP + + 
Sbjct: 339 ILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVH 398

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
            +      +    +  P +   T FCL +  TDG   I+G     G R+VFD +  ++A 
Sbjct: 399 LAGGAEVALPPENYLIPVDTRGT-FCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVAL 457

Query: 339 SHSKC 343
               C
Sbjct: 458 VPKSC 462


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 147/341 (43%), Gaps = 63/341 (18%)

Query: 47  PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           P+ SS+   + C+   C+     SR    +    C Y  +Y+     ++GYL  + L + 
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVG 194

Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
             +F K          V  GC    T + +D ++  G++GLG G +S+ S LA       
Sbjct: 195 DGTFPK----------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG----- 234

Query: 160 SFSICF--DENDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGN 209
            FS C   D  D G+  + FG     T+    QST  L  P  ++   Y+V +    + +
Sbjct: 235 RFSYCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS 294

Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           + L         TQ+G     +VDSG + T+L  + YA V   F   +++   +   +  
Sbjct: 295 TELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA 354

Query: 259 KY----CYNASS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTV 307
            Y    CY  S+    + ++VP + L F+    +   V+N+      +++G  TV CL V
Sbjct: 355 PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLV 414

Query: 308 MSTDGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +    D    IIG    M   +++D +    +++ + C ++
Sbjct: 415 LPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455


>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
          Length = 127

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 4/125 (3%)

Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD 177
           CG KQ        +P DG++GLG+G   + + L    +I +N    C      G ++ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60

Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
             P T+  T ++P+ E    Y  G+    I    +     F+A+ DSG+++T +P +IY 
Sbjct: 61  FNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119

Query: 237 EVVVK 241
           E+V K
Sbjct: 120 EIVSK 124


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 65/256 (25%), Positives = 102/256 (39%), Gaps = 43/256 (16%)

Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 170
           V  GCG    GS+   AA  GV+GLG G +S  S +  A    N F+ C        + S
Sbjct: 174 VAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTSVS 228

Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--TQSGFQ------- 218
            S+ FGD+  +T     + PI     +   Y+V +E   +G   L  + S ++       
Sbjct: 229 SSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNG 288

Query: 219 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPDMR 276
            ++ DSG + T+     Y+ ++  FD  V   R  S+QG     C   +  +    P   
Sbjct: 289 GSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG--LDLCVELTGVDQPSFPSFT 346

Query: 277 LIFSKNQSFVVRNHIFSFPENEGF------TVFCLT---VMSTDGDYGIIGQNFMMGHRI 327
           + F     F         PE E +       V CL    + S  G +  IG        +
Sbjct: 347 IEFDDGAVFQ--------PEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFV 398

Query: 328 VFDRENLKLAWSHSKC 343
            +DRE   + ++ +KC
Sbjct: 399 QYDREENLIGFAPAKC 414


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 142/335 (42%), Gaps = 34/335 (10%)

Query: 24  TLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 83
           T L CL   G +   ++    +DP  SSS   VSC    C+         + C Y  +Y 
Sbjct: 21  TWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVNSCIYKVEYG 80

Query: 84  TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
            + + + G L  + L           S+   ++ IGCG    G ++       ++GLG G
Sbjct: 81  -DGSFTIGELATETLTFV-------HSNSIPNISIGCGHDNEGLFVGADG---LIGLGGG 129

Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD--QGPATQQSTSFLPIGEKYDAY-FV 200
            +S+ S L  +     SFS C  + DS S    D    P +    S L   +++ ++ +V
Sbjct: 130 AISISSQLKAS-----SFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFRYV 184

Query: 201 GVESYCIGNSCL---------TQSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
            V    +G   L          +SG   + VDSG + T LP+++Y  +   F  L ++  
Sbjct: 185 KVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLP 244

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM 308
            + + + +  CY+ SS+  ++VP +  I     S  +  +N +    + +    FCL  +
Sbjct: 245 PAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLI---QVDSAGTFCLAFV 301

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           S      IIG     G R+ +D  N  + +S +KC
Sbjct: 302 SATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|156065227|ref|XP_001598535.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980]
 gi|154691483|gb|EDN91221.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 482

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 133/328 (40%), Gaps = 42/328 (12%)

Query: 69  CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG---CGRKQT 125
           C   + PC     YS   +S+  YL  D          A    V  +  IG     + Q 
Sbjct: 107 CSERRSPCQTAGTYSANSSSTYAYLASDFNISYVDGSGASGDYVTDTFTIGSTTLDKLQF 166

Query: 126 GSYLDGAAPDGVMGLG--LGDVSV-----------PSLLAKAGLIQ-NSFSICFDENDS- 170
           G     ++P+G++G+G  + +V V           P+ +   GLI  N+FS+  ++ DS 
Sbjct: 167 GIGYTSSSPEGILGIGYEINEVQVGRARKSAYKNLPAQMVADGLINSNAFSLWLNDLDSS 226

Query: 171 -GSVFFGDQGPATQQST-SFLPIGEK---YDAYFVGVESYCIGNSCLTQ-SGFQALVDSG 224
            GSV FG    A        LPI ++   Y  + + +    +GN  + Q      L+DSG
Sbjct: 227 TGSVLFGGVDTARYHGQLETLPIQKESGYYAEFLITLTEVTLGNLVIAQDQSLAVLLDSG 286

Query: 225 ASFTFLP----TEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLI 278
           +S T+LP      IY +V  ++D    +  +  SL  NS    +  +S  +    D  L+
Sbjct: 287 SSLTYLPDAMAEAIYEQVDAQYDYSEGAAYVPCSLASNSSALNFTFTSPTIQVTMD-ELV 345

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLA 337
                S         F +    T  CL  ++  G+   ++G  F+    +V+D  N +++
Sbjct: 346 IPVTSS---NGQQLRFTDG---TAACLFGIAPAGESTAVLGDTFIRSAYVVYDLANNEIS 399

Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
            + +      + +  ++V    G S  P
Sbjct: 400 LAQTN----FNATATNVVEITTGTSAVP 423


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 75/314 (23%), Positives = 124/314 (39%), Gaps = 32/314 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +DP +SSS   VSCS P C   S+        S  + C Y A Y  + + S GYL  D +
Sbjct: 160 FDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYG-DSSFSVGYLSKDTV 218

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
              + S          +   GCG+   G +   A   G+MGL    +S+  L   A  + 
Sbjct: 219 SFGANSV--------PNFYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LYQLAPTLG 265

Query: 159 NSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            SFS C    + SG +  G   P     T  +        YF+ +    +    L  S  
Sbjct: 266 YSFSYCLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSS 325

Query: 218 Q-----ALVDSGASFTFLPTEIYAEV--VVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
           +      ++DSG   T LPT +Y  +   V      S+KR +   +    C+   + ++ 
Sbjct: 326 EYTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAY-SILDTCFEGQASKLR 384

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            VP + + FS   +  +        + +G T  CL   +      IIG        +V+D
Sbjct: 385 AVPAVSMAFSGGATLKLSAGNL-LVDVDGATT-CL-AFAPARSAAIIGNTQQQTFSVVYD 441

Query: 331 RENLKLAWSHSKCE 344
            ++ ++ ++ + C 
Sbjct: 442 VKSNRIGFAAAGCS 455


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 80/325 (24%), Positives = 129/325 (39%), Gaps = 51/325 (15%)

Query: 57  SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD------ILHLASFSKHAPQS 110
           +CS  L  S S+C +   PC Y  DY  +D S++   V        +   +S SK+  + 
Sbjct: 165 TCSKSLPFSLSTCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSSSSSSSKNKVKK 222

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 165
           +    +++GC    TG   +  A DGV+ LG  +VS  S    A      FS C      
Sbjct: 223 AKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGRFSYCLVDHLS 278

Query: 166 DENDSGSVFFGDQ-----------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
             N +  + FG             GP  +Q T  +        Y V +++  +    L  
Sbjct: 279 PRNATSYLTFGPNSALSGPCPAAAGPGARQ-TPLVLDSRMRPFYDVSIKAISVDGELLKI 337

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS 265
                    G   +VDSG S T L    Y  VV     KL    R+++  + ++YCYN +
Sbjct: 338 PRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM--DPFEYCYNWT 395

Query: 266 S----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIG 318
           S    +E   +P + + F+ +      +  +      G  V C+ V   +G +    +IG
Sbjct: 396 SPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPG--VKCIGVQ--EGPWPGISVIG 451

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                 H   FD +N +L +  S+C
Sbjct: 452 NILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 147/341 (43%), Gaps = 63/341 (18%)

Query: 47  PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           P+ SS+   + C+   C+     SR    +    C Y  +Y+     ++GYL  + L + 
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVG 194

Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
             +F K          V  GC    T + +D ++  G++GLG G +S+ S LA       
Sbjct: 195 DGTFPK----------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG----- 234

Query: 160 SFSICF--DENDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGN 209
            FS C   D  D G+  + FG     T+    QST  L  P  ++   Y+V +    + +
Sbjct: 235 RFSYCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS 294

Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           + L         TQ+G     +VDSG + T+L  + YA V   F   +++   +   +  
Sbjct: 295 TELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA 354

Query: 259 KY----CYNASS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTV 307
            Y    CY  S+    + ++VP + L F+    +   V+N+      +++G  TV CL V
Sbjct: 355 PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLV 414

Query: 308 MSTDGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +    D    IIG    M   +++D +    +++ + C ++
Sbjct: 415 LPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 91/320 (28%), Positives = 134/320 (41%), Gaps = 42/320 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           Y+P+ SSS K V C   LC+    S C S    C Y   Y  + + + G    + L L  
Sbjct: 187 YNPALSSSYKLVGCQANLCQQLDVSGC-SRNGSCLYQVSYG-DGSYTQGNFATETLTLGG 244

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSF 161
               AP  +V     IGCG    G ++  A    ++GLG G +S PS L  + G I   F
Sbjct: 245 ----APLQNV----AIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQLTDENGKI---F 290

Query: 162 SICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS-- 215
           S C  + DS S   + FG          + +    + D  Y+V +    +G   L+ S  
Sbjct: 291 SYCLVDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDS 350

Query: 216 --GFQA------LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYN 263
             G  A      +VDSG + T L T  Y  +   F      L S+  +SL    +  CY+
Sbjct: 351 VFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL----FDTCYD 406

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
            SS+E + VP +   FS   S  +    +  P +     FC     T     I+G     
Sbjct: 407 LSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDS-MGTFCFAFAPTSSSLSIVGNIQQQ 465

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G R+ FDR N ++ ++ +KC
Sbjct: 466 GIRVSFDRANNQVGFAVNKC 485


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 129/317 (40%), Gaps = 40/317 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSSS  ++ C    C++   S C++ K  C Y   Y  + + + G  V + L   +
Sbjct: 197 FDPRSSSSFASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVTETLTFGN 253

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S + + V +GCG    G +        V   GL  +    L   + +  +SFS
Sbjct: 254 -------SGMINDVAVGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQMKASSFS 298

Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
            C  + DS S   + F    P+   +   L  G+    Y+VG+    +G   L+      
Sbjct: 299 YCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLF 358

Query: 214 ---QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASS 266
               SG+  + VDSG + T L T+ Y  +    D  VS      + N +     CY+ SS
Sbjct: 359 QMDDSGYGGIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSS 415

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
           +  + +P +   F+  +S  +    +  P +   T FC     T     IIG     G R
Sbjct: 416 QSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTR 474

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D  N  + +S  KC
Sbjct: 475 VHYDLANSVVGFSPHKC 491


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 132/333 (39%), Gaps = 52/333 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P  S+S + + C+  LC      SC+   D C Y  +Y  + T + G    +    AS
Sbjct: 138 FAPGQSASYEPMRCAGTLCSDILHHSCER-PDTCTYRYNYG-DGTMTVGVYATERFTFAS 195

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            S     ++    +  GCG    GS  +G+   G++G G   +S+ S L+        FS
Sbjct: 196 -SGGGGLTTTTVPLGFGCGSVNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFS 246

Query: 163 ICFDENDS--------GSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            C     S        GS+  G  G AT   Q+T  L   +    Y+V      +G   L
Sbjct: 247 YCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRL 306

Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN------ 256
              +S F          +VDSG + T LP  + AEVV  F + +     +  GN      
Sbjct: 307 RIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLP-FANGGNPEDGVC 365

Query: 257 -----SWKYCYNASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
                +W+    +SS   + VP M L F   +     RN++    ++      CL +  +
Sbjct: 366 FLVPAAWR---RSSSTSQMPVPRMVLHFQGADLDLPRRNYVL---DDHRRGRLCLLLADS 419

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             D   IG       R+++D E   L+ + ++C
Sbjct: 420 GDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 40/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPS S S   + C  PLC+   S  C    + C Y   Y     +   +  + +    +
Sbjct: 172 FDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETL----T 227

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F + A        V IGCG    G ++  A    ++GLG G +S P+         N FS
Sbjct: 228 FRRAA-----VPRVAIGCGHDNEGLFVGAAG---LLGLGRGGLSFPT--QTGTRFNNKFS 277

Query: 163 ICFDENDS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CL 212
            C  +  +     S+ FGD   A  ++  F P+    K D  Y+V +    +G +    +
Sbjct: 278 YCLTDRTASAKPSSIVFGDS--AVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGI 335

Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           + S F+         ++DSG S T L    Y  +   F    S  + + + + +  CY+ 
Sbjct: 336 SASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDL 395

Query: 265 SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
           S    +KVP + L F   + S    N++    +N G   FC     T     IIG     
Sbjct: 396 SGLSEVKVPTVVLHFRGADVSLPAANYLVPV-DNSG--SFCFAFAGTMSGLSIIGNIQQQ 452

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G R+VFD    ++ ++   C
Sbjct: 453 GFRVVFDLAGSRVGFAPRGC 472


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 122/324 (37%), Gaps = 54/324 (16%)

Query: 67  SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
           S C S   P  Y   Y+  D S    L  D L L + +   P +    +   GC     G
Sbjct: 165 SECSSFSCPPFY---YAYGDGSLVARLYRDSLSLPTPAPSPPINV--RNFTFGCAHTTLG 219

Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGS--------VFFGD 177
                  P GV G G G +S+PS LA  +  + N FS C   +   +        +  G 
Sbjct: 220 E------PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR 273

Query: 178 --QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF----------QALVDSGA 225
              G      TS L   +    Y VG+    +GN  +    F            +VDSG 
Sbjct: 274 YYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGT 333

Query: 226 SFTFLPTEIYAEVVVKFD----KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
           +FT LP  +Y  VV +F+    K+ +  R   +      CY    E  + VP + L F  
Sbjct: 334 TFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCY--YYENSVGVPRVVLHFVG 391

Query: 282 NQSFVV---RNHIFSFPE------NEGFTVFCLTVMS-------TDGDYGIIGQNFMMGH 325
            +S VV   +N+ + F +           V CL +M+         G    +G     G 
Sbjct: 392 EKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF 451

Query: 326 RIVFDRENLKLAWSHSKCEEVIDK 349
            +V+D E  ++ ++  +C  + D 
Sbjct: 452 EVVYDLEKNRVGFARRQCSTLWDN 475


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 140/345 (40%), Gaps = 60/345 (17%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DP+ S+S + + CS P C +R+       SC S  + C     Y+ + +SS G L  
Sbjct: 67  TTFDPTRSTSYQTIPCSSPTCTNRTQDFPIPASCDS-NNLCHATLSYA-DASSSDGNLAS 124

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
           D+ H+         SS  S ++ GC      S  D  +   G+MG+  G +S  S L   
Sbjct: 125 DVFHIG--------SSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP 176

Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
                 FS C    D SG +  G+            P  Q ST  LP  ++  AY V +E
Sbjct: 177 -----KFSYCISGTDFSGLLLLGESNLTWSVPLNYTPLIQISTP-LPYFDRV-AYTVQLE 229

Query: 204 SYCIGNSCL--TQSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
              + +  L   +S F        Q +VDSG  FTFL   +Y  +   F    SS    L
Sbjct: 230 GIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVL 289

Query: 254 QGNSWKY------CYNA--SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTV 302
           +   + +      CY    S   +  +P + L+F   +  V  + +      E     +V
Sbjct: 290 EDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSV 349

Query: 303 FCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            CL+  ++D    +  +IG +      + FD E  ++  +  +C+
Sbjct: 350 HCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLEKSRIGLAQVRCD 394


>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
          Length = 142

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 4/125 (3%)

Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGD 177
           CG KQ        +P DG++GLG+G     + L    +I  N    C      G ++ GD
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60

Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
             P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T +P +IY 
Sbjct: 61  FNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119

Query: 237 EVVVK 241
           E+V K
Sbjct: 120 EIVSK 124


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 80/348 (22%), Positives = 141/348 (40%), Gaps = 72/348 (20%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
           + +DPS SSS   + C+HPLCK R    +L   C      + + +  + T + G LV + 
Sbjct: 124 ASFDPSLSSSFYVLPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREK 183

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L  +        S     +I+GC  +   +        G++G+ LG +S P   AK    
Sbjct: 184 LAFSP-------SQTTPPLILGCSSESRDA-------RGILGMNLGRLSFP-FQAKV--- 225

Query: 158 QNSFSICF------DEND--SGSVFFGDQG-------------PATQQSTSFLPIGEKYD 196
              FS C       + N+  +GS + G+               P +Q+  +  P+     
Sbjct: 226 -TKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPL----- 279

Query: 197 AYFVGVESYCIGNSCLT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
           AY V ++   IG   L             SG Q +VDSG+ FTFL    Y  V  +  ++
Sbjct: 280 AYTVPMQGIRIGGRKLNIPPSVFRPNAGGSG-QTMVDSGSEFTFLVDVAYDRVREEIIRV 338

Query: 246 VSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFT 301
           +    K+  + G     C++ ++ E+ + + D+   F K    VV +  + +   + G  
Sbjct: 339 LGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLA---DVGGG 395

Query: 302 VFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           V C+ +  ++       IIG        + FD  N ++ +  + C  +
Sbjct: 396 VHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSRL 443


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 73/329 (22%), Positives = 128/329 (38%), Gaps = 34/329 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-----SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           YD SSSSS + + C+   C+       SSC  +   PC Y   YS + + ++G L  + +
Sbjct: 104 YDKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYS-DQSRTTGILAYETI 162

Query: 99  HLASFSK-------HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            + S  +       H  +     +V +GC R+  G+   GA+  GV+GLG G +S+ +  
Sbjct: 163 SMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQT 220

Query: 152 AKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 206
               L    FS C  +   GS    F   G    +  +  PI     A   Y+V V    
Sbjct: 221 RHTAL-GGIFSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVA 279

Query: 207 IGNSCL-----TQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
           +    +     +  G         + DSG + ++L    Y++V+   +  +   R     
Sbjct: 280 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 339

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
             ++ CYN +  E   +P + + F       +  + +     E      L  ++T     
Sbjct: 340 EGFELCYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSN 398

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           I+G      H I +D    ++ +  S C 
Sbjct: 399 ILGNLLQQDHHIEYDLAKARIGFKWSPCH 427


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 131/329 (39%), Gaps = 45/329 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DPSSSS+   V CS  LC     S+C S    C Y   Y  + +S+ G L  +   L  
Sbjct: 142 FDPSSSSTYATVPCSSALCSDLPTSTCTSASK-CGYTYTYG-DASSTQGVLASETFTLGK 199

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
             K  P       V  GCG    G  +  GA   G++GLG G +   SL+++ GL  + F
Sbjct: 200 EKKKLP------GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--DKF 245

Query: 162 SICFDENDSGS----VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGN 209
           S C    D G     +  G    A          Q+T  +    +   Y+V +    +G+
Sbjct: 246 SYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGS 305

Query: 210 SCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
           + +T   S F          +VDSG S T+L  + Y  +   F   ++   +        
Sbjct: 306 TRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLD 365

Query: 260 YCYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
            C+   A   + ++VP + L F       +    +   ++      CLTV  + G   II
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVAPSRG-LSII 423

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           G       + V+D     L+++  +C ++
Sbjct: 424 GNFQQQNFQFVYDVAGDTLSFAPVQCNKL 452


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 118/298 (39%), Gaps = 26/298 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SS+  NVSC+ P C   +        C Y   Y  + + S G+   D L L+S+ 
Sbjct: 223 FDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGF 217
           C     +G+ +      +   +++ L      D     Y+VG+    +G   L+  QS F
Sbjct: 329 CLPARSTGTGYLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF 388

Query: 218 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 272
                +VDSG   T LP   Y+ +   F   ++++  + +   +    CY+ +    + +
Sbjct: 389 ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAI 448

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           P + L+F       V      +  +              GD GI+G   +    + +D
Sbjct: 449 PTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506


>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 413

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 79/319 (24%), Positives = 129/319 (40%), Gaps = 45/319 (14%)

Query: 56  VSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           VS   PLC + SS      K+  D C Y  +Y+ +  SS G LV D++ +    +     
Sbjct: 103 VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYA-DHGSSVGVLVKDLVPM----RLTNGK 157

Query: 111 SVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
            +  ++  GCG  Q  G      +  GV+GL     ++ S L+  G + N    C     
Sbjct: 158 RISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRG 217

Query: 170 SGSVFFG-DQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSGFQALVDSGASF 227
            G +FFG D  P++    S+ PI    +  Y  G          +   G     DSG+S+
Sbjct: 218 GGFLFFGGDVVPSS--GMSWTPILRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSY 275

Query: 228 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML--------KVPDMRLIF 279
           T+  +++Y  +    +KL+ +    L+GN  K   +  + E+          V D+R  F
Sbjct: 276 TYFNSQVYRAI----EKLLKN---DLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFF 328

Query: 280 ---------SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
                    SKN  F +    +       F   CL ++       G+  IIG   M+   
Sbjct: 329 KPLAMSFKNSKNVQFQIPPEAYLIISE--FGNVCLGILDGSKEGMGNVNIIGDISMLNKI 386

Query: 327 IVFDRENLKLAWSHSKCEE 345
           +V+D E  ++ W+ S C  
Sbjct: 387 VVYDNERERIGWASSNCNR 405


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 120/317 (37%), Gaps = 34/317 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED-TSSSGYLVDDILHLASF 103
           +DP  SS+   VSC+   C S    +S    C Y  DY   D +S+SG L        S 
Sbjct: 122 FDPVKSSTYDTVSCASNFCSSLP-FQSCTTSCKY--DYMYGDGSSTSGAL--------ST 170

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
                 +    +V  GCG    GS+   A   G++GLG G +S+ S    + +    FS 
Sbjct: 171 ETVTVGTGTIPNVAFGCGHTNLGSF---AGAAGIVGLGQGPLSLIS--QASSITSKKFSY 225

Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEK----YDAYFVGVE------SYCIGNS 210
           C        +  +  GD   A   + + L         Y A   G+       +Y +G  
Sbjct: 226 CLVPLGSTKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTF 285

Query: 211 CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
            +  SG    + DSG + T+L T  +  +V      V             YC++ +    
Sbjct: 286 SIDASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN 345

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
              P M   F      +   ++F   +  G    CL + ++ G + I+G      H IV 
Sbjct: 346 PTYPTMTFHFKGADYELPPENVFVALDTGG--SICLAMAASTG-FSIMGNIQQQNHLIVH 402

Query: 330 DRENLKLAWSHSKCEEV 346
           D  N ++ +  + CE +
Sbjct: 403 DLVNQRVGFKEANCETI 419


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 74/309 (23%), Positives = 126/309 (40%), Gaps = 32/309 (10%)

Query: 62  LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
           +C   + CK+    CPY   Y     ++  + ++      + S   P+     +V+ GCG
Sbjct: 60  VCLVTNPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCG 119

Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 176
               G +   A    ++GLG G +S  S L    L  +SFS C      D N S  + FG
Sbjct: 120 HWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDANVSSKLIFG 174

Query: 177 -DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCL----------TQSGFQAL 220
            D+   +    +F  L  G++      Y+V ++S  +G   +          T      +
Sbjct: 175 EDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTI 234

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
           +DSG + ++     Y  +   F   V    +       + CYN +  E   +PD  ++FS
Sbjct: 235 IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFS 294

Query: 281 KNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLA 337
                +F V N+   F E E   V CL ++ T      IIG        I++D +  +L 
Sbjct: 295 DGAVWNFPVENY---FIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLG 351

Query: 338 WSHSKCEEV 346
           ++ +KC +V
Sbjct: 352 FAPTKCADV 360


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 131/317 (41%), Gaps = 40/317 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP SSSS  ++ C    C++   S C++ K  C Y   Y  + + + G  V + L   +
Sbjct: 197 FDPRSSSSFASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVIETLTFGN 253

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S + ++V +GCG    G +        V   GL  +   SL   + +  +SFS
Sbjct: 254 -------SGMINNVAVGCGHDNEGLF--------VGSAGLLGLGGGSLSLTSQMKASSFS 298

Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
            C  + DS S   + F    P+   +   L  G+    Y+VG+    +G   L+      
Sbjct: 299 YCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLF 358

Query: 214 ---QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASS 266
               SG+  + VDSG + T L T+ Y  +    D  VS      + N +     CY+ SS
Sbjct: 359 QMDDSGYGGIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSS 415

Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
           +  + +P +   F+  +S  +    +  P +   T FC     T     IIG     G R
Sbjct: 416 QSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTR 474

Query: 327 IVFDRENLKLAWSHSKC 343
           + +D  N  + +S  KC
Sbjct: 475 VHYDLANSVVGFSPHKC 491


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 77/333 (23%), Positives = 131/333 (39%), Gaps = 54/333 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P  S+S + + C+  LC       C+ + D C Y  +Y     +   Y  +      +
Sbjct: 144 FAPGESASYEPMRCAGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMGVYATERF----T 198

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
           F+       +   +  GCG    GS  +G+   G++G G   +S+ S L+        FS
Sbjct: 199 FTSSGGDRLMTVPLGFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFS 250

Query: 163 ICFDENDSG---SVFFGD-----QGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
            C     SG   ++ FG       G AT   Q+T  L   +    Y+V +    +G   L
Sbjct: 251 YCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRL 310

Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN------ 256
              +S F          +VDSG + T LP  + AEVV  F + +     +  GN      
Sbjct: 311 RIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLP-FANGGNPEDGVC 369

Query: 257 -----SWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMST 310
                +W+    +SS   + VP M   F   +     RN++    ++      CL +  +
Sbjct: 370 FLVPAAWR---RSSSTSQVPVPRMVFHFQDADLDLPRRNYVL---DDHRKGRLCLLLADS 423

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             D   IG       R+++D E   L+++ ++C
Sbjct: 424 GDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 71/320 (22%), Positives = 127/320 (39%), Gaps = 46/320 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DPS SS+   ++C+   C+       + C S    C Y  +Y+ + + S G   ++ L 
Sbjct: 175 FDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYA-DGSHSRGVYSNETLT 233

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           L      AP  +V+     GCGR Q G        DG++GLG   VS+  ++  + +   
Sbjct: 234 L------APGITVE-DFHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGG 281

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSCL-- 212
           +FS C    +S + F     P +   ++F+    ++       Y V +    +G   L  
Sbjct: 282 AFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHI 341

Query: 213 TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
            QS F+   ++DSG   T LP   Y  +     K + +  + +  + +  CYN +    +
Sbjct: 342 PQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPL-VPSDDFDTCYNFTGYSNI 400

Query: 271 KVPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
            VP +   FS   +        ++ N   +F E+             D   GIIG     
Sbjct: 401 TVPRVAFTFSGGATIDLDVPNGILVNDCLAFQES-----------GPDDGLGIIGNVNQR 449

Query: 324 GHRIVFDRENLKLAWSHSKC 343
              +++D     + +    C
Sbjct: 450 TLEVLYDAGRGNVGFRAGAC 469


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 77/320 (24%), Positives = 134/320 (41%), Gaps = 40/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P+ S S  N+ C  PLC+   S  C + K  C Y   Y  + + + G    + L    
Sbjct: 189 FNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYG-DGSFTYGEFSTETLTF-- 245

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 + +    V +GCG    G ++  A    ++GLG G +S PS + +       FS
Sbjct: 246 ------RGTRVGRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFS 294

Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CL 212
            C  +  + S    + FGD   A  ++  F P+    K D  Y+V +    +G +    +
Sbjct: 295 YCLVDRSASSKPSYMVFGDS--AISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGI 352

Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           T S F+         ++DSG S T L    Y  +   F    S+ + + + + +  C++ 
Sbjct: 353 TASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDL 412

Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
           S +  +KVP + L F   + S    N++    +N G   FC     T     I+G     
Sbjct: 413 SGKTEVKVPTVVLHFRGADVSLPASNYLIPV-DNSG--SFCFAFAGTMSGLSIVGNIQQQ 469

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G R+V+D    ++ ++   C
Sbjct: 470 GFRVVYDLAASRVGFAPRGC 489


>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 547

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 73/161 (45%), Gaps = 11/161 (6%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           + P  SS+S    CS   C    +SC    + C Y   Y  E +S+SG+L +D+L +   
Sbjct: 123 FKPELSSTSSTFGCSDARCFCGANSCSCNNEQCGYSIRY-LEGSSTSGFLAEDMLAVGDG 181

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
              A       + + GC + ++G  L     DGV G+G    S+   L + G+I ++FS+
Sbjct: 182 GPAA-------NFVFGCAQSESG-LLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSM 233

Query: 164 CFDENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVE 203
           CF     G +  G+   PA   +    P+    + + + +E
Sbjct: 234 CFGAPREGVLLLGNVALPADAPAPVVTPVVGNTNKFNIQIE 274


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 133/337 (39%), Gaps = 50/337 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTED-----TSSSGYLVDD 96
           +DP  S+S   ++   P C++  RS     K   C Y   Y   D     ++S G LV++
Sbjct: 183 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEE 242

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L  A   +       Q+ + IGCG    G  L GA   G++GL  G +S+P  +A  G 
Sbjct: 243 TLTFAGGVR-------QAYLSIGCGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY 293

Query: 157 IQNSFSICFDENDSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVES 204
              SFS C  +  SG      ++ FG     T    SF P      +   Y    +GV  
Sbjct: 294 -NASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSV 352

Query: 205 YCIGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQ 254
             +    +T+   Q          ++DSG + T L    Y      F    +   ++S  
Sbjct: 353 GGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTG 412

Query: 255 GNS--WKYCYNASSEEML----KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLT 306
           G S  +  CY       L    KVP + + F+     S   +N++ +  ++ G   F   
Sbjct: 413 GPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITV-DSRGTVCFAF- 470

Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             + D    +IG     G R+V+D    ++ ++ + C
Sbjct: 471 AGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 129/318 (40%), Gaps = 41/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP +SS+  +V CS   C        + S+C S  + C Y A Y  + + S GYL  D 
Sbjct: 177 FDPRASSTYTSVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGYLSTDT 234

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +   S S          S   GCG+   G +   A   G++GL    +S+   LA +  +
Sbjct: 235 VSFGSTSY--------PSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281

Query: 158 QNSFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT 213
             SFS C     + S  +   GP  T    S+ P+     DA  YF+ +    +G S L 
Sbjct: 282 GYSFSYCLPT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLA 339

Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 S    ++DSG   T LPT ++  +     + ++  + +   +    C+   + +
Sbjct: 340 VSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399

Query: 269 MLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            L+VP + + F+   S     RN +    ++      CL    TD    IIG        
Sbjct: 400 -LRVPTVVMAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTDST-AIIGNTQQQTFS 453

Query: 327 IVFDRENLKLAWSHSKCE 344
           +++D    ++ +S   C 
Sbjct: 454 VIYDVAQSRIGFSAGGCS 471


>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
          Length = 411

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 62/300 (20%), Positives = 123/300 (41%), Gaps = 37/300 (12%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
           K+ C Y   Y     SS G L+ D     SFS  A   +  +S+  GCG  Q  +  +  
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164

Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
            P +G++GLG G V++ S L   G+I ++    C      G +FFGD    T   T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223

Query: 191 IGEKYDAYFVGVESYCIGN---SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
           +  ++  Y     +    +   S ++ +  + + DSGA++T+   + Y   +      +S
Sbjct: 224 MNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLS 283

Query: 248 SK-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIF 292
            +      +  +  +   C+    +++  + +++  F           K  +  +    +
Sbjct: 284 KECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHY 342

Query: 293 SFPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
                EG    CL ++    ++       +IG   M+   +++D E   L W + +C+ +
Sbjct: 343 LIISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 400


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 132/321 (41%), Gaps = 42/321 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P  S S   + CS PLC+    S C + +  C Y   Y  + + ++G    + L    
Sbjct: 152 FNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYG-DGSFTTGDFATETLTF-- 208

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSF 161
                 + +  + V +GCG    G ++  A    ++GLG G +S PS   + G+   + F
Sbjct: 209 ------RGNKIAKVALGCGHHNEGLFVGAAG---LLGLGRGRLSFPS---QTGIRFNHKF 256

Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SC 211
           S C  +  + S    + FGD   A  +   F P+    K D  Y+VG+    +G      
Sbjct: 257 SYCLVDRSASSKPSSMVFGDA--AISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRG 314

Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
           ++ S F+         ++DSG S T L    Y  +   F       +   + + +  CY+
Sbjct: 315 VSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYD 374

Query: 264 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
            S +  +KVP + L F   + +    N++    EN  F   C     T     IIG    
Sbjct: 375 LSGQSSVKVPTVVLHFRGADMALPATNYLIPVDENGSF---CFAFAGTISGLSIIGNIQQ 431

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
            G R+V+D    ++ ++   C
Sbjct: 432 QGFRVVYDLAGSRIGFAPRGC 452


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 111/292 (38%), Gaps = 59/292 (20%)

Query: 108 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 164
           P + + ++   GC           A P GV G G G +S+P+ LA  +  + N FS C  
Sbjct: 208 PTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLV 261

Query: 165 -------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
                              +D ++      G   P     TS L   E    Y VG+E  
Sbjct: 262 SHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGI 320

Query: 206 CIGNSCLTQSGFQA----------LVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRI 251
            IG   +   GF            +VDSG +FT LP  +Y  VV +F+  V       R+
Sbjct: 321 SIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARV 380

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENEGFTVF 303
             +      CY   +  +     + L F  N S VV   RN+ + F      + +   V 
Sbjct: 381 IEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVG 439

Query: 304 CLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
           CL +M+       + G    +G     G  +V+D EN ++ ++  +C  + +
Sbjct: 440 CLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWE 491


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 130/345 (37%), Gaps = 39/345 (11%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCP 77
           L W    + A    +R +   D   S S K V C    CK       S ++C +   PC 
Sbjct: 107 LTWVNCRYRARGKDNRRVFRAD--ESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCS 164

Query: 78  YIADYSTEDTSSS-GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 136
           Y  DY   D S++ G    + + +   +    +       +IGC    TG    GA  DG
Sbjct: 165 Y--DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTGQSFQGA--DG 217

Query: 137 VMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ--QSTSFL 189
           V+GL   D S  S      L    FS C      ++N S  + FG         + T+ L
Sbjct: 218 VLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPL 275

Query: 190 PIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
            +      Y + V    +G   L          SG   ++DSG S T L    Y +VV  
Sbjct: 276 DLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTG 335

Query: 242 FDK-LVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG 299
             + LV  KR+  +G   +YC++ +S   + K+P +         F    H  S+  +  
Sbjct: 336 LARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF--EPHRKSYLVDAA 393

Query: 300 FTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             V CL  +S       +IG      +   FD     L+++ S C
Sbjct: 394 PGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 79/352 (22%), Positives = 141/352 (40%), Gaps = 71/352 (20%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKD-------PCP-YIADYSTEDTSS 89
           +  + P +SSSS+ + C +P C+    +   C+           PCP YI  Y     S+
Sbjct: 137 IPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGL--GST 194

Query: 90  SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
           +G L+ + L         P  +V    ++GC      S +    P G+ G G G  S+PS
Sbjct: 195 AGILISEKLDF-------PDLTVPD-FVVGC------SVISTRTPAGIAGFGRGPESLPS 240

Query: 150 LLAKAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK------- 194
            +          S  FD+         D+GS   G +  +     S+ P  +        
Sbjct: 241 QMKLKSFSHCLVSRRFDDTNVTTDLGLDTGS---GHKSGSKTPGLSYTPFRKNPNVSNTA 297

Query: 195 -YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
             + Y++ +    +G+  +          T     ++VDSG++FTF+   ++  V  +F 
Sbjct: 298 FLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFA 357

Query: 244 KLVS--SKRISLQGNSW-KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEG 299
             +S  ++   L+  S    C+N S +  + VP++   F       +  ++ FSF  N  
Sbjct: 358 TQMSNYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNA- 416

Query: 300 FTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
               CLTV+S +        G   I+G      + + +D EN +  ++  KC
Sbjct: 417 -DTVCLTVVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 74/319 (23%), Positives = 126/319 (39%), Gaps = 35/319 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +DPS SSS  N++C+  LC        +S C S    C Y   Y  + T S G+L  + L
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKST-SVGFLSQERL 237

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            + +       + +    + GCG+   G +   A   G++GLG   +S   +   + +  
Sbjct: 238 TITA-------TDIVDDFLFGCGQDNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYN 285

Query: 159 NSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL- 212
             FS C     S  G + FG    AT  +  + P+     D  F G++     +G + L 
Sbjct: 286 KIFSYCLPSTSSSLGHLTFG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLP 344

Query: 213 --TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
             + S F A   ++DSG   T L    YA +   F + +    ++ +   +  CY+ S  
Sbjct: 345 AVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGY 404

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGH 325
           + + VP +   F+      V   +            CL   +   D D  I G       
Sbjct: 405 KEISVPKIDFEFAGG--VTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTL 462

Query: 326 RIVFDRENLKLAWSHSKCE 344
            +V+D E  ++ +  + C 
Sbjct: 463 EVVYDVEGGRIGFGAAGCN 481


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 30/315 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS SSS  N+ C+  LC    S  C S  D  C Y   Y  +++ S G+L  + L + 
Sbjct: 183 FDPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYG-DNSISRGFLSQERLTIT 241

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
           +       + +    + GCG+   G +   A   G+MGL    +S   +   + +    F
Sbjct: 242 A-------TDIVHDFLFGCGQDNEGLFRGTA---GLMGLSRHPISF--VQQTSSIYNKIF 289

Query: 162 SICFDENDS--GSVFFGDQGP--ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
           S C     S  G + FG      A  + T F  I  +   Y + +    +G + L     
Sbjct: 290 SYCLPSTPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSS 349

Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            T S   +++DSG   T LP   YA +   F + +    ++        CY+ S  + + 
Sbjct: 350 STFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEIS 409

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGHRIVF 329
           VP  R+ F       V   +      E     CL   +     D  I G        +V+
Sbjct: 410 VP--RIDFEFAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVY 467

Query: 330 DRENLKLAWSHSKCE 344
           D E  ++ +  + C 
Sbjct: 468 DVEGGRIGFGAAGCN 482


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 111/292 (38%), Gaps = 59/292 (20%)

Query: 108 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 164
           P + + ++   GC           A P GV G G G +S+P+ LA  +  + N FS C  
Sbjct: 208 PTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLV 261

Query: 165 -------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
                              +D ++      G   P     TS L   E    Y VG+E  
Sbjct: 262 SHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGI 320

Query: 206 CIGNSCLTQSGFQA----------LVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRI 251
            IG   +   GF            +VDSG +FT LP  +Y  VV +F+  V       R+
Sbjct: 321 SIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARV 380

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENEGFTVF 303
             +      CY   +  +     + L F  N S VV   RN+ + F      + +   V 
Sbjct: 381 IEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVG 439

Query: 304 CLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
           CL +M+       + G    +G     G  +V+D EN ++ ++  +C  + +
Sbjct: 440 CLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWE 491


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 122/313 (38%), Gaps = 35/313 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ S++    SC    C       + C  LK  C YI  Y  + ++++G    D L L
Sbjct: 173 FDPAMSATYSAFSCGSAQCAQLGDEGNGC--LKSQCQYIVKYG-DGSNTAGTYGSDTLSL 229

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
            S       S    S   GC  +  G   +    DG+MGLG GD    SL+++ A     
Sbjct: 230 TS-------SDAVKSFQFGCSHRAAGFVGE---LDGLMGLG-GDTE--SLVSQTAATYGK 276

Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV--ESYCIGNSCLT- 213
           +FS C      +  G +  G  G A+    S  P+       F GV  +   +  + L  
Sbjct: 277 AFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNV 336

Query: 214 -QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
             S F   ++VDSG   T LP   Y  +   F K + +   +    S   C++ S    +
Sbjct: 337 PASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTI 396

Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
            VP + L FS+  +  +      +     F     T  + DGD GI+G        ++FD
Sbjct: 397 TVPTVTLTFSRGAAMDLDISGILYAGCLAF-----TATAHDGDTGILGNVQQRTFEMLFD 451

Query: 331 RENLKLAWSHSKC 343
                + +    C
Sbjct: 452 VGGRTIGFRSGAC 464


>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
          Length = 136

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 65/135 (48%), Gaps = 10/135 (7%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
           +  +  GCG KQ        +P    DG++GLG+G     + L    +I  N    C   
Sbjct: 6   KKKIAFGCGYKQEEP---ADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
              G ++ GD  P ++   +++P+ E    Y  G+    I N  +  +  F+A+ DSG++
Sbjct: 63  KGKGVLYVGDFNPPSR-GVTWVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121

Query: 227 FTFLPTEIYAEVVVK 241
           +T +P +IY E+V K
Sbjct: 122 YTHVPAQIYNEIVSK 136


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 121/319 (37%), Gaps = 50/319 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           YDPS SS+   V C+  +CK        S C S K  C +   Y+ + TS+ G    D L
Sbjct: 157 YDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQDKL 214

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L      AP + VQ +   GCG    G +      DGV+GLG       SL A+ G + 
Sbjct: 215 TL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGR---LRESLGARYGGV- 260

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSC--LT 213
             FS C     S   F             F P+G    +     V +    +G     L 
Sbjct: 261 --FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 318

Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            S F    +VDSG   T L +  Y  +   F K + + R+   G+    CYN +  + + 
Sbjct: 319 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD-LDTCYNLTGYKNVV 377

Query: 272 VPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           VP + L F+   +        ++ N   +F E+             DG  G++G      
Sbjct: 378 VPKIALTFTGGATINLDVPNGILVNGCLAFAES-----------GPDGSAGVLGNVNQRA 426

Query: 325 HRIVFDRENLKLAWSHSKC 343
             ++FD    K  +    C
Sbjct: 427 FEVLFDTSTSKFGFRAKAC 445


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 78/334 (23%), Positives = 128/334 (38%), Gaps = 48/334 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCK--------SRSSCKSL---KDPCPYIADYSTEDTSSSGYL 93
           +DP+ S++   V C+   C         +  SC S     + C Y   Y  + + S G L
Sbjct: 190 FDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYG-DGSFSRGVL 248

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
             D + L   S            + GCG    G +       G+MGLG  ++S+ S  A 
Sbjct: 249 ATDTVALGGAS--------LGGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAS 297

Query: 154 AGLIQNSFSICFDENDSG------SVFFGDQGPATQQSTSFLPIG--------EKYDAYF 199
                  FS C     SG      S+  GD   ++ ++T+  P+          +   YF
Sbjct: 298 --RYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTT--PVAYTRMIADPAQPPFYF 353

Query: 200 VGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQG 255
           + V    +G + L   G  A   L+DSG   T L   +Y  V  +F  +  ++   +  G
Sbjct: 354 LNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPG 413

Query: 256 NS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDG 312
            S    CY+ +  + +KVP + L         V      F   +  +  CL +  +S + 
Sbjct: 414 FSILDTCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYED 473

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +  IIG       R+V+D    +L ++   C  V
Sbjct: 474 ETPIIGNYQQKNKRVVYDTLGSRLGFADEDCNYV 507


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 82/319 (25%), Positives = 121/319 (37%), Gaps = 50/319 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           YDPS SS+   V C+  +CK        S C S K  C +   Y+ + TS+ G    D L
Sbjct: 123 YDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQDKL 180

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L      AP + VQ +   GCG    G +      DGV+GLG       SL A+ G + 
Sbjct: 181 TL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGR---LRESLGARYGGV- 226

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSC--LT 213
             FS C     S   F             F P+G    +     V +    +G     L 
Sbjct: 227 --FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 284

Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            S F    +VDSG   T L +  Y  +   F K + + R+   G+    CYN +  + + 
Sbjct: 285 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD-LDTCYNLTGYKNVV 343

Query: 272 VPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           VP + L F+   +        ++ N   +F E+             DG  G++G      
Sbjct: 344 VPKIALTFTGGATINLDVPNGILVNGCLAFAES-----------GPDGSAGVLGNVNQRA 392

Query: 325 HRIVFDRENLKLAWSHSKC 343
             ++FD    K  +    C
Sbjct: 393 FEVLFDTSTSKFGFRAKAC 411


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 61/260 (23%), Positives = 110/260 (42%), Gaps = 46/260 (17%)

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFG---DQGP 180
           A P GV G G G +S+P+ LA +  +   FS C   +         S  +  G   D   
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS--LSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAA 288

Query: 181 ATQQSTSFL--PI--GEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVDSGA 225
                T F+  P+    K+  ++ V +E+  +G   +                 +VDSG 
Sbjct: 289 IGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGT 348

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFS 280
           +FT LP++ +A V  +F + +++ R +    +        CY+ S  +   VP + L F 
Sbjct: 349 TFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFR 407

Query: 281 KNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIV 328
            N +  +  RN+   F   EG +V CL +M+  G+           G +G     G  +V
Sbjct: 408 GNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVV 467

Query: 329 FDRENLKLAWSHSKCEEVID 348
           +D +  ++ ++  +C ++ D
Sbjct: 468 YDVDAGRVGFARRRCTDLWD 487


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/270 (22%), Positives = 113/270 (41%), Gaps = 46/270 (17%)

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFG---DQGP 180
           A P GV G G G +S+P+ LA +  +   FS C   +         S  +  G   D   
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS--LSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAA 288

Query: 181 ATQQSTSFL--PI--GEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVDSGA 225
                T F+  P+    K+  ++ V +E+  +G   +                 +VDSG 
Sbjct: 289 IGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGT 348

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFS 280
           +FT LP++ +A V  +F + +++ R +    +        CY+ S  +   VP + L F 
Sbjct: 349 TFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFR 407

Query: 281 KNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIV 328
            N +  +  RN+   F   EG +V CL +M+  G+           G +G     G  +V
Sbjct: 408 GNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVV 467

Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
           +D +  ++ ++  +C ++ D     ++  P
Sbjct: 468 YDVDAGRVGFARRRCTDLWDTLSRRIIDQP 497


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 81/179 (45%), Gaps = 22/179 (12%)

Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGD 177
           GCGR   G +  G+  DG++GLG G +S  S  A     +  FS C  +EN  GS+ FG+
Sbjct: 224 GCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLFGE 279

Query: 178 QGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSG 224
           +  +   S  F  +         E+   YFV +    +GN  L    S F +   ++DSG
Sbjct: 280 KATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSG 339

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
              T LP   Y+ +   F K ++   +S     + +    CYN S  + + +P+  L F
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHF 398


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 76/317 (23%), Positives = 122/317 (38%), Gaps = 40/317 (12%)

Query: 56  VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           V C  PLC +  S     C    + C Y  +Y+ +  SS G L+ D + L    K    S
Sbjct: 114 VKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNIPL----KFTNGS 168

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
             +  +  GCG  QT     G  P     GV+GLG G  S+ S L   GLI+N    C  
Sbjct: 169 LARPMLAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLS 225

Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
               G +FFGDQ  P +    + L        Y  G           +  G + + DSG+
Sbjct: 226 GRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGS 285

Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WK--YCYNASSEEMLKVPDMRL 277
           S+T+  ++ +  +V      +  K +S           WK    + +  +       + L
Sbjct: 286 SYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLL 345

Query: 278 IFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIV 328
            F+K+     +N     P      V      CL ++       G+  IIG   +    ++
Sbjct: 346 SFTKS-----KNSPLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVI 400

Query: 329 FDRENLKLAWSHSKCEE 345
           +D E  ++ W+ + C+ 
Sbjct: 401 YDNEKQQIGWASANCDR 417


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 130/352 (36%), Gaps = 53/352 (15%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCP 77
           L W    + A    +R +   D   S S K V C    CK       S ++C +   PC 
Sbjct: 129 LTWVNCRYRARGKDNRRVFRAD--ESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCS 186

Query: 78  YIADYSTEDTSSS-GYLVDDIL-------HLASFSKHAPQSSVQSSVIIGCGRKQTGSYL 129
           Y  DY   D S++ G    + +        +A    H          +IGC    TG   
Sbjct: 187 Y--DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH----------LIGCSSSFTGQSF 234

Query: 130 DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ- 183
            GA  DGV+GL   D S  S      L    FS C      ++N S  + FG        
Sbjct: 235 QGA--DGVLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTA 290

Query: 184 -QSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEI 234
            + T+ L +      Y + V    +G   L          SG   ++DSG S T L    
Sbjct: 291 FRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAA 350

Query: 235 YAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIF 292
           Y +VV    + LV  KR+  +G   +YC++ +S   + K+P +         F    H  
Sbjct: 351 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF--EPHRK 408

Query: 293 SFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           S+  +    V CL  +S       +IG      +   FD     L+++ S C
Sbjct: 409 SYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460


>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
          Length = 157

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 58/126 (46%), Gaps = 1/126 (0%)

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 278
           ++DSG   T LP  +Y  +   F +++S K     G S    C+  + +EM +VP++++I
Sbjct: 32  IIDSGTVITRLPMPVYTALKNSFVRIMSKKYAQAPGISILDTCFKGNVKEMSEVPEIQMI 91

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F       ++ H      ++G T   +   S +    IIG       ++ +D  N K+ +
Sbjct: 92  FGGGADLPLKAHNTLIELDKGVTCLAIAGSSENNPIAIIGNYQQQTFKVAYDVANSKIGF 151

Query: 339 SHSKCE 344
           +   C+
Sbjct: 152 AAGGCQ 157


>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
          Length = 142

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 64/131 (48%), Gaps = 4/131 (3%)

Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGD 177
           CG KQ        +P DG++GLG+G     + L    +I  N    C      G ++ G+
Sbjct: 1   CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60

Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
             P ++  T ++P+ E    Y  G+    I N  +     F+A+ DSG+++T +P++IY 
Sbjct: 61  FNPPSRGVT-WVPMRESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYN 119

Query: 237 EVVVKFDKLVS 247
           E+V K    +S
Sbjct: 120 EIVSKVRGTLS 130


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 133/316 (42%), Gaps = 38/316 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PS SSS   ++C    CKS    +   D C Y   Y         Y V D    A+ +
Sbjct: 197 FEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSY-----GDGSYTVGD---FATET 248

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                S+  ++V IGCG    G ++  A    ++GLG G +S PS +  +     SFS C
Sbjct: 249 ITLDGSASLNNVAIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQINAS-----SFSYC 300

Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
               D + + ++ F    P+   +   L   +    Y++G+    +G   L+  +S F+ 
Sbjct: 301 LVNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEV 360

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
                   +VDSG + T L +++Y  +   F +    L S+  ++L    +  CY+ SS 
Sbjct: 361 DESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL----FDTCYDLSSR 416

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
             ++VP +   F   +   +    +  P +   T FC     T     IIG     G R+
Sbjct: 417 SSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGT-FCFAFAPTTSALSIIGNVQQQGTRV 475

Query: 328 VFDRENLKLAWSHSKC 343
            +D  N  + +S + C
Sbjct: 476 SYDLSNSLVGFSPNGC 491


>gi|154311375|ref|XP_001555017.1| hypothetical protein BC1G_06540 [Botryotinia fuckeliana B05.10]
 gi|114149215|gb|AAR87747.3| aspartic proteinase precursor [Botryotinia fuckeliana]
 gi|347829155|emb|CCD44852.1| similar to aspartic-type endopeptidase opsB [Botryotinia
           fuckeliana]
          Length = 482

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 143/341 (41%), Gaps = 50/341 (14%)

Query: 65  SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG--- 121
           S + C    +PC     Y+   +S+  Y+  D          A    V  +  IG     
Sbjct: 103 SSTLCSRKTNPCQTAGTYTANSSSTYAYVASDFNISYVDGSGASGDYVTDTFTIGSATLD 162

Query: 122 RKQTGSYLDGAAPDGVMGLG--LGDVSV-----------PSLLAKAGLIQ-NSFSICFDE 167
           + Q G     ++P+G++G+G  + +V V           P+ +   GLI  N+FS+  ++
Sbjct: 163 KLQFGIGYTSSSPEGILGIGYEINEVQVGRAGKKAYNNLPAQMVADGLINSNAFSLWLND 222

Query: 168 ND--SGSVFFGDQGPATQQ---STSFLPIGEK---YDAYFVGVESYCIGNSCLTQ-SGFQ 218
            D  +GS+ FG  G  T Q       LPI ++   Y  + + +    +G++ + Q     
Sbjct: 223 LDASTGSILFG--GVDTAQFHGQLETLPIEKESGYYAEFLITLTEVMLGDTVIAQDQALA 280

Query: 219 ALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-- 272
            L+DSG+S T+LP      IY +V  ++D        + +G ++  C  A++   L    
Sbjct: 281 VLLDSGSSLTYLPDAMAEAIYEQVEAQYD--------ASEGAAYVPCSLATNTSALNFTF 332

Query: 273 --PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVF 329
             P +++  ++    V           +G T  CL  ++  GD   ++G  F+    IV+
Sbjct: 333 TSPTIQVTMNELVIPVTSTTGQQLQFTDG-TAACLFGIAPAGDSTSVLGDTFIRSAYIVY 391

Query: 330 DRENLKLAWSHSK----CEEVIDKSHVHLVPPPAGQSPNPL 366
           D +N +++ + +        V++ +      P A    NP+
Sbjct: 392 DLDNNEISLAQTNFNATSTSVVEITTGTTAVPSATLVANPV 432


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 125/326 (38%), Gaps = 50/326 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           +DP+ SSS   V C  P C++       ++  +    C Y   Y         Y V D  
Sbjct: 238 FDPALSSSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYG-----DGSYTVGDFA 292

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
              + +     S+    V IGCG    G ++  A    + G  L   S PS ++      
Sbjct: 293 -TETLTLGGDGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA----- 343

Query: 159 NSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
             FS C  + DS S   + FG    A+  ST   P+     +   Y+V +    +G   L
Sbjct: 344 TEFSYCLVDRDSPSASTLQFG----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETL 399

Query: 213 T-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNS 257
           +           Q     +VDSG + T L +  Y+ +   F +    L  +  +SL    
Sbjct: 400 SDIPPAAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL---- 455

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
           +  CY+ +    ++VP + L F       +    +  P  +G   +CL   +T G   I+
Sbjct: 456 FDTCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPV-DGAGTYCLAFAATGGAVSIV 514

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G     G R+ FD     + +S +KC
Sbjct: 515 GNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 121/301 (40%), Gaps = 58/301 (19%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D+    +DPS SS+ K   C+ P              CPY   Y  + + + G L  + +
Sbjct: 101 DQKAPIFDPSKSSTFKETRCNTP-----------DHSCPYKLVYD-DKSYTQGTLATETV 148

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGL 156
            + S S       V    IIGC R  +GS   G  P   G++GL  G +S+ S +     
Sbjct: 149 TIHSTSG---VPFVMPETIIGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQM----- 197

Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
                         G  + GD       ST+      K   Y++ +++  +G++ +   G
Sbjct: 198 --------------GGAYPGDG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVG 239

Query: 217 --FQAL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
             F AL     +DSG   T+ P      V    +++V++ R+     +   CY +++ E+
Sbjct: 240 TPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEI 299

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-GDYGIIG----QNFMMG 324
              P + + FS     V+  +      N G  VFCL ++  +     I G     NF++G
Sbjct: 300 F--PVITVHFSGGADLVLDKYNMYMELNRG-GVFCLAIICNNPTQVAIFGNRAQNNFLVG 356

Query: 325 H 325
           +
Sbjct: 357 Y 357


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 135/324 (41%), Gaps = 48/324 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASF 103
           ++P  S +   + C    C       S +  C Y   YS  D+S + G L  + +   +F
Sbjct: 124 FEPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAY--SYSYADSSVTKGVLAREAI---TF 178

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNS-- 160
           S       V   +I GCG   +G++ +       M         P SL+++ G +  S  
Sbjct: 179 SSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGM------GGGPLSLVSQIGTLYGSKR 232

Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIG--EKYDAYFVGVESYCIG----- 208
           FS C      D + SG++ FG++   + +     P+   E   +Y V +E   +G     
Sbjct: 233 FSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVR 292

Query: 209 -NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNA 264
            NS  T S    ++DSG   T++P E Y  +V +    V S  + ++ +     + CY  
Sbjct: 293 FNSSETLSKGNIMIDSGTPATYIPQEFYERLVEELK--VQSSLLPIEDDPDLGTQLCYR- 349

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQ---- 319
            SE  L+ P +   F      ++    F  P  +G  VFC  +  STDGDY I G     
Sbjct: 350 -SETNLEGPILTAHFEGADVQLLPIQTF-IPPKDG--VFCFAMAGSTDGDY-IFGNFAQS 404

Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
           N +MG    FD +   +++  + C
Sbjct: 405 NILMG----FDLDRKTISFKPTDC 424


>gi|196003874|ref|XP_002111804.1| hypothetical protein TRIADDRAFT_55203 [Trichoplax adhaerens]
 gi|190585703|gb|EDV25771.1| hypothetical protein TRIADDRAFT_55203 [Trichoplax adhaerens]
          Length = 428

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 83/347 (23%), Positives = 128/347 (36%), Gaps = 82/347 (23%)

Query: 67  SSCKSLKDPCPYIADYSTEDTSS------------------SGYLVDDILHLASFSKHAP 108
           S C  +  P P +  Y   + SS                  SG LV D+LHL     H  
Sbjct: 81  SFCGIMAAPSPVVKHYFHMNRSSTLEETNLRIDSSYVKGYWSGQLVSDMLHLG-IGLHK- 138

Query: 109 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP----------SLLAKAGLIQ 158
           Q  +Q + I      Q   + +    DG++GL    ++V            ++ +AG I+
Sbjct: 139 QVRIQFAAIT----NQKEFFTETTRFDGILGLAYPSLAVQGNFYQKPVFNEIVQQAG-IR 193

Query: 159 NSFSICFDENDSGSVFFGDQ-------------------GPATQQSTSFLPIGEKYDAYF 199
           + F++ +  +      FG+Q                   GP       + PI EKY   F
Sbjct: 194 DIFTLTYCASKMRKDLFGNQYITGGGFMTLGGIDNNLLAGPVF-----YTPIVEKYYYQF 248

Query: 200 ----VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
               V V+   IG S      + ALVDSG S    P  +Y  ++  F + +  + +   G
Sbjct: 249 QLTNVLVDGQSIGFSPYDYMHYPALVDSGTSILRFPPFMYKRLMPIFLRSIQDRSVFSHG 308

Query: 256 NSWK---YCYNASSEEMLKVPDMRLI----------FSKNQSFVV-----RNHIFSFPEN 297
             ++    C   S     + P +RL           F   + F +     +  I S  E 
Sbjct: 309 FFYRGHAVCMEESQLLQHRFPTIRLSIRLASFEKTNFKTPRQFTLVLSPMQYFILSGKER 368

Query: 298 EGFTVFCLTVMSTDGDYGII-GQNFMMGHRIVFDRENLKLAWSHSKC 343
            G   +   +  T G +GII G   M G  + FDR N  L ++ SKC
Sbjct: 369 HGKPCYHFGIAGTSGAFGIILGDVVMKGFSVTFDRVNSMLGFAVSKC 415


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 129/323 (39%), Gaps = 44/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           + P++SS+  ++ CS P C      SC +      +       D+S S  L  D L LA 
Sbjct: 138 FSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLA- 196

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
                       S   GC    +GS L    P G++GLG G +   SLL+++G L    F
Sbjct: 197 -------VDTLPSYSFGCVNAVSGSTL---PPQGLLGLGRGPM---SLLSQSGSLYSGVF 243

Query: 162 SICFDEND----SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
           S CF        SGS+  G  G P   ++T  L    +   Y+V +    +G   +    
Sbjct: 244 SYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAP 303

Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
                   +G   ++DSG   T     +YA +  +F K V     ++   ++  C+ A++
Sbjct: 304 ELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATI--GAFDTCFAATN 361

Query: 267 EEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNF 321
           E++   P +   F+  +    + N +     +   ++ CL + +   +      +I    
Sbjct: 362 EDI--APPVTFHFTGMDLKLPLENTLI---HSSAGSLACLAMAAAPNNVNSVLNVIANLQ 416

Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
               RI+FD  N +L  +   C 
Sbjct: 417 QQNLRIMFDVTNSRLGIARELCN 439


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 51/262 (19%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP  SS+    SCS   C   + R +  SL   C Y   Y  + ++++G    D L L 
Sbjct: 165 FDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYG-DGSNTTGTYGSDTLALN 223

Query: 102 SFSKHAPQSSVQSSVIIGCGRK-QTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
           S  K         +   GC      G  LD    DG+MGLG G    PSL+++ A    +
Sbjct: 224 STEK-------VENFQFGCSETSDPGEGLDEDQTDGLMGLGGG---APSLVSQTAATYGS 273

Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------YFVGV 202
           +FS C               PAT +S+ FL +G                      YFV +
Sbjct: 274 AFSYCL--------------PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVIL 319

Query: 203 ESYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           +   +G     ++ + F A  ++DSG   T LP   Y+ +   F   +     +   +  
Sbjct: 320 QGINVGGDPVAISPTVFAAGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSIL 379

Query: 259 KYCYNASSEEMLKVPDMRLIFS 280
             C++ + ++ + +P + L+FS
Sbjct: 380 DTCFDFTGQDNVSIPAVELVFS 401


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 145/338 (42%), Gaps = 59/338 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDP  SS+   + CS  LC+    S  +C S K+ C Y      ED   S   V  +L  
Sbjct: 137 YDPGESSTFAFLPCSDRLCQEGQFSFKNCTS-KNRCVY------EDVYGSAAAVG-VLAS 188

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +F+  A + +V   +  GCG    GS +      G++GL    +S+ + L     IQ  
Sbjct: 189 ETFTFGA-RRAVSLRLGFGCGALSAGSLIGAT---GILGLSPESLSLITQLK----IQR- 239

Query: 161 FSIC---FDENDSGSVFFGDQGPATQ-------QSTSFLPIGEKYDAYFVGVESYCIGNS 210
           FS C   F +  +  + FG     ++       Q+T+ +    K   Y+V +    +G+ 
Sbjct: 240 FSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHK 299

Query: 211 CLT----------QSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQGNS 257
            L             G   +VDSG++  +L     E   E V+   +L  + R       
Sbjct: 300 RLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTV---ED 356

Query: 258 WKYCY------NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTV-MS 309
           ++ C+       A++ E ++VP + L F    + V+ R++ F  P      + CL V  +
Sbjct: 357 YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG---LMCLAVGKT 413

Query: 310 TDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           TDG    IIG        ++FD ++ K +++ ++C+++
Sbjct: 414 TDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 451


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 122/311 (39%), Gaps = 41/311 (13%)

Query: 49  SSSSSKNVSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
           +SSS K + C+   C   SS       ++ C Y  +Y  + + +SG +  D +   S   
Sbjct: 53  ASSSYKKLPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGA 111

Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
                S     + GCGRK  G   D     G++GLG    S+   L     +   FS C 
Sbjct: 112 GEDHRSFFDGFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166

Query: 166 DENDS-----GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT--- 213
              DS       +F G             PI  G+  D   Y+V ++S  +G   +    
Sbjct: 167 VSYDSPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYD 226

Query: 214 -QSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
            +SG           + ++DSG ++T L   +Y  +    ++ V    +   GNS     
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDL 283

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           C+N+S +     P +   F+     V+   +IF     +   V CL++ S+ GD  IIG 
Sbjct: 284 CFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGN 340

Query: 320 NFMMGHRIVFD 330
                  I++D
Sbjct: 341 MQQQNFHILYD 351


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/312 (23%), Positives = 126/312 (40%), Gaps = 31/312 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P+SS+S   +SC    C+S    +   + C Y   Y  + + + G  V + + L S S
Sbjct: 186 FEPASSTSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYG-DGSYTVGDFVTETITLGSAS 244

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     +V IGCG    G ++  A   G+ G  L   S PS +  +     SFS C
Sbjct: 245 V--------DNVAIGCGHNNEGLFIGAAGLLGLGGGKL---SFPSQINAS-----SFSYC 288

Query: 165 F--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQA 219
               ++DS S    +        T+ L    + D  Y+VG+    +G   L+  +S F+ 
Sbjct: 289 LVDRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEM 348

Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
                   ++DSG + T L T  Y  +   F K      ++ +   +  CY+ S +  ++
Sbjct: 349 DESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE 408

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
           VP +    +  +   +    +  P +   T FC     T     IIG     G R+ FD 
Sbjct: 409 VPTVTFHLAGGKVLPLPATNYLIPVDSDGT-FCFAFAPTSSALSIIGNVQQQGTRVGFDL 467

Query: 332 ENLKLAWSHSKC 343
            N  + +   +C
Sbjct: 468 ANSLVGFEPRQC 479


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 131/320 (40%), Gaps = 41/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P  S S   V C  PLC+   S  C   +  C Y   Y  + + ++G  V + L    
Sbjct: 171 FNPVKSGSFAKVLCRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF-- 226

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 + +    V +GCG    G ++  A    ++GLG G +S PS   +       FS
Sbjct: 227 ------RRTKVEQVALGCGHDNEGLFVGAAG---LLGLGRGGLSFPSQAGRT--FNQKFS 275

Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCL 212
            C  +  + S    V FG+   A  ++  F P+    + D  Y+V +    +G    S +
Sbjct: 276 YCLVDRSASSKPSSVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 333

Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           T S F+         ++D G S T L    Y  +   F    SS + + + + +  CY+ 
Sbjct: 334 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 393

Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
           S +  +KVP + L F   + S    N++      +G   FC     T     IIG     
Sbjct: 394 SGKTTVKVPTVVLHFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQ 450

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G R+V+D  + ++ +S   C
Sbjct: 451 GFRVVYDLASSRVGFSPRGC 470


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/339 (21%), Positives = 132/339 (38%), Gaps = 53/339 (15%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           S ++P  SSS     C+  +C +R+       SC      C  I  Y+ + +S+ G L  
Sbjct: 95  STFNPLLSSSYTPTPCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYA-DASSAEGTLAA 153

Query: 96  DILHLASFSKHAPQSSVQSSVIIGC--GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           +   LA         + Q   + GC      T    + A   G+MG+  G +S+ +    
Sbjct: 154 ETFSLAG--------AAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVT---- 201

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI------GEKYD--AYFVGVESY 205
             ++   FS C    D+  V     GP+      + P+         +D  AY V +E  
Sbjct: 202 -QMVLPKFSYCISGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGI 260

Query: 206 CIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
            +    L           T +G Q +VDSG  FTFL   +Y  +  +F +        ++
Sbjct: 261 KVSEKLLQLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIE 319

Query: 255 GNSWKY------CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
             ++ +      CY+A +  +  VP + L+FS  +  V    +          V+C T  
Sbjct: 320 DPNFVFEGAMDLCYHAPA-SLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFG 378

Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           ++D    +  +IG +      + FD    ++ ++ + C+
Sbjct: 379 NSDLLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTCD 417


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 84/380 (22%), Positives = 148/380 (38%), Gaps = 74/380 (19%)

Query: 15  NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCK 70
           ++L+CLP T+   C      S +    +  + P +SSSSK + C  P C+        C+
Sbjct: 111 SSLVCLPCTSRYLCSGC-DFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCR 169

Query: 71  SLKDP--------CP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
              DP        CP YI  Y     S++G L+ + L     +            ++GC 
Sbjct: 170 GC-DPNTRNCTVGCPPYILQYGLG--STAGVLITEKLDFPDLT--------VPDFVVGC- 217

Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--------DSGSV 173
                S +    P G+ G G G VS+PS +          S  FD+         D+GS 
Sbjct: 218 -----SIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGS- 271

Query: 174 FFGDQGPATQQSTSFLPIGEK--------YDAYFVGVESYCIGNSCL----------TQS 215
             G    +     ++ P  +          + Y++ +    +G   +          T  
Sbjct: 272 --GHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNG 329

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNS-WKYCYNASSEEMLKV 272
              ++VDSG++FTF+   ++  V  +F   +S  ++   L+  +    C+N S +  + V
Sbjct: 330 DGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGKGDVTV 389

Query: 273 PDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTD--------GDYGIIGQNFMM 323
           P++   F       +  ++ F+F  N      CLTV+S          G   I+G     
Sbjct: 390 PELIFEFKGGAKLELPLSNYFTFVGNT--DTVCLTVVSDKTVNPSGGTGPAIILGSFQQQ 447

Query: 324 GHRIVFDRENLKLAWSHSKC 343
            + + +D EN +  ++  KC
Sbjct: 448 NYLVEYDLENDRFGFAKKKC 467


>gi|328768800|gb|EGF78845.1| hypothetical protein BATDEDRAFT_12639 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 355

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 122/301 (40%), Gaps = 43/301 (14%)

Query: 58  CSHPLCKSRSSCKSLKDPC------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
           CS P C   S    L           +   Y T   S SG +  D  ++A  +      S
Sbjct: 82  CSDPACVKHSQFNRLLSSTWTSLTQTFSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQS 137

Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSIC 164
              SV       Q G+       DGV+GLG+ ++S+ ++      +   GLI    F + 
Sbjct: 138 FAESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLY 191

Query: 165 FDENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
             +N + GSV   G   P+    S ++LP+  K   + VG+ S     + L Q+  QA+ 
Sbjct: 192 LTKNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVF 249

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
           DSG S   +PT             VS+  I  Q  +  Y           +P +  +   
Sbjct: 250 DSGTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-N 295

Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
           N SF +RN  +  P   G+ V     +   G + I+G +FM  +  +FD +N ++  ++S
Sbjct: 296 NVSFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANS 354

Query: 342 K 342
           +
Sbjct: 355 R 355


>gi|328768784|gb|EGF78829.1| hypothetical protein BATDEDRAFT_12559 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 355

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 122/301 (40%), Gaps = 43/301 (14%)

Query: 58  CSHPLCKSRSSCKSLKDPC------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
           CS P C   S    L           +   Y T   S SG +  D  ++A  +      S
Sbjct: 82  CSDPACVKHSQFNRLLSSTWTSLTQTFSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQS 137

Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSIC 164
              SV       Q G+       DGV+GLG+ ++S+ ++      +   GLI    F + 
Sbjct: 138 FAESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLY 191

Query: 165 FDENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
             +N + GSV   G   P+    S ++LP+  K   + VG+ S     + L Q+  QA+ 
Sbjct: 192 LTKNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVF 249

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
           DSG S   +PT             VS+  I  Q  +  Y           +P +  +   
Sbjct: 250 DSGTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-N 295

Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
           N SF +RN  +  P   G+ V     +   G + I+G +FM  +  +FD +N ++  ++S
Sbjct: 296 NVSFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANS 354

Query: 342 K 342
           +
Sbjct: 355 R 355


>gi|32487305|emb|CAE05796.1| OSJNBb0046K02.6 [Oryza sativa Japonica Group]
 gi|38344664|emb|CAE02326.2| OSJNBb0112E13.8 [Oryza sativa Japonica Group]
 gi|125547764|gb|EAY93586.1| hypothetical protein OsI_15371 [Oryza sativa Indica Group]
 gi|125589862|gb|EAZ30212.1| hypothetical protein OsJ_14269 [Oryza sativa Japonica Group]
          Length = 174

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 43/88 (48%), Gaps = 2/88 (2%)

Query: 32  FGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 91
           FG   V  R L+ YDP SS SSK V C   +C SR  C ++   CPYIA YS +   + G
Sbjct: 89  FGHVCVCLRKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYIAAYS-DGGLTMG 146

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIG 119
            L  D+LH      +       +SV  G
Sbjct: 147 ILFTDLLHYHQLYGNGQTQPTSTSVTFG 174


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 112/267 (41%), Gaps = 31/267 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
           +DP+ SSS   V C  P+C      ++       C Y+  Y  + ++++G    D L L+
Sbjct: 185 FDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYG-DGSNTTGVYSSDTLTLS 243

Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNS 160
           +       SS       GCG  Q+G + +G   DG++GLG      PSL+ + AG     
Sbjct: 244 A-------SSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGR---EQPSLVEQTAGTYGGV 290

Query: 161 FSICFDENDS--GSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           FS C     S  G +  G  GP+       +T  LP       Y V +    +G   L+ 
Sbjct: 291 FSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSV 350

Query: 214 -QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSEE 268
             S F    +VD+G   T LP   YA +   F   ++S       ++     CYN +   
Sbjct: 351 PASAFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYG 410

Query: 269 MLKVPDMRLIFSKNQSFVV-RNHIFSF 294
            + +P++ L F    + ++  + I SF
Sbjct: 411 TVTLPNVALTFGSGATVMLGADGILSF 437


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/340 (25%), Positives = 149/340 (43%), Gaps = 63/340 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDP  SS+   + CS  LC+    S  +C S K+ C Y      ED   S   V  +L  
Sbjct: 59  YDPGESSTFAFLPCSDRLCQEGQFSFKNCTS-KNRCVY------EDVYGSAAAVG-VLAS 110

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
            +F+  A + +V   +  GCG    GS L GA   G++GL    +S+ + L     IQ  
Sbjct: 111 ETFTFGA-RRAVSLRLGFGCGALSAGS-LIGAT--GILGLSPESLSLITQLK----IQR- 161

Query: 161 FSIC---FDENDSGSVFFGDQGPATQ-------QSTSFL--PIGEKYDAYFVGVESYCIG 208
           FS C   F +  +  + FG     ++       Q+T+ +  P+   Y  Y+V +    +G
Sbjct: 162 FSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVY--YYVPLVGISLG 219

Query: 209 NSCLT----------QSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQG 255
           +  L             G   +VDSG++  +L     E   E V+   +L  + R     
Sbjct: 220 HKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTV--- 276

Query: 256 NSWKYCY------NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTV- 307
             ++ C+       A++ E ++VP + L F    + V+ R++ F  P      + CL V 
Sbjct: 277 EDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA---GLMCLAVG 333

Query: 308 MSTDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            +TDG    IIG        ++FD ++ K +++ ++C+++
Sbjct: 334 KTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 373


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 72/319 (22%), Positives = 125/319 (39%), Gaps = 34/319 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           + PS+SSS ++VSC+   C+S         +C S    C Y+ +Y  + + ++G L  + 
Sbjct: 105 FKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYG-DGSYTNGELGVEQ 163

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L     S         S  + GCGR   G +       G+MGLG   +S+ S        
Sbjct: 164 LSFGGVSV--------SDFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNATF 210

Query: 158 QNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 209
              FS C    +   SGS+  G++    +       T  LP  +  + Y + +    +  
Sbjct: 211 GGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDG 270

Query: 210 SCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
             L    F     L+DSG   T LP+ +Y  +   F K  +    +   +    C+N + 
Sbjct: 271 VALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTG 330

Query: 267 EEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
            + + +P + + F  N    V      +   E+       L  +S   D  IIG      
Sbjct: 331 YDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRN 390

Query: 325 HRIVFDRENLKLAWSHSKC 343
            R+++D +  K+ ++   C
Sbjct: 391 QRVIYDTKQSKVGFAEESC 409


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 124/320 (38%), Gaps = 43/320 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++   S++ K + C  P CK   +       C +   Y +    S+  L  D + L+   
Sbjct: 74  FNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALSM-- 129

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
              P  +       GC +K TGS +    P G++G G G +S   L     L +++FS C
Sbjct: 130 DPVPYYA------FGCIQKATGSSVP---PQGLLGFGRGPLSF--LSQTQNLYKSTFSYC 178

Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
                  N SGS+  G  G P   ++T  L    +   Y+V +    +G   +       
Sbjct: 179 LPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSAL 238

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   + DSG  FT L    Y  V  +F K V +  +S  G  +  CY+      
Sbjct: 239 AFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG-FDTCYSVP---- 293

Query: 270 LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMG 324
           +  P +  +FS  N +    N +       G T  CL + +   +      +I       
Sbjct: 294 IVPPTITFMFSGMNVTMPPENLLIH--STAGVTS-CLAMAAAPDNVNSVLNVIASMQQQN 350

Query: 325 HRIVFDRENLKLAWSHSKCE 344
           HRI+FD  N +L  +  +C 
Sbjct: 351 HRILFDVPNSRLGVAREQCS 370


>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 435

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 117/291 (40%), Gaps = 54/291 (18%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDD 96
           S Y P+   S++   CS     S  +C S   P      C    D +   T++SG L  D
Sbjct: 80  STYRPARCGSAQ---CSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQD 136

Query: 97  ILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAK 153
           ++ L S +   P Q++  S  +  C        L G A    G+ GLG   +++PS LA 
Sbjct: 137 VVSLQSTNGFNPIQNATVSRFLFSCAPT---FLLQGLATGVSGMAGLGRTRIALPSQLAS 193

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPAT-------QQSTSFLPI-------------GE 193
           A   +  F++C   ++ G  FFGD GP          Q  +F P+             GE
Sbjct: 194 AFSFRRKFAVCLSSSN-GVAFFGD-GPYVLLPNVDASQLLTFTPLLINPVSTASAFSQGE 251

Query: 194 KYDAYFVGVESYCIG------NSCLTQSGFQAL----VDSGASFTFLPTEIYAEVVVKFD 243
               YF+GV+S  I       N+ L     + +    + S   +T L   I+  V   F 
Sbjct: 252 PSAEYFIGVKSIKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFV 311

Query: 244 KLVSSKRISLQGNSWKYCYNASSEEML------KVPDMRLIFSKNQSFVVR 288
           K  S++ I+   +   +    S E +L       VP + L+  +NQ  V R
Sbjct: 312 KASSARNITRVASVAPFEVCFSRENVLATRLGAAVPTIELVL-QNQKTVWR 361


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 129/321 (40%), Gaps = 35/321 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SSS ++V C  P C   S   S    C +    ST     +G +V D L L+   
Sbjct: 185 FDPSMSSSFRSVLCGSPDCGGHSC--SAGGSCTFTLQNSTF-VFGNGTIVMDTLTLSP-- 239

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSI 163
                S+   +  +GC +     + DG A  G + L L   S+ + +L  +     +FS 
Sbjct: 240 -----SATFENFAVGCMQLDNDLFTDGVA-VGNIDLSLSRHSLATRVLNSSPPGMAAFSY 293

Query: 164 CFDENDSGSVF---------FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
           C   +     F         + D            P G  +  Y+V + +  I    L  
Sbjct: 294 CLPADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNF--YYVDLVAIAINGEDLPI 351

Query: 215 -----SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
                +G   ++DS ++FT+L   IYA +  +F K +   +          CYN +  E 
Sbjct: 352 PPALFTGNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLDTCYNFTLAEN 411

Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPEN--EGFTVFCLTVMST---DGDYGIIGQNFM 322
           + +PD+ L FS  ++  +  R  ++ F E+  +GF   CL   +    +  +  +G    
Sbjct: 412 IYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQ 471

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
               IV+D     +A+  S+C
Sbjct: 472 RTKEIVYDVRGGMVAFVPSRC 492


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/353 (22%), Positives = 142/353 (40%), Gaps = 55/353 (15%)

Query: 35  SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSS 89
           ++  D+ +  +  S S +   V CS PLC        S C +    C Y   Y  + + +
Sbjct: 126 TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGY-MDHSIT 184

Query: 90  SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
           +G + +D     +    A  ++   ++  GCG    G +    +  G+ G G G +S+PS
Sbjct: 185 TGKMAEDTFTFKA-PDRADTAAAVPNIRFGCGMMNYGLFTPNQS--GIAGFGTGPLSLPS 241

Query: 150 LLAKAGLIQNSFSICF---DENDSGSVFFGDQ---------GPATQQSTSFLP------I 191
            L         FS CF   +E+    V  G +         GP   QST F P      +
Sbjct: 242 QLKV-----RRFSYCFTAMEESRVSPVILGGEPENIEAHATGPI--QSTPFAPGPAGAPV 294

Query: 192 GEKYDAYFVGVESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYA---EV 238
           G +   YF+ +    +G + L    S F           +DSG + TF P  ++    E 
Sbjct: 295 GSQ-PFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREA 353

Query: 239 VVKFDKLVSSKRISLQGNSWKYCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPEN 297
            V    L  +K  +   N    C++  +++    VP + L        + R +     ++
Sbjct: 354 FVAQVPLPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDD 411

Query: 298 EGFTV---FCLTVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
           +G       C+ ++S     G I  NF   +  IV+D E+ K+ ++ ++C+++
Sbjct: 412 DGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARCDKL 464


>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
          Length = 440

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 133/350 (38%), Gaps = 63/350 (18%)

Query: 50  SSSSKNVSCSHPLCKSR------SSCKSLKDP------CPYIADYSTEDTSSSGYLVDDI 97
           SSS K V C    CK         SC     P      C +I       TS+ G L  D+
Sbjct: 79  SSSYKPVPCGSIPCKRSLSGACVESCVGPPSPGCNNNTCSHIPYNHFIRTSTGGELAQDV 138

Query: 98  LHLASFSKHAPQSSVQSS-VIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKA 154
           + L S     P+  + ++ V+  C      S L+G A    G++GLG G V  P+ LA A
Sbjct: 139 VSLQSTDGSNPRKYLSTNGVVFDCAPH---SLLEGLAKGVKGILGLGNGYVGFPTQLANA 195

Query: 155 GLIQNSFSICFDENDS--GSVFFGDQ------GPATQQSTSFLPI-------------GE 193
             +   F+IC   + +  G +FFGD       G    +   + P+             GE
Sbjct: 196 FSVPRKFAICLTSSTTSRGVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGE 255

Query: 194 KYDAYFVGVESYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFD 243
               YF+GV S  I GN     +    +   G           +T L T IY  +   F 
Sbjct: 256 PSTDYFIGVTSIKINGNVVPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFV 315

Query: 244 K-LVSSKRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSFVVRNHIFSFPENE 298
           K L    R+      +K CYN +S    +    VP + L+   N++      I+      
Sbjct: 316 KSLAKVPRVKPVA-PFKVCYNRTSLGSTRVGRGVPPIELVLG-NKNATTSWTIWGVNSMV 373

Query: 299 GFT--VFCLTVMSTDGDYG-----IIGQNFMMGHRIVFDRENLKLAWSHS 341
                V CL  +    ++      +IG + +  + + FD  N +L ++ S
Sbjct: 374 AMNNDVLCLGFLDGGVEFEPTTSIVIGAHQIEDNLLQFDIANKRLGFTSS 423


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 137/318 (43%), Gaps = 33/318 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +D S S + K + C    C+S     C S K  C Y   Y  + + S G L  + L L S
Sbjct: 131 FDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYSIHY-VDGSQSLGDLSVETLTLGS 188

Query: 103 FSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
            +     S VQ    +IGCGR       +  +  G++GLG G +S+ + L+ +      F
Sbjct: 189 TNG----SPVQFPGTVIGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKF 240

Query: 162 SICFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT--- 213
           S C        S  + FG+    + + T   P+  K     YF+ +E++ +G + +    
Sbjct: 241 SYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGS 300

Query: 214 -QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM- 269
             SG +   ++DSG + T LP  +Y+++     K V  +R+         CY  + +++ 
Sbjct: 301 PGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLD 360

Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFMMGHRI 327
             VP +   FS     V  N I +F +  +    F      T   +G +  QN ++G   
Sbjct: 361 ASVPVITAHFSGAD--VTLNAINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVG--- 415

Query: 328 VFDRENLKLAWSHSKCEE 345
            +D +   +++ H+ C +
Sbjct: 416 -YDLQMNTVSFKHTDCTK 432


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 81/340 (23%), Positives = 129/340 (37%), Gaps = 60/340 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILH 99
           ++P+SS++ + V C  P C      SC SL   K+ C +   Y   D+S    L  D L 
Sbjct: 135 FNPASSATFRPVPCGAPPCSQAPNPSCTSLAKSKNSCGFSLSYG--DSSLDATLSQDNLA 192

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQ 158
           + +         V      GC  K  GS    AAP    GL          +A+  G+ +
Sbjct: 193 VTA------NGGVIKGYTFGCLTKSNGS----AAP--AQGLLGLGRGPLGFVAQTKGIYE 240

Query: 159 NSFSICFDE------NDSGSVFFGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
            +FS C         N SGS+  G +G   P   ++T  L    +   Y+V +    IG 
Sbjct: 241 GTFSYCLPSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGK 300

Query: 210 SCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-----------S 248
             +            +G   ++DSG  F  L    YA V  +  + V+           S
Sbjct: 301 KSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGAS 360

Query: 249 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
             +S  G  +  CYN S+   +  P + L+F       +           G T  CL + 
Sbjct: 361 VSVSSLGG-FDTCYNVST---VAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTS-CLAMA 415

Query: 309 STDGD-----YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           ++  D       +IG      HR++FD  N ++ ++  +C
Sbjct: 416 ASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/264 (23%), Positives = 105/264 (39%), Gaps = 47/264 (17%)

Query: 132 AAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSGS--------VFFGDQGPAT 182
           + P GV G G G +S+P+ LA  +  + N FS C   +   S        +  G      
Sbjct: 214 SEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFRSERIRKPSPLILGRYNDEK 273

Query: 183 QQS---------TSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 223
           Q +         TS L   +    Y VG++   +G   +           +     +VDS
Sbjct: 274 QSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPAPKILRRVNKKGDGGVVVDS 333

Query: 224 GASFTFLPTEIYAEVVVKFDKLV--SSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIF 279
           G +FT LP + Y  VV  FD+    S++R     Q      CY  ++  ++    +R + 
Sbjct: 334 GTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLSPCYYLNTAAIVPAVTLRFV- 392

Query: 280 SKNQSFVV--RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQNFMMGH 325
             N S V+  +N+ + F +          V CL  M+       + G  G++G     G 
Sbjct: 393 GMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGF 452

Query: 326 RIVFDRENLKLAWSHSKCEEVIDK 349
            + +D E  ++ ++  KC  + D+
Sbjct: 453 EVEYDLEKKRVGFARRKCASLWDR 476


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 81/321 (25%), Positives = 126/321 (39%), Gaps = 42/321 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P++SS+ + V C+ PLCK    S C++ K  C Y   Y            D    +  
Sbjct: 195 FNPAASSTYRKVPCATPLCKKLDISGCRN-KRYCEYQVSYG-----------DGSFTVGD 242

Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           FS      +  V   V +GCG    G ++  A   G+     G +S PS           
Sbjct: 243 FSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLG---RGSLSFPS--QTGAQFSKR 297

Query: 161 FSICF-DENDSG---SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT 213
           FS C  D + SG   S+ FG    A  +S  F P+    K D  Y+V +    +G   LT
Sbjct: 298 FSYCLVDRSASGTASSLIFGK--AAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLT 355

Query: 214 Q---SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
               S F+         ++DSG S T L    Y+ +   F     + + +   + +  CY
Sbjct: 356 SIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCY 415

Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
           + S  + +KVP +   F       +    +  P +   T FC       G   IIG    
Sbjct: 416 DLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSAT-FCFAFAGNTGGLSIIGNIQQ 474

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
            G+R+VFD    ++ +    C
Sbjct: 475 QGYRVVFDSLANRVGFKAGSC 495


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 106/268 (39%), Gaps = 51/268 (19%)

Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----N 168
           S++ +GC          GA+  G++G+    +S PS L+        FS CF +     N
Sbjct: 254 SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLN 309

Query: 169 DSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            SG VFFG+              Q PA   ++         D Y+VG+    +  S L  
Sbjct: 310 SSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGISVDESRLPL 361

Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
                     T SG   ++DSG +FT+L    +  +  +F    S        + +  CY
Sbjct: 362 SHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 420

Query: 263 NASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLT-VMSTDGDYG 315
           N +S     E   +P + L F      V+  +    P   +E  T  CL  +MS D  + 
Sbjct: 421 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFN 480

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IIG        + +D E L+L  + ++C
Sbjct: 481 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 127/320 (39%), Gaps = 41/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P  S S   V C  PLC+   S  C   +  C Y   Y  + + ++G  V + L    
Sbjct: 84  FNPVKSGSFAKVLCRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF-- 139

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                 + +    V +GCG    G ++  A   G+       +S PS   +       FS
Sbjct: 140 ------RRTKVEQVALGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRT--FNQKFS 188

Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCL 212
            C  +  + S    V FG+   A  ++  F P+    + D  Y+V +    +G    S +
Sbjct: 189 YCLVDRSASSKPSSVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 246

Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
           T S F+         ++D G S T L    Y  +   F    SS + + + + +  CY+ 
Sbjct: 247 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 306

Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
           S +  +KVP + L F   + S    N++      +G   FC     T     IIG     
Sbjct: 307 SGKTTVKVPTVVLHFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQ 363

Query: 324 GHRIVFDRENLKLAWSHSKC 343
           G R+V+D  + ++ +S   C
Sbjct: 364 GFRVVYDLASSRVGFSPRGC 383


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 130/317 (41%), Gaps = 37/317 (11%)

Query: 43  SEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           S +DPS+SS+    SCS   C      +  + C S +  C YI  Y  + +S++G    D
Sbjct: 171 SLFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQ--CQYIVSY-VDGSSTTGTYSSD 227

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AG 155
            L L S +    Q         GC + ++G + D    DG+MGLG GD    SL+++ AG
Sbjct: 228 TLTLGSNAIKGFQ--------FGCSQSESGGFSD--QTDGLMGLG-GDAQ--SLVSQTAG 274

Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKYDAYFVGVESYCIGNSCL 212
               +FS C      GS  F   G A++     T  L   +    Y V +E+  +G   L
Sbjct: 275 TFGKAFSYCLPPTP-GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQL 333

Query: 213 T--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
               S F A  ++DSG   T LP   Y+ +   F   +     +        C++ S + 
Sbjct: 334 NIPTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQS 393

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHR 326
            + +P + L+FS      V N  F+    E    +CL     S D   G IG        
Sbjct: 394 SVSIPSVALVFSGG---AVVNLDFNGIMLE-LDNWCLAFAANSDDSSLGFIGNVQQRTFE 449

Query: 327 IVFDRENLKLAWSHSKC 343
           +++D     + +    C
Sbjct: 450 VLYDVGGGAVGFRAGAC 466


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/298 (22%), Positives = 117/298 (39%), Gaps = 26/298 (8%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP  SS+  NVSC+ P C   +        C Y   Y  + + S G+   D L L+S+ 
Sbjct: 221 FDPVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 278

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
                         GCG +  G + + A   G++GLG G  S+P     K G +   F+ 
Sbjct: 279 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 326

Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGF 217
           C     +G+ +      +   +++ L      D     Y++G+    +G   L+  QS F
Sbjct: 327 CLPARSTGTGYLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVF 386

Query: 218 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 272
                +VDSG   T LP   Y+ +   F   ++++  + +   +    CY+ +    + +
Sbjct: 387 ATAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAI 446

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
           P + L+F       V      +  +              GD GI+G   +    + +D
Sbjct: 447 PTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 504


>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 453

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/327 (21%), Positives = 131/327 (40%), Gaps = 41/327 (12%)

Query: 46  DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
           DP  SS+ +   C   L      C + +  C     Y TE +S +   V D   L     
Sbjct: 128 DPQRSSTLRYTQCGSCLLSGIQECAA-EQKCGINQRY-TEGSSWTAVEVSDTFVLGGPEI 185

Query: 106 HAPQSSVQSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFS 162
            + +  V  ++I   GC +K  G +    A +G++GL   D+S+   L K  +I + SFS
Sbjct: 186 SSLEQYVSFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVIPRESFS 244

Query: 163 ICFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLTQS---- 215
           +C    +    + G  GP   + T    + P       Y V V    +G+ CLT +    
Sbjct: 245 LCMTPFEG---YIGLGGPLRDKHTESMKYTPFTSTQSWYAVHVVRVFVGDECLTSNDQHD 301

Query: 216 -------------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
                        G   ++DSG + T+LP  +   +   + +L ++     Q +S    Y
Sbjct: 302 TVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAGRMREIWARLSNTP---FQPSS---TY 355

Query: 263 NASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPEN----EGFTVFCLTVMSTDGDYGI 316
             + +E   +P +    + N +   + +N +   PE      G       + + +    +
Sbjct: 356 AYTYDEFRSLPIVTFELANNVTLQALPKNFMEDLPEPLRPWTGRRKLMNRLYADEVQGAV 415

Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +G N M+G+ ++FD +  +   + + C
Sbjct: 416 VGLNTMVGYDLLFDVQGNRFGVAPALC 442


>gi|453087366|gb|EMF15407.1| candidapepsin-4 precursor [Mycosphaerella populorum SO2202]
          Length = 471

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/303 (25%), Positives = 129/303 (42%), Gaps = 42/303 (13%)

Query: 69  CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG---CGRKQT 125
           C++  DPC     Y+  D+S+  YL  D          +    V  +V IG      +Q 
Sbjct: 107 CQARGDPCSISGTYNANDSSTYTYLNSDFNISYVDGSGSAGDYVSDTVKIGDTTLTGQQF 166

Query: 126 GSYLDGAAPDGVMGLG--LGDVS-----------VPSLLAKAGLIQ-NSFSICFDEND-- 169
           G   + ++ +G++G+G  + +V+           VP  L KAG I  N++S+  ++ D  
Sbjct: 167 GIGYESSSQEGILGIGYPINEVAVQYNGGKTYSNVPQSLVKAGAINTNAYSLWLNDLDAS 226

Query: 170 SGSVFFGDQGPATQQSTSFL---PIGEKYDAY------FVGVESYCIGNSCLTQSGFQAL 220
           +GS+ FG  G  T++ T  L   PI E    Y         V +     S + +    AL
Sbjct: 227 TGSILFG--GVNTEKYTGSLETIPIVETQGVYAEFIIALTAVGANGTAGSIVNKQAIPAL 284

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
           +DSG+S  +LP +I   +   +D  V +   S QG ++  C  A+S+  L      L FS
Sbjct: 285 LDSGSSLMYLPNDITQSI---YDS-VGASYDSEQGAAFVDCDLANSDGSLD-----LTFS 335

Query: 281 KNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD--RENLKLA 337
                V  N +      + G  V  L +        ++G  F+    +V+D  +  + LA
Sbjct: 336 SPTIKVPMNELVIVAGIDRGKEVCILGIGPAGSSTPVLGDTFLRSAYVVYDLAKNEISLA 395

Query: 338 WSH 340
            ++
Sbjct: 396 QTN 398


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 164/371 (44%), Gaps = 64/371 (17%)

Query: 5   ICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCS 59
           +  G+ A  Y+A++    + L+W     C + F      D+    +DP  SSS   + CS
Sbjct: 101 LAIGTPAETYSAIMDTG-SDLIWTQCKPCKVCF------DQPTPIFDPEKSSSFSKLPCS 153

Query: 60  HPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 117
             LC +   SSC    D C Y   Y  + +S+ G L  +       S         S + 
Sbjct: 154 SDLCVALPISSC---SDGCEYRYSYG-DHSSTQGVLATETFTFGDAS--------VSKIG 201

Query: 118 IGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVF 174
            GCG    G +Y  GA   G++GLG G +   SL+++ G+ + S+ +   ++  G  ++ 
Sbjct: 202 FGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLL 255

Query: 175 FGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LV 221
            G +  AT +S    P+ +   +   Y++ +E   +G++ L   +S F          ++
Sbjct: 256 VGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRL 277
           DSG + T+L    +A +  +F   +S  ++ +  +     + C+    +   ++VP +  
Sbjct: 314 DSGTTITYLKDNAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVF 370

Query: 278 IFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLK 335
            F   +      N+I    E+    V CLT+ S+ G   I G NF   + +V  D E   
Sbjct: 371 HFEGVDLKLPKENYII---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKET 425

Query: 336 LAWSHSKCEEV 346
           ++++ ++C ++
Sbjct: 426 ISFAPAQCNQL 436


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 134/318 (42%), Gaps = 39/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++PS SSS K ++C+  +C K +    S K+ C Y   Y     +   +  + +    SF
Sbjct: 123 FNPSLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETL----SF 178

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
            +HA +     SV +GCGR   G +   A    ++GLG G +S PS    +    + FS 
Sbjct: 179 GEHAVR-----SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSY 228

Query: 164 CFDENDS---GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
           C    +S    S+ FG    P   + T  LP       Y+VG+    +  S +       
Sbjct: 229 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 288

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASS 266
              ++     +VDSG + + L T  Y  +   F  LV   S+  ISL    +  CY+ SS
Sbjct: 289 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSS 344

Query: 267 EEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
            +   +P + L F    S  +  + I    ++EG   +CL     +  + IIG       
Sbjct: 345 MKTATLPAVVLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTF 402

Query: 326 RIVFDRENLKLAWSHSKC 343
           RI  D +  ++  +  +C
Sbjct: 403 RISIDNQKEQMGIAPDQC 420


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 65/348 (18%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVD 95
           R    + P+SSS+   + C+  LC+  +S    C +    C Y   Y    T+  GYL  
Sbjct: 91  RPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFTA--GYLAT 146

Query: 96  DILHL--ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
           + LH+  ASF   A   S ++ V              G +  G++GLG   +S   L+++
Sbjct: 147 ETLHVGGASFPGVAFGCSTENGV--------------GNSSSGIVGLGRSPLS---LVSQ 189

Query: 154 AGLIQNSFSICF----DENDSGSVFFGDQGPAT--QQSTSFL--PIGEKYDAYFVGVESY 205
            G+    FS C     D  DS  + FG     T  + S + L  P       Y+V +   
Sbjct: 190 VGV--GRFSYCLRSDADAGDS-PILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGI 246

Query: 206 CIGNSCL----TQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
            +G + L    T  GF            +VDSG + T+L  E YA V   F   +++  +
Sbjct: 247 TVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANL 306

Query: 252 SLQGNSWKY----CYNASSE---EMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGF 300
           +   N  ++    C++A++      + VP + L F+    + VR      +         
Sbjct: 307 TTTVNGTRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRA 366

Query: 301 TVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            V CL V+  S      IIG    M   +++D +    +++ + C  V
Sbjct: 367 AVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 414


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 42/331 (12%)

Query: 44  EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDIL 98
            +DP+ SS+ + V C  P C     +  SC +     C +   Y++    +   L  D L
Sbjct: 142 SFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA--VLGQDAL 199

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L+  +  A           GC R  TGS      P G++G G G +   S L++     
Sbjct: 200 SLSDSNGAA---VPDDHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPL---SFLSQTKATY 252

Query: 159 NS-FSICF----DENDSGSVFFGDQG-PATQQSTSFLP------------IGEKYDAYFV 200
            S FS C       N SG++  G  G P   ++T  L             +G + +   V
Sbjct: 253 GSIFSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAV 312

Query: 201 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
            + +  +     T  G   +VD+G  FT L    YA +   F + VS+      G  +  
Sbjct: 313 PIPASALALDAATGRG-GTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALGG-FDT 370

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
           CY  +  +   VP +  +F+      +           G  V CL + +   D    G N
Sbjct: 371 CYYVNGTK--SVPAVAFVFAGGARVTLPEENVVISSTSG-GVACLAMAAGPSDGVNAGLN 427

Query: 321 FM-----MGHRIVFDRENLKLAWSHSKCEEV 346
            +       HR+VFD  N ++ +S   C  V
Sbjct: 428 VLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 85/322 (26%), Positives = 126/322 (39%), Gaps = 47/322 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++P  SSS K++SC    C   ++    +   C Y  +Y  + + S G    + L L S 
Sbjct: 180 FEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYG-DGSRSQGDFSQETLTLGSD 238

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL-AKAGLIQNSFS 162
           S          S   GCG   TG +   A   G++GLG   +S PS   +K G     FS
Sbjct: 239 SF--------PSFAFGCGHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYG---GQFS 284

Query: 163 IC---FDENDSGSVFFGDQG--PATQQSTSFLPI--GEKYDA-YFVGVESYCIGN----- 209
            C   F  + S   F   QG  PAT    +F+P+     Y + YFVG+    +G      
Sbjct: 285 YCLPDFVSSTSTGSFSVGQGSIPAT---ATFVPLVSNSNYPSFYFVGLNGISVGGERLSI 341

Query: 210 --SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYN 263
             + L + G   +VDSG   T L  + Y  +   F      L S+K  S+       CY+
Sbjct: 342 PPAVLGRGG--TIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSI----LDTCYD 395

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNF 321
            SS   +++P +   F  N    V      F      +  CL   S        IIG   
Sbjct: 396 LSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQ 455

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R+ FD    ++ ++   C
Sbjct: 456 QQRMRVAFDTGAGRIGFAPGSC 477


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 143/360 (39%), Gaps = 80/360 (22%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLC--------------KSRSSCKSLKDPCP-YIADYSTE 85
           N  ++ P +SSSSK V C++P C              + +++  +    CP Y   Y   
Sbjct: 128 NTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLG 187

Query: 86  DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
             S++G+L+ + L+  +           S  ++GC      S +    P G+ G G G+ 
Sbjct: 188 --STAGFLLSENLNFPT--------KKYSDFLLGC------SVVSVYQPAGIAGFGRGEE 231

Query: 146 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS----------TSFL--PIGE 193
           S+PS   +  L + S+ +   + D  +    +    T  S          T FL  P  +
Sbjct: 232 SLPS---QMNLTRFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTK 288

Query: 194 KYDA----YFVGVESYCIGNSCLT------------QSGFQALVDSGASFTFLPTEIYAE 237
           K  A    Y++ ++   +G   +               GF  +VDSG++FTF+   I+  
Sbjct: 289 KNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGF--IVDSGSTFTFMERPIFDL 346

Query: 238 VVVKFDKLVSSKRISLQGNSWKY--CYN-ASSEEMLKVPDMRLIFS--KNQSFVVRNHIF 292
           V  +F K VS  R       +    C+  A   E    P++R  F         V N+  
Sbjct: 347 VAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFS 406

Query: 293 SFPENEGFTVFCLTVMSTD--GDYGIIGQNFMMGH------RIVFDRENLKLAWSHSKCE 344
              + +   V CLT++S D  G  G +G   ++G+       + +D EN +  +    C+
Sbjct: 407 LVGKGD---VACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 78/173 (45%), Gaps = 7/173 (4%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +N++ +DP +SSS+  ++CS   C S    KS   P  Y  +YS + + +SGY + D++ 
Sbjct: 119 QNVTFFDPGASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEYS-DGSFTSGYYISDLIS 177

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
             +           +  + GC     G   L   +  G++GLG G + V S L+   L  
Sbjct: 178 FETVMSSNLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAP 237

Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
             FS+C    +   G +  G+       +T + P+      Y V ++++ + +
Sbjct: 238 EVFSLCLSGGQEGGGVIILGEN---RLPNTVYTPLVRSQTHYNVNLKTFAVND 287


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 81/325 (24%), Positives = 126/325 (38%), Gaps = 43/325 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------------SRSSCKSLKDPCPYIADYSTEDTSSSG 91
           +DP++S +   V C  P C              +RS+  S +  C Y   Y  + + S G
Sbjct: 225 FDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNS-EQRCYYALSYG-DGSFSRG 282

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            L  D L L + +K           + GCG    G +   A   G+MGLG  D+S+ S  
Sbjct: 283 VLAQDTLGLGTTTK-------LDGFVFGCGLSNRGLFGGTA---GLMGLGRTDLSLVS-- 330

Query: 152 AKAGLIQNSFSICF--DENDSGSVFFGDQGPAT----QQSTSFLPIGEKYDAYFVGVE-S 204
             A      FS C       +GS+  G  GP++       T  +    +   YF+ +  +
Sbjct: 331 QTAARFGGVFSYCLPATTTSTGSLSLG-PGPSSSFPNMAYTRMIADPTQPPFYFINITGA 389

Query: 205 YCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKY 260
              G + LT  GF A   LVDSG   T L   +Y  V  +F +    +  +  G S    
Sbjct: 390 AVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRF--EYPAAPGFSILDA 447

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIG 318
           CY+ +  + + VP + L         V      F   +  +  CL + S   +    IIG
Sbjct: 448 CYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIG 507

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                  R+V+D    +L ++   C
Sbjct: 508 NYQQRNKRVVYDTVGSRLGFADEDC 532


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/244 (22%), Positives = 105/244 (43%), Gaps = 40/244 (16%)

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
           A P GV G G G +S+P+ LA + L  ++ +     +++  V+           T  L  
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS-LSGSTDAAAIGASETDFVY-----------TPLLHN 278

Query: 192 GEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
            +    Y V +E+  +G   +                 +VDSG +FT LP++ +A V  +
Sbjct: 279 PKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADE 338

Query: 242 FDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF 294
           F + +++ R +    +        CY+ S  +   VP + L F  N +  +  RN+   F
Sbjct: 339 FARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFRGNATVALPRRNYFMGF 397

Query: 295 PENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
              EG +V CL +M+  G+           G +G     G  +V+D +  ++ ++  +C 
Sbjct: 398 KSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457

Query: 345 EVID 348
           ++ D
Sbjct: 458 DLWD 461


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 81/346 (23%), Positives = 152/346 (43%), Gaps = 62/346 (17%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---DP---CPYIADYSTEDTSSSGYLVDD 96
           S ++P +S +   + CS P C++R+    L    DP   C +I  Y+ + +S  G L  +
Sbjct: 103 SIFNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYA-DASSVEGNLAFE 161

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
              + S +  A         + GC      S   + A   G+MG+  G +S    + + G
Sbjct: 162 TFRVGSVTGPA--------TVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLS---FVNQMG 210

Query: 156 LIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEK------YD--AYFVGVESYC 206
                FS C  D + SG +  G+   +  +  ++ P+ E       +D  AY V +E   
Sbjct: 211 F--RKFSYCISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIR 268

Query: 207 IGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKF-------DKLVSSK 249
           + +  L+  +S F        Q +VDSG  FTFL   +Y+ +  +F        ++++  
Sbjct: 269 VSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEP 328

Query: 250 RISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFP-ENEGF-TVFC 304
           R   QG +   CY    +   +  +P + L+F   +  V  +  ++  P E  G  +V+C
Sbjct: 329 RYVFQG-AMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWC 387

Query: 305 LTVMSTDGDYGIIGQNFMMGHR------IVFDRENLKLAWSHSKCE 344
            T  ++D   GI  ++F++GH       + +D E  ++ ++  +C+
Sbjct: 388 FTFGNSD-SLGI--ESFVIGHHQQQNVWMEYDLEKSRIGFAEVRCD 430


>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
          Length = 133

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 3/120 (2%)

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE 193
           DG++GLG+G     + L    +I  N    C      G ++ G+  P ++  T ++P+ E
Sbjct: 10  DGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVT-WVPMRE 68

Query: 194 KYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
               Y  G+    I N  +     F+A+ DSG+++T +P++IY E+V K    +S   ++
Sbjct: 69  SSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKVRGTLSESSLA 128


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 81/322 (25%), Positives = 121/322 (37%), Gaps = 43/322 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPC-----PYIADYSTEDTSSSGYLVDDILH 99
           ++P  SSS K + C    C    + +S   PC      Y  +Y  + +SS G    + L 
Sbjct: 179 FEPKQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYG-DGSSSQGDFSQETLT 237

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-LAKAGLIQ 158
           L S S          +   GCG   TG +       G++GLG   +S PS   +K G   
Sbjct: 238 LGSDSFQ--------NFAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG--- 283

Query: 159 NSFSICF-DENDSGSVFFGDQGPAT-QQSTSFLPIGEKY---DAYFVGVESYCIGNSCLT 213
             F+ C  D   S S      G  +   S  F P+   +     YFVG+    +G   L+
Sbjct: 284 GQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLS 343

Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNA 264
                      +VDSG   T L  + Y  +   F      L S+K  S+       CY+ 
Sbjct: 344 IPPAVLGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI----LDTCYDL 399

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST---DGDYGIIGQNF 321
           S    +++P +   F  N    V +     P   G +  CL   S    DG + IIG   
Sbjct: 400 SRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDG-FNIIGNFQ 458

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R+ FD    ++ ++   C
Sbjct: 459 QQRMRVAFDTGAGRIGFASGSC 480


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 152/373 (40%), Gaps = 60/373 (16%)

Query: 1   MLGAICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKN 55
            L  +  G+ A  Y+A++    + L+W     C   F      D+    +DP  SSS   
Sbjct: 97  FLMKLAIGTPAETYSAIMDTG-SDLIWTQCKPCKDCF------DQPTPIFDPKKSSSFSK 149

Query: 56  VSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
           + CS  LC +   SSC    D C Y+  Y  + +S+ G L  +       S         
Sbjct: 150 LPCSSDLCAALPISSC---SDGCEYLYSYG-DYSSTQGVLATETFAFGDASV-------- 197

Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 169
           S +  GCG    GS     A  G++GLG G +S+ S L      +  FS C     D   
Sbjct: 198 SKIGFGCGEDNDGSGFSQGA--GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKG 250

Query: 170 SGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA----- 219
             S+  G +  AT ++    P+ +   +   Y++ +E   +G++ L   +S F       
Sbjct: 251 ISSLLVGSE--ATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGS 308

Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDM 275
              ++DSG + T+L    +A +  +F   +              C+    +   + VP +
Sbjct: 309 GGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQL 368

Query: 276 RLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DREN 333
              F   +      N+I +   + G  V CLT+ S+ G   I G NF   + +V  D E 
Sbjct: 369 VFHFEGADLKLPAENYIIA---DSGLGVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEK 423

Query: 334 LKLAWSHSKCEEV 346
             ++++ ++C ++
Sbjct: 424 ETISFAPAQCNQL 436


>gi|147821119|emb|CAN68736.1| hypothetical protein VITISV_030193 [Vitis vinifera]
          Length = 441

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 67/147 (45%), Gaps = 13/147 (8%)

Query: 43  SEYDPSSSSSSKNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           S Y P+   SS+        SC H L + R  C +    C   ++       S+G L +D
Sbjct: 81  SSYRPARCHSSQCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSED 138

Query: 97  ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +L L S     P+S+V     +  C  +     L G A +G+ GLG G + +P+LL+ A 
Sbjct: 139 VLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSAL 197

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGP 180
                F++C       SG +FFGD GP
Sbjct: 198 NFTRKFAVCLPPTTTSSGVIFFGD-GP 223


>gi|225451013|ref|XP_002284868.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
          Length = 441

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 67/147 (45%), Gaps = 13/147 (8%)

Query: 43  SEYDPSSSSSSKNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           S Y P+   SS+        SC H L + R  C +    C   ++       S+G L +D
Sbjct: 81  SSYRPAQCHSSQCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSED 138

Query: 97  ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           +L L S     P+S+V     +  C  +     L G A +G+ GLG G + +P+LL+ A 
Sbjct: 139 VLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSAL 197

Query: 156 LIQNSFSICF--DENDSGSVFFGDQGP 180
                F++C       SG +FFGD GP
Sbjct: 198 NFTRKFAVCLPPTTTSSGVIFFGD-GP 223


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 134/318 (42%), Gaps = 39/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           ++PS SSS K ++C+  +C K +    S K+ C Y   Y     +   +  + +    SF
Sbjct: 56  FNPSLSSSFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETL----SF 111

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
            +HA +     SV +GCGR   G +   A    ++GLG G +S PS    +    + FS 
Sbjct: 112 GEHAVR-----SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSY 161

Query: 164 CFDENDS---GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
           C    +S    S+ FG    P   + T  LP       Y+VG+    +  S +       
Sbjct: 162 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 221

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASS 266
              ++     +VDSG + + L T  Y  +   F  LV   S+  ISL    +  CY+ SS
Sbjct: 222 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSS 277

Query: 267 EEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
            +   +P + L F    S  +  + I    ++EG   +CL     +  + IIG       
Sbjct: 278 MKTATLPAVVLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTF 335

Query: 326 RIVFDRENLKLAWSHSKC 343
           RI  D +  ++  +  +C
Sbjct: 336 RISIDNQKEQMGIAPDQC 353


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/376 (20%), Positives = 146/376 (38%), Gaps = 79/376 (21%)

Query: 23  TTLLW--CLLVFGASIVQDRNLSEYDPSS--------SSSSKNVSCSHPLCK----SRSS 68
           ++L+W  C +       Q+   S  DP+         SS+ +++ C  P C     S  +
Sbjct: 95  SSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDLN 154

Query: 69  CKSLKDPCPYIA-DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
           C + K  CPY   +Y     S++G LV D+L L+  ++           + GC      S
Sbjct: 155 CSTTKR-CPYYGLEYGLG--STTGQLVSDVLGLSKLNRIP-------DFLFGC------S 198

Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQG----P 180
            +    P+G+ G G G  S+P   A+ GL + S+ +    FD+          +G     
Sbjct: 199 LVSNRQPEGIAGFGRGLASIP---AQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHAD 255

Query: 181 ATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 224
           A     ++ P  +        + Y++ +    +G   +           +     +VDSG
Sbjct: 256 AAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSG 315

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRLIFSK 281
           ++FTF+   I+  V  + +K ++  + + +    +    CYN + +  + VP +   F  
Sbjct: 316 STFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLTFSFKG 375

Query: 282 NQSFVVRNHIFSFPENEGFT-----VFCLTVM-------STDGDYGIIGQNFMMGHRIVF 329
             +          P  + F+     V C+TV+       ST G   I+G        I +
Sbjct: 376 GAN-------MDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEY 428

Query: 330 DRENLKLAWSHSKCEE 345
           D +  +  +   +C+ 
Sbjct: 429 DLKKQRFGFKPQQCDR 444


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 72/336 (21%), Positives = 127/336 (37%), Gaps = 65/336 (19%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 84
           L+W   +   S  + +N   +DPS S+S K VSC    C+                    
Sbjct: 47  LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCR-------------------L 86

Query: 85  EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 144
            DT +S      IL+                ++ GCG   +G++ +     G+ G G   
Sbjct: 87  LDTPTS------ILN----------------IVFGCGHNNSGTFNENEM--GLFGTGGRP 122

Query: 145 VSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-- 197
           +S+ S +         FS C      D + +  + FG +   +       P+  K D   
Sbjct: 123 LSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTY 182

Query: 198 YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
           YFV ++   +G+        S +   G    +D+G   T LP + Y  +V    + +  +
Sbjct: 183 YFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIPME 241

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
            +       + CY +++  ++  P +   F      +   + F  P+ EG  V+C  +  
Sbjct: 242 PVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAMQP 296

Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            DGD GI G    M   I FD +  K+++    C +
Sbjct: 297 IDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 332


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 143/324 (44%), Gaps = 40/324 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILH 99
           +DPS SSS +++ C    C     S  +C    + C Y   YS  D S ++G L  +   
Sbjct: 136 FDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEY--HYSYGDKSYTNGNLATEKFT 193

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQ 158
           + S S         S ++ GCG    G++      +   G+        SL+++ + +I+
Sbjct: 194 IGSTSSRPVH---LSPIVFGCGTGNGGTF-----DELGSGIVGLGGGALSLVSQLSSIIK 245

Query: 159 NSFSICF-----DENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC 211
             FS C        N +  + FG D   +  Q  S   + ++ D Y+ V +E+  +GN  
Sbjct: 246 GKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKR 305

Query: 212 L----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
           L           + G   ++DSG + TFL +E + E+    ++ V ++R+S     +  C
Sbjct: 306 LPYTNGLLNGNVEKG-NVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVC 364

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
           + ++ +  + +P + + F  N + V    + +F + +   + C T++S++   GI G   
Sbjct: 365 FRSAGD--IDLPVIAVHF--NDADVKLQPLNTFVKADE-DLLCFTMISSN-QIGIFGNLA 418

Query: 322 MMGHRIVFDRENLKLAWSHSKCEE 345
            M   + +D E   +++  + C +
Sbjct: 419 QMDFLVGYDLEKRTVSFKPTDCTK 442


>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
 gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
          Length = 472

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 110/257 (42%), Gaps = 33/257 (12%)

Query: 39  DRNLSEYDP----SSSSSSKNVSCSHPLCK-----SRSSCKSL---KDPCPYIADYSTED 86
           D N+S  DP    +SS+S   + C+ P C      S ++C S       C Y   YST D
Sbjct: 121 DCNVSTNDPLFSSASSTSYTRIPCTSPFCSTSPGFSTNACGSSAVGSTTCLYNFSYST-D 179

Query: 87  TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
            SS+G +  D++ + +  K     S++ S  +GCGR+ T + L      G++G    D S
Sbjct: 180 YSSAGEMASDVVAMKTPRKTRGNKSLRMS--LGCGREST-TLLGILNTSGLVGFAKTDKS 236

Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESY 205
               LA+             +  SG +  G+   ++  S S+ P+     A Y++G+ S 
Sbjct: 237 FIGQLAEMDYTSKFIYCVPSDTFSGKIVLGNYKISSHSSLSYTPMIVNSTALYYIGLRSI 296

Query: 206 CIGNS-------CLTQSGFQALVDSGASFTFLPTEIYAEVV-------VKFDKLVSSKRI 251
            I ++        L       ++DS  +F++   + Y  +V           K+ S++  
Sbjct: 297 SITDTLTFPVQGILADGTGGTIIDSTFAFSYFTPDSYTPLVQAIQNLNSNLTKVSSNETA 356

Query: 252 SLQGNSWKYCYNASSEE 268
           +L GN    CYN S  +
Sbjct: 357 ALLGN--DICYNVSVND 371


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 130/326 (39%), Gaps = 46/326 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDP--CPYIADYSTEDTSSSGYLVD 95
           + PS+SSS ++VSC+   C+S         +C S  +P  C Y+ +Y  + + ++G L  
Sbjct: 105 FKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGS-SNPSTCNYVVNYG-DGSYTNGELGV 162

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
           + L     S         S  + GCGR   G +       G+MGLG   +S+ S      
Sbjct: 163 EALSFGGVSV--------SDFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNA 209

Query: 156 LIQNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCI 207
                FS C    +   SGS+  G++    + +     T  L   +  + Y + +    +
Sbjct: 210 TFGGVFSYCLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDV 269

Query: 208 GNSCLTQ----SGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWK 259
           G   L           L+DSG   T LP+ +Y    AE + KF    S+   S+      
Sbjct: 270 GGVALKAPLSFGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSI----LD 325

Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
            C+N +  + + +P + L F  N    V      +   E+       L  +S   D  II
Sbjct: 326 TCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAII 385

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G       R+++D +  K+ ++   C
Sbjct: 386 GNYQQRNQRVIYDTKQSKVGFAEEPC 411


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 141/342 (41%), Gaps = 63/342 (18%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS SSS   + C+HPLCK R       +SC  L   C Y   Y+ + T + G LV + 
Sbjct: 119 FDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFYA-DGTLAEGNLVREK 176

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +  ++     P       +I+GC         D +   G++G+ LG +S  S   +A + 
Sbjct: 177 ITFSTSQSTPP-------LILGCAE-------DASDDKGILGMNLGRLSFAS---QAKIT 219

Query: 158 QNSFSICFDEND-------SGSVFFGDQ-GPATQQSTSFLPIGEKYD-------AYFVGV 202
           +  FS C            +GS + G+    A  Q  S L   +          A+ V +
Sbjct: 220 K--FSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVAL 277

Query: 203 ESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
           +   IGN  L    S F        Q+++DSG+ FT+L    Y +V  +  +L     K+
Sbjct: 278 QGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKK 337

Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM 308
             +       C++ ++ E+ + + +M   F K    V+ +  + +   + G  V C+ + 
Sbjct: 338 GYVYSGVSDMCFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLA---DVGGGVHCVGIG 394

Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
            ++       IIG        + FD  N ++ +  + C   +
Sbjct: 395 RSEMLGAASNIIGNFHQQNLWVEFDIANRRVGFGKADCSRSV 436


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 105/268 (39%), Gaps = 51/268 (19%)

Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----N 168
           S++ +GC          GA+  G++G+    +S PS L+        FS CF +     N
Sbjct: 253 SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLN 308

Query: 169 DSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            SG VFFG+              Q PA   ++         D Y+VG+    +  S L  
Sbjct: 309 SSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGISVDESRLPL 360

Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
                     T SG   ++DSG +FT+L    +  +  +F    S        + +  CY
Sbjct: 361 SHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 419

Query: 263 NASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTV-MSTDGDYG 315
           N +S     E   +P + L F      V+  +    P   +E  T  CL   MS D  + 
Sbjct: 420 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFN 479

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IIG        + +D E L+L  + ++C
Sbjct: 480 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 507


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 135/328 (41%), Gaps = 39/328 (11%)

Query: 45  YDPSSSSSSKNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           +DPS S+S K + C+        H  C+  SS  S K  C Y   Y  + + +SG L  +
Sbjct: 129 FDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALE 186

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L + S S H P S     ++IGCG    G +        ++GLG G +S PS L ++  
Sbjct: 187 SLSV-SLSDH-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSP 240

Query: 157 IQNSFSICFDEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESY 205
           I  SFS C  +       S ++ FG     ++      F P     ++    Y++G++  
Sbjct: 241 IGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGI 300

Query: 206 CIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
            I    L          T      ++DSG + T+L  + Y  V   F   +S  R     
Sbjct: 301 KIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PF 359

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
           +    CYNA+    +  P + ++F       +    +    +      CL ++ TDG   
Sbjct: 360 DILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MS 418

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IIG         ++D ++ +L ++++ C
Sbjct: 419 IIGNFQQQNIHFLYDVQHARLGFANTDC 446


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 75/336 (22%), Positives = 135/336 (40%), Gaps = 50/336 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDIL 98
           +DPS S + + +SC  P+C+    C ++ D       C +   Y  +  + SG LV D+ 
Sbjct: 169 HDPSKSRTFRRLSCFDPMCEL---CTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 224

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           H  + +       ++  V  GC   +    + G +  G++ LG+G    PS + + G+  
Sbjct: 225 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 277

Query: 159 NSFSICF-------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
           + FS C              +E  +  + FG     T +   F   G  Y      V  Y
Sbjct: 278 DRFSYCIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV-VY 336

Query: 206 CIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRI 251
             G     Q       +G +A      LVDSG +  +LP  ++  +  + ++ +S ++R 
Sbjct: 337 QHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRY 396

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMST 310
            L   S  YCY  +  ++  V  + L F       +    +F   EN      CL V + 
Sbjct: 397 DLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA- 453

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            G+  I+G        + +D   +++A+   +C+ V
Sbjct: 454 -GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 488


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/342 (22%), Positives = 142/342 (41%), Gaps = 48/342 (14%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYL 93
           +N + YDP +S+S KN++C+   C   SS      CKS    CPY   Y     ++  + 
Sbjct: 207 QNGAFYDPKASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFA 266

Query: 94  VDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
           V+   ++L +    +   +V+ +++ GCG    G +   A    ++GLG G +S  S L 
Sbjct: 267 VETFTVNLTTNGGSSELYNVE-NMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL- 321

Query: 153 KAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVG 201
              L  +SFS C      D N S  + FG+            TSF+   E      Y+V 
Sbjct: 322 -QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQ 380

Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR- 250
           ++S  +    L          +      ++DSG + ++     Y  +  K  +    K  
Sbjct: 381 IKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYP 440

Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCL 305
           +         C+N S    +++P++ + F+          +++FP    F      + CL
Sbjct: 441 VYRDFPILDPCFNVSGIHNVQLPELGIAFADGA-------VWNFPTENSFIWLNEDLVCL 493

Query: 306 TVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            ++ T    + IIG        I++D +  +L ++ +KC ++
Sbjct: 494 AMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 535


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 75/336 (22%), Positives = 135/336 (40%), Gaps = 50/336 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDIL 98
           +DPS S + + +SC  P+C+    C ++ D       C +   Y  +  + SG LV D+ 
Sbjct: 148 HDPSKSRTFRRLSCFDPMCEL---CTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 203

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           H  + +       ++  V  GC   +    + G +  G++ LG+G    PS + + G+  
Sbjct: 204 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 256

Query: 159 NSFSICF-------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
           + FS C              +E  +  + FG     T +   F   G  Y      V  Y
Sbjct: 257 DRFSYCIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV-VY 315

Query: 206 CIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRI 251
             G     Q       +G +A      LVDSG +  +LP  ++  +  + ++ +S ++R 
Sbjct: 316 QHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRY 375

Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMST 310
            L   S  YCY  +  ++  V  + L F       +    +F   EN      CL V + 
Sbjct: 376 DLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA- 432

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
            G+  I+G        + +D   +++A+   +C+ V
Sbjct: 433 -GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 467


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 141/351 (40%), Gaps = 66/351 (18%)

Query: 28  CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTE 85
           C L FG    QD  +  YD ++SSS   + CS   C     S C +    C Y       
Sbjct: 114 CKLCFG----QDTPI--YDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYR------ 161

Query: 86  DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
                 Y  DD     ++S      SV   +  GCG    G   +     G +GLG G +
Sbjct: 162 ------YAYDD----GAYSPECAGISV-GGIAFGCGVDNGGLSYNST---GTVGLGRGSL 207

Query: 146 SVPSLLAKAGLIQNSFSIC----FDENDSGSVFFGDQGPATQ----------QSTSFLPI 191
           S   L+A+ G+    FS C    F+ + S  VFFG                 QST  +  
Sbjct: 208 S---LVAQLGV--GKFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAAVVQSTPLVQS 262

Query: 192 GEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVV 240
                 Y+V +E   +G++ L                  +VDSG  FT L  E    VVV
Sbjct: 263 PYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTIL-VETGFRVVV 321

Query: 241 KFDKLVSSKRISLQGNSWKYCYNASS---EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPE 296
                V  + +    +  + C+ A +   +E+  +PDM L F+      + R++  SF E
Sbjct: 322 DHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNE 381

Query: 297 NEGFTVFCLTVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
            E  + FCL ++ T+   G +  NF   + +++FD    +L++  + C ++
Sbjct: 382 EE--SSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFMPTDCSKL 430


>gi|328865865|gb|EGG14251.1| hypothetical protein DFA_12021 [Dictyostelium fasciculatum]
          Length = 698

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 117/281 (41%), Gaps = 36/281 (12%)

Query: 132 AAP---DGVMGLGL-------GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA 181
           AAP   DG+MGL         GD  + SLL K   I NSFS+C  + + G +  G   P 
Sbjct: 246 AAPRKRDGIMGLSYQSLDPNNGD-DIFSLLVKTHEIHNSFSMCLSD-EGGMLVLGGVDPK 303

Query: 182 TQQS-TSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQ--ALVDSGASFTFLPTEIYAE 237
              +   + PI  E+Y  Y V      I  + L    FQ  ++VDSG +  FL  +I+ +
Sbjct: 304 MNSTLMKYTPITNERY--YSVNCTGLRIDGNNLNSKSFQSISIVDSGTTIMFLKLDIFND 361

Query: 238 VVVKFDKLVSS-KRISLQGNS-WKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
           ++    +  S    I+ Q  S W + C+  S  ++ K P + ++F   +  +      + 
Sbjct: 362 LIYYLVQHYSHLPGITTQSESLWNHQCFTLSDRQLEKYPTISMVFPNTEGGLFE---VAI 418

Query: 295 PEN------EGFTVFCLTVMSTDGDYGI-IGQNFMMGHRIVFDRENLKLAWSH--SKCEE 345
           P N      +    F    +     Y + IG   + G+ + ++RE+  + ++     C  
Sbjct: 419 PPNLYMIKIDDMYCFGFEKLPIKSPYSVLIGDVALQGYNVHYNREDGSIGFAKVTDNCGM 478

Query: 346 VIDKS--HVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
             D +  HV ++     Q  + L        +NG+    P+
Sbjct: 479 GQDNNQYHVEMISEEV-QENDSLVVKIHAIDANGRDGGAPN 518


>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
 gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
          Length = 492

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 129/328 (39%), Gaps = 43/328 (13%)

Query: 41  NLSEYDPSSSSSSKNVSCSHPLC----KSRSSCK---SLKDPCPYIADYSTEDTSSSGYL 93
           N   YDP+ SSSS+ + CS   C     +  SCK   + K  C +I  Y   D S     
Sbjct: 133 NRPVYDPALSSSSQLIPCSSDKCLGSGSASPSCKLHQNAKSTCDFIILYG--DGSK---- 186

Query: 94  VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG-------LGDVS 146
               +    FS     S V S++  G   ++ G++ +    DG+MGLG       L    
Sbjct: 187 ----IKGKVFSDEITVSGVSSTIYFGANVEEVGAF-EYPRADGIMGLGRTSNNKNLVPTI 241

Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESY 205
             S++     I+N F I  D +  G +  G         S  + PI      Y +   S+
Sbjct: 242 FDSMVRSNSSIKNIFGIYLDYHGQGYLSLGKINHHYYIGSIQYTPIQPAGPFYAIKPTSF 301

Query: 206 CIGNSCL-TQSGFQALVDSGASFTFLPTEIYAEVVVKF-------DKLVSSKRISLQGNS 257
            + N+     S  Q +VDSG S   L + +Y  ++  F       D + S   I     S
Sbjct: 302 RVDNTSFPANSMGQVIVDSGTSDLILTSRVYDHLIQYFRKHYCHIDMVCSYPSIF----S 357

Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPEN-EGFTVFCLTVMSTDGDY 314
            + C+    E+    P +   F       +  +N++     N +G   +C  +   D D 
Sbjct: 358 SRVCF-EKEEDFATFPWLHFGFEGGVRIAIPPKNYMIKTESNQQGVYGYCWGIDRGD-DM 415

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSK 342
            I+G  FM G+  +FD    ++ ++  K
Sbjct: 416 TILGDVFMRGYYTIFDNIENRVGFAIGK 443


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 78/326 (23%), Positives = 144/326 (44%), Gaps = 56/326 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLA 101
           +DPS+S++   + C+   C +   S +S  DP  C Y   Y  + + ++GYL  D + + 
Sbjct: 122 FDPSNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYG-DHSYTTGYLASDTVTVG 180

Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +       +SVQ  +V  GCG +  G++ +  +  G++GLG G++S  S L     I   
Sbjct: 181 N-------ASVQIRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKK 229

Query: 161 FSICF------------DENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDA--YFVG 201
           FS C             D   +  + FGD    +  ST+ +     P+  K  +  Y++ 
Sbjct: 230 FSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLT 289

Query: 202 VESYCIGNSCLT-----------QSGFQA-------LVDSGASFTFLPTEIYAEVVVKFD 243
           +E+  +G   L             SG ++       ++DSG + TFL  E Y  +     
Sbjct: 290 IEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALV 349

Query: 244 KLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 302
           + +  +R++   NS +  C+ +  EE +++P M++ F       ++         EG   
Sbjct: 350 EEIKMERVNDVKNSMFSLCFKSGKEE-VELPLMKVHFRGGADVELKPVNTFVRAEEGLVC 408

Query: 303 FCLTVMSTDGDYGIIGQ-NFMMGHRI 327
           F +   +  G YG + Q NF++G+ +
Sbjct: 409 FTMLPTNDVGIYGNLAQMNFVVGYDL 434


>gi|407926291|gb|EKG19258.1| Peptidase A1 [Macrophomina phaseolina MS6]
          Length = 477

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 53/225 (23%), Positives = 99/225 (44%), Gaps = 49/225 (21%)

Query: 146 SVPSLLAKAGLIQ-NSFSICFDENDS--GSVFFGDQGPATQQST-SFLPIGEKYDAYFVG 201
           ++P L+   G+IQ N++S+  ++ D+  GS+ FG         T + LPI ++Y +Y   
Sbjct: 200 NLPQLMVDKGIIQSNAYSLWLNDLDASRGSILFGGVDTEKYHGTLATLPIIQEYGSY--- 256

Query: 202 VESYCIGNSCLTQSGFQA---------------LVDSGASFTFLPTEIYAEVVVKFDKLV 246
              + I  + L  +G                  L+DSG+S T+LP  + A +   FD   
Sbjct: 257 -REFIIALTGLGANGNNGSYFSSNDSSSNVVPVLLDSGSSLTYLPDSVVANIYSDFDATY 315

Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE-----GFT 301
            S+    QG ++  C  A+S++ L+             F   +   S P NE     G++
Sbjct: 316 DSE----QGAAFVDCDKANSDDTLE-------------FTFSSPTISVPMNELVLLAGYS 358

Query: 302 ---VFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
                C+  ++  GD   ++G  F+    +V+D  N +++ + + 
Sbjct: 359 RGQAICILGIAPAGDSTSVLGDTFLRSAYVVYDLANNEISLAQTN 403


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 133/322 (41%), Gaps = 43/322 (13%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS---SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           + ++P++S S + V C  P C SR+   SC      C +   Y+  D+S    L  D L 
Sbjct: 146 TPFNPAASKSYRAVPCGSPAC-SRAPNPSCSLNTKSCGFSLTYA--DSSLEAALSQDSLA 202

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +A        + V  S   GC +K TG+      P G++GLG G +S   L     + + 
Sbjct: 203 VA--------NDVVKSYTFGCLQKATGT---ATPPQGLLGLGRGPLSF--LSQTKDMYEG 249

Query: 160 SFSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           +FS C       N SG++  G +G P   ++T  L    +   Y+V +    +G   +  
Sbjct: 250 TFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPI 309

Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                     +G   ++DSG  FT L    Y  V  +  + +    +S  G  +  CYN 
Sbjct: 310 PPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGG-FDTCYNT 368

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
           +    +K P +  +F+  Q  +  +++       G T       + DG      +I    
Sbjct: 369 T----VKWPPVTFMFTGMQVTLPADNLV-IHSTYGTTSCLAMAAAPDGVNTVLNVIASMQ 423

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
              HRI+FD  N ++ ++  +C
Sbjct: 424 QQNHRILFDVPNGRVGFAREQC 445


>gi|403343737|gb|EJY71200.1| Aspartic protease PM5 [Oxytricha trifallax]
          Length = 518

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 70/298 (23%), Positives = 123/298 (41%), Gaps = 33/298 (11%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
           +D C +   Y  E +S SG+LV D ++      H    +   +   GC  ++T  +    
Sbjct: 64  QDKCMFNQRYG-EGSSYSGFLVKDQVYFGD-KYHDKDDAF--NFTFGCVAEETHLFYSQE 119

Query: 133 APDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQS 185
           A DG++G+     S PS+      + +  LI +  FS+C  +N       G  G +    
Sbjct: 120 A-DGILGM-TRRTSNPSMKPIYESMYENNLIDKKMFSLCLGKNGGYFQLGGFDGQSHLDD 177

Query: 186 TSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
             +LP+ +K   Y + ++   + N  ++  +S  Q  +DSG +FT++P ++   +   FD
Sbjct: 178 VLWLPLIDK-STYIIKLQGISMNNHMMSGIESITQGFIDSGTTFTYIPQKLIDTLKQHFD 236

Query: 244 KL--------VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR---NHIF 292
                        KRI  Q    + C+  + E+    P          +F V    N + 
Sbjct: 237 WFCKVDPENNCKGKRIDPQ-QEQQICFEYNEEQNPDGPKKFFQSYPLLTFKVDDNGNTLD 295

Query: 293 SFPENEGFT----VFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            +P    +      +CL +  T   D  I+G  FM     +FD EN K+  + + C E
Sbjct: 296 WYPSEYLYRDQKHKYCLAIEVTQRPDQIILGGTFMRQKNFIFDVENNKVGIARASCNE 353


>gi|24647679|ref|NP_650621.1| CG17283 [Drosophila melanogaster]
 gi|7300253|gb|AAF55416.1| CG17283 [Drosophila melanogaster]
          Length = 465

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
           + S N+    P CKS++ CK  K   P  +    ++  S          +G L  D + +
Sbjct: 169 TGSSNIWVPGPHCKSKA-CKKHKQYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 227

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
           A          V ++       K+ G+    +  DG++GLG   ++V ++     L+QN 
Sbjct: 228 AGL--------VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNM 276

Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
                     F+IC     S     ++ FG    +      S ++ P+ +K    F   +
Sbjct: 277 CSEDVITSCKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQD 336

Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
            Y +G + ++ S  QA+VDSG S    PT IY     K +K++   R +  G  W  C  
Sbjct: 337 IY-VGGTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK 389

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDYGIIGQNF 321
                  K+PD   + +  + FVV+ +        N G TV    V     +  I+G  F
Sbjct: 390 -------KIPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVTEVPDEPVILGDAF 441

Query: 322 MMGHRIVFDRENLKLAWS 339
           +      FD  N ++ ++
Sbjct: 442 IRHFCTEFDLANNRIGFA 459


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 69/318 (21%), Positives = 129/318 (40%), Gaps = 31/318 (9%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           ++P  SS+  N+SC    C S +   C  + + C Y   Y  + +S+ G L  + +H  S
Sbjct: 132 FEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYG-DGSSTKGVLCTESIHFGS 190

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
            +   P++      I GCG      +       G++GLG G +S+ S L     I + FS
Sbjct: 191 QTVTFPKT------IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFS 242

Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLP--IGEKYDA-YFVGVESYCIGNSCLT--- 213
            C   F    +  + FG+    T       P  I   Y + YF+ +    IG   L    
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302

Query: 214 --QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 270
              +    ++D G   T+L    Y   V    + L  S+        + +C+   ++  +
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCF--PNQANI 360

Query: 271 KVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRI 327
             P +   F+  + F+  +N  F F   +   + CL V+       + + G    +  ++
Sbjct: 361 TFPKIVFQFTGAKVFLSPKNLFFRF---DDLNMICLAVLPDFYAKGFSVFGNLAQVDFQV 417

Query: 328 VFDRENLKLAWSHSKCEE 345
            +DR+  K++++ + C +
Sbjct: 418 EYDRKGKKVSFAPADCSK 435


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 141/338 (41%), Gaps = 57/338 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           Y+P  SSS   + CS  LC+    S  +C +  + C Y   Y + +  + G L  +    
Sbjct: 133 YEPRRSSSFAYLPCSDRLCQEGQFSYKNC-ARNNRCMYDELYGSAE--AGGVLASETFTF 189

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
              +K      V   +  GCG    G  L GA+  G+MGL  G +S+ S L+        
Sbjct: 190 GVNAK------VSLPLGFGCGALSAGD-LVGAS--GLMGLSPGIMSLVSQLSVP-----R 235

Query: 161 FSIC---FDENDSGSVFFGD-------QGPATQQSTSFL--PIGEKYDAYFVGVESYCIG 208
           FS C   F E  +  + FG        +   T Q+TS L  P  E    Y+V +    +G
Sbjct: 236 FSYCLTPFAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETA-YYYVPLVGLSLG 294

Query: 209 NSCL----TQSGF-------QALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQ 254
              L    T  G          +VDSG++ ++L    +  V   VV+  +L  +      
Sbjct: 295 TKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDED 354

Query: 255 GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST 310
            + ++ C+      + E +K P + L F    +  + R++ F  P      + CL V ++
Sbjct: 355 YDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA---GLMCLAVGTS 411

Query: 311 DGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              +G  IIG        ++FD  N K +++ +KC+++
Sbjct: 412 PDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPTKCDDI 449


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 121/311 (38%), Gaps = 41/311 (13%)

Query: 49  SSSSSKNVSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
           +SSS K + C+   C   SS       ++ C Y  +Y  + + +SG +  D +   S   
Sbjct: 53  ASSSYKKLPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGA 111

Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
                S     + GC RK  G   D     G++GLG    S+   L     +   FS C 
Sbjct: 112 GEDHRSFFDGFLFGCARKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166

Query: 166 DENDS-----GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT--- 213
              DS       +F G             PI  G+  D   Y+V ++S  IG   +    
Sbjct: 167 VSYDSPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYD 226

Query: 214 -QSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
            +SG           + ++DSG ++T L   +Y  +    ++ V    +   GNS     
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDL 283

Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           C+N+S +     P +   F+     V+   +IF     +   V CL++ S+ GD  IIG 
Sbjct: 284 CFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGN 340

Query: 320 NFMMGHRIVFD 330
                  I++D
Sbjct: 341 MQQQNFHILYD 351


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 126/313 (40%), Gaps = 34/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ SS+ + VSC+   C    +  + C +    C Y   Y  + ++++G    D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +  S              GC   ++G + D    DG+MGLG G  S+ S  A A    NS
Sbjct: 230 SGASDAV------KGFQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 161 FSICFDENDSGSVFFGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 214
           FS C     SGS  F   G         +T  L   +    Y   ++   +G     L+ 
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSP 337

Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           S F A  +VDSG   T LP   Y+ +   F   +   R +   +    C++ + +  + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 330
           P + L+FS   +  +  +   +         CL   +T  DG  GIIG        +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450

Query: 331 RENLKLAWSHSKC 343
             +  L +    C
Sbjct: 451 VGSSTLGFRSGAC 463


>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
          Length = 154

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 10/135 (7%)

Query: 113 QSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
           +  +  GCG KQ        +P    DG++GLG+G     + L    +I  N    C   
Sbjct: 6   KKKIAFGCGYKQEEP---ADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62

Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
              G ++ GD  P ++  T ++P+ E    Y  G+    I N  +  +  F+A+ DS ++
Sbjct: 63  KGKGVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSDST 121

Query: 227 FTFLPTEIYAEVVVK 241
           +T +P +IY E+V K
Sbjct: 122 YTHVPAQIYNEIVSK 136


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 79/356 (22%), Positives = 137/356 (38%), Gaps = 70/356 (19%)

Query: 35  SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR---------SSCKSLKDPCPYIADYSTE 85
           S    + +  +DP  SSSSK + C +P C S            C      C Y   YST+
Sbjct: 118 SAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQ 177

Query: 86  --DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
               +SSGY + + L         P+ +++ + ++GC    T S     + D + G G  
Sbjct: 178 YGTGASSGYFLLENLKF-------PRKTIR-NFLLGC----TTSAARELSSDALAGFGRS 225

Query: 144 DVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA 197
             S+P  +         F+ C + +D      SG +   D      +  S+ P  +   A
Sbjct: 226 MFSLPIQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYRDGKTKGLSYTPFLKSPPA 279

Query: 198 ----YFVGVESYCIGNSCLT------------QSGFQALVDSG-ASFTFLPTEIYAEVVV 240
               Y +GV+   IGN  L             +SG   ++DSG     ++   ++  V  
Sbjct: 280 SAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSG--VIIDSGYGGAGYMTGPVFKIVTN 337

Query: 241 KFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPE 296
           +  K +S  R SL+  +      CYN +  + +K+P +   F    + VV   + F    
Sbjct: 338 ELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISP 397

Query: 297 NEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
            E    F   +M T+G            I+G +  + + + +D +N +  +    C
Sbjct: 398 QESLACF---LMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450


>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 435

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 104/252 (41%), Gaps = 40/252 (15%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDG 131
            + C    D S   T++SG L +D+L + S +   P Q+ V S  +  C        L  
Sbjct: 115 NNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLAT 174

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA--------TQ 183
            A  G+ GLG   +++PS LA A      F+IC   +  G V FGD GP           
Sbjct: 175 GA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK-GVVLFGD-GPYGFLPNVVFDS 231

Query: 184 QSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA----- 225
            S ++ P+             G+    YF+GV++  I    ++ +     +D+       
Sbjct: 232 DSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTK 291

Query: 226 -----SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSEEML--KVPDMR 276
                 +T L   IY  V   F K  +++ I   G+   +++CY   +   L   VP + 
Sbjct: 292 ISTVDPYTVLEASIYKAVTDAFVKASAARNIKRVGSVAPFEFCYTNLTGTRLGAAVPTIE 351

Query: 277 LIFSKNQSFVVR 288
           L F +N++ V R
Sbjct: 352 L-FLQNENVVWR 362


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 76/324 (23%), Positives = 134/324 (41%), Gaps = 54/324 (16%)

Query: 56  VSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           + C+HPLCK R    SL   C      + + +  + T + G LV + +  +      P  
Sbjct: 138 LPCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPP-- 195

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND- 169
                +I+GC  +   +        G++G+ LG +  PS   +A + + S+ +   +   
Sbjct: 196 -----IILGCATQSDDA-------RGILGMNLGRLGFPS---QAKITKFSYCVPTKQAQP 240

Query: 170 -SGSVFFGDQGPATQ--QSTSFLPIGEKYD-------AYFVGVESYCIGNSCLT------ 213
            SGS + G+  PA+   +  + L  G+          AY + ++   IG   L       
Sbjct: 241 ASGSFYLGNN-PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVF 299

Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASS 266
                 SG Q ++DSG+ FT+L  E Y  +  +  K V    K+  + G     C++  +
Sbjct: 300 KPNAGGSG-QTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDA 358

Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD--GDYGIIGQNFMM 323
            E+ + V DM   F K    V+         + G  V CL +  ++  G  G I  NF  
Sbjct: 359 IEIGRLVGDMVFEFEKGVQIVIPKERVLATVDGG--VHCLGMGRSERLGAGGNIIGNFHQ 416

Query: 324 GHRIV-FDRENLKLAWSHSKCEEV 346
            +  V FD  N ++ +  + C ++
Sbjct: 417 QNLWVEFDLANRRVGFGEADCSKL 440


>gi|304361786|gb|ADM26243.1| MIP25078p [Drosophila melanogaster]
          Length = 467

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
           + S N+    P CKS++ CK  K   P  +    ++  S          +G L  D + +
Sbjct: 171 TGSSNIWVPGPHCKSKA-CKKHKQYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 229

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
           A          V ++       K+ G+    +  DG++GLG   ++V ++     L+QN 
Sbjct: 230 AGL--------VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNM 278

Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
                     F+IC     S     ++ FG    +      S ++ P+ +K    F   +
Sbjct: 279 CSEDVITSCKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQD 338

Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
            Y +G + ++ S  QA+VDSG S    PT IY     K +K++   R +  G  W  C  
Sbjct: 339 IY-VGGTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK 391

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDYGIIGQNF 321
                  K+PD   + +  + FVV+ +        N G TV    V     +  I+G  F
Sbjct: 392 -------KIPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVTEVPDEPVILGDAF 443

Query: 322 MMGHRIVFDRENLKLAWS 339
           +      FD  N ++ ++
Sbjct: 444 IRHFCTEFDLANNRIGFA 461


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 82/352 (23%), Positives = 146/352 (41%), Gaps = 67/352 (19%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
           ++ +DP+ SSS   + CS P C++R+       SC S K  C     Y+ + +SS G L 
Sbjct: 110 VNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDK-LCHATLSYA-DASSSEGNLA 167

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAK 153
            +I H  +       S+  S++I GC    +GS   +     G++G+  G +   S +++
Sbjct: 168 AEIFHFGN-------STNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQ 217

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
            G  + S+ I   ++  G +  GD            P  + ST  LP  ++  AY V + 
Sbjct: 218 MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLT 275

Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-------L 245
              +    L           T +G Q +VDSG  FTFL   +Y  +   F         +
Sbjct: 276 GIKVNGKLLPIPKSVLLPDHTGAG-QTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTV 334

Query: 246 VSSKRISLQGNSWKYCYNASSEEML-----KVPDMRLIFSKNQSFVV-RNHIFSFPE--- 296
                   QG +   CY  S   +      ++P + L+F   +  V  +  ++  P    
Sbjct: 335 YEDPEFVFQG-TMDLCYRISPFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTA 393

Query: 297 -NEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            N+  +V+C T  ++D    +  +IG +      I FD +  ++  +  +C+
Sbjct: 394 GND--SVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVQCD 443


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 127/317 (40%), Gaps = 37/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
           +DP+ SSS   V C    C +    C      C Y  +Y  + +S++G L  + L  +S 
Sbjct: 155 FDPAKSSSYAVVPCGTTECAAAGGECNGTT--CVYGVEYG-DGSSTTGVLARETLTFSS- 210

Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 SS  +  I GCG    G +  +DG    G   L L   + P+     G+    F
Sbjct: 211 ------SSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF---GGI----F 257

Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL---- 212
           S C    ++  G +  G      Q    +  +  K D    YF+ + S  IG   L    
Sbjct: 258 SYCLPSYNTTPGYLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPP 317

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T++G   L+DSG   T+LP   Y  +  +F   +   + +   +    CY+ + +  
Sbjct: 318 SEFTKTG--TLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSG 375

Query: 270 LKVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHR 326
           + +P +   FS    F +    I +FP++    V CL  +S   D  + ++G        
Sbjct: 376 ILIPGVSFNFSDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAE 435

Query: 327 IVFDRENLKLAWSHSKC 343
           +++D    K+ +  + C
Sbjct: 436 VIYDVPAQKIGFIPASC 452


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 127/346 (36%), Gaps = 89/346 (25%)

Query: 69  CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 128
           C +   PCP    Y+  D S    L  D L L S S         ++   GC        
Sbjct: 172 CNTSSYPCPPFY-YAYGDGSLVAKLFSDSLSLPSVS--------VANFTFGCAHTTL--- 219

Query: 129 LDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDEN--DSGSVFFGDQGPATQQS 185
              A P GV G G G +S+P+ L+  +  + NSFS C   +  DS  V    + P+    
Sbjct: 220 ---AEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRV----RRPSPLIL 272

Query: 186 TSFLPIGEKYDA--------------------------------YFVGVESYCIGNSCLT 213
             F+   EK  A                                Y V ++   IG   + 
Sbjct: 273 GRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRNIP 332

Query: 214 QSGFQALVD----------SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---- 259
                  +D          SG +FT LP + Y  VV +FD  V   R+  + +  +    
Sbjct: 333 APAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVG--RVHERADRVEPSSG 390

Query: 260 --YCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS 309
              CY  +  + +KVP + L F+ N S V    RN+ + F +      E   V CL +M+
Sbjct: 391 MSPCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEKRKVGCLMLMN 448

Query: 310 -------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
                    G   I+G     G  +V+D  N ++ ++  KC  + D
Sbjct: 449 GGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWD 494


>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
          Length = 435

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 104/252 (41%), Gaps = 40/252 (15%)

Query: 73  KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDG 131
            + C    D S   T++SG L +D+L + S +   P Q+ V S  +  C        L  
Sbjct: 115 NNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLAT 174

Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA--------TQ 183
            A  G+ GLG   +++PS LA A      F+IC   +  G V FGD GP           
Sbjct: 175 GA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK-GVVLFGD-GPYGFLPNVVFDS 231

Query: 184 QSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA----- 225
            S ++ P+             G+    YF+GV++  I    ++ +     +D+       
Sbjct: 232 DSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTK 291

Query: 226 -----SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSEEML--KVPDMR 276
                 +T L   IY  V   F K  +++ I   G+   +++CY   +   L   VP + 
Sbjct: 292 ISTVDPYTVLEASIYKAVTDAFVKAPAARNIKRVGSVAPFEFCYTNLTGTRLGAAVPTIE 351

Query: 277 LIFSKNQSFVVR 288
           L F +N++ V R
Sbjct: 352 L-FLQNENVVWR 362


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 84/205 (40%), Gaps = 27/205 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS+S + K++SC+   C S          C++  + C Y A Y  + + S GYL  D+
Sbjct: 161 FDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDL 219

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L LA        S      + GCG+   G +   A   G++GLG   +S+  L   +   
Sbjct: 220 LTLA-------PSQTLPGFVYGCGQDSDGLFGRAA---GILGLGRNKLSM--LGQVSSKF 267

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQ 214
             +FS C      G      +      +  F P+         YF+ + +  +G   L  
Sbjct: 268 GYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV 327

Query: 215 SGFQ----ALVDSGASFTFLPTEIY 235
           +  Q     ++DSG   T LP  +Y
Sbjct: 328 AAAQYRVPTIIDSGTVITRLPMSVY 352


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 163/371 (43%), Gaps = 64/371 (17%)

Query: 5   ICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCS 59
           +  G+ A  Y+A++    + L+W     C + F      D+    +DP  SSS   + CS
Sbjct: 101 LAIGTPAETYSAIMDTG-SDLIWTQCKPCKVCF------DQPTPIFDPEKSSSFSKLPCS 153

Query: 60  HPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 117
             LC +   SSC    D C Y   Y  + +S+ G L  +       S         S + 
Sbjct: 154 SDLCVALPISSC---SDGCEYRYSYG-DHSSTQGVLATETFTFGDAS--------VSKIG 201

Query: 118 IGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVF 174
            GCG    G +Y  GA   G++GLG G +   SL+++ G+ + S+ +   ++  G  ++ 
Sbjct: 202 FGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLL 255

Query: 175 FGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LV 221
            G +  AT +S    P+ +   +   Y++ +E   +G++ L   +S F          ++
Sbjct: 256 VGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRL 277
           DSG + T+L    +A +  +F   +S  ++ +  +     + C+    +   + VP +  
Sbjct: 314 DSGTTITYLKDSAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVF 370

Query: 278 IFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLK 335
            F   +      N+I    E+    V CLT+ S+ G   I G NF   + +V  D E   
Sbjct: 371 HFEGVDLKLPKENYII---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKET 425

Query: 336 LAWSHSKCEEV 346
           ++++ ++C ++
Sbjct: 426 ISFAPAQCNQL 436


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
           +DPS S + + +SC  P+C+    C ++ D       C +   Y  +  + SG LV D+ 
Sbjct: 168 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 223

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           H  + +       ++  V  GC   +    + G +  G++ LG+G    PS + + G+  
Sbjct: 224 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 276

Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
           + FS C                +E  +  + FG     T +   F   G  Y      V 
Sbjct: 277 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 335

Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
            Y  G     Q       +G +A      LVDSG +  +LP  ++  +  + ++ +S ++
Sbjct: 336 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 395

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
           R  L   S  YCY  +  ++  V  + L F       +    +F   EN      CL V 
Sbjct: 396 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 453

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +  G+  I+G        + +D   +++A+   +C+ V
Sbjct: 454 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 489


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 147/352 (41%), Gaps = 67/352 (19%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
           ++ +DP+ SSS   + CS P C++R+       SC S K  C     Y+ + +SS G L 
Sbjct: 110 VNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKL-CHATLSYA-DASSSEGNLA 167

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAK 153
            +I H  +       S+  S++I GC    +GS   +     G++G+  G +   S +++
Sbjct: 168 AEIFHFGN-------STNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQ 217

Query: 154 AGLIQNSFSICFDENDSGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
            G  + S+ I   ++  G +  GD            P  + ST  LP  ++  AY V + 
Sbjct: 218 MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLT 275

Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-------L 245
              +    L           T +G Q +VDSG  FTFL   +Y  +   F         +
Sbjct: 276 GIKVNGKLLPIPKSVLVPDHTGAG-QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTV 334

Query: 246 VSSKRISLQGNSWKYCYNAS-----SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPE--- 296
                   QG +   CY  S     S  + ++P + L+F   +  V  +  ++  P    
Sbjct: 335 YEDPDFVFQG-TMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTV 393

Query: 297 -NEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
            N+  +V+C T  ++D    +  +IG +      I FD +  ++  +  +C+
Sbjct: 394 GND--SVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECD 443


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 96/227 (42%), Gaps = 23/227 (10%)

Query: 25  LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 84
           L+W   +       + +L  +DP  SS+ KNV C    C+  ++       C Y  D   
Sbjct: 121 LVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRH 180

Query: 85  EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 144
           +D+   G L  D L L S +    +S +  +    CG +  G Y       G++GLG G 
Sbjct: 181 QDSCPDGDLAMDTLTLNSTTG---KSFMLPNTGFICGNRIGGDY----PGVGILGLGHGS 233

Query: 145 VSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAY 198
           +S+ + ++   LI   FS C   +  N +  + FGD+   +     ST     G  Y +Y
Sbjct: 234 LSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPY-SY 290

Query: 199 FVGVESYCIGNSCLTQSGFQAL-------VDSGASFTFLPTEIYAEV 238
            +      +GN  ++  G  +        +DSG  FT+ P   Y+++
Sbjct: 291 TLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQL 337


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 83/350 (23%), Positives = 141/350 (40%), Gaps = 56/350 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           + P +S++   V C    C SR      SC +    C     Y+ + ++S G L  D+  
Sbjct: 102 FRPRASATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSLSYA-DGSASDGALATDVFA 160

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
           +      AP   ++S+   GC      S  D  A  G++G+  G +S    + +A     
Sbjct: 161 VGD----AP--PLRSA--FGCMSAAYDSSPDAVATAGLLGMNRGALS---FVTQAS--TR 207

Query: 160 SFSICF-DENDSGSVFFGDQG--------PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
            FS C  D +D+G +  G               Q T  LP  ++  AY V +    +G  
Sbjct: 208 RFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRV-AYSVQLLGIRVGGK 266

Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
            L           T +G Q +VDSG  FTFL  + Y+ V  +F K       +L+  S+ 
Sbjct: 267 PLPIPPSVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFA 325

Query: 260 Y------CYN---ASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGFT-VFCLTV 307
           +      C+           ++P + L+F+  Q  V  +  ++  P E  G   V+CLT 
Sbjct: 326 FQEAFDTCFRVPKGRPPPSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF 385

Query: 308 MSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 354
            + D       +IG +  M   + +D E  ++  +  KC+   ++  + L
Sbjct: 386 GNADMVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASERLGLML 435


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
           +DPS S + + +SC  P+C+    C ++ D       C +   Y  +  + SG LV D+ 
Sbjct: 147 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 202

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           H  + +       ++  V  GC   +    + G +  G++ LG+G    PS + + G+  
Sbjct: 203 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 255

Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
           + FS C                +E  +  + FG     T +   F   G  Y      V 
Sbjct: 256 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 314

Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
            Y  G     Q       +G +A      LVDSG +  +LP  ++  +  + ++ +S ++
Sbjct: 315 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 374

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
           R  L   S  YCY  +  ++  V  + L F       +    +F   EN      CL V 
Sbjct: 375 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 432

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +  G+  I+G        + +D   +++A+   +C+ V
Sbjct: 433 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 468


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 89/350 (25%), Positives = 147/350 (42%), Gaps = 70/350 (20%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           S ++P SS +   V C  P CK+R+       SC + K  C  I  Y+ + TS  G L  
Sbjct: 105 SVFNPLSSKTYSKVPCLSPTCKTRTRDLTIPVSCDATK-LCHVIVSYA-DATSIEGNLAF 162

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           +   L S +K A         I GC      S   + +   G++G+  G +S  + +   
Sbjct: 163 ETFRLGSLTKPA--------TIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP 214

Query: 155 GLIQNSFSICFDENDS-GSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
                 FS C    DS G +  G+            P  Q ST  LP  ++  AY V +E
Sbjct: 215 -----KFSYCISGFDSAGVLLLGNASFPWLKPLSYTPLVQISTP-LPYFDRV-AYTVQLE 267

Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKF-------DKLV 246
              + N  L+  +S F        Q +VDSG  FTFL   +Y  +  +F        K++
Sbjct: 268 GIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKVL 327

Query: 247 SSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGF-T 301
           +      QG +   CY  ++S   +  +P + L+F   +  V     ++  P E  G  +
Sbjct: 328 NDDNFVFQG-AMDLCYLLDSSRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDS 386

Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
           V+C T  ++D    ++G + F++GH       + FD E  ++  +  +C+
Sbjct: 387 VWCFTFGNSD----LLGVEAFVIGHHHQQNVWMEFDLEKSRIGLADVRCD 432


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 135/365 (36%), Gaps = 82/365 (22%)

Query: 37  VQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDP--------CP-YIAD 81
           +    +  + P +SS++K + C +P C         S C   K P        CP YI  
Sbjct: 130 IDPTKIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQ 189

Query: 82  YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 141
           Y    T+  G+L+ D L+     K  PQ       ++GC      S L    P G+ G G
Sbjct: 190 YGLGATA--GFLLLDNLNFPG--KTVPQ------FLVGC------SILSIRQPSGIAGFG 233

Query: 142 LGDVSVPSLLAKAGLIQNSFSIC-----FDENDSGS---VFFGDQGPATQQSTSFLPIGE 193
            G  S+PS +         FS C     FD+    S   +     G       S+ P   
Sbjct: 234 RGQESLPSQMN-----LKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRS 288

Query: 194 K-------YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYA 236
                    + Y+V +    +G   +          +      +VDSG++FTF+   +Y 
Sbjct: 289 NPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYN 348

Query: 237 EVVVKFDKLVSSKRISLQGN-----SWKYCYNASSEEMLKVPDMRLIFS--KNQSFVVRN 289
            V  +F + +  K+ S + N         C+N S  + +  P+    F      S  + N
Sbjct: 349 LVAQEFLRQL-GKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLN 407

Query: 290 HIFSFPENEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSH 340
           + FSF  +    V C TV+S DG  G         I+G        + +D EN +  +  
Sbjct: 408 Y-FSFVGDA--EVLCFTVVS-DGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGP 463

Query: 341 SKCEE 345
             C+ 
Sbjct: 464 RNCKR 468


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
           +DPS S + + +SC  P+C+    C ++ D       C +   Y  +  + SG LV D+ 
Sbjct: 150 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 205

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
           H  + +       ++  V  GC   +    + G +  G++ LG+G    PS + + G+  
Sbjct: 206 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 258

Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
           + FS C                +E  +  + FG     T +   F   G  Y      V 
Sbjct: 259 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 317

Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
            Y  G     Q       +G +A      LVDSG +  +LP  ++  +  + ++ +S ++
Sbjct: 318 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 377

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
           R  L   S  YCY  +  ++  V  + L F       +    +F   EN      CL V 
Sbjct: 378 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 435

Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
           +  G+  I+G        + +D   +++A+   +C+ V
Sbjct: 436 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 471


>gi|342871686|gb|EGU74178.1| hypothetical protein FOXB_15313 [Fusarium oxysporum Fo5176]
          Length = 656

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 67/285 (23%), Positives = 123/285 (43%), Gaps = 29/285 (10%)

Query: 146 SVPSLLAKAGLI-QNSFSICFD--ENDSGSVFFGDQGPATQQSTS---FLPIGE---KYD 196
           ++P+ LA  GLI  N++S+  +  E+ +G++ FG  G   +Q T     LPI +   ++ 
Sbjct: 199 NLPAKLASKGLIASNAYSLYLNDLESATGTILFG--GVDQEQYTGDLVTLPINKINGEFA 256

Query: 197 AYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRI 251
              + ++S    +  +  +     ++DSG++ ++LP    ++IY  V  ++++  S   +
Sbjct: 257 ELSITLQSVSADSETIADNLDLAVILDSGSTLSYLPATLTSDIYDIVGAQYEEGESVAYV 316

Query: 252 --SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
              L  +S    +       + VP   L+        V     SF   +    F   +  
Sbjct: 317 PCDLGNDSGNLTFKFKDPAEISVPLSELVLDFTD---VTGRQLSFDNGQAACTFG--IAP 371

Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT 369
           T GD  I+G  F+    +VFD EN +++ + S      D +  H++    G+ P P  T 
Sbjct: 372 TTGDISILGDTFLRSAYVVFDLENNEISLAQSN----FDATKSHILEIGTGKHPVPTATG 427

Query: 370 EQQSTSNGQAAA--PPSTAKTAPSKSIAASAQQLDSVLRVACSLL 412
              S +   AAA   P     A S    A A  + + + +A S L
Sbjct: 428 SGSSDNKENAAASLAPLGGDAAISMVAGAFALGMTAYIELAASWL 472


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 126/313 (40%), Gaps = 34/313 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +DP+ SS+ + VSC+   C    +  + C +    C Y   Y  + ++++G    D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           +  S              GC   ++G + D    DG+MGLG G  S+ S  A A    NS
Sbjct: 230 SGASDAV------KGFQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278

Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 214
           FS C     SGS  F   G     S    T  L   +    Y   ++   +G     L+ 
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSP 337

Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
           S F A  +VDSG   T LP   Y+ +   F   +   R +   +    C++ + +  + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 330
           P + L+FS   +  +  +   +         CL   +T  DG  GIIG        +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450

Query: 331 RENLKLAWSHSKC 343
             +  L +    C
Sbjct: 451 VGSSTLGFRSGAC 463


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 137/345 (39%), Gaps = 56/345 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           + P +S++   V C    C SR      SC      C     Y+ + ++S G L  D+  
Sbjct: 111 FRPRASATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYA-DGSASDGALATDVF- 168

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
             +  +  P  S       GC      S  DG A  G++G+  G +S    + +A     
Sbjct: 169 --AVGEAPPLRSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLS---FVTQAS--TR 216

Query: 160 SFSICF-DENDSGSVFFGDQG--------PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
            FS C  D +D+G +  G               Q T  LP  ++  AY V +    +G  
Sbjct: 217 RFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRV-AYSVQLLGIRVGGK 275

Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
            L           T +G Q +VDSG  FTFL  + Y+ +  +F K       +L   S+ 
Sbjct: 276 ALPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFA 334

Query: 260 Y------CYNASS---EEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGFT-VFCLTV 307
           +      C+   +       ++P + L+F+  +  V  +  ++  P E+ G   V+CLT 
Sbjct: 335 FQEALDTCFRVPAGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF 394

Query: 308 MSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
            + D       +IG +  M   + +D E  ++  +  KC+   ++
Sbjct: 395 GNADMVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASER 439


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 132/327 (40%), Gaps = 65/327 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           S ++P SSS+   V CS P+C++R+       SC      C ++A    + TS  G L  
Sbjct: 97  SVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFC-HVAISYADATSIEGNLAH 155

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKA 154
           D   + S ++           + GC      S   + A   G+MG+  G +S  + L  +
Sbjct: 156 DTFVIGSVTRPG--------TLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS 207

Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
                 FS C   +D SG +  GD            P   Q+T  LP  ++  AY V +E
Sbjct: 208 -----KFSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTP-LPYFDRV-AYTVQLE 260

Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
              +G+  L+  +S F        Q +VDSG  FTFL   +Y  +  +F        ++V
Sbjct: 261 GIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIV 320

Query: 247 SSKRISLQGNSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF--- 300
                  QG +   CY   S        +P + L+F   +  V    +       G    
Sbjct: 321 DDPNFVFQG-TMDLCYRVGSSTRPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGK 379

Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGH 325
             V+C T  ++D    ++G + F++GH
Sbjct: 380 EEVYCFTFGNSD----LLGIEAFVIGH 402


>gi|225719388|gb|ACO15540.1| Cathepsin D precursor [Caligus clemensi]
          Length = 362

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 132/318 (41%), Gaps = 43/318 (13%)

Query: 47  PSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
           PSS+  + NV C +H    S +S   +KD   +   Y      +SG+L  D        K
Sbjct: 69  PSSTCGAPNVPCKTHNQYDSGNSSTHVKDGSKFNVKYKI--GKASGFLSQD--------K 118

Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC 164
                        G    ++         DGV+GLG G  S  + L   G I++  FS+ 
Sbjct: 119 VCVDGVCMEEQTFGEATSESMDPFANVYHDGVLGLGFGKDSFLNSLLDQGRIESPLFSLW 178

Query: 165 FD------ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-----GNSCLT 213
            +      +N+S  V  G        + S++P+    D + VG++S  I     G   +T
Sbjct: 179 VNRQPFRSKNNSRLVLGGIDTGHYSGNISYIPLNSD-DVWRVGMKSISIKGVHRGCGFIT 237

Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
           + G   + D+G+ FT+ P  + A+ +   ++ + + +I+    S+ Y Y     E+L +P
Sbjct: 238 RPGCDVVFDAGSRFTYGPI-LEAKTI---NRWIGATQIA---PSYGY-YKVRCNEILTLP 289

Query: 274 DMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
           ++ L+F       K + ++V   I         T     V  T  +   +G NF   +  
Sbjct: 290 NVELVFEDLTLVLKPKDYIVETKILGMK-----TCMSGFVGLTKQESWTLGANFFGAYFS 344

Query: 328 VFDRENLKLAWSHSKCEE 345
           V+D EN ++  + S+  E
Sbjct: 345 VYDIENKRIGLATSRRAE 362


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 72/319 (22%), Positives = 117/319 (36%), Gaps = 41/319 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ SSS   VSC   +C    +       C Y   Y  + + + G L  + L +    
Sbjct: 185 FDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYG-DGSYTKGTLALETLTVGQV- 242

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                  +   V IGCG    G ++  A   G+        S+  +    G    +FS C
Sbjct: 243 -------MIRDVAIGCGHTNQGMFIGAAGLLGLG-----GGSMSFIGQLGGQTGGAFSYC 290

Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSC--- 211
                +GS        A +     LP+G  + +          Y++G+    +G      
Sbjct: 291 LVSRGTGST------GALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSV 344

Query: 212 ------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                 LT+ G   +V D+G + T  PT  Y      F    S+   +   + +  CY+ 
Sbjct: 345 PEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDL 404

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
           +  E ++VP +   FS      +    F  P + G T FCL    +     IIG     G
Sbjct: 405 NGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGT-FCLAFAPSPSGLSIIGNIQQEG 463

Query: 325 HRIVFDRENLKLAWSHSKC 343
            +I FD  N  + +  + C
Sbjct: 464 IQISFDGANGFVGFGPNIC 482


>gi|301103993|ref|XP_002901082.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
 gi|262101420|gb|EEY59472.1| aspartyl protease family A01B, putative [Phytophthora infestans
           T30-4]
          Length = 446

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 62/297 (20%), Positives = 127/297 (42%), Gaps = 29/297 (9%)

Query: 76  CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 135
           C Y   Y  E    + Y   D++ L+S        S ++ +  GC  +Q+G +LD  + D
Sbjct: 119 CKYGQTY-IEGDHWTAYKASDVMQLSS--------SFEARIEFGCIYEQSGVFLDQPS-D 168

Query: 136 GVMGLGLGDVSVPSLLAKAGLIQNS-FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
           G+MG      S+     +  +  +  FS C  E        G       +   + P+   
Sbjct: 169 GIMGFSRHPDSIFEQFYRQKVTHSRIFSQCLAEGGGLLTIGGVDLARHTEPVRYTPLRNT 228

Query: 195 -YDAYFVGVESYCIGNSCLT----QSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKL 245
            Y  + V + S  +G++  T    +  F A    ++DSG +F ++P        + + + 
Sbjct: 229 GYQYWTVTLLSVSVGDANNTVQVDRKEFNADRGCVLDSGTTFLYMPESTKQPFRLAWSRA 288

Query: 246 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFC 304
           V S     + N++   Y  +S+++  +PD+   F  +    +  +  F+   N    ++ 
Sbjct: 289 VGSFSFVPESNTF---YFMTSKQVAALPDICFWFKNDVHICLPSSRYFALVGN---GIYT 342

Query: 305 LTVMSTDGDYG-IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAG 360
            T+  T G    I+G + + GH +++D +N ++  + + C++ + ++ V L   P G
Sbjct: 343 GTIFFTAGPKATILGASVLEGHDVIYDVDNHRVGIAEAMCDQPL-QAEVELSLDPGG 398


>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
          Length = 312

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 97/247 (39%), Gaps = 14/247 (5%)

Query: 114 SSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
           +S++ GC   Q+G       A DG+ G G   +SV S L   G+    FS C   +D+G 
Sbjct: 17  ASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGG 76

Query: 173 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSG 224
                 G   +    + P+      Y + +ES         I +S  T S  Q  +VDSG
Sbjct: 77  GIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSG 135

Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 284
            +  +L    Y   V      VS    SL     + C+  SS      P + L F    +
Sbjct: 136 TTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVA 194

Query: 285 FVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
             V+  N++      +   ++C+      G +  I+G   +     V+D  N+++ W+  
Sbjct: 195 MSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADY 254

Query: 342 KCEEVID 348
            C   ++
Sbjct: 255 DCSMSVN 261


>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
          Length = 431

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 62/270 (22%), Positives = 109/270 (40%), Gaps = 25/270 (9%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           L+ YD   S + K VSC    C +      S C +    C Y   Y+ + +SS GY V  
Sbjct: 117 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVKG 174

Query: 97  ILHLASFSK--HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
               + ++   H   + +   V + C   Q+G      A DG++G G  + S+ S LA +
Sbjct: 175 YCTASKYNSIPHLNNNPLLE-VPLRCSATQSGDLSSEEALDGILGFGKSNTSMISQLASS 233

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
           G ++  F+ C D  + G +F    G   Q   +  P+      Y V +++  +G   L  
Sbjct: 234 GKVRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNL 291

Query: 214 ---------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
                    + G   ++DSG +  +LP  +Y +++ K     S  ++    + +  C+  
Sbjct: 292 PTDVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQY 348

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
           S       P +   F  +    V  H + F
Sbjct: 349 SESLDDGFPAVTFHFENSLYLKVHPHEYLF 378


>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
          Length = 133

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 3/115 (2%)

Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE 193
           DG++GLG+G       L    +I  N    C      G ++ GD  P ++  T ++P+ E
Sbjct: 10  DGILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVT-WVPMKE 68

Query: 194 KYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
               Y  G+    I N  +     F+A+ DSG+++T +P ++Y E+V K    +S
Sbjct: 69  SLFYYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVRGTLS 123


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 76/325 (23%), Positives = 121/325 (37%), Gaps = 46/325 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           YDPS SSSS    CS P C++     + C    D C Y   Y  + ++S+G  + D+L L
Sbjct: 187 YDPSKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYP-DGSASAGTYISDVLTL 245

Query: 101 ASFSKHAPQSSVQSSVIIGCGRK--QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
                 A  +S  S    GC     Q GS+ +  +  G+M LG G  S+P+         
Sbjct: 246 ----NPAKPASAISEFRFGCSHALLQPGSFSNKTS--GIMALGRGAQSLPT--QTKATYG 297

Query: 159 NSFSICFDENDSGSVFFGDQGPATQQST-SFLPIGEKYDA---YFVGVESYCIGNSCLTQ 214
           + FS C       S FF    P    S  +  P+     A   Y V + +  +    L  
Sbjct: 298 DVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPV 357

Query: 215 S----GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS----- 265
                   A++DS    T LP   Y  +   F   + + R +        CY+ S     
Sbjct: 358 PPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPG 417

Query: 266 SEEMLKVPDMRLIFSK-------NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
               +K+P + L+F         + S V+ +   +F  N           + D   GIIG
Sbjct: 418 GGGGVKLPKITLVFDGPNGAVELDPSGVLLDGCLAFAPN-----------TDDQMTGIIG 466

Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
                   ++++ +   + +    C
Sbjct: 467 NVQQQALEVLYNVDGATVGFRRGAC 491


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 71/321 (22%), Positives = 125/321 (38%), Gaps = 42/321 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S +   + CS P C+   S  C + +  C Y   Y     +   +  + +    +
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----T 239

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
           F ++  +      V +GCG    G ++  A   G+           S   + G      F
Sbjct: 240 FRRNRVKG-----VALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKF 288

Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---C 211
           S C  +  + S    V FG+   A  +   F P+    K D  Y+VG+    +G +    
Sbjct: 289 SYCLVDRSASSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 346

Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
           +T S F+         ++DSG S T L    Y  +   F     + + +   + +  C++
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFD 406

Query: 264 ASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
            S+   +KVP + L F + + S    N++     N  F   C     T G   IIG    
Sbjct: 407 LSNMNEVKVPTVVLHFRRADVSLPATNYLIPVDTNGKF---CFAFAGTMGGLSIIGNIQQ 463

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
            G R+V+D  + ++ ++   C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 81/343 (23%), Positives = 128/343 (37%), Gaps = 57/343 (16%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D+    + P+ S++ + V C  PLC +       +        Y  ++ S++G L  +  
Sbjct: 128 DQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-- 185

Query: 99  HLASFSKHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
              +F+  A  SS  + S V  GCG   +G   + +   G++GLG G +S+ S L  +  
Sbjct: 186 ---TFTFGAANSSKVMVSDVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGPS-- 237

Query: 157 IQNSFSIC---FDENDSGSVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
               FS C   F   +   + FG              +  QST  +        YF+ ++
Sbjct: 238 ---RFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLK 294

Query: 204 SYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
              +G   L                  +DSG S T+L  + Y  V     +LVS  R   
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVR---HELVSVLRPLP 351

Query: 254 QGNSWK------YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVF 303
             N  +      + +       + VPDM L F    +  V       PEN    +G T F
Sbjct: 352 PTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGANMTVP------PENYMLIDGATGF 405

Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               M   GD  IIG        I++D  N  L++  + C  V
Sbjct: 406 LCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFVPAPCNIV 448


>gi|302757745|ref|XP_002962296.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
 gi|300170955|gb|EFJ37556.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
          Length = 163

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 10/134 (7%)

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE-------MLKV 272
           + DSG + TFLP  +Y +V+  F + ++   ++        CYN S +         L  
Sbjct: 32  IFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFPSLALHF 91

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
           PD  +   ++   VV +   +   NE  +V CL +MS+      IIG     G+ I+FD 
Sbjct: 92  PDAWMNLHQDNYIVVPSRADAEAWNE--SVACLAIMSSASIGINIIGNVMQQGYHIMFDN 149

Query: 332 ENLKLAWSHSKCEE 345
           E   + ++ + C E
Sbjct: 150 EKSTVTFAPASCSE 163


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 75/321 (23%), Positives = 128/321 (39%), Gaps = 46/321 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           +DPS SS+   ++C    C       R+ C S    C Y  +Y  + +S+ G   ++ + 
Sbjct: 169 FDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYG-DGSSTRGVYSNETIT 227

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
                  AP  +V+     GCG  Q G        DG++GLG    S+  ++  A +   
Sbjct: 228 F------APGITVK-DFHFGCGHDQRGP---SDKFDGLLGLGGAPESL--VVQTASVYGG 275

Query: 160 SFSICFD--ENDSGSVFFGDQGPATQQSTSF-------LPIGEKYDAYFVGVESYCIGNS 210
           +FS C     +++G +  G +  A   +++F       LP+     +Y V +    +G  
Sbjct: 276 AFSYCLPALNSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDAT--SYMVNMTGISVGGK 333

Query: 211 CLT--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
            L   +S F+   L+DSG   T LP   Y  +     K  ++  + +    +  CYN + 
Sbjct: 334 PLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPM-VASEDFDTCYNFTG 392

Query: 267 EEMLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFM 322
              + VP + L FS   +    V N I            CL    +  D   GIIG    
Sbjct: 393 YSNVTVPRVALTFSGGATIDLDVPNGI--------LVKDCLAFRESGPDVGLGIIGNVNQ 444

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
               +++D  + K+ +    C
Sbjct: 445 RTLEVLYDAGHGKVGFRAGAC 465


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 133/333 (39%), Gaps = 58/333 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SS+  N+SCS   C   + C  +   CPY  +Y     SS G    + L L +  
Sbjct: 135 FDPSKSSTYSNLSCSE--C---NKCDVVNGECPYSVEY-VGSGSSQGIYAREQLTLETID 188

Query: 105 KHAPQSSVQSSVIIGCGRK----QTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQN 159
           +   +     S+I GCGRK      G    G   +GV GLG G  S +PS   K      
Sbjct: 189 ESIIKV---PSLIFGCGRKFSISSNGYPYQGI--NGVFGLGSGRFSLLPSFGKK------ 237

Query: 160 SFSICFDENDSGSVFF-----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
            FS C     + +  F     GD+      ST+   I      Y+V +E+  IG   L  
Sbjct: 238 -FSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN---GLYYVNLEAISIGGRKLDI 293

Query: 213 ---------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY 260
                    T +    ++DSGA  T+L    +  +  + + L+    +  Q    N +  
Sbjct: 294 DPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTL 353

Query: 261 CYNA-SSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTD--GD-- 313
           CY+   S+++   P +   F++       V +      ENE    FC+ ++  +  GD  
Sbjct: 354 CYSGVVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQTTENE----FCMAMLPGNYFGDDY 409

Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
             +  IG      + + +D   +++ +    CE
Sbjct: 410 ESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDCE 442


>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 469

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 97/246 (39%), Gaps = 50/246 (20%)

Query: 50  SSSSKNVSCSHPLCKSRSS------CKSLKDP------CPYIADYSTEDTSSSGYLVDDI 97
           SSS   VSC   LCK  +S      C S   P      C +  +       +SG +  D+
Sbjct: 114 SSSYTPVSCDSLLCKLANSLACATECNSTPKPGCHNNTCAHSPENPVIRLGTSGQIGQDV 173

Query: 98  LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGD--VSVPSLLAK 153
           + L SF+   P   V   +    CG     ++L     DGV GL GLG+  +S+P+  + 
Sbjct: 174 VSLQSFNGKTPDRIVSVPNFPFVCGP----TFLLENLADGVTGLAGLGNSNISLPAQFSS 229

Query: 154 AGLIQNSFSICFDE--NDSGSVFFGD----------------QGPATQQSTSFLPIGEKY 195
           A      F++C       +G +FFGD                  P +    S+L  GE  
Sbjct: 230 AFGFPKKFAVCLSNSTKSNGLIFFGDGPYSNLPNDLTYTPLIHNPVSTAGGSYL--GEAS 287

Query: 196 DAYFVGVESYCIGNSCLTQSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDKL 245
             YF+GV+S  IG   +  +     +DS             +T L T IY  VV  F K 
Sbjct: 288 VEYFIGVKSIRIGGKDVKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKE 347

Query: 246 VSSKRI 251
           +  K I
Sbjct: 348 MDKKFI 353


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 135/328 (41%), Gaps = 39/328 (11%)

Query: 45  YDPSSSSSSKNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           +DPS S+S K + C+        H  C+  SS  S K  C Y   Y  + + +SG L  +
Sbjct: 213 FDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALE 270

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L + S S H P S     ++IGCG    G +        ++GLG G +S PS L ++  
Sbjct: 271 SLSV-SLSDH-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSP 324

Query: 157 IQNSFSICFDEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESY 205
           I  SFS C  +       S ++ FG     ++      F P     ++    Y++G++  
Sbjct: 325 IGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGI 384

Query: 206 CIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
            I    L     +           ++DSG + T+L  + Y  V   F   +S  R     
Sbjct: 385 KIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PF 443

Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
           +    CYNA+    +  P + ++F       +    +    +      CL ++ TDG   
Sbjct: 444 DILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MS 502

Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           IIG         ++D ++ +L ++++ C
Sbjct: 503 IIGNFQQQNIHFLYDVQHARLGFANTDC 530


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 139/324 (42%), Gaps = 42/324 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDP-CPYIADYSTEDTS-SSGYLVDDILH 99
           +DP SSS+ +++SCS   C   K  +SC    +  C Y   YS  D S +SG +  D + 
Sbjct: 134 FDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHY--SYSYGDRSFTSGNVAADTIT 191

Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAG-LI 157
           L S S    +  +    IIGCG    GS+ +  +    +         P SL+++ G  I
Sbjct: 192 LGSTSG---RPVLLPKAIIGCGHNNGGSFTEKGSGIVGL------GGGPISLISQLGSTI 242

Query: 158 QNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDA-YFVGVESYCIGN- 209
              FS C      +  +S  + FG  G  +       P I +  D  YF+ +E+  +G+ 
Sbjct: 243 DGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSE 302

Query: 210 ------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
                 S    S    ++DSG + T  P + ++E+       V+   +         CY+
Sbjct: 303 RIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS 362

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NF 321
             ++  LK P +   F  + + V  N + +F + ++    F    +++   +G + Q NF
Sbjct: 363 IDAD--LKFPSITAHF--DGADVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNF 418

Query: 322 MMGHRIVFDRENLKLAWSHSKCEE 345
           ++G    +D E   +++  + C +
Sbjct: 419 LVG----YDLEGKTVSFKPTDCTQ 438


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/298 (25%), Positives = 123/298 (41%), Gaps = 35/298 (11%)

Query: 67  SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
           + CK+      Y   Y  + TS   Y  D +            S V      G GR   G
Sbjct: 154 TQCKACTVENNYNMTYGDDSTSVGNYGCDTMT--------LEPSDVFQKFQFGRGRNNKG 205

Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQGPATQQS 185
            +  G+  DG++GLG G +S  S  A        FS C  E DS GS+ FG++  AT QS
Sbjct: 206 DF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLFGEK--ATSQS 259

Query: 186 TSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSGASFTFLPT 232
           +S     L  G    ++   YFV +    +GN  L    S F +   ++DS    T LP 
Sbjct: 260 SSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 319

Query: 233 EIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 288
             Y+ +   F K ++   +S     +G+    CYN S  + + +P++ L F       + 
Sbjct: 320 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLN 379

Query: 289 --NHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
             N ++   E+     F     ST + +  IIG    +   +++D +  ++ +  + C
Sbjct: 380 GTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/343 (23%), Positives = 128/343 (37%), Gaps = 57/343 (16%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           D+    + P+ S++ + V C  PLC +       +        Y  ++ S++G L  +  
Sbjct: 128 DQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-- 185

Query: 99  HLASFSKHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
              +F+  A  SS  + S V  GCG   +G   + +   G++GLG G +S+ S L  +  
Sbjct: 186 ---TFTFGAANSSKVMVSDVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGPS-- 237

Query: 157 IQNSFSIC---FDENDSGSVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
               FS C   F   +   + FG              +  QST  +        YF+ ++
Sbjct: 238 ---RFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLK 294

Query: 204 SYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
              +G   L                  +DSG S T+L  + Y  V     +LVS  R   
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVR---RELVSVLRPLP 351

Query: 254 QGNSWK------YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVF 303
             N  +      + +       + VPDM L F    +  V       PEN    +G T F
Sbjct: 352 PTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGANMTVP------PENYMLIDGATGF 405

Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               M   GD  IIG        I++D  N  L++  + C  V
Sbjct: 406 LCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFVPAPCNIV 448


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 73/332 (21%), Positives = 131/332 (39%), Gaps = 36/332 (10%)

Query: 47  PSSSSSSKNVSCSHPLCKSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           PS+SS+   V C    C++ SS  S      C Y+  Y  + + +SG L  +    ++ +
Sbjct: 157 PSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYG-DGSRASGQLSTETFTFSTIA 215

Query: 105 KHAPQSSVQ--------------SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
             +  +S                + +  GC    TG++      DG++GLG G VS+ S 
Sbjct: 216 DSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVSLASQ 271

Query: 151 LAKAGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVES 204
           L     +   FS C     + N S ++ FG +   ++   +  P+  GE    Y + ++S
Sbjct: 272 LGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDS 331

Query: 205 YCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
             +  +    +  QA  +VDSG + T+L + +   +V    + +   R          CY
Sbjct: 332 INVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCY 391

Query: 263 NAS---SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
           + S    E+ L +PD+ L+        ++         EG     L   S      I+G 
Sbjct: 392 DISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGN 451

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
                  + +D E   + ++ + C     KSH
Sbjct: 452 IAQQNLHVGYDLEKGTVTFAAADCA----KSH 479


>gi|348685429|gb|EGZ25244.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 467

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/328 (20%), Positives = 136/328 (41%), Gaps = 44/328 (13%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
           S +   S   P C +   C++ K  C Y   Y  E    S Y   D++ L+         
Sbjct: 117 SMTLQTSWGEPACMA---CENGK--CKYGQTY-VEGDHWSAYKASDMMQLSP-------- 162

Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSICFDEND 169
           S ++ +  GC  +Q+G +LD  + DG+MG      S+     +  +  +  FS C  E  
Sbjct: 163 SFEARIEFGCIYEQSGVFLDQPS-DGIMGFSRHPDSIFEQFYRQKVTHSRIFSQCLTEGG 221

Query: 170 SGSVFFGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLT--------QSGFQAL 220
                 G       +   + P+    Y  + V ++S  +GN   T         +    +
Sbjct: 222 GMLTIGGVDLTRHTEPVRYTPLRSTGYQYWTVTLQSVSVGNQSNTLQVDTYEYNADRGCV 281

Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
           +DSG +F ++P        + + + V S     Q +++   Y+ + +++  +PD+     
Sbjct: 282 LDSGTTFLYMPERTKEPFRLAWSRAVGSFSYIPQSDTF---YSMTPDQVAALPDI----- 333

Query: 281 KNQSFVVRNHI-FSFPENEGFT-----VFCLTVMSTDGDYG-IIGQNFMMGHRIVFDREN 333
               F ++N +    P +  F      V+  T+  + G    I+G + + GH I++D +N
Sbjct: 334 ---CFWLKNDVHICLPPSRYFAQVGDGVYTGTIFFSPGPRATILGASVLEGHDIIYDVDN 390

Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQ 361
            ++  + + C++ + ++ V L   P G+
Sbjct: 391 NRVGIAEAMCDQPM-QAAVELSLDPGGE 417


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 74/323 (22%), Positives = 135/323 (41%), Gaps = 42/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           ++ SSSS+ + V CS  +C          S C   +D C Y   Y++ +  S+GYL  D 
Sbjct: 69  FNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY-SAGYLSQDR 127

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L LA+        S+Q   I GCG   + +  +G +  G++G G    S  + +A+    
Sbjct: 128 LTLAN------SYSIQ-KFIFGCG---SDNRYNGHSA-GIIGFGNKSYSFFNQIAQL-TN 175

Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
            ++FS CF  N     F    GP  + S   + + + +D Y   +  Y +    +  +G 
Sbjct: 176 YSAFSYCFPSNQENEGFL-SIGPYVRDSNKLI-LTQLFD-YGAHLPVYALQQFDMMVNGM 232

Query: 218 Q------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY--N 263
           +             +VDSG   TF+ + ++  +     K + ++      +S + C+  N
Sbjct: 233 RLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSN 292

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQN 320
             S +  K+P + + FS++   +   ++F +  ++G    C T    D       I+G  
Sbjct: 293 GDSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDG--SICSTFQPDDAGVPGVQILGNR 350

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
                R+VFD +     +    C
Sbjct: 351 ATRSFRVVFDIQQRNFGFEAGAC 373


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 118/345 (34%), Gaps = 55/345 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV---DDILHLA 101
           Y P+ SSS + + CS   C             PY    S     S  Y     D  + + 
Sbjct: 190 YRPAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAESCSYFQKTQDGTVTIG 239

Query: 102 SFSKHAPQSSVQSS-------VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            + K     +V          +I+GC   + G  +D  A DGV+ LG GD+S     AK 
Sbjct: 240 IYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR 297

Query: 155 GLIQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLPI------GEKYDAYF 199
                 FS C        + S  + FG      GP T ++     +      G K     
Sbjct: 298 --FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVL 355

Query: 200 VGVESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
           VG E   I +       F     ++D+  S T L  E YA V    D+ +S      +  
Sbjct: 356 VGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELE 415

Query: 257 SWKYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
            ++YCY              + +P   +  +              PE E   V CL    
Sbjct: 416 GFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEP-GVACLAFRK 474

Query: 310 -TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
              G  GI+G  FM  +    D  + K+ +   KC    +  H+H
Sbjct: 475 LLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC----NTHHLH 515


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 136/331 (41%), Gaps = 53/331 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   V CS  LC +  RS+C   KD C Y+  Y  + +S+ G L  +      
Sbjct: 149 FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED 207

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 ++S+ S +  GCG +  G   DG +   G++GLG G +S+ S L      +  F
Sbjct: 208 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 252

Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
           S C     D   S S+F G               G  T ++ S L   ++   Y++ ++ 
Sbjct: 253 SYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQG 311

Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
             +G   L+  +S F+         ++DSG + T+L    +  +  +F   +S       
Sbjct: 312 ITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 371

Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
                 C+    + + + VP M   F   +      N++ +   +    V CL + S++G
Sbjct: 372 STGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSSNG 428

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              I G        ++ D E   +++  ++C
Sbjct: 429 -MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 49/317 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA----DYSTEDTSSSGYLVDDILHL 100
           + P  S++ KNVSC+ P       CK + +P   ++    + +   +S +  LV D + L
Sbjct: 132 FAPEKSTTFKNVSCAAP------ECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITL 185

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
           A        +    S   GC  K TG+    A P G++GLG G +S+ S      L Q++
Sbjct: 186 A--------TDPVPSYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQST 232

Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           FS C       N SGS+  G    P   + T  L    +   Y+V +E+  +G   +   
Sbjct: 233 FSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIP 292

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                    +G   + DSG  FT L   +Y  V  +F + V  K        +  CYN  
Sbjct: 293 PAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP 352

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQNF 321
               + VP +  IF+     + +++I     +   +  CL +     + +    +I    
Sbjct: 353 ----IVVPTITFIFTGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 406

Query: 322 MMGHRIVFDRENLKLAW 338
              HR+++D  N +  W
Sbjct: 407 QQNHRVLYDVPNSR-GW 422


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 78/323 (24%), Positives = 121/323 (37%), Gaps = 43/323 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
           +DPS SS+   + C+   CK            ++   +   C Y  +Y      + G   
Sbjct: 169 FDPSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYG-NGAITEGVYS 227

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            + L L S       S+V  S   GCG  Q G Y      DG++GLG    S+ S  A  
Sbjct: 228 TETLALGS-------SAVVKSFRFGCGSDQHGPY---DKFDGLLGLGGAPESLVSQTAS- 276

Query: 155 GLIQNSFSICFDENDSGSVFFGDQGP-ATQQSTS---FLPIG----EKYDAYFVGVESYC 206
            +   +FS C    +SG+ F     P +T  S S   F P+     +    Y V +    
Sbjct: 277 -VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGIS 335

Query: 207 IGNSCL--TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYC 261
           +G   L    + F    +VDSG   T +PT  Y  +   F   ++   +    +S    C
Sbjct: 336 VGGKALDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTC 395

Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQN 320
           YN +    + VP + L F    +  +        E+      CL    + DG +GIIG  
Sbjct: 396 YNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED------CLAFADAGDGSFGIIGNV 449

Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
                 +++D     L +    C
Sbjct: 450 NTRTIEVLYDSGKGHLGFRAGAC 472


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 128/318 (40%), Gaps = 41/318 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DP +SS+  +V CS   C        + S+C S  + C Y A Y  + + S G L  D 
Sbjct: 177 FDPRASSTYASVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGSLSTDT 234

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           +   S        +   S   GCG+   G +   A   G++GL    +S+   LA +  +
Sbjct: 235 VSFGS--------TRYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281

Query: 158 QNSFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT 213
             SFS C     + S  +   GP  T    S+ P+     DA  YF+ +    +G S L 
Sbjct: 282 GYSFSYCLPT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLA 339

Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
                 S    ++DSG   T LPT ++  +     + ++  + +   +    C+   + +
Sbjct: 340 VSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399

Query: 269 MLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
            L+VP + + F+   S     RN +    ++      CL    TD    IIG        
Sbjct: 400 -LRVPTVAMAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTDST-AIIGNTQQQTFS 453

Query: 327 IVFDRENLKLAWSHSKCE 344
           +++D    ++ +S   C 
Sbjct: 454 VIYDVAQSRIGFSAGGCS 471


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/339 (23%), Positives = 131/339 (38%), Gaps = 54/339 (15%)

Query: 39  DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
           D++   +DP +S S   V C+ PLC+   S  C   +  C Y   Y  + + ++G    +
Sbjct: 183 DQSGQMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 241

Query: 97  ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
            L  AS ++  P+      V +GCG    G ++  A    ++GLG G +S PS +++   
Sbjct: 242 TLTFASGAR-VPR------VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--R 289

Query: 157 IQNSFSICFDE---------NDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVES 204
              SFS C  +         + S +V FG             P GE+    D        
Sbjct: 290 FGRSFSYCLVDRTSSSASATSRSSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHG 349

Query: 205 YCIGNSCL-------------TQSGFQALVDSGASFTFLPTEIYAEV------VVKFDKL 245
           +                    T  G   +VDSG      P+  +A          +    
Sbjct: 350 HQRRRRARPGRGRVRPPPDPSTGRG-GVIVDSG-----RPSPAWARAGRTPPCATRSRAA 403

Query: 246 VSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 304
            +  R+S  G S +  CY+ S  +++KVP + + F+      +    +  P +   T FC
Sbjct: 404 AAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FC 462

Query: 305 LTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                TDG   IIG     G R+VFD +  +L +    C
Sbjct: 463 FAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 3/131 (2%)

Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
           +G   +++   +FT+L  ++YA +  +F K +S   ++    S   CYN ++     VP 
Sbjct: 355 AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPA 414

Query: 275 MRLIFSKNQSF-VVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
           + L F     F +  + +  FPE    F+V CL  ++ DG   +IG    M   +V+D  
Sbjct: 415 VTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGG-AVIGSMAQMSTEVVYDVR 473

Query: 333 NLKLAWSHSKC 343
             K+ +   +C
Sbjct: 474 GGKVGFVPYRC 484


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 136/331 (41%), Gaps = 53/331 (16%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   V CS  LC +  RS+C   KD C Y+  Y  + +S+ G L  +      
Sbjct: 41  FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED 99

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 ++S+ S +  GCG +  G   DG +   G++GLG G +S+ S L      +  F
Sbjct: 100 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 144

Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
           S C     D   S S+F G               G  T ++ S L   ++   Y++ ++ 
Sbjct: 145 SYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQG 203

Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
             +G   L+  +S F+         ++DSG + T+L    +  +  +F   +S       
Sbjct: 204 ITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 263

Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
                 C+    + + + VP M   F   +      N++ +   +    V CL + S++G
Sbjct: 264 STGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSSNG 320

Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
              I G        ++ D E   +++  ++C
Sbjct: 321 -MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 124/314 (39%), Gaps = 36/314 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP+ S++    SCS   C       +  L   C YI  Y  + ++++G    D L L +
Sbjct: 174 FDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKY-VDHSNTTGTYGSDTLGLTT 232

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSF 161
                  S    +   GC  +  G        DG+MGLG GD    SL+++ A     +F
Sbjct: 233 -------SDAVKNFQFGCSHRANGFV---GQLDGLMGLG-GDTE--SLVSQTAATYGKAF 279

Query: 162 SICFDENDSGSVFFGDQGPATQQSTS----FLPIGEKYDAYFVGV--ESYCIGNSCLT-- 213
           S C   + S +  F   G A   ++S      P+       F GV  ++  +  + L   
Sbjct: 280 SYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVP 339

Query: 214 QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
            S F   ++VDSG   T LP   Y  +   F K + +   +        C++ S  + ++
Sbjct: 340 ASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVR 399

Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL--TVMSTDGDYGIIGQNFMMGHRIVF 329
           VP + L FS       R  +     +  F   CL  T  + DGD GI+G        ++F
Sbjct: 400 VPVVTLTFS-------RGAVMDLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLF 452

Query: 330 DRENLKLAWSHSKC 343
           D     L +    C
Sbjct: 453 DVGGSTLGFRPGAC 466


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/351 (21%), Positives = 134/351 (38%), Gaps = 63/351 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +D  +S ++  V CS P+C S     S C    + C Y+ DY+ + + +SG +V+D    
Sbjct: 142 FDALASQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYA-DKSITSGRIVED---- 196

Query: 101 ASFSKHAPQSSVQS---------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
            +F+  +PQ +  S         +V  GCG+   G +    +  G+ G   G +S+PS L
Sbjct: 197 -TFTFRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQL 253

Query: 152 AKAGLIQNSFSICFDENDSGSVFFGDQGP--------ATQQSTSFLPIGEKYDAYFVGVE 203
            K     + F+   D   S     G  GP           QST F         Y++ ++
Sbjct: 254 -KVARFSHCFTAIADARTSPVFLGGAPGPDNLGAHATGPVQSTPF--ANSNGSLYYLTLK 310

Query: 204 SYCIGNSCLTQSGFQ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
              +G + L  +                ++DSG     LP  +Y  +   F   V+  ++
Sbjct: 311 GITVGKTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAF---VARVKL 367

Query: 252 SLQGNSW-----KYCYNASSEEMLKVPDMRLIFSK--------NQSFVVRNHIFSFPENE 298
            +   S        C+ A+    L          K        +      +++    E+E
Sbjct: 368 PVANESAADAESTLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDE 427

Query: 299 --GFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               +  CL + S  D D  IIG        + +D E  KL +  ++C+++
Sbjct: 428 DGSGSGLCLVMNSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 134/334 (40%), Gaps = 58/334 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           + P+SSS+   + C+   C+    S  +C +    C Y   Y +  T+  GYL  + L +
Sbjct: 128 FQPASSSTFSKLPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYTA--GYLATETLKV 183

Query: 101 --ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
             ASF           SV  GC  +       G +  G+ GLG G +S   L+ + G+  
Sbjct: 184 GDASFP----------SVAFGCSTENG----VGNSTSGIAGLGRGALS---LIPQLGV-- 224

Query: 159 NSFSICFDENDSGS---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGN 209
             FS C     +     + FG     T    QST F+    +   Y  Y+V +    +G 
Sbjct: 225 GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGE 282

Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           + L         TQ+G     +VDSG + T+L  + Y  V   F    ++          
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGL 342

Query: 259 KYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD- 313
             C+ ++     + VP + L F     + V  + F+  E +     TV CL ++   GD 
Sbjct: 343 DLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGDQ 401

Query: 314 -YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
              +IG    M   +++D +    ++S + C +V
Sbjct: 402 PMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAKV 435


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 58/269 (21%)

Query: 132 AAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFFGDQ------------ 178
           A P GV G G G +S+P+ L+  +  + N FS C   +     F GD+            
Sbjct: 212 AEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHS----FDGDRLRRPSPLILGRH 267

Query: 179 -----GPATQQSTSFLPI----GEKYDAYF-VGVESYCIGNSCL----------TQSGFQ 218
                G    +S  F+        K+  Y+ VG+    +G   +           +    
Sbjct: 268 NDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGG 327

Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWKYCYNASSEEMLKVPD 274
            +VDSG +FT LP   Y  VV +FDK V+   KR S          CY  +   + ++P 
Sbjct: 328 MVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLGPCYYLNG--LSQIPV 385

Query: 275 MRLIFSKNQSFVV---RNHIFSFPE-NEGF----TVFCLTVMSTD-------GDYGIIGQ 319
           ++L F  N S VV   +N+ + F +  +G      V C+ +M+ +       G    +G 
Sbjct: 386 LKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLMNGEDETELDGGPGATLGN 445

Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID 348
               G  +V+D E  ++ ++  +C  + D
Sbjct: 446 YQQQGFEVVYDLEKERVGFAKKECALLWD 474


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/353 (21%), Positives = 136/353 (38%), Gaps = 73/353 (20%)

Query: 42  LSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDP--------CP-YIADYSTEDTS 88
           +  + P +SSSSK + C  P C+        C+   DP        CP YI  Y     S
Sbjct: 137 IPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGC-DPNTRNCTVGCPPYILQYGLG--S 193

Query: 89  SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
           ++G L+ + L     +            ++GC      S +    P G+ G G G VS+P
Sbjct: 194 TAGVLITEKLDFPDLT--------VPDFVVGC------SIISTRQPAGIAGFGRGPVSLP 239

Query: 149 SLLAKAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK------ 194
           S +          S  FD+         D+GS   G    +     ++ P  +       
Sbjct: 240 SQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGS---GHNSGSKTPGLTYTPFRKNPNVSNK 296

Query: 195 --YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
              + Y++ +    +G   +          T     ++VDSG++FTF+   ++  V  +F
Sbjct: 297 AFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF 356

Query: 243 DKLVS--SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
              +S  ++   L+  +    C+N S +  + VP++   F       +  ++ F+F  N 
Sbjct: 357 ASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNT 416

Query: 299 GFTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
                CLTV+S          G   I+G      + + +D EN +  ++  KC
Sbjct: 417 --DTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 132/320 (41%), Gaps = 45/320 (14%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DPS SS+ K   C               + CPY   Y+ E + S+G L  + + + S S
Sbjct: 103 FDPSKSSTFKEKRCH-------------GNSCPYEIIYADE-SYSTGILATETVTIQSTS 148

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSL--LAKAGLIQNS 160
               +  V +   IGCG   +     G  A+  G++GL +G  S+ S   L   GLI   
Sbjct: 149 G---EPFVMAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLI--- 202

Query: 161 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
            S CF    +  + FG      G  T  +  F+   + +  Y++ +++  +G+  +   G
Sbjct: 203 -SYCFSSQGTSKINFGTNAVVAGDGTVAADMFIKKDQPF--YYLNLDAVSVGDKRIETLG 259

Query: 217 --FQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK--YCYNASSE 267
             F A      +DSG ++T+LPT  Y  +V +           +   S +   CYN  + 
Sbjct: 260 TPFHAQDGNIFIDSGTTYTYLPTS-YCNLVREAVAASVVAANQVPDPSSENLLCYNWDTM 318

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
           E+   P + L F+     V+  +        G T FCL +   D     I  N    + +
Sbjct: 319 EIF--PVITLHFAGGADLVLDKYNMYVETITGGT-FCLAIGCVDPSMPAIFGNRAHNNLL 375

Query: 328 V-FDRENLKLAWSHSKCEEV 346
           V +D   L +++S + C  +
Sbjct: 376 VGYDSSTLVISFSPTNCSAL 395


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/317 (20%), Positives = 127/317 (40%), Gaps = 37/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P  SSS   + CS  LC++  S     + C Y   Y  + + + G +  + L   S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVS 195

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     ++  GCG    G      A  G++G+G G +S+PS L         FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240

Query: 165 F---DENDSGSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
                 ++S ++  G   +   A   +T+ +   +    Y++ +    +G++ L      
Sbjct: 241 MTPIGSSNSSTLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSV 300

Query: 213 ----TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
               + +G    ++DSG + T+     Y  V   F   ++   ++   + +  C+   S+
Sbjct: 301 FKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSD 360

Query: 268 EM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
           +  L++P   + F      +   + F  P N    + CL + S+     I G        
Sbjct: 361 QSNLQIPTFVMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLL 417

Query: 327 IVFDRENLKLAWSHSKC 343
           +V+D  N  +++  ++C
Sbjct: 418 VVYDTGNSVVSFLSAQC 434


>gi|452820752|gb|EME27790.1| aspartyl protease [Galdieria sulphuraria]
          Length = 559

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/339 (20%), Positives = 131/339 (38%), Gaps = 58/339 (17%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKS--------LKDPCPYIADYSTEDT 87
           S+Y     S S  V C+ PLC S        S C S        +   C +   Y     
Sbjct: 161 SKYSSHLQSKSSIVGCNDPLCSSNICEALGCSECSSSGACCANKMPQACGFFLRYGDGSG 220

Query: 88  SSSGYLVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG---LG 143
           +    LVD + +  ASF  H              G  +  +  + ++ DG++G+G   LG
Sbjct: 221 AEGALLVDQVQVGNASFVAHFG------------GILEDTTNFEQSSVDGILGMGYPALG 268

Query: 144 ------DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
                 +  + S+  ++ + QN FS+C        V  G        + +F+P+      
Sbjct: 269 CTPSCIEPLIDSMFRQSKIEQNMFSLCISVRGGHLVLGGYDSNMAASNITFVPMILSSPP 328

Query: 198 YFVGVE---SYCIGNSCLTQSGF-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
            F  V    S  + N  L+  GF + +VDSG +   +  + +    ++    + +    +
Sbjct: 329 TFYAVSLGGSIRVDNEELSLDGFDKGIVDSGTTLLVISEQAF----IQLKNYLQTHYCQV 384

Query: 254 QG-----NSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFC 304
            G     +SW     C       +  +P + +  +     ++  + +    +  GF+++C
Sbjct: 385 PGLCDYQHSWFDSASCVILEESHLQHLPTLTIHVANRVDLILTPYDYMLQVQRNGFSLYC 444

Query: 305 LTVM---STDGD-YGIIGQNFMMGHRIVFDRENLKLAWS 339
           L +    S DG  + I+G   M  +  +FDR N ++ ++
Sbjct: 445 LGIQSLPSKDGSPFVILGNTVMTKYLTIFDRRNHRIGFA 483


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 102/251 (40%), Gaps = 28/251 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
           +DPS S+S  N++C+  LC   S+       C +    C Y   Y  + + S GY   + 
Sbjct: 189 FDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYG-DSSFSVGYFSRER 247

Query: 98  LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
           L + +       + V  + + GCG+   G +   A   G++GLG   +S   +   A   
Sbjct: 248 LTVTA-------TDVVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAKY 295

Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
           +  FS C     S  G + FG        + T F  I      Y + + +  +G   L  
Sbjct: 296 RKIFSYCLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPV 355

Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
              T S   A++DSG   T LP   Y  +   F + +S    + + +    CY+ S  ++
Sbjct: 356 SSSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKV 415

Query: 270 LKVPDMRLIFS 280
             +P +   F+
Sbjct: 416 FSIPTIEFSFA 426


>gi|50552716|ref|XP_503768.1| YALI0E10175p [Yarrowia lipolytica]
 gi|49649637|emb|CAG79359.1| YALI0E10175p [Yarrowia lipolytica CLIB122]
          Length = 534

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 63/312 (20%)

Query: 135 DGVMGLGLGDV------------------SVPSLLAKAGLIQ-NSFSICFDE--NDSGSV 173
           +GVMG+GL  +                  ++P  +   GLI+ N++S+  +   +DSG+V
Sbjct: 192 NGVMGIGLAGLESTITYRGNDQISGNPYENLPMKMKAEGLIKANAYSLWLNNLSSDSGNV 251

Query: 174 FFGDQGPAT-------------QQSTSFLPIGEKYDAYFVGVESYCIGN-----SCLTQS 215
            FG    A              Q+S S  PI     A++VG++S  I +       +T+ 
Sbjct: 252 LFGGVDYAKIDGDLFTVKLVNPQRSVSSKPI-----AFYVGLDSVSITDVKGVSGFITKQ 306

Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL----QGNSWKYCYNASSEEMLK 271
              AL+DSG + T+LP + +  VV         +   +     G S    YN S    + 
Sbjct: 307 PVPALLDSGTTLTYLPQDAFNYVVRAMGATYDPQNGYVCPCKNGYSGHLDYNFSGAN-IS 365

Query: 272 VPDMRLIFS---KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
           VP  +L +    ++QS  V N  F   ++      CL +M    D+ I+G +F+    +V
Sbjct: 366 VPLYQLTYPIQLQSQSGRVVNAQFRNGDDA-----CLLLMQASQDHVILGDSFLRAAYVV 420

Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTS-NGQAA---APPS 384
           ++ ++ +++   +K    +  +++  +     ++ NP P      T+ N +       P 
Sbjct: 421 YNLDSYEVSMGQTKYG--VTDTNIVEIDSNGVKNANPAPEYSSSFTNVNSETTILRGAPG 478

Query: 385 TAKTAPSKSIAA 396
           +A + PS +++ 
Sbjct: 479 SADSNPSTTLSG 490


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 66/317 (20%), Positives = 126/317 (39%), Gaps = 37/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P  SSS   + CS  LC++ SS     + C Y   Y  + + + G +  + L   S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGTETLTFGSVS 195

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     ++  GCG    G      A  G++G+G G +S+PS L         FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240

Query: 165 FDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSG 216
                S +   +  G   +   A   +T+ +   +    Y++ +    +G++ L    S 
Sbjct: 241 MTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSA 300

Query: 217 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
           F           ++DSG + T+     Y  V  +F   ++   ++   + +  C+   S+
Sbjct: 301 FALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSD 360

Query: 268 -EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
              L++P   + F      +   + F  P N    + CL + S+     I G        
Sbjct: 361 PSNLQIPTFVMHFDGGDLELPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNML 417

Query: 327 IVFDRENLKLAWSHSKC 343
           +V+D  N  ++++ ++C
Sbjct: 418 VVYDTGNSVVSFASAQC 434


>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 25/148 (16%)

Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK------YCYNASSEEMLKVPDM 275
           DSG +FT LP + Y  VV +FD  V   R+  + +  +       CY  +  + +KVP +
Sbjct: 353 DSGTTFTMLPAKFYNSVVEEFDSRVG--RVHERADRVEPSSGMSPCYYLN--QTVKVPAL 408

Query: 276 RLIFSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQN 320
            L F+ N+S V    RN+ + F +      E   + CL +M+         G   I+G  
Sbjct: 409 VLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNY 468

Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID 348
              G  +V+D  N ++ ++  KC  + D
Sbjct: 469 QQQGFEVVYDLLNRRVGFAKRKCASLWD 496


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 83/328 (25%), Positives = 132/328 (40%), Gaps = 65/328 (19%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           S ++P SSS+   V CS P+C++R+       SC      C ++A    + TS  G L  
Sbjct: 101 SVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLC-HVAISYADATSIEGNLAH 159

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
           +   + S ++           + GC      S   + A   G+MG+  G +S  + L  +
Sbjct: 160 ETFVIGSVTRPG--------TLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS 211

Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
                 FS C   +D SG +  GD            P   QST  LP  ++  AY V +E
Sbjct: 212 -----KFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTP-LPYFDRV-AYTVQLE 264

Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
              +G+  L+  +S F        Q +VDSG  FTFL   +Y  +  +F        +LV
Sbjct: 265 GIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV 324

Query: 247 SSKRISLQGNSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF--- 300
                  QG +   CY   S        +P + L+F   +  V    +       G    
Sbjct: 325 DDPDFVFQG-TMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGK 383

Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGHR 326
             V+C T  ++D    ++G + F++GH 
Sbjct: 384 EEVYCFTFGNSD----LLGIEAFVIGHH 407


>gi|194745306|ref|XP_001955129.1| GF16404 [Drosophila ananassae]
 gi|190628166|gb|EDV43690.1| GF16404 [Drosophila ananassae]
          Length = 463

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/307 (21%), Positives = 127/307 (41%), Gaps = 52/307 (16%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIA----------DYSTEDTSSSGYLVDDILHL 100
           + S N+    P CKS++ C+S K   P  +          + +    S  G L +D + +
Sbjct: 170 TGSSNIWVPGPKCKSKA-CRSHKKFHPAKSSTYKKKTKAFEITYGSGSVKGRLAEDTVSI 228

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
              +      ++ SS        + G   + +  DG++GLG   +SV ++     L+QN 
Sbjct: 229 GGLTVDNQTFAMTSS--------EPGEAFEESKFDGILGLGYQAISVDNVKT---LMQNM 277

Query: 161 ----------FSICFDENDS----GSVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
                     F+IC     +    GS+F G++         S  + P+ +K   + + ++
Sbjct: 278 CSQNVITSCIFAICLRGGGTSAKGGSLFIGNKNTTAYTGSNSYVYTPVTKK-GYWQMKLD 336

Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
            + +G++ ++ +  QA+VDSG S    P   Y E V +     +S      G  W  C  
Sbjct: 337 GFYVGSTKVSGTA-QAIVDSGTSLIAAPLHAYKEFVKETGCTPTS-----SGECWVKCSK 390

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
              + +  + D +++   +++ +           +G TV  L V   + ++ I+G  F+ 
Sbjct: 391 TIPDIVFVIADKKIVIKGDKAKM------KVKTQKGHTVCLLVVTYEETNFWILGDPFLR 444

Query: 324 GHRIVFD 330
            +  VFD
Sbjct: 445 NNCAVFD 451


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/362 (20%), Positives = 136/362 (37%), Gaps = 66/362 (18%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           +++DPS SS+   VSC    C++  R++C    + C Y+  Y  + ++++G L  +    
Sbjct: 144 TQFDPSRSSTYGRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTF 201

Query: 101 A-SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
               +  +P+      V  GC     GS+          G     VS+ + L  A  +  
Sbjct: 202 DDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGR 257

Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
            FS C      N S ++ FG     T+   +  P+               +GN  +  + 
Sbjct: 258 RFSYCLVPHSVNASSALNFGALADVTEPGAASTPL---------------VGNKTVASAA 302

Query: 217 F-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM---LKV 272
             + +VDSG + TFL   +   +V +  + ++   +       + CYN +  E+     +
Sbjct: 303 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 362

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMST------------------ 310
           PD+ L F    +  ++      PEN    V     CL +++T                  
Sbjct: 363 PDLTLEFGGGAAVALK------PENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIH 416

Query: 311 ---DGDYGIIGQNFM---MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN 364
              D D G +G   +      RI+ D          S    ++D+    +  PP  QSP+
Sbjct: 417 VGYDLDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPV-QSPD 475

Query: 365 PL 366
            L
Sbjct: 476 GL 477


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/326 (23%), Positives = 127/326 (38%), Gaps = 51/326 (15%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +D S S++  +V C  P C S ++C S    CP+   +        G    D+L +    
Sbjct: 191 FDTSQSTTFTHVPCDSPDCPSTANC-SAGSVCPFNLFF------VEGTFSQDVLTV---- 239

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV-----SVPSLLAKAGLIQN 159
             AP  +VQ    + C        LD  A DG+  +G  D+     S+PS L  AG    
Sbjct: 240 --APSVAVQDFTFV-C--------LDAGASDGMPEVGTLDLSRDRNSLPSRL--AGSASA 286

Query: 160 SFSICFDE--NDSGSVFFGDQGPATQQS-TSFLPIGEKYDA-----YFVGVESYCIGNSC 211
           +FS C  +  +  G +  GD       + T+  P+    D      YF+ V    +G+  
Sbjct: 287 AFSYCMPQYPDSPGFLSLGDDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVD 346

Query: 212 LT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNA 264
           L        +    +V++G +FT L  + Y  +   F + ++    S+ G   +  CYN 
Sbjct: 347 LPIPSGTFGNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNF 406

Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNH---IFSFPENEGFTVFCLTVMS----TDGDYGII 317
           +  + L VP +   F    S ++       +  P    FTV CL   +     D    +I
Sbjct: 407 TGLQELTVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVI 466

Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
           G   +    +V+D     + +    C
Sbjct: 467 GAYSLATTEVVYDVAGGTVGFIPESC 492


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 129/320 (40%), Gaps = 39/320 (12%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PS SS+ ++++C   LC+        ++ C Y   Y            D    +  FS
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYG-----------DGSFTVGEFS 171

Query: 105 KHAPQ--SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
                  S+  +SV IGCG    G +   A    ++GLG G +S PS + +  L  + FS
Sbjct: 172 TETLSFGSNAVNSVAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFS 226

Query: 163 ICFDENDS-GSV--FFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSC 211
            C    +S GSV   FG+Q  A+    + L    K D ++        VG  S  I    
Sbjct: 227 YCLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGS 286

Query: 212 L-----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNAS 265
           L     T +G   ++DSG + T L T  Y  +   F   + S      G S +  CY+ S
Sbjct: 287 LSLDSSTGNG-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLS 345

Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
               + +P +  +F+   +  +       P +N G   +CL       ++ IIG      
Sbjct: 346 GRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQS 403

Query: 325 HRIVFDRENLKLAWSHSKCE 344
            R+ FD    ++    ++C 
Sbjct: 404 FRMSFDSTGNRVGIGANQCN 423


>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
          Length = 394

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 130/316 (41%), Gaps = 43/316 (13%)

Query: 47  PSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
           PS      N++C  H    +  S    K+   +   Y +   S SGYL  D +++A    
Sbjct: 102 PSKQCYFTNIACLMHNKYDANKSSSYKKNGTEFAIHYGS--GSLSGYLSTDTVNIAGLGI 159

Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-Q 158
              Q+  ++         + G    GA  DG++GLG   ++V  +      + + GLI Q
Sbjct: 160 EG-QTFAEA-------LSEPGLVFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQ 211

Query: 159 NSFSICFDEN----DSGSVFFGDQGPATQQST-SFLPIGEKYDAYF-VGVESYCIGNSCL 212
             FS   + +    + G + FG   P   +   ++LP+  K  AY+ + ++S  +GN  L
Sbjct: 212 PVFSFYLNRDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRK--AYWQIKMDSASMGNLNL 269

Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
            Q G Q + D+G S   LP           +K +    I + G      Y  + E + K+
Sbjct: 270 CQGGCQVIADTGTSLIALP----PSEATSINKAIGGTPI-MGGQ-----YMVACENIPKL 319

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHR 326
           P +R +    ++F +    +     +     CL+  M  D     G   I+G  F+  + 
Sbjct: 320 PVIRFVLG-GKTFELEGKDYILRIAQMGKTICLSGFMGIDIPPPNGPIWILGDVFIGKYY 378

Query: 327 IVFDRENLKLAWSHSK 342
             FD  N ++ ++ +K
Sbjct: 379 TEFDMGNDRVGFAEAK 394


>gi|408397130|gb|EKJ76280.1| hypothetical protein FPSE_03535 [Fusarium pseudograminearum CS3096]
          Length = 467

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/303 (20%), Positives = 131/303 (43%), Gaps = 36/303 (11%)

Query: 117 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFD--ENDSGSV 173
           +IG G     + +D   P+    +       P+ LA  G+I  N++S+  D  E+ +G +
Sbjct: 176 VIGIGYTSNEAVVDQPDPEFYKNM-------PARLASDGVIASNAYSLYLDDLESATGKI 228

Query: 174 FFGDQGPATQQ------STSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
            FG  G   Q       +   + I ++Y  ++V ++S   G+  + +      ++DSG++
Sbjct: 229 LFG--GVDEQHFIGDLVTVPIMKINDEYSEFYVKLQSINSGSEIVGEDLDLGVVLDSGST 286

Query: 227 FTFLPTE----IYAEVVVKFDKLVSSKRI----SLQGNSWKYCYNASSEEMLKVPDMRLI 278
            T+LP      IY  V   +++  ++  +    + QG +  + + + +E  + + ++ L 
Sbjct: 287 LTYLPASVTDSIYQLVGADYEEGQTTAYVPCDLANQGGNLTFKFTSPAEITVPLSELILD 346

Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
           F+      +     SF   +    F +   ++     I+G  F+    +VFD +N +++ 
Sbjct: 347 FTD-----ITGRQMSFTNGQAACSFGIAPSTSQ--VSILGDTFLRSAYVVFDLDNNEISL 399

Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
           + S  E     SH+  +       P+   +   QS+ +  AA   S  ++  + SI A A
Sbjct: 400 AQSNSEAT--GSHILEISKGKNAVPSATGSEGPQSSGSENAAGSLSPLESTGAVSILAGA 457

Query: 399 QQL 401
             L
Sbjct: 458 MAL 460


>gi|302763589|ref|XP_002965216.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
 gi|300167449|gb|EFJ34054.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
          Length = 163

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 10/134 (7%)

Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE-------MLKV 272
           + DSG + TFLP  +Y +V+  F + ++   ++        CYN S +         L  
Sbjct: 32  IFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFPSLALHF 91

Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
           PD  +   ++   +V +   +   NE  +V CL +MS+      IIG     G+ I+FD 
Sbjct: 92  PDAWMNLHQDNYIIVPSRADAEAWNE--SVACLAIMSSASIGINIIGNVMQEGYHIMFDN 149

Query: 332 ENLKLAWSHSKCEE 345
           E   + ++ + C E
Sbjct: 150 EKSTVTFAPASCSE 163


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/263 (23%), Positives = 105/263 (39%), Gaps = 52/263 (19%)

Query: 134 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFGDQGPATQQS 185
           P GV G G G +S+P  LA    +   FS C   +            +  G    A  ++
Sbjct: 245 PVGVAGFGRGPLSLPGQLAPQ--LSGRFSYCLVSHSFRADRLIRPSPLILGRSPDAAAET 302

Query: 186 TSFL--PI--GEKYDAYF-VGVESYCIGNSCLT---------QSGFQAL-VDSGASFTFL 230
             F+  P+    K+  ++ V +E+  +G + +          ++G   + VDSG +FT L
Sbjct: 303 GGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTML 362

Query: 231 PTEIYAEV------VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 284
           P E YA V       +       ++R   Q      CY+ ++ +   VP + L F  N +
Sbjct: 363 PNETYARVAEAFARAMAAAGFARAERAEEQ-TGLTPCYHYAASDR-GVPPLALHFRGNAT 420

Query: 285 FVV--RNHIFSFPENEGF-------TVFCLTVMS----------TDGDYGIIGQNFMMGH 325
             +  RN+   F   E          V CL +M+           DG  G +G     G 
Sbjct: 421 VALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGF 480

Query: 326 RIVFDRENLKLAWSHSKCEEVID 348
            +V+D +  ++ ++  +C E+ D
Sbjct: 481 EVVYDVDAGRVGFARRRCTELWD 503


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 76/318 (23%), Positives = 132/318 (41%), Gaps = 35/318 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++PS SS+ ++++C   LC+        ++ C Y   Y  + + + G    + L      
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYG-DGSFTVGEFSTETLSFG--- 178

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                S+  +SV IGCG    G +   A    ++GLG G +S PS + +  L  + FS C
Sbjct: 179 -----SNAVNSVAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYC 228

Query: 165 FDENDS-GSV--FFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-------- 212
               +S GSV   FG+Q  A+    + L    K D  Y+V +    +G + +        
Sbjct: 229 LPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLS 288

Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSE 267
               T +G   ++DSG + T L T  Y  +   F   + S      G S +  CY+ S  
Sbjct: 289 LDSSTGNG-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGR 347

Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
             + +P +  +F+   +  +       P +N G   +CL       ++ IIG       R
Sbjct: 348 SSIMLPAVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQSFR 405

Query: 327 IVFDRENLKLAWSHSKCE 344
           + FD    ++    ++C 
Sbjct: 406 MSFDSTGNRVGIGANQCN 423


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 72/315 (22%), Positives = 123/315 (39%), Gaps = 33/315 (10%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           +DP+ S+S   VSCS  +C    +       C Y   Y  + + + G L    L   +F 
Sbjct: 182 FDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGTLA---LETLTFG 237

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
           +     ++  SV IGCG +  G ++  A   G+        S+  +    G    +FS C
Sbjct: 238 R-----TMVRSVAIGCGHRNRGMFVGAAGLLGLG-----GGSMSFVGQLGGQTGGAFSYC 287

Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS-------- 210
                 + SGS+ FG +  A     +++P+     A   Y++G+    +G          
Sbjct: 288 LVSRGTDSSGSLVFGRE--ALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEV 345

Query: 211 -CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
             LT+ G   +V D+G + T LPT  Y      F    ++   +     +  CY+     
Sbjct: 346 FRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFV 405

Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
            ++VP +   FS      +    F  P ++  T FC     +     I+G     G +I 
Sbjct: 406 SVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT-FCFAFAPSTSGLSILGNIQQEGIQIS 464

Query: 329 FDRENLKLAWSHSKC 343
           FD  N  + +  + C
Sbjct: 465 FDGANGYVGFGPNIC 479


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 64/317 (20%), Positives = 126/317 (39%), Gaps = 37/317 (11%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
           ++P  SSS   + CS  LC++  S     + C Y   Y  + + + G +  + L   S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVS 195

Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
                     ++  GCG    G      A  G++G+G G +S+PS L         FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240

Query: 165 F---DENDSGSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
                 + S ++  G   +   A   +T+ +   +    Y++ +    +G++ L      
Sbjct: 241 MTPIGSSTSSTLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSV 300

Query: 213 ----TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
               + +G    ++DSG + T+     Y  V   F   ++   ++   + +  C+   S+
Sbjct: 301 FKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSD 360

Query: 268 EM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
           +  L++P   + F      +   + F  P N    + CL + S+     I G        
Sbjct: 361 QSNLQIPTFVMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLL 417

Query: 327 IVFDRENLKLAWSHSKC 343
           +V+D  N  +++  ++C
Sbjct: 418 VVYDTGNSVVSFLFAQC 434


>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 316

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/271 (25%), Positives = 106/271 (39%), Gaps = 53/271 (19%)

Query: 116 VIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEND 169
           V++GC    TG S+L   A DGV+ LG  +VS  S    A      FS C        N 
Sbjct: 58  VVLGCTTSYTGESFL---ASDGVLSLGYSNVSFAS--RAAARFGGRFSYCLVDHLAPRNA 112

Query: 170 SGSVFFGDQ------------------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
           +  + FG                     P  +Q T  L        Y V V    +    
Sbjct: 113 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQ-TPLLLDHRMRPFYAVAVNGVSVDGEL 171

Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCY 262
           L         Q G  A++DSG S T L +  Y  VV     KLV   R+++  + + YCY
Sbjct: 172 LRIPRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGLPRVAM--DPFDYCY 229

Query: 263 NASS----EEM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--- 314
           N +S    E++ + VP + + F+ +         +      G  V C+ +   +GD+   
Sbjct: 230 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAAPG--VKCIGLQ--EGDWPGV 285

Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
            +IG      H   FD +N +L +  S+C +
Sbjct: 286 SVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 316


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 80/360 (22%), Positives = 124/360 (34%), Gaps = 55/360 (15%)

Query: 30  LVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 89
           +  G    ++ + + Y P+ SSS + + CS   C             PY    S     S
Sbjct: 172 MSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAES 221

Query: 90  SGYLV---DDILHLASFSKHAPQSSVQSS-------VIIGCGRKQTGSYLDGAAPDGVMG 139
             Y     D  + +  + K     +V          +I+GC   + G  +D  A DGV+ 
Sbjct: 222 CSYFQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLS 279

Query: 140 LGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLP 190
           LG GD+S     AK       FS C        + S  + FG      GP T ++     
Sbjct: 280 LGNGDMSFAVHAAKR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYN 337

Query: 191 I------GEKYDAYFVGVESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVK 241
           +      G +     VG E   I +       F     ++D+  S T L  E YA V   
Sbjct: 338 VDVKPAYGAQVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAA 397

Query: 242 FDKLVSSKRISLQGNSWKYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
            D+ +S      +   ++YCY              + +P   +  +              
Sbjct: 398 LDRHLSHLPRVYELEGFEYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVM 457

Query: 295 PENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
           PE E   V CL       G  GI+G  FM  +    D  + K+ +   KC    +  H+H
Sbjct: 458 PEVEP-GVACLAFRKLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC----NTHHLH 512


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 133/335 (39%), Gaps = 59/335 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
           + P+SSS+   + C+   C+    S  +C +    C Y   Y +  T+  GYL  + L +
Sbjct: 128 FQPASSSTFSKLPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYTA--GYLATETLKV 183

Query: 101 --ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
             ASF           SV  GC  +       G +  G+ GLG G +S   L+ + G+  
Sbjct: 184 GDASFP----------SVAFGCSTENG----VGNSTSGIAGLGRGALS---LIPQLGV-- 224

Query: 159 NSFSICFDENDSGS---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGN 209
             FS C     +     + FG     T    QST F+    +   Y  Y+V +    +G 
Sbjct: 225 GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGE 282

Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
           + L         TQ+G     +VDSG + T+L  + Y  V   F    +           
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGL 342

Query: 259 KYCYNAS--SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD 313
             C+ ++      + VP + L F     + V  + F+  E +     TV CL ++   GD
Sbjct: 343 DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGD 401

Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
               +IG    M   +++D +    +++ + C +V
Sbjct: 402 QPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCAKV 436


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/333 (23%), Positives = 138/333 (41%), Gaps = 57/333 (17%)

Query: 45  YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  SSS   V CS  LC +  RS+C   KD C Y+  Y  + +S+ G L  +      
Sbjct: 150 FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYG-DYSSTRGLLATETFTFED 208

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
                 ++S+ S +  GCG +  G   DG +   G++GLG G +S+ S L      +  F
Sbjct: 209 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 253

Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
           S C     D   S S+F G               G  T ++ S L   ++   Y++ ++ 
Sbjct: 254 SYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVT-KTMSLLRNPDQPSFYYLELQG 312

Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
             +G   L+  +S F+         ++DSG + T+L    +  +  +F   +S       
Sbjct: 313 ITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 372

Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
                 C+   ++ + + VP  +LIF     +      N++ +   +    V CL + S+
Sbjct: 373 STGLDLCFKLPNAAKNIAVP--KLIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSS 427

Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
           +G   I G        ++ D E   + +  ++C
Sbjct: 428 NG-MSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 71/321 (22%), Positives = 124/321 (38%), Gaps = 42/321 (13%)

Query: 45  YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
           +DP  S +   + CS P C+   S  C + +  C Y   Y     +   +  + +    +
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----T 239

Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
           F ++  +      V +GCG    G ++  A   G+           S   + G      F
Sbjct: 240 FRRNRVKG-----VALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKF 288

Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---C 211
           S C  +  + S    V FG+   A  +   F P+    K D  Y+VG+    +G +    
Sbjct: 289 SYCLVDRSASSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 346

Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
           +T S F+         ++DSG S T L    Y  +   F     + + +   + +  C++
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 406

Query: 264 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
            S+   +KVP + L F   + S    N++     N  F   C     T G   IIG    
Sbjct: 407 LSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKF---CFAFAGTMGGLSIIGNIQQ 463

Query: 323 MGHRIVFDRENLKLAWSHSKC 343
            G R+V+D  + ++ ++   C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 85/346 (24%), Positives = 143/346 (41%), Gaps = 62/346 (17%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
           + +DP+ SSS   V CS   C  R+       SC S    C  I  Y+ + +SS G L  
Sbjct: 121 TTFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDS-NQLCHAILSYA-DASSSEGNLAS 178

Query: 96  DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKA 154
           D  ++ +        S     I GC      +  +  + + G+MG+  G +S  S +   
Sbjct: 179 DTFYIGN--------SDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFP 230

Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
                 FS C  ++D SG +  GD            P  Q ST  LP  ++  AY V +E
Sbjct: 231 -----KFSYCISDSDFSGVLLLGDANFSWLMPLNYTPLIQISTP-LPYFDRV-AYTVQLE 283

Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
              + +  L           T +G Q +VDSG  FTFL   +Y+ +  +F    S     
Sbjct: 284 GIKVSSKLLPLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRV 342

Query: 253 LQGNSWKY------CYNA--SSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGF-T 301
           L+  ++ +      CY    S   +  +P + L+F   +  V  +  ++  P E  G  +
Sbjct: 343 LEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDS 402

Query: 302 VFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
           V+C T  ++D    +  +IG +      + FD E  ++ ++  +C+
Sbjct: 403 VYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQCD 448


>gi|195501954|ref|XP_002098017.1| GE10127 [Drosophila yakuba]
 gi|194184118|gb|EDW97729.1| GE10127 [Drosophila yakuba]
          Length = 465

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 134/319 (42%), Gaps = 58/319 (18%)

Query: 51  SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
           + S N+    P CKS++ C+  K   P  +    ++  S          +G L  D + +
Sbjct: 169 TGSSNIWVPGPHCKSKA-CQKHKKYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 227

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
           A  +  A Q+   ++       K+ G+    +  DG++GLG   +SV ++     L++N 
Sbjct: 228 AGLT-VANQTFAMTT-------KEPGTTFVTSNFDGILGLGYRSISVDNVKT---LVENM 276

Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
                     F+IC     S     ++ FG    +      S ++ P+  K    F   +
Sbjct: 277 CSEDVITSCKFAICMKGGGSSSRGGALIFGSSNTSAYSGSNSYTYTPVTTKGYWQFTLQD 336

Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
            Y +G++ ++ S  QA+VDSG S    PT IY     K +K++     S  G  W  C  
Sbjct: 337 IY-VGSTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGCTATS-SGECWMKCAK 389

Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF--SFPENEGFTVFCLTVMSTDGDYGII-GQN 320
                  K+PD   + +  + FVV+ +        N G TV C++ +S   D  +I G  
Sbjct: 390 -------KIPDFTFVIA-GKKFVVKGNKMKVKVKTNRGKTV-CISAVSEVPDEPVILGDA 440

Query: 321 FMMGHRIVFDRENLKLAWS 339
           F+     VFD  N ++ ++
Sbjct: 441 FIRHFCTVFDLANNRIGFA 459


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 123/322 (38%), Gaps = 42/322 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
            + P+ SS+ + V C  P C    S S    +   C +   Y+   ++    L  D L L
Sbjct: 142 SFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL 199

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                   +++V  S   GC R  +G   +   P G++G G G +S   L        + 
Sbjct: 200 --------ENNVVVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSV 246

Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           FS C       N SG++  G  G P   ++T  L    +   Y+V +    +G+  +   
Sbjct: 247 FSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVP 306

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                    +G   ++D+G  FT L   +YA V   F   V +      G  +  CYN +
Sbjct: 307 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT 365

Query: 266 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
               + VP +  +F+   +  +   ++     + G     +    +DG      ++    
Sbjct: 366 ----VSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQ 421

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R++FD  N ++ +S   C
Sbjct: 422 QQNQRVLFDVANGRVGFSRELC 443


>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
          Length = 178

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
           R L+ YDP SS SSK V C   +C SR  C ++   CPYI  Y+ +   + G L  D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158

Query: 100 LASFSKHAPQSSVQSSVIIG 119
                 +       +SV  G
Sbjct: 159 YHQLYGNGQTQPTSTSVTFG 178


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 123/322 (38%), Gaps = 42/322 (13%)

Query: 44  EYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
            + P+ SS+ + V C  P C    S S    +   C +   Y+   ++    L  D L L
Sbjct: 123 SFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL 180

Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
                   +++V  S   GC R  +G   +   P G++G G G +S   L        + 
Sbjct: 181 --------ENNVVVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSV 227

Query: 161 FSICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
           FS C       N SG++  G  G P   ++T  L    +   Y+V +    +G+  +   
Sbjct: 228 FSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVP 287

Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
                    +G   ++D+G  FT L   +YA V   F   V +      G  +  CYN +
Sbjct: 288 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT 346

Query: 266 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
               + VP +  +F+   +  +   ++     + G     +    +DG      ++    
Sbjct: 347 ----VSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQ 402

Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
               R++FD  N ++ +S   C
Sbjct: 403 QQNQRVLFDVANGRVGFSRELC 424


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 138/361 (38%), Gaps = 69/361 (19%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R+   + P +S +  +V C    C+SR      +C      C     Y+ + +SS G L 
Sbjct: 104 RSALSFRPRASLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYA-DGSSSDGALA 162

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            ++    +  +  P  +       GC      +  DG A  G++G+  G +S  S  +  
Sbjct: 163 TEVF---TVGQGPPLRAA-----FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST- 213

Query: 155 GLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIG-----------EKYD--AYFV 200
                 FS C  D +D+G +  G           FLP+              +D  AY V
Sbjct: 214 ----RRFSYCISDRDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSV 263

Query: 201 GVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
            +    +G   L           T +G Q +VDSG  FTFL  + Y+ +  +F +     
Sbjct: 264 QLLGIRVGGKPLPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFSRQTKPW 322

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMR----------LIFSKNQSFVVRNH-IFSFP--E 296
             +L  N   + +  + +   +VP  R          L+F+  Q  V  +  ++  P   
Sbjct: 323 LPAL--NDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGER 380

Query: 297 NEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
             G  V+CLT  + D       +IG +  M   + +D E  ++  +  +C+   ++  + 
Sbjct: 381 RGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGLM 440

Query: 354 L 354
           L
Sbjct: 441 L 441


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 131/327 (40%), Gaps = 47/327 (14%)

Query: 43  SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCP---------YIADYSTEDTSSSG 91
           + + P+ S++   + CS  +C    R +C                 Y   Y     ++SG
Sbjct: 132 TAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191

Query: 92  YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
           YL  D     +        +    V+ GC      SY D A   GV+G+G G++S   L+
Sbjct: 192 YLATDTFTFGA--------TAVPGVVFGC---SDASYGDFAGASGVIGIGRGNLS---LI 237

Query: 152 AKAGLIQNSFSICFDE-NDSGS----VFFGDQG-PATQ--QSTSFLPIGEKYDAYFVGVE 203
           ++    + S+ +   E  D GS    + FGD   P T+  QST  L      D Y+V + 
Sbjct: 238 SQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLT 297

Query: 204 SYCIGNSCLTQ--SGFQALVDSGASFTFL----PTEIYAEVVVKFDKLVSSKRISL---Q 254
              +  + L    +G   L  +G     L    P     +      +   + RI L    
Sbjct: 298 GVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVN 357

Query: 255 GNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
           G++      CYNASS   +KVP + L+F       +    + + +N+   + CLT++ + 
Sbjct: 358 GSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDND-TGLECLTMLPSQ 416

Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAW 338
           G   ++G     G  +++D +  +L +
Sbjct: 417 GG-SVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/361 (21%), Positives = 138/361 (38%), Gaps = 69/361 (19%)

Query: 40  RNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLV 94
           R+   + P +S +  +V C    C+SR      +C      C     Y+ + +SS G L 
Sbjct: 105 RSALSFRPRASLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYA-DGSSSDGALA 163

Query: 95  DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
            ++    +  +  P  +       GC      +  DG A  G++G+  G +S  S  +  
Sbjct: 164 TEVF---TVGQGPPLRAA-----FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST- 214

Query: 155 GLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIG-----------EKYD--AYFV 200
                 FS C  D +D+G +  G           FLP+              +D  AY V
Sbjct: 215 ----RRFSYCISDRDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSV 264

Query: 201 GVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
            +    +G   L           T +G Q +VDSG  FTFL  + Y+ +  +F +     
Sbjct: 265 QLLGIRVGGKPLPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFSRQTKPW 323

Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMR----------LIFSKNQSFVVRNH-IFSFP--E 296
             +L  N   + +  + +   +VP  R          L+F+  Q  V  +  ++  P   
Sbjct: 324 LPAL--NDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGER 381

Query: 297 NEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
             G  V+CLT  + D       +IG +  M   + +D E  ++  +  +C+   ++  + 
Sbjct: 382 RGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGLM 441

Query: 354 L 354
           L
Sbjct: 442 L 442


>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
          Length = 375

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 68/275 (24%), Positives = 120/275 (43%), Gaps = 34/275 (12%)

Query: 88  SSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
           S SGYL  D + +   S  +P SSV+    I G   KQ G     A  DG++G+    +S
Sbjct: 111 SLSGYLSQDTVSVPCKSAPSPPSSVKVERQIFGEATKQPGITFIAAKFDGILGMAYPRIS 170

Query: 147 V-------PSLLAKAGLIQNSFSICFDENDSGS-----VFFGDQGPATQQSTSFLPIGEK 194
           V        +L+ +  + +N FS   + + +       +  G        S S+L +  K
Sbjct: 171 VNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDSKYYTGSLSYLNVTRK 230

Query: 195 YDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
             AY+ V +E   +GNS  L ++G +A+VD+G S    P     E V    K + +  + 
Sbjct: 231 --AYWQVHMEQVDVGNSLTLCKAGCEAIVDTGTSLVVGPV----EEVRALQKAIGAVPL- 283

Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD 311
           +QG      Y    E++  +P++ L     + + +    ++   ++G    CL+  M  D
Sbjct: 284 IQGE-----YMIPCEKVSSLPEVTLKLG-GKGYKLGAEDYTLKVSQGGKTICLSGFMGMD 337

Query: 312 -----GDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
                G   I+G  F+  +  VFDR+  ++  + +
Sbjct: 338 IPPPGGPLWILGDVFIGRYYTVFDRDENRVGLAEA 372


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 118/310 (38%), Gaps = 22/310 (7%)

Query: 43  SEYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
           S +DPSSSS+    SCS   C    +S+     +   C YI +Y    +++      D L
Sbjct: 162 SLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTG-TYSSDTL 220

Query: 99  HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
            L         SS  +    GC + ++G + D    DG+MGLG G  S+ S    AG   
Sbjct: 221 TLG--------SSAMTDFQFGCSQSESGGFND--QTDGLMGLGGGAQSLAS--QTAGTFG 268

Query: 159 NSFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QS 215
            +FS C       S F     G +    T  L   +    Y V +ES  +G+  L    S
Sbjct: 269 TAFSYCLPPTSGSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTS 328

Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
            F A  L+DSG   T LP   Y+ +   F   +     +        C++ S +  + +P
Sbjct: 329 VFSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIP 388

Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
            + L+FS   +  +         +        T    D   GIIG        +++D   
Sbjct: 389 TVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGG 448

Query: 334 LKLAWSHSKC 343
             + +    C
Sbjct: 449 GAVGFKAGAC 458


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.132    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,666,950,588
Number of Sequences: 23463169
Number of extensions: 281859798
Number of successful extensions: 1093697
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 229
Number of HSP's successfully gapped in prelim test: 1890
Number of HSP's that attempted gapping in prelim test: 1090535
Number of HSP's gapped (non-prelim): 2435
length of query: 422
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 277
effective length of database: 8,957,035,862
effective search space: 2481098933774
effective search space used: 2481098933774
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)