BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011566
         (483 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  584 bits (1505), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 292/462 (63%), Positives = 356/462 (77%), Gaps = 23/462 (4%)

Query: 28  SAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY 87
           S +T+T+PL+P  TK       SDP + L+ LA++S+SRA HLK+   PKT       N+
Sbjct: 24  SPSTITIPLSPTITKR----PSSDPWEYLNHLATTSISRAHHLKS---PKT-------NF 69

Query: 88  SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
           S  LIKTPL   SYGGYS+SLS GTP Q +   I DTGSSLVWFPCTSRY C  CNFPN 
Sbjct: 70  S--LIKTPLFSRSYGGYSMSLSLGTPSQ-TVKLIMDTGSSLVWFPCTSRYVCASCNFPNT 126

Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
           D ++IP F+P+ SSSS+LIGC+NPKC+W+FG +V+S+C  C+P+ + C  ACP Y++QYG
Sbjct: 127 DITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYG 186

Query: 208 LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
           LG TAGLLLSET+ FP+KT+ +FLAGCS+LS RQP GIAGFGRS ESLP QLGLKKFSYC
Sbjct: 187 LGSTAGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYC 246

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS-AFGEFYYVGLRQII 326
           L+SR+FDD+PVSS+L+LD GP + DSKT GLSYTPF KN    S+ AF E+YYV LR+II
Sbjct: 247 LVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VG  HVK+PYS+LVPGSDGNGG IVDSGSTFTF+EG +FE +AKEF +QM NY+ A +V+
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA-- 444
           K +GLRPCFDISG+KSV +P+L  +FKGGAKM LP  NYFA V   V+CL + +DNAA  
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAAL 426

Query: 445 ---GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              G     GPAIILG+FQ QNFY+E+DL NDRFGF +Q CA
Sbjct: 427 GGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  563 bits (1450), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 281/447 (62%), Positives = 344/447 (76%), Gaps = 19/447 (4%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS 97
           P STK  L  S  +P   L+ LAS SLSRA H+K+   PKTK          SL+KTPL 
Sbjct: 40  PSSTK--LIVSSKNPWGALNHLASLSLSRAHHIKS---PKTK---------FSLLKTPLF 85

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
             SYGGYSISL+FGTPPQ +T F+ DTGSSLVWFPCTSRY C  C+FPN++ + IP FIP
Sbjct: 86  PRSYGGYSISLNFGTPPQ-TTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIP 144

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
           K+SSSS LIGC+N KCSW+FGP V+S+C+ C P  + C  +CP Y++QYGLG TAGLLLS
Sbjct: 145 KQSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLS 204

Query: 218 ETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
           ETL FP  KT+P FL GCS+ S RQP GIAGFGRS ESLPSQLGLKKFSYCL+S  FDD 
Sbjct: 205 ETLDFPHKKTIPGFLVGCSLFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDT 264

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
           P SS+LVLDTG GS D+KTPGLSYTPF KNP   ++AF ++YYV LR I++G  HVK+PY
Sbjct: 265 PASSDLVLDTGSGSDDTKTPGLSYTPFQKNP---TAAFRDYYYVLLRNIVIGDTHVKVPY 321

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
            +LVPGSDGNGG IVDSG+TFTFME P++E VAKEF +Q+ +Y+ A +V+ ++GLRPCF+
Sbjct: 322 KFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFN 381

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
           ISG+KSV +PE I  FKGGAKMALP  NYF+ V + V+CL + +DN +G  +G GPAIIL
Sbjct: 382 ISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIIL 441

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G++Q +NF++EFDL N+RFGF +Q C 
Sbjct: 442 GNYQQRNFHVEFDLKNERFGFKQQNCV 468


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  554 bits (1428), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 280/478 (58%), Positives = 347/478 (72%), Gaps = 17/478 (3%)

Query: 7   SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
           S I    LL+ L +  A   S+  T+T+PL+PL  K   H SDSDP   L   AS+SL+R
Sbjct: 8   SYIITVFLLLSLLSHIAFTSSNPNTITLPLSPLLIKP--HSSDSDPFHSLKFAASASLTR 65

Query: 67  ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
           A HLK +           +N S S+  TP    SYGGYSI L+ GTPPQ S PF+ DTGS
Sbjct: 66  AHHLKHR-----------NNNSPSVATTPAYPKSYGGYSIDLNLGTPPQTS-PFVLDTGS 113

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
           SLVWFPCTSRY C  CNFPN+D ++IP FIPK SS+++L+GC+NPKC +IFG +V+ RC 
Sbjct: 114 SLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCP 173

Query: 187 GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIA 246
            C P ++ C L CP+Y++QYGLG TAG LL + L FP KTVP FL GCSILS RQP+GIA
Sbjct: 174 QCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIA 233

Query: 247 GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN 306
           GFGR  ESLPSQ+ LK+FSYCL+S +FDD P SS+LVL     +GD+KT GLSYTPF  N
Sbjct: 234 GFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQIS-STGDTKTNGLSYTPFRSN 292

Query: 307 PVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE 366
           P  ++ AF E+YY+ LR++IVG K VKIPY++L PGSDGNGG IVDSGSTFTFME P++ 
Sbjct: 293 PSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYN 352

Query: 367 AVAKEFIRQM-GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
            VA+EF++Q+  NYSRA D E +SGL PCF+ISG K+V  PEL  KFKGGAKM  P +NY
Sbjct: 353 LVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNY 412

Query: 426 FALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           F+LVG+ EV+CL + +D  AGP    GPAIILG++Q QNFY+E+DL N+RFGF  + C
Sbjct: 413 FSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 276/459 (60%), Positives = 343/459 (74%), Gaps = 10/459 (2%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
            + V +PL+P S   +   S  DP   L  LA SS++RA  LK  T  KP  +  +  + 
Sbjct: 16  VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTAT 72

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
            S +++K+ LS  SYGGYS+SLSFGTP Q + PF+FDTGSSLVWFPCTSRY C DCNF  
Sbjct: 73  ASATVVKSHLSPKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVWFPCTSRYLCSDCNFSG 131

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +DP++IP FIPK SSSS++IGCQNPKC ++FG NV+  C+GC P  + C + CP Y+LQY
Sbjct: 132 LDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQ--CRGCDPNTRNCTVPCPPYILQY 189

Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
           GLG TAG+L+SE L FP  TVP+F+ GCS++S R PAGIAGFGR  ESLPSQ+ LK FS+
Sbjct: 190 GLGSTAGILISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSH 249

Query: 267 CLLSRKFDDAPVSSNLVLDTGPG-SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL+SR+FDD  V+++L LDTG G    SKTPGLSYTPF KNP  S++AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRI 309

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VGSKHVKIPY +L PG++GNGG IVDSGSTFTFME P+FE VA+EF  QM NY+R  D+
Sbjct: 310 YVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDL 369

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
           EK SG+ PCF+ISGK  V +PELI +FKGGAKM LP  NYF+ VGN + +CL + +DN  
Sbjct: 370 EKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTV 429

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            P  G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 281/474 (59%), Positives = 345/474 (72%), Gaps = 19/474 (4%)

Query: 10  CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
           C F+L  LL   ++    + AT+T+PLTPL TK+      SDP ++L  L S+SL+RA H
Sbjct: 13  CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 68

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           LK +              + S + TPL  HSYGGYS+SLSFGTP Q +  F+ DTGSSLV
Sbjct: 69  LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 114

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           WFPCTSRY C  C+FPN+DP++IP FIPK SSS++++GC NPKC ++    V +RC GC 
Sbjct: 115 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCD 174

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
             +  C  ACP+Y +QYGLG T GLLL E+L F  +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 175 QNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 234

Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           R   SLP Q+GLKKFSYCLLS +FDD+P SS + L  GP S D KT GLSYTPF KNPV 
Sbjct: 235 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 294

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
           S+SAF E+YYV LR IIVG K VK+PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 295 SNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 354

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            EF RQM NY+RAADVE  SGL+PCF++SG  SV LP L+ +FKGGAKM LP  NYF+LV
Sbjct: 355 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 414

Query: 430 GN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+  VLCL + ++ A G  L  GP+IILG++Q QNFY E+DL N+RFGF +Q+C
Sbjct: 415 GDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 281/474 (59%), Positives = 344/474 (72%), Gaps = 19/474 (4%)

Query: 10  CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
           C F+L  LL   ++    + AT+T+PLTPL TK+      SDP ++L  L S+SL+RA H
Sbjct: 13  CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 68

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           LK +              + S + TPL  HSYGGYS+SLSFGTP Q +  F+ DTGSSLV
Sbjct: 69  LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 114

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           WFPCTSRY C  C+FPN+DP++IP FIPK SSS++++GC NPKC ++    V +RC GC 
Sbjct: 115 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCD 174

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
             +  C  ACP+Y +QYGLG T GLLL E+L F  +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 175 QNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 234

Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           R   SLP Q+GLKKFSYCLLS +FDD+P SS + L  GP S D KT GLSYTPF KNPV 
Sbjct: 235 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 294

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
           S+SAF E+YYV LR IIVG K VK PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 295 SNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 354

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            EF RQM NY+RAADVE  SGL+PCF++SG  SV LP L+ +FKGGAKM LP  NYF+LV
Sbjct: 355 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 414

Query: 430 GN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+  VLCL + ++ A G  L  GP+IILG++Q QNFY E+DL N+RFGF +Q+C
Sbjct: 415 GDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  543 bits (1398), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 269/459 (58%), Positives = 340/459 (74%), Gaps = 10/459 (2%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
            + V +PL+P S   +   S  DP   L  LA SS++RA  LK  T  KP     +  + 
Sbjct: 16  VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTT 72

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
            S +++K+PLS  SYGGYS+SLSFGTP Q + PF+FDTGSSLVW PCTSRY C  C+F  
Sbjct: 73  ASATVVKSPLSAKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVWLPCTSRYLCSGCDFSG 131

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +DP+ IP FIPK SSSS++IGCQ+PKC +++GPNV+  C+GC P  + C + CP Y+LQY
Sbjct: 132 LDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQ--CRGCDPNTRNCTVGCPPYILQY 189

Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
           GLG TAG+L++E L FP  TVP+F+ GCSI+S RQPAGIAGFGR   SLPSQ+ LK+FS+
Sbjct: 190 GLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSH 249

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL+SR+FDD  V+++L LDTG G    SKTPGL+YTPF KNP  S+ AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRI 309

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG KHVKIPY YL PG++G+GG IVDSGSTFTFME P+FE VA+EF  QM NY+R  D+
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
           EK++GL PCF+ISGK  V +PELI +FKGGAK+ LP  NYF  VGN + +CL + +D   
Sbjct: 370 EKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 280/458 (61%), Positives = 352/458 (76%), Gaps = 18/458 (3%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
           S + T+T+PL+  S    L  S   P   L+ LAS SLSRA H+K+   PKT       N
Sbjct: 19  SKSTTITIPLSAPSFNK-LIVSSKKPWGSLNHLASLSLSRAHHIKS---PKT-------N 67

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
           +S  LIKTPL   SYGGYSISL+FGTPPQ +T F+ DTGSSLVWFPCTSRY C +CNFPN
Sbjct: 68  FS--LIKTPLFPRSYGGYSISLNFGTPPQ-TTKFVMDTGSSLVWFPCTSRYLCSECNFPN 124

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +  + IP F+PK SSSS+LIGC+NP+CS IFGP ++S+C+ C    + C   CP Y++QY
Sbjct: 125 IKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY 184

Query: 207 GLGFTAGLLLSETLRFPSK-TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
           G G TAGLLLSETL FP+K T+P+FL GCSI S +QP GIAGFGRS ESLPSQLGLKKFS
Sbjct: 185 GSGSTAGLLLSETLDFPNKKTIPDFLVGCSIFSIKQPEGIAGFGRSPESLPSQLGLKKFS 244

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL+S  FDD P SS+LVLDTG GSG +KT GLS+TPF KNP   ++AF ++YYV LR I
Sbjct: 245 YCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNP---TTAFRDYYYVLLRNI 301

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
           ++G  HVK+PY +LVPG+DGNGG IVDSG+TFTFME P++E VAKEF +QM +Y+ A ++
Sbjct: 302 VIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEI 361

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
           +  +GLRPC++ISG+KS+ +P+LI +FKGGAKMALP  NYF++V + V+CL + +DN AG
Sbjct: 362 QNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAG 421

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           P LG GPAIILG++Q +NFY+EFDL N++FGF +Q CA
Sbjct: 422 PGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSCA 459


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  537 bits (1384), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 268/459 (58%), Positives = 339/459 (73%), Gaps = 10/459 (2%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
            + V +PL+P S   +   S  DP   L  LA SS++RA  LK  T  KP     +  + 
Sbjct: 16  VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTT 72

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
            S +++K+PLS  SYGGYS+SLSFGTP Q + PF+FDTGSSLV  PCTSRY C  C+F  
Sbjct: 73  ASATVVKSPLSAKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVCLPCTSRYLCSGCDFSG 131

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +DP+ IP FIPK SSSS++IGCQ+PKC +++GPNV+  C+GC P  + C + CP Y+LQY
Sbjct: 132 LDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQ--CRGCDPNTRNCTVGCPPYILQY 189

Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
           GLG TAG+L++E L FP  TVP+F+ GCSI+S RQPAGIAGFGR   SLPSQ+ LK+FS+
Sbjct: 190 GLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSH 249

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL+SR+FDD  V+++L LDTG G    SKTPGL+YTPF KNP  S+ AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRI 309

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG KHVKIPY YL PG++G+GG IVDSGSTFTFME P+FE VA+EF  QM NY+R  D+
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
           EK++GL PCF+ISGK  V +PELI +FKGGAK+ LP  NYF  VGN + +CL + +D   
Sbjct: 370 EKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 271/455 (59%), Positives = 334/455 (73%), Gaps = 19/455 (4%)

Query: 31  TVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNS 90
           ++T+PL+PL TK   H SDSDP   +   ASSSL+RA HLK +           +N S S
Sbjct: 28  SITLPLSPLLTKP--HSSDSDPFHSVKLAASSSLTRAHHLKHR-----------NNNSPS 74

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           +  TP    SYGGYSI L+ GTPPQ S PF+ DTGSSLVWFPCTS Y C  CNFPN+DP+
Sbjct: 75  VATTPAYPKSYGGYSIDLNLGTPPQTS-PFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPT 133

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYGLG 209
           +IP FIPK SS+++L+GC+NPKC ++FGP+VESRC  C  P ++ C L CPSY++QYGLG
Sbjct: 134 KIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLG 193

Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
            TAG LL + L FP KTVP FL GCSILS RQP+GIAGFGR  ESLPSQ+ LK+FSYCL+
Sbjct: 194 ATAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLV 253

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           S +FDD P SS+LVL     +GD+KT GLSYTPF  NP  ++S F E+YYV LR++IVG 
Sbjct: 254 SHRFDDTPQSSDLVLQIS-STGDTKTNGLSYTPFRSNP-SNNSVFREYYYVTLRKLIVGG 311

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKK 388
             VKIPY +L PGSDGNGG IVDSGSTFTFME P++  VA+EF+RQ+G  YSR  +VE +
Sbjct: 312 VDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQ 371

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPA 447
           SGL PCF+ISG K++  PE   +FKGGAKM+ P  NYF+ VG+ EVLC  + +D  AG  
Sbjct: 372 SGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQP 431

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              GPAIILG++Q QNFY+E+DL N+RFGF  + C
Sbjct: 432 KTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 271/474 (57%), Positives = 340/474 (71%), Gaps = 25/474 (5%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
            S   LL  +   A + +  +T+PL       + H S  DPL+ L  LASSS +RA  +K
Sbjct: 7   LSFFYLLLFSSLSAIAHSNPITLPL-----NSFPHLSSPDPLQALTFLASSSQTRAHQIK 61

Query: 72  TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
           T   PK          SNS+ K+PLS HSYG YS  LSFGTP Q +   IFDTGSSLVWF
Sbjct: 62  T---PK----------SNSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWF 107

Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
           PCTSRY C +C+FP +DP+ IP F+PK SSSS+L+GCQNPKCSWIFGP+V+S+C+ C+P+
Sbjct: 108 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK 167

Query: 192 NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRS 251
            + C   CP+Y++QYG G TAGLLLSETL FP K +PNF+ GCS LS  QP+GIAGFGR 
Sbjct: 168 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRG 227

Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
           SESLPSQ+GLKKF+YCL SRKFDD+P S  L+LD    S   K+ GL+YTPF +NP  S+
Sbjct: 228 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSN 283

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
           +A+ E+YY+ +R+IIVG++ VK+PY +LVPG DGNGG I+DSGSTFTFM+ P+ E VA+E
Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
           F +Q+ N++RA DVE  +GLRPCFDIS +KSV  PELI +FKGGAK ALP  NYFALV +
Sbjct: 344 FEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSS 403

Query: 432 E-VLCLILFTDNAAG-PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             V CL + T         G GP++ILG FQ QNFY+E+DL N R GF +Q C+
Sbjct: 404 SGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 271/474 (57%), Positives = 340/474 (71%), Gaps = 25/474 (5%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
            S   LL  +   A + +  +T+PL       + H S  DPL+ L  LASSS +RA  +K
Sbjct: 7   LSFFYLLLFSSLSAIAHSNPITLPL-----NSFPHLSSPDPLQALTFLASSSQTRAHQIK 61

Query: 72  TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
           T   PK          SNS+ K+PLS HSYG YS  LSFGTP Q +   IFDTGSSLVWF
Sbjct: 62  T---PK----------SNSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWF 107

Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
           PCTSRY C +C+FP +DP+ IP F+PK SSSS+L+GCQNPKCSWIFGP+V+S+C+ C+P+
Sbjct: 108 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK 167

Query: 192 NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRS 251
            + C   CP+Y++QYG G TAGLLLSETL FP K +PNF+ GCS LS  QP+GIAGFGR 
Sbjct: 168 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRG 227

Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
           SESLPSQ+GLKKF+YCL SRKFDD+P S  L+LD    S   K+ GL+YTPF +NP  S+
Sbjct: 228 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSN 283

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
           +A+ E+YY+ +R+IIVG++ VK+PY +LVPG DGNGG I+DSGSTFTFM+ P+ E VA+E
Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
           F +Q+ N++RA DVE  +GLRPCFDIS +KSV  PELI +FKGGAK ALP  NYFALV +
Sbjct: 344 FEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSS 403

Query: 432 E-VLCLILFTDNAAG-PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             V CL + T         G GP++ILG FQ QNFY+E+DL N R GF +Q C+
Sbjct: 404 SGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 260/474 (54%), Positives = 338/474 (71%), Gaps = 24/474 (5%)

Query: 11  LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
           LFS+ +LL        SS++T  +PLT   +  +     +DP K ++ L S+SL+RA+HL
Sbjct: 59  LFSIFLLL-----PTSSSSSTTVLPLTTFPSVSF-----TDPFKTINLLLSASLNRAQHL 108

Query: 71  KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
           KT   P++K +    N S       L   SYG YS+SL+FGTPPQ +  FIFDTGSSLVW
Sbjct: 109 KT---PQSKSNTSIQNVS-------LFPRSYGAYSVSLAFGTPPQ-NLSFIFDTGSSLVW 157

Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
           FPCT+ YRC  C+FP VDP+ I  F+PK SSS +++GC+NPKC+WIFGPN++SRC+ C+ 
Sbjct: 158 FPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNS 217

Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGR 250
           +++ C  +CP Y LQYG G TAG+LLSETL   +K VP+FL GCS++S  QPAGIAGFGR
Sbjct: 218 KSRKCSDSCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGR 277

Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
             ESLPSQ+ LK+FS+CL+SR FDD+PVSS LVLD+G  S +SKT    Y PF +NP  S
Sbjct: 278 GPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVS 337

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
           ++AF E+YY+ LR+I++G K VK PY YLVP S GNGG I+DSGSTFTF++ P+FEA+A 
Sbjct: 338 NAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIAD 397

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALV 429
           E  +Q+  Y RA DVE +SGLRPCF+I   ++S   P+++LKFKGG K++L  ENY A+V
Sbjct: 398 ELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMV 457

Query: 430 GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +E V+CL + TD A     G GPAIILG FQ QN  +E+DLA  R GF KQKC
Sbjct: 458 TDEGVVCLTMMTDEAVV-GGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score =  480 bits (1236), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 254/436 (58%), Positives = 310/436 (71%), Gaps = 29/436 (6%)

Query: 10  CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
           C F+L  LL   ++    + AT+T+PLTPL TK+      SDP ++L  L S+SL+RA H
Sbjct: 29  CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 84

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           LK +              + S + TPL  HSYGGYS+SLSFGTP Q +  F+ DTGSSLV
Sbjct: 85  LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 130

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           WFPCTSRY C  C+FPN+DP++IP FIPK SSS++++GC NPKC ++            S
Sbjct: 131 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMD----------S 180

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
             +  C  ACP+Y +QYGLG T GLLL E+L F  +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 181 ENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 240

Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           R   SLP Q+GLKKFSYCLLS +FDD+P SS + L  GP S D KT GLSYTPF KNPV 
Sbjct: 241 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 300

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
           S+SAF E+YYV LR IIVG K VK+PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 301 SNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 360

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            EF RQM NY+RAADVE  SGL+PCF++SG  SV LP L+ +FKGGAKM LP  NYF+LV
Sbjct: 361 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 420

Query: 430 GN-EVLCLILFTDNAA 444
           G+  VLCL + ++ A 
Sbjct: 421 GDLSVLCLTIVSNEAV 436


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  476 bits (1224), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 248/434 (57%), Positives = 301/434 (69%), Gaps = 19/434 (4%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           DP + L  L S+SL RARHLK      T  +              L  HSYG YSI LSF
Sbjct: 50  DPYRNLRHLVSASLIRARHLKNPKTTPTSTTP-------------LFTHSYGAYSIPLSF 96

Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           GTPPQ + P I DTGS LVWFPCT RY C +C+F   +PS    FIPK SSSS+++GC N
Sbjct: 97  GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
           PKC WI G  V+SRC+ C P +  C   CP YL+ YG G T G++LSETL  P K VPNF
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGVPNF 214

Query: 231 LAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           + GCS+LS  QPAGI+GFGR   SLPSQLGLKKFSYCLLSR++DD   SS+LVLD    S
Sbjct: 215 IVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDS 274

Query: 291 GDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
           G+ KT GLSYTPF +NP V    AF  +YY+GLR I VG KHVKIPY YL+PG+DG+GG 
Sbjct: 275 GE-KTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGT 333

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
           I+DSG+TFT+M+G +FE VA EF +Q+ +  RA +VE  +GLRPCF+ISG  +   PEL 
Sbjct: 334 IIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPELT 392

Query: 410 LKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           LKF+GGA+M LP  NY A + G++V+CL + TD AAG     GPAIILG+FQ QNFY+E+
Sbjct: 393 LKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEY 452

Query: 469 DLANDRFGFAKQKC 482
           DL N+R GF +Q C
Sbjct: 453 DLRNERLGFRQQSC 466


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 246/482 (51%), Positives = 321/482 (66%), Gaps = 32/482 (6%)

Query: 6   FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
           FSL+   S++I  F++     S+  T+T+ L+PL T H    S S P   L    S+S++
Sbjct: 8   FSLLSFLSIIITTFSS-----STPNTITLHLSPLFTNH--PSSSSHPFHTLKLAVSTSIT 60

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           RA HLK   KP            N  ++TP+   +YGGYSI L FGTP Q + PF+ DTG
Sbjct: 61  RAHHLKNH-KP------------NKSLETPVHPKTYGGYSIDLEFGTPSQ-TFPFVLDTG 106

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR- 184
           S+LVW PC+S Y C  CN      S  P FIPK SSSS+ +GC NPKC+W+FGP+V+S  
Sbjct: 107 STLVWLPCSSHYLCSKCN----SFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHC 162

Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAG 244
           C+        C   CP+Y +QYGLG TAG LLSE L FP+K   +FL GCS++S  QPAG
Sbjct: 163 CRQDKAAFNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPTKKYSDFLLGCSVVSVYQPAG 222

Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           IAGFGR  ESLPSQ+ L +FSYCLLS +FDD A ++SNLVL+T   S D KT G+SYTPF
Sbjct: 223 IAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSATITSNLVLETA-SSRDGKTNGVSYTPF 281

Query: 304 YKNPVGSSS-AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
            KNP    + AFG +YY+ L++I+VG K V++P   L P  DG+GG IVDSGSTFTFME 
Sbjct: 282 LKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMER 341

Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS-GKKSVYLPELILKFKGGAKMALP 421
           P+F+ VA+EF +Q+ +Y+RA + EK+ GL PCF ++ G ++   PEL  +F+GGAKM LP
Sbjct: 342 PIFDLVAQEFAKQV-SYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLP 400

Query: 422 PENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
             NYF+LVG  +V CL + +D+ AG     GPA+ILG++Q QNFY+E+DL N+RFGF  Q
Sbjct: 401 VANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQ 460

Query: 481 KC 482
            C
Sbjct: 461 SC 462


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 243/447 (54%), Positives = 295/447 (65%), Gaps = 31/447 (6%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS 97
           PLS  +   +   D L+ L+ L S+SL+RA HLK    P+T               TP+ 
Sbjct: 29  PLSHSYTNQNPSQDHLQKLNYLVSTSLARAHHLK---NPQT---------------TPVF 70

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
            HSYGGYSISLSFGTPPQ +  F+ DTGSS VWFPCT RY C +C+F     SRI  F+P
Sbjct: 71  SHSYGGYSISLSFGTPPQ-TLSFVMDTGSSFVWFPCTLRYLCNNCSFT----SRISPFLP 125

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
           K SSSS++IGC+NPKCSWI   ++  RC  C   ++ C   CP YL+ YG G T G+ LS
Sbjct: 126 KHSSSSKIIGCKNPKCSWIHQTDL--RCTDCDNNSRNCSQICPPYLILYGSGTTGGVALS 183

Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           ETL      VPNFL GCS+ S RQPAGIAGFGR   SLPSQLGL KFSYCLLS KFDD  
Sbjct: 184 ETLHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQ 243

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
            SS+LVLD+   S D KT  L YTP  KNP V    AF  +YYV LR+I +G + VKIPY
Sbjct: 244 ESSSLVLDSQSDS-DKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPY 302

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
            YL P  DGNGG I+DSG+TFT+M    FE ++ EFI Q+ NY RA  VE  SGL+PCF+
Sbjct: 303 KYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFN 362

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAII 455
           +SG K + LP+L L FKGGA + LP ENYFA +G+ EV C  + TD A   +   GP +I
Sbjct: 363 VSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKAS---GPGMI 419

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+FQ+QNFY+E+DL N+R GF K+ C
Sbjct: 420 LGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 249/459 (54%), Positives = 306/459 (66%), Gaps = 24/459 (5%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS 88
           ++++T+PL    T         D  + L+ L ++SL+RARHLK    P+T          
Sbjct: 6   SSSITIPLQHPQTNQIPFQ---DQYQKLNHLVTTSLARARHLKN---PQTT--------P 51

Query: 89  NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
            +    PL  HSYGGYS+SLSFGTPPQ +  FI DTGS +VWFPCTS Y C  C+F +  
Sbjct: 52  ATTTTAPLFSHSYGGYSVSLSFGTPPQ-TLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSS 110

Query: 149 PS-RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV--ESRCKGCSPRNKTCPLACPSYLLQ 205
           PS RI  FIPK SSSS+L+GC+NPKCSWI   N+  +  C   S  N+TCP     Y++ 
Sbjct: 111 PSSRIQPFIPKESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCP----PYMIF 166

Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
           YG G T G+ LSETL   S + PNFL GCS+ S  QPAGIAGFGR   SLPSQLGL KFS
Sbjct: 167 YGSGTTGGVALSETLHLHSLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFS 226

Query: 266 YCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLR 323
           YCLLS +FDD    SS+LVLD      D KT  L YTPF KNP V + S+F  +YY+GLR
Sbjct: 227 YCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLR 286

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           +I VG  HVK+PY YL PG DGNGGVI+DSG+TFTFM    FE ++ EFIRQ+ +Y R  
Sbjct: 287 RITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVK 346

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
           ++E   GLRPCF++S  K+V  PEL L FKGGA +ALP ENYFA VG EV CL + TD  
Sbjct: 347 EIEDAIGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGV 406

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           AGP    GP +ILG+FQ+QNFY+E+DL N+R GF ++KC
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 226/449 (50%), Positives = 295/449 (65%), Gaps = 28/449 (6%)

Query: 44  YLHH---SDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHS 100
           + HH   S+S P   L    S+S++RA HLK    P             S +KT +   +
Sbjct: 166 FTHHPSSSNSHPFHTLQLAVSTSITRAHHLKNHNNP-------------SSLKTLVHPKT 212

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN-FPNVDPSRIPAFIPKR 159
           YGGYSI L FGTPPQ + PF+ DTGSSLVW PC S Y C  CN F N   +  P FIPK 
Sbjct: 213 YGGYSIDLKFGTPPQ-TFPFVLDTGSSLVWLPCYSHYLCSKCNSFSN---NNTPKFIPKD 268

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRC----KGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
           S SS+ +GC+NPKC+W+FG +V S C    K     N  C   CP+Y +QYGLG TAG L
Sbjct: 269 SFSSKFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFL 328

Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
           LSE L FP+K V +FL GCS++S  QP GIAGFGR  ESLP+Q+ L +FSYCLLS +FD+
Sbjct: 329 LSENLNFPAKNVSDFLVGCSVVSVYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDE 388

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +P +S+LV++        KT G+SYT F KNP     AFG +YY+ LR+I+VG K V++P
Sbjct: 389 SPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVP 448

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
              L P  +G+GG IVDSGST TFME P+F+ VA+EF++Q+ NY+RA ++EK+ GL PCF
Sbjct: 449 RRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQV-NYTRARELEKQFGLSPCF 507

Query: 396 DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPA 453
            ++ G ++   PE+  +F+GGAKM LP  NYF+ VG  +V CL + +D+ AG     GPA
Sbjct: 508 VLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPA 567

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +ILG++Q QNFY+E DL N+RFGF  Q C
Sbjct: 568 VILGNYQQQNFYVECDLENERFGFRSQSC 596


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 229/435 (52%), Positives = 277/435 (63%), Gaps = 34/435 (7%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           DP + L  L S+SL RARHLK                +     TPL  HSYG YSI LSF
Sbjct: 50  DPYRNLRHLVSASLIRARHLKNPK-------------TTPTSTTPLFTHSYGAYSIPLSF 96

Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           GTPPQ + P I DTGS LVWFPCT RY C +C+F   +PS    FIPK SSSS+++GC N
Sbjct: 97  GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
           PKC WI G  V+SRC+ C P +  C   CP YL                LRF       F
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYL--------------NFLRFWDHRRSQF 200

Query: 231 LAGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
                  L       I+GFGR   SLPSQLGLKKFSYCLLSR++DD   SS+LVLD    
Sbjct: 201 HRRMLCPLHQSTRREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESD 260

Query: 290 SGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
           SG+ KT GLSYTPF +NP V    AF  +YY+GLR I VG KHVKIPY YL+PG+DG+GG
Sbjct: 261 SGE-KTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGG 319

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
            I+DSG+TFT+M+G +FE VA EF +Q+ +  RA +VE  +GLRPCF+ISG  +   PEL
Sbjct: 320 TIIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPEL 378

Query: 409 ILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
            LKF+GGA+M LP  NY A + G++V+CL + TD AAG     GPAIILG+FQ QNFY+E
Sbjct: 379 TLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVE 438

Query: 468 FDLANDRFGFAKQKC 482
           +DL N+R GF +Q C
Sbjct: 439 YDLRNERLGFRQQSC 453


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 207/461 (44%), Positives = 289/461 (62%), Gaps = 32/461 (6%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKT-KTKPKTKDSNIGS 85
           ++ AT+T+PLT      + +   + PL+ L  LA++SLSRA HLK  KT P T+ S    
Sbjct: 27  NTPATITIPLT----STFTNSPSTKPLRFLQHLATASLSRAHHLKHGKTSPLTQIS---- 78

Query: 86  NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
                     LS HSYGG+SI LSFGTPPQ  + F+ DTGS +VW PCT+ Y C +C+F 
Sbjct: 79  ----------LSPHSYGGHSIPLSFGTPPQKLS-FLVDTGSHVVWAPCTTHYTCTNCSFS 127

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
           + +P ++P F PK SSSS+++GC+NPKC     P+V   C  C+  +K C  ACP Y LQ
Sbjct: 128 DAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQ 187

Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQLGLKK 263
           YG G ++G  L E L FP KT+  FL GC  S + +   A +AGFGRS  SLP Q+G+KK
Sbjct: 188 YGTGASSGDFLLENLNFPGKTIHEFLVGCTTSAVGEVTSAALAGFGRSMFSLPMQMGVKK 247

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           F+YCL S  +DD   SS L+LD      D +T GLSY PF KNP      F  +YY+G++
Sbjct: 248 FAYCLNSHDYDDTRNSSKLILDY----SDGETKGLSYAPFLKNP----PDFPIYYYLGVK 299

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +G+K ++IP  YL PGSDG GG+++DSG  + +M GP+F+ V  E  ++M  Y R+ 
Sbjct: 300 DIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSL 359

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
           + E + G+ PC++ +G+KS+ +P+LI +F+GGA M +P +NYF L+    L     T +A
Sbjct: 360 EAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDA 419

Query: 444 AGPAL--GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               L    GP+IILG+ Q  ++Y+EFDL N+R GF +Q C
Sbjct: 420 GTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 206/436 (47%), Positives = 278/436 (63%), Gaps = 30/436 (6%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           D  + ++  A SSLSRARHLK   +P T    +           P    SYGGYS+  S 
Sbjct: 33  DKWESINLAALSSLSRARHLK---RPPTLTGKV---------TLPAYPRSYGGYSVIFSL 80

Query: 111 GTPPQASTPFIFDTGSSLVWFPCT---SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           GTPPQ  +  + DTGSSLVW PCT   + Y C +C F  VDP++IP +   +SS+ Q + 
Sbjct: 81  GTPPQKVS-LVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLP 139

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS-KT 226
           C++PKC+W+FG ++      CS   +     CP Y L+YGLG T G L+S+ L       
Sbjct: 140 CRSPKCNWVFGSDLN-----CSTTKR-----CPYYGLEYGLGSTTGQLVSDVLGLSKLNR 189

Query: 227 VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDT 286
           +P+FL GCS++S+RQP GIAGFGR   S+P+QLGL KFSYCL+S +FDD P S +LVL  
Sbjct: 190 IPDFLFGCSLVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHR 249

Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
           G    D+   G++Y PF K+P  + S + E+YY+ L +I+VG K V IP  YLVP  +G+
Sbjct: 250 GRRHADAAANGVAYAPFTKSP--ALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGD 307

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
           GG+IVDSGSTFTFME  +F+ VA+E  + M  Y RA ++E  SGL PC++I+G+  V +P
Sbjct: 308 GGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVP 367

Query: 407 ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
           +L   FKGGA M LP  +YF+LV + V+C+ + TD    P    GPAIILG++Q QNFY+
Sbjct: 368 KLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDE-PGSTTGPAIILGNYQQQNFYI 426

Query: 467 EFDLANDRFGFAKQKC 482
           E+DL   RFGF  Q+C
Sbjct: 427 EYDLKKQRFGFKPQQC 442


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 207/481 (43%), Positives = 295/481 (61%), Gaps = 35/481 (7%)

Query: 6   FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
           FS+  LFS L+L     +   +  AT+T+PLTP  TK+      ++PL  L  LA++S+S
Sbjct: 9   FSVFTLFSRLVL---ASSSKNNIPATITIPLTPTFTKN----PSTEPLLFLQHLATASMS 61

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R+ HLK                ++ LI+T L  HS+GG++I LSFGTPPQ  + F+ DTG
Sbjct: 62  RSHHLK-------------HGKASPLIQTSLFPHSHGGHTIPLSFGTPPQKLS-FLVDTG 107

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S +VW PCT+ Y C +C+F N  P ++P F P+ SSS +++GC++PKC+    P+V   C
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGC 165

Query: 186 KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA-- 243
             C+  +K C  ACP Y LQYG G  +G  L E L FP KT+  FL GC+  +DR+P+  
Sbjct: 166 PRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSD 225

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
            +AGFGR+  SLP Q+G+KKF+YCL S  +DD   S  L+LD      D +T GLSY PF
Sbjct: 226 ALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILDYS----DGETQGLSYAPF 281

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
            KNP      +  +YY+G++ + +G+K ++IP  YL PGSD  GGV++DSG  + +M  P
Sbjct: 282 LKNP----PDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLP 337

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
           +F+ V  E  +QM  Y R+ + E +SGL PC++ +G KS+ +P+LI +F GGA M +P  
Sbjct: 338 VFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397

Query: 424 NYFALVGNEVL-CLILFTDNAAGP-ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
           NYF L     L C  + TD+         GP+IILG++Q  + Y+EFDL N+R GF +Q 
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457

Query: 482 C 482
           C
Sbjct: 458 C 458


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 204/481 (42%), Positives = 297/481 (61%), Gaps = 35/481 (7%)

Query: 6   FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
           FS+  LFS L+L     +   +  AT+T+PLTP+ TK+      ++PL  L  LA++S+S
Sbjct: 9   FSVFTLFSHLVL---ASSSKNNIPATITIPLTPIFTKN----PSTEPLLFLQHLATASMS 61

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R+ HLK                ++ LI+T L  HSYG ++I LSFGTPPQ  + F+ DTG
Sbjct: 62  RSHHLK-------------HGKASPLIQTSLFPHSYGAHTIPLSFGTPPQKLS-FLMDTG 107

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S +VW PCT+ Y C +C+F N  P ++P F P+ SSS +++GC++PKC+    PBV    
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGX 165

Query: 186 KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA-- 243
             C+  +K C  ACP Y LQYG G  +G  L E L FP KT+  FL GC+  +DR+P+  
Sbjct: 166 PRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSD 225

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
            +AGFGR+  SLP Q+G+KKF+YCL S  +DD   S  L+LD      D +T GLSY PF
Sbjct: 226 ALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPF 281

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
            KNP      +  +YY+G++ + +G+K ++IP  YL PGSD  GGV++DSG  +++M  P
Sbjct: 282 XKNP----PDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLP 337

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
           +F+ V  E  +QM  Y R+ ++E ++G+ PC++ +G KS+ +P+LI +F GGA M +P  
Sbjct: 338 VFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397

Query: 424 NYFALVGNEVL-CLILFTDN-AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
           NYF L     L C  + TD+  +      GP+IILG++Q  + Y+EFDL N+R GF +Q 
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457

Query: 482 C 482
           C
Sbjct: 458 C 458


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/477 (44%), Positives = 292/477 (61%), Gaps = 43/477 (9%)

Query: 11  LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
           +F L  +LF       +  AT+T+PLT   T        S PL      AS+SLSRA HL
Sbjct: 12  VFILFSILFLASCSKDNIPATITIPLTSTFT--------SKPL------ASASLSRAHHL 57

Query: 71  KT-KTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           K  KT P               +KT L  HSYGG+SISLSFGTPPQ  + F+ DTGS +V
Sbjct: 58  KHGKTNPP--------------VKTSLFPHSYGGHSISLSFGTPPQKLS-FLVDTGSDVV 102

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           W PCT+ Y C +C+F   DP ++P F PK SSSS+++ C+NPKC   + P V   C  C+
Sbjct: 103 WAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCN 162

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA--GIAG 247
             +K C  ACP Y  QYG G ++G  L E L+FP KT+ NFL GC+  + R+ +   +AG
Sbjct: 163 GNSKHCSYACP-YSTQYGTGASSGYFLLENLKFPRKTIRNFLLGCTTSAARELSSDALAG 221

Query: 248 FGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP 307
           FGRS  SLP Q+G+KKF+YCL S  +DD   S  L+LD      D KT GLSYTPF K+P
Sbjct: 222 FGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLILDY----RDGKTKGLSYTPFLKSP 277

Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG-STFTFMEGPLFE 366
              +SAF  +Y++G++ I +G+K ++IP  YL PGSDG  GVI+DSG     +M GP+F+
Sbjct: 278 --PASAF--YYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFK 333

Query: 367 AVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF 426
            V  E  +QM  Y R+ + E ++GL PC++ +G KS+ +P LI +F+GGA M +P +NYF
Sbjct: 334 IVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYF 393

Query: 427 ALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +   E L C ++ T+      +   P+IILG+ Q  ++Y+E+DL NDRFGF +Q C
Sbjct: 394 GISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 201/440 (45%), Positives = 269/440 (61%), Gaps = 28/440 (6%)

Query: 61  SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTP-LSVHSYGGYSISLSFGTPPQASTP 119
           ++SL+RA HLK +       S  GS    S+  T  L  HSYGGY+ + S GTPPQ   P
Sbjct: 25  AASLARALHLKRRDP--NHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQ-PLP 81

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-G 178
            + DTGS L W PCTS Y C +C+ P+   S +P F PK SSSS+L+GC+NP C W+   
Sbjct: 82  VLLDTGSHLTWVPCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSA 139

Query: 179 PNVESRCKG--CSPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLA 232
            N+ ++C+   CSP    CP A    CP Y + YG G TAGLL+++TLR P + VP F+ 
Sbjct: 140 ANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVL 199

Query: 233 GCSILSDRQP-AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGS 290
           GCS++S  QP +G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL      
Sbjct: 200 GCSLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG---- 255

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
                 G+ Y P  K+  G    +G +YY+ LR + VG K V++P       + G+GG I
Sbjct: 256 -TGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTI 314

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPEL 408
           VDSG+TFT+++  +F+ VA   +  + G Y R+ D E + GL PCF +  G +S+ LPEL
Sbjct: 315 VDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPEL 374

Query: 409 ILKFKGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGDFQLQ 462
              F+GGA M LP ENYF + G    E +CL + TD + G   G    GPAIILG FQ Q
Sbjct: 375 SFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQ 434

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N+ +E+DL  +R GF +Q C
Sbjct: 435 NYLVEYDLEKERLGFRRQSC 454


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  335 bits (859), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 194/428 (45%), Positives = 258/428 (60%), Gaps = 26/428 (6%)

Query: 73  KTKPKTKDSNIGSNYSNSLIKTP-LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
           K +     S  GS    S+  T  L  HSYGGY+ + S GTPPQ   P + DTGS L W 
Sbjct: 67  KRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQ-PLPVLLDTGSHLTWV 125

Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-GPNVESRCKG--C 188
           PCTS Y C +C+ P+   S +P F PK SSSS+L+GC+NP C W+    N+ ++C+   C
Sbjct: 126 PCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPC 183

Query: 189 SPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP-A 243
           SP    CP A    CP Y + YG G TAGLL+++TLR P + VP F+ GCS++S  QP +
Sbjct: 184 SPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQPPS 243

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTP 302
           G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL            G+ Y P
Sbjct: 244 GLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG-----TGGGEGMQYVP 298

Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
             K+  G    +G +YY+ LR + VG K V++P       + G+GG IVDSG+TFT+++ 
Sbjct: 299 LVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDP 358

Query: 363 PLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMAL 420
            +F+ VA   +  + G Y R+ D E   GL PCF +  G +S+ LPEL   F+GGA M L
Sbjct: 359 TVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQL 418

Query: 421 PPENYFALVGN---EVLCLILFTD---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
           P ENYF + G    E +CL + TD    +     G GPAIILG FQ QN+ +E+DL  +R
Sbjct: 419 PVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKER 478

Query: 475 FGFAKQKC 482
            GF +Q C
Sbjct: 479 LGFRRQSC 486


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 190/439 (43%), Positives = 257/439 (58%), Gaps = 34/439 (7%)

Query: 69  HLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--HSYGGYSISLSFGTPPQASTPFIFDTGS 126
           HLK + +         S+  +  I    ++  HSYGGY+ + S GTPPQ   P + DTGS
Sbjct: 66  HLKRRGRASHHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQ-PLPVLLDTGS 124

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
            L W PCTS Y C +C+ P    + +P F PK SSSS+L+GC+NP C W+      ++C+
Sbjct: 125 QLTWVPCTSNYDCRNCSSPFA--AAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCR 182

Query: 187 GCSPRNKTCPLA---CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP- 242
               R   C  A   CP Y + YG G TAGLL+++TLR P + V  F+ GCS++S  QP 
Sbjct: 183 APCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVSGFVLGCSLVSVHQPP 242

Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYT 301
           +G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL      GD+   G+ Y 
Sbjct: 243 SGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVL-----GGDND--GMQYV 295

Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           P  K+  G    +  +YY+ L  + VG K V++P       + G+GG IVDSG+TFT+++
Sbjct: 296 PLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLD 355

Query: 362 GPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMA 419
             +F+ VA   +  + G Y R+ DVE+  GL PCF +  G KS+ LPEL L FKGGA M 
Sbjct: 356 PTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQ 415

Query: 420 LPPENYFALVGNE-------------VLCLILFTD--NAAGPALGRGPAIILGDFQLQNF 464
           LP ENYF + G                +CL + TD   +     G GPAIILG FQ QN+
Sbjct: 416 LPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNY 475

Query: 465 YLEFDLANDRFGFAKQKCA 483
            +E+DL  +R GF +Q CA
Sbjct: 476 LVEYDLEKERLGFRRQPCA 494


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score =  315 bits (806), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 173/376 (46%), Positives = 234/376 (62%), Gaps = 24/376 (6%)

Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-GPNVE 182
           +GS L W PCTS Y C +C+ P+   S +P F PK SSSS+L+GC+NP C W+    N+ 
Sbjct: 79  SGSHLTWVPCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLA 136

Query: 183 SRCKG--CSPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSI 236
           ++C+   CSP    CP A    CP Y + YG G TAGLL+++TLR P + VP F+ GCS+
Sbjct: 137 TKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSL 196

Query: 237 LSDRQP-AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSK 294
           +S  QP +G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL          
Sbjct: 197 VSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG-----TGG 251

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
             G+ Y P  K+  G    +G +YY+ LR + VG K V++P       + G+GG IVDSG
Sbjct: 252 GEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSG 311

Query: 355 STFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKF 412
           +TFT+++  +F+ VA   +  + G Y R+ D E + GL PCF +  G +S+ LPEL   F
Sbjct: 312 TTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHF 371

Query: 413 KGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGDFQLQNFYL 466
           +GGA M LP ENYF + G    E +CL + TD + G   G    GPAIILG FQ QN+ +
Sbjct: 372 EGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLV 431

Query: 467 EFDLANDRFGFAKQKC 482
           E+DL  +R GF +Q C
Sbjct: 432 EYDLEKERLGFRRQSC 447


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 188/418 (44%), Positives = 250/418 (59%), Gaps = 32/418 (7%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           ++  L  HSYGGY+ ++S GTPPQ   P + DTGS L W PCTS Y+C +C+  +   S 
Sbjct: 77  VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA-SP 134

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
           +  F PK SSSS+LIGC+NP C WI  P+  S C+         C+PRN      CP YL
Sbjct: 135 LHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 194

Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
           + YG G TAGLL+S+TLR P + V NF+ GCS+ S  Q P+G+AGFGR + S+PSQLGL 
Sbjct: 195 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 254

Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           KFSYCLLSR+FDD A VS  L+L      G     G+ Y P  ++   +   +  +YY+ 
Sbjct: 255 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 311

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
           L  I VG K V++P    V      GG IVDSG+TF++ +  +FE VA   +  + G YS
Sbjct: 312 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 370

Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
           R+  VE+  GL PCF +  G K++ LPE+ L FKGG+ M LP ENYF + G         
Sbjct: 371 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 430

Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             E +CL + +D    +        GPAIILG FQ QN+Y+E+DL  +R GF +Q+CA
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 488


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 188/418 (44%), Positives = 250/418 (59%), Gaps = 32/418 (7%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           ++  L  HSYGGY+ ++S GTPPQ   P + DTGS L W PCTS Y+C +C+  +   S 
Sbjct: 77  VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA-SP 134

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
           +  F PK SSSS+LIGC+NP C WI  P+  S C+         C+PRN      CP YL
Sbjct: 135 LHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 194

Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
           + YG G TAGLL+S+TLR P + V NF+ GCS+ S  Q P+G+AGFGR + S+PSQLGL 
Sbjct: 195 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 254

Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           KFSYCLLSR+FDD A VS  L+L      G     G+ Y P  ++   +   +  +YY+ 
Sbjct: 255 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 311

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
           L  I VG K V++P    V      GG IVDSG+TF++ +  +FE VA   +  + G YS
Sbjct: 312 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 370

Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
           R+  VE+  GL PCF +  G K++ LPE+ L FKGG+ M LP ENYF + G         
Sbjct: 371 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 430

Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             E +CL + +D    +        GPAIILG FQ QN+Y+E+DL  +R GF +Q+CA
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 488


>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
          Length = 490

 Score =  298 bits (763), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 184/418 (44%), Positives = 244/418 (58%), Gaps = 33/418 (7%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           ++  L  HSYGGY+ ++S GTPPQ   P + +TGS L W P TS Y     +     P  
Sbjct: 77  VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLETGSHLSWVPSTSSYSANCSSLSAASPLH 135

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
           +  F PK SSSS+LIGC+NP C WI  P+  S C+         C+PRN      CP YL
Sbjct: 136 V--FHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 193

Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
           + YG G TAGLL+S+TLR P + V NF+ GCS+ S  Q P+G+AGFGR + S+PSQLGL 
Sbjct: 194 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 253

Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           KFSYCLLSR+FDD A VS  L+L      G     G+ Y P  ++   +   +  +YY+ 
Sbjct: 254 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 310

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
           L  I VG K V++P    V      GG IVDSG+TF++ +  +FE VA   +  + G YS
Sbjct: 311 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 369

Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
           R+  VE+  GL PCF +  G K++ LPE+ L FKGG+ M LP ENYF + G         
Sbjct: 370 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 429

Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             E +CL + +D    +        GPAIILG FQ QN+Y+E+DL  +R GF +Q+CA
Sbjct: 430 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 487


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  287 bits (735), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 158/377 (41%), Positives = 223/377 (59%), Gaps = 27/377 (7%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS LVW PCT  Y C++C  P  D +    F+P+ SSS  L+ C +  C  ++G N 
Sbjct: 1   MDTGSDLVWVPCTRNYSCINC--PE-DSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNT 57

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP------SKTVPNFLAGCS 235
           E  C+ C+   K C   CP Y +QYG G TAGLLL+ETL  P      ++ + +F  GCS
Sbjct: 58  ELLCQSCAGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGCS 117

Query: 236 ILSDRQPAGIAGFGRSSESLPSQLGLK----KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           I+S +QP+GIAGFGR + S+PSQLG      +F+YCL S +FD+    S +VL      G
Sbjct: 118 IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVL------G 171

Query: 292 DSKTPG---LSYTPFYKNPVG-SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGN 346
           D   P    L+YTPF  N     SS +G +YY+GLR + +G K +K +P   L   + GN
Sbjct: 172 DKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGN 231

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
           GG I+DSG+TFT     +F+ +A  F  Q+G Y RA +VE K+G+  C+D++G +++ LP
Sbjct: 232 GGTIIDSGTTFTVFSDEIFKHIAAGFASQIG-YRRAGEVEDKTGMGLCYDVTGLENIVLP 290

Query: 407 ELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
           E    FKGG+ M LP  NYF+   + + +CL + +       +  GPA+ILG+ Q Q+FY
Sbjct: 291 EFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGL-LEVDSGPAVILGNDQQQDFY 349

Query: 466 LEFDLANDRFGFAKQKC 482
           L +D   +R GF +Q C
Sbjct: 350 LLYDREKNRLGFTQQTC 366


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  284 bits (727), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 184/421 (43%), Positives = 241/421 (57%), Gaps = 29/421 (6%)

Query: 85  SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S+ + + ++T L  HSYGGY+ S+S GTPPQ   P + DTGS L W PCTS Y+C +C+ 
Sbjct: 72  SSQAPAAVRTALYPHSYGGYAFSVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSS 130

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                S +  F PK SSSS+L+GC+NP C WI      S C   S  N      CP YL+
Sbjct: 131 SPSAMSAMAVFHPKNSSSSRLVGCRNPACRWIHS-KSPSTCG--STGNNGNGDVCPPYLV 187

Query: 205 QYGLGFTAGLLLSETLRFPSKTVP-------NFLAGCSILSDRQ-PAGIAGFGRSSESLP 256
            YG G T+GLL+S+TLR    +         NF  GCSI+S  Q P+G+AGFGR + S+P
Sbjct: 188 VYGSGSTSGLLISDTLRLSPSSSSSAPAPFRNFAIGCSIVSVHQPPSGLAGFGRGAPSVP 247

Query: 257 SQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
           SQL + KFSYCLLSR+FDD + VS  LVL         K   + Y P   N   S   + 
Sbjct: 248 SQLKVPKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNN-AASKPPYS 306

Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
            +YY+ L  I VG K V +P    VP S   GG I+DSG+TFT+++  +F+ VA      
Sbjct: 307 VYYYLALTGISVGGKPVNLPSRAFVPSS--GGGAIIDSGTTFTYLDPTVFKPVAAAMESA 364

Query: 376 M-GNYSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYF------ 426
           + G Y+R+  VE   GLRPCF +      ++ LP+L LKFKGGA M LP ENYF      
Sbjct: 365 VGGRYNRSRPVEDALGLRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPA 424

Query: 427 --ALVGNEVLCLILFTD--NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G   +CL + +D   + G     GPAIILG FQ QN+++E+DL  +R GF +Q C
Sbjct: 425 GGPAAGPVAICLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484

Query: 483 A 483
           A
Sbjct: 485 A 485


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  284 bits (727), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 201/475 (42%), Positives = 267/475 (56%), Gaps = 52/475 (10%)

Query: 35  PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKT 94
           PL P + +H+           L  LA +SL+RA  L+   + +          ++S ++ 
Sbjct: 36  PLPPAAAQHH----------PLSRLARASLARASRLRGHHQGQA---------ASSPVRA 76

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
            L  HSYGGY+ SLS GTPPQ   P + DTGS L W PCTS Y+C +C+         P 
Sbjct: 77  ALYPHSYGGYAFSLSLGTPPQ-PLPVLLDTGSHLTWVPCTSNYQCQNCS---AAAGSFPV 132

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG----CSPRNKTCPL----ACPSYLLQY 206
           F PK SSSS L+ C +P C WI   +  S C      C P    C       CP YL+ Y
Sbjct: 133 FHPKSSSSSLLVSCSSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVY 192

Query: 207 GLGFTAGLLLSETLRFPSKTVP--NFLAGCSILSDRQP-AGIAGFGRSSESLPSQLGLKK 263
           G G TAGLL+S+TLR   +     NF  GCS+ S  QP +G+AGFGR + S+P+QLG+ K
Sbjct: 193 GSGSTAGLLVSDTLRLSPRGAASRNFAVGCSLASVHQPPSGLAGFGRGAPSVPAQLGVNK 252

Query: 264 FSYCLLSRKF-DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           FSYCLLSR+F DDA +S  LVL  G  S       + Y P  KN  G+   +  +YY+ L
Sbjct: 253 FSYCLLSRRFDDDAAISGELVL--GASSAGKAKAMMQYAPLLKN-AGARPPYSVYYYLSL 309

Query: 323 RQIIVGSKHVKIPYSYLVPGS-DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
             I VG K V +P   L P S  G GG I+DSG+TFT+++  +F+ VA   +  + G Y+
Sbjct: 310 TGIAVGGKSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYN 369

Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG------NEV 433
           R+ DVE   GLRPCF + +G +++ LPEL L F GGA+M LP ENYF   G       E 
Sbjct: 370 RSKDVEGALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEA 429

Query: 434 LCLILFTD-----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +CL + +D       AG + G GPAIILG FQ QN+ +E+DL  +R GF +Q C+
Sbjct: 430 ICLAVVSDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPCS 484


>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
          Length = 446

 Score =  247 bits (631), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 130/235 (55%), Positives = 155/235 (65%), Gaps = 15/235 (6%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           DP + L  L S+SL RARHLK      T  + +               HSYG YSI LSF
Sbjct: 50  DPYRNLRHLVSASLIRARHLKNPKTTPTSTTPL-------------FTHSYGAYSIPLSF 96

Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           GTPPQ + P I DTGS LVWFPCT RY C +C+F   +PS    FIPK SSSS+++GC N
Sbjct: 97  GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
           PKC WI G  V+SRC+ C P +  C   CP YL+ YG G T G++LSETL  P K VPNF
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGVPNF 214

Query: 231 LAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
           + GCS+LS  QPAGI+GFGR   SLPSQLGLKKFSYCLLSR++DD   SS+L+ +
Sbjct: 215 IVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLIFE 269



 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 66/120 (55%), Positives = 86/120 (71%), Gaps = 2/120 (1%)

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
           +FE VA EF +Q+ +  RA +VE  +GLRPCF+ISG  +   PEL LKF+GGA+M LP  
Sbjct: 267 IFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLA 325

Query: 424 NYFALVG-NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           NY A +G ++V+CL + TD AAG     GPAIILG+FQ QNFY+E+DL N+R GF +Q C
Sbjct: 326 NYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 385


>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
 gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
          Length = 330

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 144/321 (44%), Positives = 191/321 (59%), Gaps = 24/321 (7%)

Query: 183 SRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR 240
           S C G  C+PRN      CP YL+ YG G TAGLL+S+TLR P + V NF+ GCS+ S  
Sbjct: 11  SSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVH 70

Query: 241 Q-PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGL 298
           Q P+G+AGFGR + S+PSQLGL KFSYCLLSR+FDD A VS  L+L      G     G+
Sbjct: 71  QPPSGLAGFGRGAPSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGM 128

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
            Y P  ++   +   +  +YY+ L  I VG K V++P    V      GG IVDSG+TF+
Sbjct: 129 QYAPLARS-ASARPPYSVYYYLALTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFS 186

Query: 359 FMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGA 416
           + +  +FE VA   +  + G YSR+  VE+  GL PCF +  G K++ LPE+ L FKGG+
Sbjct: 187 YFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGS 246

Query: 417 KMALPPENYFALVG----------NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQ 462
            M LP ENYF + G           E +CL + +D    +        GPAIILG FQ Q
Sbjct: 247 VMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQ 306

Query: 463 NFYLEFDLANDRFGFAKQKCA 483
           N+Y+E+DL  +R GF +Q+CA
Sbjct: 307 NYYIEYDLEKERLGFRRQQCA 327


>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
          Length = 218

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 101/226 (44%), Positives = 147/226 (65%), Gaps = 10/226 (4%)

Query: 259 LGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           +G+KKF+YCL S  +DD   S  L+LD      D KT GLSYTPF K+P   +SAF  +Y
Sbjct: 1   MGVKKFAYCLNSHDYDDTRNSGKLILDYR----DGKTKGLSYTPFLKSP--PASAF--YY 52

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG-STFTFMEGPLFEAVAKEFIRQMG 377
           ++G++ I +G+K ++IP  YL PGSDG  GVI+DSG     +M GP+F+ V  E  +QM 
Sbjct: 53  HLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMS 112

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CL 436
            Y R+ + E ++GL PC++ +G KS+ +P LI +F+GGA M +P +NYF +   E L C 
Sbjct: 113 KYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACF 172

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ++ T+      +   P+IILG+ Q  ++Y+E+DL NDRFGF +Q C
Sbjct: 173 LMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 218


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 148/460 (32%), Positives = 213/460 (46%), Gaps = 39/460 (8%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           L + HSL+ S  +   HL   T  +   S    ++ +  I  PLS  S   Y++S + G+
Sbjct: 27  LPLTHSLSKSQFNSTPHLLKFTSAR---SATRFHHRHRQISLPLSPGS--DYTLSFNLGS 81

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
            P        DTGS LVWFPC   + C+ C     D +      P   +SS  + C++P 
Sbjct: 82  HPPQPISLYMDTGSDLVWFPCAP-FECILCE-GKYDTAATGGLSPPNITSSASVSCKSPA 139

Query: 173 CSWIFGPNVES------RCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFPSK 225
           CS        S      RC         C   +CP +   YG G     L  ++L  P+ 
Sbjct: 140 CSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPAS 199

Query: 226 T---VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDA 276
           +   + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S  FD  
Sbjct: 200 SPLVLHNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDAD 259

Query: 277 PVSSNLVLDTGPGSGDS---KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            V     L  G  S D    K  G     F    +  +     FY VGL  I VG++ + 
Sbjct: 260 RVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIP 319

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKSGLR 392
           +P         GNGG++VDSG+TFT +   L+E++  EF  +MG  Y RA  +E+++GL 
Sbjct: 320 VPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLG 379

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGNEVLCLILFTDNA 443
           PC+  S   +  +P + L F G + + LP  NY+              +V CL+L   N 
Sbjct: 380 PCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLM--NG 436

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              A   GPA  LG++Q Q F + +DL   R GFA++KCA
Sbjct: 437 GDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCA 476


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 147/458 (32%), Positives = 217/458 (47%), Gaps = 45/458 (9%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           L + H+L+ +  +   HL   T      S + +      +  PLS  S   Y++S + G 
Sbjct: 5   LPLTHTLSQTQFNNTHHLLKST------STLSAKRFRRQLSLPLSPGS--DYTLSFNLGP 56

Query: 113 PPQASTPFIF-DTGSSLVWFPCTSRYRCVDCN-FPNVDPSRIPAFIPKRSSSSQLIGCQN 170
             QA    ++ DTGS LVWFPC   ++C+ C   PN  P       P  ++ S  + C++
Sbjct: 57  RAQAQPITLYMDTGSDLVWFPCAP-FKCILCEGKPNASP-------PVNTTRSVAVSCKS 108

Query: 171 PKCSW---IFGPN---VESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFP 223
           P CS    +  P+     +RC   S     C    CP +   YG G     L  +TL   
Sbjct: 109 PACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTLSLS 168

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAP 277
           S  + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S  FD   
Sbjct: 169 SLFLRNFTFGCAYTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSER 228

Query: 278 VS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           V   S L+L       + +  G     F   P+  +     FY VGL  I VG + V  P
Sbjct: 229 VRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAP 288

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAADVEKKSGLRPC 394
                  + G+GGV+VDSG+TFT +    + +V  EF R +G  + RA  +E+K+GL PC
Sbjct: 289 EMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPC 348

Query: 395 FDISGKKSVYLPELILKFKGG-AKMALPPENYF--------ALVGNEVLCLILFTDNAAG 445
           + ++    V  P L L+F GG + + LP +NYF        A  G   +  ++  +    
Sbjct: 349 YYLNSVAEV--PVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDE 406

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             L  GP   LG++Q Q F +E+DL   R GFA+++CA
Sbjct: 407 AELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 444


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 143/407 (35%), Positives = 198/407 (48%), Gaps = 46/407 (11%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y++S + G+ P  S     DTGS LVWFPC   + C+ C        +  A  P   + S
Sbjct: 19  YTLSFNLGSHPSQSITLYMDTGSDLVWFPCAP-FECILCE------GKFNATKPLNITRS 71

Query: 164 QLIGCQNPKCSWIFGPNVE------SRCKGCSPRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
             + CQ+P CS              +RC   +     C  A CP +   YG G     L 
Sbjct: 72  HRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAHLH 131

Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLS 270
            +TL      + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S
Sbjct: 132 RDTLSMSQLFLKNFTFGCAHTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVS 191

Query: 271 RKFDDAPVS--SNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
             FD   V   S L+L    G  D   S+     YT   +NP  S      FY VGL  I
Sbjct: 192 HSFDKERVRKPSPLIL----GHYDDYSSERVEFVYTSMLRNPKHSY-----FYCVGLTGI 242

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
            VG + +  P         G+GGV+VDSG+TFT +   L+ +V  EF R++G  + RA++
Sbjct: 243 SVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASE 302

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFA--LVGNE-----VLCL 436
           VE+K+GL PC+ + G   V +P +   F G  + + LP  NYF   L G +     V CL
Sbjct: 303 VEEKTGLGPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCL 360

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +L  +      L  GP  ILG++Q Q F + +DL N R GFAK++CA
Sbjct: 361 MLM-NGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCA 406


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 153/467 (32%), Positives = 213/467 (45%), Gaps = 64/467 (13%)

Query: 55  ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
           + H+L+ +  +   HL   T          S  S    +  LS+    G   +LSF   P
Sbjct: 29  LTHTLSKAQFNSTHHLLKST----------STRSAKRFRRQLSLPLSPGSDYTLSFNLGP 78

Query: 115 QASTPFI---FDTGSSLVWFPCTSRYRCVDC----NFPNVDPSRIPAFIPKRSSSSQLIG 167
           QA    I    DTGS LVWFPC   ++C+ C    N PN  P       P   + S  + 
Sbjct: 79  QAQAQPITLYMDTGSDLVWFPCAP-FKCILCEGKPNEPNASP-------PTNITQSVAVS 130

Query: 168 CQNPKCSWIFG---PN---VESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETL 220
           C++P CS       P+     +RC   S     C    CP +   YG G     L  +TL
Sbjct: 131 CKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTL 190

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFD 274
              S  + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S  FD
Sbjct: 191 SLSSLFLRNFTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFD 250

Query: 275 DAPVS--SNLVLDTGPGSGDSKTPG----LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
              V   S L+L         K  G      YT   +NP         FY V L  I VG
Sbjct: 251 SERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENP-----KHPYFYTVSLIGIAVG 305

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEK 387
            + +  P       + G+GGV+VDSG+TFT +    + +V  EF R++G +  RA  +E+
Sbjct: 306 KRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEE 365

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGG--AKMALPPENYF---------ALVGNEVLCL 436
           K+GL PC+ ++    V  P L L+F GG  + + LP +NYF         A    +V CL
Sbjct: 366 KTGLAPCYYLNSVADV--PALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCL 423

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +L          G GP   LG++Q Q F +E+DL   R GFA+++CA
Sbjct: 424 MLMNGGDEADLSG-GPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 152/463 (32%), Positives = 227/463 (49%), Gaps = 49/463 (10%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDS-NIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
           L + HS++ +  +   HL   T  ++K   +   +   + +  PL+  S   Y++S + G
Sbjct: 25  LPLTHSISKTKFNSTHHLLKSTSTRSKARFHHQHHKHQTQVSLPLAPGS--DYTLSFNLG 82

Query: 112 T-PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           + PPQ  T ++ DTGS LVWFPC S + C+ C       +  PA I K++ S   + CQ+
Sbjct: 83  SNPPQLITLYM-DTGSDLVWFPC-SPFECILCE--GKPQTTKPANITKQTHS---VSCQS 135

Query: 171 PKCSWIFGPNVE------SRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFP 223
           P CS              SRC         C   +CP +   YG G     L  +TL   
Sbjct: 136 PACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLYQQTLSLS 195

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAP 277
           S  + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S  FD   
Sbjct: 196 SLHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDR 255

Query: 278 VS--SNLVL----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
           +   S L+L    DT  G+GD ++    YT    NP         +Y VGL  I VG + 
Sbjct: 256 LRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNP-----KHPYYYCVGLAGISVGKRT 310

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY-SRAADVEKKSG 390
           V  P         GNGG++VDSG+TFT +    + AV  EF +++  +  RA+++E K+G
Sbjct: 311 VPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTG 370

Query: 391 LRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFALVGN---------EVLCLILFT 440
           L PC+ ++G   +  P L L F G  + + LP +NYF    +         +V C++L  
Sbjct: 371 LGPCYYLNGLSQI--PVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLM- 427

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +      L  GP   LG++Q Q F + +DL  +R GFAK++CA
Sbjct: 428 NGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 470


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 141/412 (34%), Positives = 198/412 (48%), Gaps = 45/412 (10%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCN---FPNVDPSRIPAFIPK 158
           GY I+L+ GTPPQA   ++ DTGS L W PC +  + C+DCN     N+  S I  F P 
Sbjct: 10  GYLITLNIGTPPQAVQVYM-DTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSI--FSPL 66

Query: 159 RSSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
            SSSS    C +  C+ I      F P   + C        TC   CPS+   YG G   
Sbjct: 67  HSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLV 126

Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSYCLL 269
           +G+L  + L+  ++ VP F  GC   +  +P GIAGFGR   SLPSQLG   K FS+C L
Sbjct: 127 SGILTRDILKARTRDVPRFSFGCVTSTYHEPIGIAGFGRGLLSLPSQLGFLEKGFSHCFL 186

Query: 270 SRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
             KF + P +SS L+L     S +  T  L +TP    PV  +S     YY+GL  I +G
Sbjct: 187 PFKFVNNPNISSPLILGASALSIN-LTDSLQFTPMLNTPVYPNS-----YYIGLESITIG 240

Query: 329 SKH--VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +     ++P +     S GNGG++VDSG+T+T +  P +  +    ++    Y RA + E
Sbjct: 241 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLT-ILQSTITYPRATETE 299

Query: 387 KKSGLRPCFDI----------SGKKSVYLPELILKFKGGAKMALPPENYFALV-----GN 431
            ++G   C+ +               +  P +   F   A + LP  N F  +     G+
Sbjct: 300 SRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGS 359

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            V CL LF +   G     GPA + G FQ QN  + +DL  +R GF    C 
Sbjct: 360 VVQCL-LFQNMEDG---NYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 407


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 143/412 (34%), Positives = 205/412 (49%), Gaps = 48/412 (11%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GY ISL+ GTPP+    ++ DTGS L W PC +  + C+DCN  +   +++ +      S
Sbjct: 28  GYLISLNLGTPPKVIQVYM-DTGSDLTWVPCGNLSFDCMDCN--DYRNNKLMSTYSPSYS 84

Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           SS L   C +P CS +      + P   + C   +    TCP  CPS+   YG G    G
Sbjct: 85  SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIG 144

Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
            L  +TL         ++ VPNF  GC   + R+P GIAGFGR   SLPSQLG   K FS
Sbjct: 145 TLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFS 204

Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           +C L  KF + P +SS LV+ D    S D     L +T   KNP+     +  +YY+GL 
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNPM-----YPNYYYIGLE 255

Query: 324 QIIVG-SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I VG +  +++P S     S GNGG+I+DSG+T+T + GP +  +    ++ +  Y RA
Sbjct: 256 AITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLL-SMLQSIITYPRA 314

Query: 383 ADVEKKSGLRPCFDISGKKSVY------LPELILKFKGGAKMALPPENYFALVG-----N 431
            + E ++G   C+ I    +V       LP +   F     + LP  N+F  +G      
Sbjct: 315 QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST 374

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            V CL+L   + +      GPA + G FQ QN  + +DL  +R GF    CA
Sbjct: 375 VVKCLLLQNMDDS----DSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 422


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 143/412 (34%), Positives = 205/412 (49%), Gaps = 48/412 (11%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GY ISL+ GTPP+    ++ DTGS L W PC +  + C+DCN  +   +++ +      S
Sbjct: 11  GYLISLNLGTPPKVIQVYM-DTGSDLTWVPCGNLSFDCMDCN--DYRNNKLMSTYSPSYS 67

Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           SS L   C +P CS +      + P   + C   +    TCP  CPS+   YG G    G
Sbjct: 68  SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIG 127

Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
            L  +TL         ++ VPNF  GC   + R+P GIAGFGR   SLPSQLG   K FS
Sbjct: 128 TLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFS 187

Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           +C L  KF + P +SS LV+ D    S D     L +T   KNP+     +  +YY+GL 
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNPM-----YPNYYYIGLE 238

Query: 324 QIIVG-SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I VG +  +++P S     S GNGG+I+DSG+T+T + GP +  +    ++ +  Y RA
Sbjct: 239 AITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLS-MLQSIITYPRA 297

Query: 383 ADVEKKSGLRPCFDISGKKSVY------LPELILKFKGGAKMALPPENYFALVG-----N 431
            + E ++G   C+ I    +V       LP +   F     + LP  N+F  +G      
Sbjct: 298 QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST 357

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            V CL+L   + +      GPA + G FQ QN  + +DL  +R GF    CA
Sbjct: 358 VVKCLLLQNMDDS----DSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 405


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 151/475 (31%), Positives = 221/475 (46%), Gaps = 62/475 (13%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSL-----IKTPLSVHSYGGYSIS 107
           L + HSL+++  +   HL   T  ++       +    L     +  PLS  S   Y++S
Sbjct: 28  LPLTHSLSNTQFTSTHHLLKSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGS--DYTLS 85

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN--FPNVDPSRIPAFIPKRSSSSQL 165
            +  + P        DTGS LVWFPC   + C+ C     N   S  P   P+ SS+++ 
Sbjct: 86  FTLNSNPPQHVSLYLDTGSDLVWFPCKP-FECILCEGKAENTTASTPP---PRLSSTARS 141

Query: 166 IGCQNPKCSWIFG--PNVE----SRCKGCSPRNKTC-PLACPSYLLQYGLGFTAGLLLSE 218
           + C++  CS      P  +    + C   S     C   +CPS+   YG G     L  +
Sbjct: 142 VHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHD 201

Query: 219 TLRFP----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCL 268
           +++ P    S ++ NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL
Sbjct: 202 SIKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCL 261

Query: 269 LSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLS-------YTPFYKNPVGSSSAFGEFYY 319
           +S  F  D   + S L+L    G  D K   ++       YT    NP         FY 
Sbjct: 262 VSHSFNSDRLRLPSPLIL----GHSDDKEKRVNKDDVQFVYTSMLDNP-----KHPYFYC 312

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN- 378
           VGL  I +G K +  P        +G+GGV+VDSG+TFT +   L+ +V  EF  ++G  
Sbjct: 313 VGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRV 372

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFA--LVGNE--- 432
           Y RA +VE K+GL PC+       V +P L+L F G  + + LP +NYF   L G +   
Sbjct: 373 YERAKEVEDKTGLGPCYYY--DTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVR 430

Query: 433 ----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               V CL+L          G GP   LG++Q   F + +DL   R GFA++KCA
Sbjct: 431 RKRRVGCLMLMNGGEEAELTG-GPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 137/410 (33%), Positives = 196/410 (47%), Gaps = 41/410 (10%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDC-NFPNVDPSRIPAFIPKRS 160
           GY I+L+ GTPPQA   ++ DTGS L W PC +  + C++C +  N D      F P  S
Sbjct: 82  GYLITLNIGTPPQAVQVYL-DTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHS 140

Query: 161 SSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           S+S    C +  C  I      F P   + C        TC   CPS+   YG G   +G
Sbjct: 141 STSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISG 200

Query: 214 LLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSYCLLSR 271
           +L  + L+  ++ VP F  GC   + R+P GIAGFGR   SLPSQLG   K FS+C L  
Sbjct: 201 ILTRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPF 260

Query: 272 KFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           KF + P +SS L+L     S +  T  L +TP    P+  +S     YY+GL  I +G+ 
Sbjct: 261 KFVNNPNISSPLILGASALSIN-LTDSLQFTPMLNTPMYPNS-----YYIGLESITIGTN 314

Query: 331 --HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
               ++P +     S GNGG++VDSG+T+T +  P +  +    ++    Y RA + E +
Sbjct: 315 ITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTT-LQSTITYPRATETESR 373

Query: 389 SGLRPCFDI----------SGKKSVYLPELILKFKGGAKMALPPENYFALV-----GNEV 433
           +G   C+ +               +  P +   F   A + LP  N F  +     G+ V
Sbjct: 374 TGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVV 433

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            CL LF +   G     GPA + G FQ QN  + +DL  +R GF    C 
Sbjct: 434 QCL-LFQNMEDG---DYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 150/465 (32%), Positives = 218/465 (46%), Gaps = 52/465 (11%)

Query: 53  LKILHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           L + HSL+    +   HL   T T   ++      ++ N L   PLS  S   Y++S + 
Sbjct: 25  LPLTHSLSMIEFNTTHHLLKSTSTHSLSRFHRHKHHHHNQL-SLPLSPGS--DYTLSFNL 81

Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP---NVDPSRIPAFIPKRSSSSQLIG 167
           G   Q  T ++ DTGS LVWFPCT  + C+ C        DPS      P   S S  I 
Sbjct: 82  GPHSQPITLYM-DTGSDLVWFPCTP-FNCILCELKPKLTSDPSP-----PTNISHSTPIS 134

Query: 168 CQNPKCSWIFGPNVES------RCKGCSPRNKTC-PLACPSYLLQYGLGFTAGLLLSETL 220
           C +  CS        S       C   S   K C    CP +   YG G     L  +TL
Sbjct: 135 CNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIASLYRDTL 194

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFD 274
              +  + NF  GC+  +  +P G+AGFGR   SLP+QL         +FSYCL+S  F 
Sbjct: 195 SLSTLQLTNFTFGCAHTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254

Query: 275 DAPVS--SNLVL----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
              +   S L+L    D    +GD +     YT   +NP  S      FY VGL+ I VG
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGD-EVVEFVYTSMLENPKHS-----YFYTVGLKGISVG 308

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEK 387
            K V  P         G+GGV+VDSG+TFT +    + +V + F R+    +R A ++E+
Sbjct: 309 KKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQ 368

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYF--------ALVGNEVLCLIL 438
           K+GL PC+ ++   +  +P + L+F G  + + LP +NYF         +   E +  ++
Sbjct: 369 KTGLSPCYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLM 426

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           F +      +  GP  +LG++Q Q F +E+DL   R GFA++KCA
Sbjct: 427 FMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCA 471


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 153/485 (31%), Positives = 218/485 (44%), Gaps = 79/485 (16%)

Query: 53  LKILHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSL------------IKTPLSV 98
           L + HSL+ +  +   HL   T  K   +  +  + +SN L            I  PLS 
Sbjct: 31  LPLSHSLSKTKFTSTHHLLKSTTIKSTARHHHHRTRHSNKLKNHHRHHQHQQQISLPLS- 89

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
               G   +L+F    Q  + ++ DTGS +VWFPC S + C+ C     +P  +    P 
Sbjct: 90  ---PGTDYTLTFSINSQTLSVYM-DTGSDIVWFPC-SPFECILCE-GKFEPGTL---TPL 140

Query: 159 RSSSSQLIGCQNPKCSWIFG-PNVESRCKGCSPRNKTCPL-----------ACPSYLLQY 206
             S S LI C++  CS     P+    C         CPL            CPS+   Y
Sbjct: 141 NVSKSSLISCKSRACSTAHNSPSTSDLCAIAK-----CPLDEIETSDCSNYHCPSFYYAY 195

Query: 207 GLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL 261
           G G     L    L  PS +     + +F  GC+  +  +P G+AGFG  S SLP+QL  
Sbjct: 196 GDGSLIAKLHKHNLIMPSTSNKPFSLKDFTFGCAHSALGEPIGVAGFGFGSLSLPAQLAN 255

Query: 262 ------KKFSYCLLSRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
                  +FSYCL+S  FD   +   S L+L         +     YTP   NP      
Sbjct: 256 LSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNP-----K 310

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
              FY V +  I VGS  V+ P + +    DGNGGV+VDSG+T+T +    + +VA E  
Sbjct: 311 HPYFYSVSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELD 370

Query: 374 RQMGN-YSRAADVEKKSGLRPCFDISG----KKSVYLPELILKFKGGAKMALPPENYF-- 426
           R++G  + RA++ E K+GL PC+ + G    +  + +P L   F G   + LP  NYF  
Sbjct: 371 RRVGRVFKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYE 430

Query: 427 ------ALVGNEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
                    G +V CL+L    D + G     GP   LG++Q Q F + +DL   R GFA
Sbjct: 431 FLDGEDEKKGRKVGCLMLMDGGDESEG-----GPGATLGNYQQQGFQVVYDLEERRVGFA 485

Query: 479 KQKCA 483
            +KCA
Sbjct: 486 PRKCA 490


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 144/433 (33%), Positives = 213/433 (49%), Gaps = 51/433 (11%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
           I  P++ ++  GY +SL+ GTPPQ    ++ DTGS L W PC  +S Y+C+DC   +V P
Sbjct: 14  IIEPVTAYT-DGYLLSLNLGTPPQVFQVYL-DTGSDLTWVPCGSSSSYQCLDCG-SSVKP 70

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYL 203
           +  P F+P  S+S+    C +  C  +      F P   + C   +     CP  CP + 
Sbjct: 71  T--PTFLPSESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFS 128

Query: 204 LQYGLG-FTAGLLLSETLRFPSKT-------------VPNFLAGCSILSDRQPAGIAGFG 249
             YG G    G L  +++     T              P F  GC   S R+P GIAGFG
Sbjct: 129 YTYGGGALVLGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFG 188

Query: 250 RSSESLPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKN 306
           R + SLPSQLG   K FS+C L  +F   P  +S LV+     S  S   G  +TP    
Sbjct: 189 RGALSLPSQLGFLGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPML-- 246

Query: 307 PVGSSSAFGEFYYVGLRQIIVGSKH----VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
              +S+ +  FYYVGL  +++G       +  P S     + GNGGV+VD+G+T+T +  
Sbjct: 247 ---TSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPD 303

Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKM 418
           P + +V    I     Y R+ D+E ++G   CF +   ++      LP + L   GGA++
Sbjct: 304 PFYASVLASLISAAPPYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGARL 363

Query: 419 ALPP-ENYF---ALVGNEVLCLILF----TDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
           ALP   +Y+   A+  + V+  +LF     ++      G GPA +LG FQ+QN  + +DL
Sbjct: 364 ALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDL 423

Query: 471 ANDRFGFAKQKCA 483
           A  R GF  + CA
Sbjct: 424 AAGRVGFRPRDCA 436


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 138/410 (33%), Positives = 203/410 (49%), Gaps = 45/410 (10%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GY ISLS GTPPQ    ++ DTGS L W PC +  + C++C+  N   +R+ A      S
Sbjct: 79  GYLISLSIGTPPQVIQVYM-DTGSDLTWAPCGNISFDCIECD--NYRNNRMMASFSPSHS 135

Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           SS     C +P C  +        P   + C   +    TC   CP +   YG G    G
Sbjct: 136 SSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTG 195

Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
            L  +TLR        ++ +P F  GC   S R+P GIAGFGR + SLPSQLG   K FS
Sbjct: 196 TLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREPIGIAGFGRGALSLPSQLGFLRKGFS 255

Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           +C L+ K+ + P +SS L++ D    S D     + +TP  K+P+     +  +YYVGL 
Sbjct: 256 HCFLAFKYANNPNISSPLIIGDIALTSKDD----MQFTPMLKSPM-----YPNYYYVGLE 306

Query: 324 QIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I VG+    ++P S     S GNGG++VDSG+T+T +  P +  V    ++ + NY RA
Sbjct: 307 AITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVL-SVLQSIINYPRA 365

Query: 383 ADVEKKSGLRPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYF----ALVGNEV 433
            D+E ++G   C+ +  + +       LP +   F   A + L   ++F    A   + V
Sbjct: 366 TDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTV 425

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  +LF     G     GPA +LG FQ Q+  + +D+  +R GF    CA
Sbjct: 426 VKCLLFQSMDDG---DYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDCA 472


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score =  176 bits (445), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 141/422 (33%), Positives = 198/422 (46%), Gaps = 50/422 (11%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNF--PNVDPSRIPAFIPKR 159
           GY +SLS GTPPQ    ++ DTGS L W PC +  + C DC     N+   R+ AF+P  
Sbjct: 20  GYLMSLSIGTPPQVVQVYM-DTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTH 78

Query: 160 SSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-GFTA 212
           SS+S    C +  C  I      F P   + C   S    TCP  CPS+   YG  G   
Sbjct: 79  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVT 138

Query: 213 GLLLSETL---------RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL-- 261
           G L  + L            +K +P F  GC   + R+P GIAGFGR   SLP QLG   
Sbjct: 139 GSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSH 198

Query: 262 KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           K FS+C L  KF + P  SS L+L  G  +  SK   L +TP  K+P+     +  +YY+
Sbjct: 199 KGFSHCFLPFKFSNNPNFSSPLIL--GNLAISSKDENLQFTPLLKSPM-----YPNYYYI 251

Query: 321 GLRQIIVGS--KHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           GL  I +G+   + +   S+ +   D  GNGG+++DSG+T+T +  PL+  +       +
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI 311

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKS-------VYLPELILKFKGGAKMALPP-ENYFAL 428
           G Y RA  VE  +G   C+ +  K +         LP +   F     + LP   N++A+
Sbjct: 312 G-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAM 370

Query: 429 VG----NEVLCLI---LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
                   V CL+   +        +   GPA I G FQ QN  + +DL  +R GF    
Sbjct: 371 AAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMD 430

Query: 482 CA 483
           C 
Sbjct: 431 CV 432


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 139/410 (33%), Positives = 200/410 (48%), Gaps = 44/410 (10%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GY ISL+ GTPPQ    ++ DTGS L W PC +  + C+DC+    +   + AF P  SS
Sbjct: 11  GYLISLNIGTPPQVIQVYM-DTGSDLTWVPCGNLSFDCMDCD-DYRNSKLMSAFSPSHSS 68

Query: 162 SSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGL 214
           SS    C +P C+ I      F P   + C   +    TC   CPS+   YG G    G 
Sbjct: 69  SSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGT 128

Query: 215 LLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSY 266
           L  +TLR        +K +P F  GC   +  +P GIAGF R + S PSQLGL  K FS+
Sbjct: 129 LTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPSQLGLLKKGFSH 188

Query: 267 CLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           C L+ K+ + P +SS LV+ DT   S D+    + +TP  K+P+     +  +YY+GL  
Sbjct: 189 CFLAFKYANNPNISSPLVIGDTALSSKDN----MQFTPMLKSPM-----YPNYYYIGLEA 239

Query: 325 IIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           I VG+     +P +     S GNGG+++DSG+T+T +  P +  +   F + +  Y RA 
Sbjct: 240 ITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIF-KAIITYPRAT 298

Query: 384 DVEKKSGLRPCFDIS------GKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEV 433
           +VE ++G   C+ +              P +   F       LP  N+F    A   + V
Sbjct: 299 EVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTV 358

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  +LF   A       GPA + G FQ QN  + +DL  +R GF    CA
Sbjct: 359 VKCLLFQSMADS---DYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 405


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 165/494 (33%), Positives = 216/494 (43%), Gaps = 68/494 (13%)

Query: 6   FSLICLFSLLILLFTTDAGAGSSAAT------VTVPLTPLSTKHYLHHSDSDPLKILHSL 59
           FS I L SLL++       A SS         + VPLT     H   H +   L++L   
Sbjct: 25  FSWIVLVSLLLVSMAIVLAAASSHPAAGLLDGLRVPLT-----HVDAHGNYTKLQLLRRA 79

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQAST 118
           A  S  R   L  +T         GS  + +     + VH+  G + + +S GTP  A  
Sbjct: 80  ARRSHHRMSRLVARTA-------TGSVKAAAAPDLQVPVHAGNGEFLMDMSIGTPALAYA 132

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
             I DTGS LVW  C     CV+C   +      P F P  SS+   + C +  CS +  
Sbjct: 133 A-IVDTGSDLVWTQCKP---CVECFNQST-----PVFDPSSSSTYSTLPCSSSLCSDLPT 183

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSIL 237
               S  K C             Y   YG    T G+L +ET       +P    GC   
Sbjct: 184 STCTSAAKDCG------------YTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDT 231

Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           ++     Q AG+ G GR   SL SQLGL KFSYCL S   DD   S  L+      S D+
Sbjct: 232 NEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTS--LDDTSKSPLLLGSLAAISTDT 289

Query: 294 KTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
            +   +  TP  KNP   S     FYYV L+ + VGS  + +P S      DG GGVIVD
Sbjct: 290 ASAAAIQTTPLIKNPSQPS-----FYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVD 344

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYLPELIL 410
           SG++ T++E   +  + K F  QM      AD     GL  CF    SG   V +P+L+L
Sbjct: 345 SGTSITYLELQGYRPLKKAFAAQM--KLPVAD-GSAVGLDLCFKAPASGVDDVEVPKLVL 401

Query: 411 KFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
            F GGA + LP ENY  L   +  LCL +           RG +II G+FQ QN    +D
Sbjct: 402 HFDGGADLDLPAENYMVLDSASGALCLTVMGS--------RGLSII-GNFQQQNIQFVYD 452

Query: 470 LANDRFGFAKQKCA 483
           +  D   FA  +CA
Sbjct: 453 VDKDTLSFAPVQCA 466


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + + +S GTP  A +  I DTGS LVW  C     CVDC          P F 
Sbjct: 88  VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 138

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P  SS+   + C +  CS +             P +K    +   Y   YG    T G+L
Sbjct: 139 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 185

Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            +ET       +P  + GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S 
Sbjct: 186 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 244

Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
             DD   +S L+L +  G     +    +  TP  KNP   S     FYYV L+ I VGS
Sbjct: 245 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 297

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + +P S      DG GGVIVDSG++ T++E   + A+ K F  QM     AAD     
Sbjct: 298 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 354

Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
           GL  CF     G   V +P L+  F GGA + LP ENY  L G    LCL +        
Sbjct: 355 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 409

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              RG +II G+FQ QNF   +D+ +D   FA  +C
Sbjct: 410 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 441


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + + +S GTP  A +  I DTGS LVW  C     CVDC          P F 
Sbjct: 98  VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 148

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P  SS+   + C +  CS +             P +K    +   Y   YG    T G+L
Sbjct: 149 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 195

Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            +ET       +P  + GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S 
Sbjct: 196 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 254

Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
             DD   +S L+L +  G     +    +  TP  KNP   S     FYYV L+ I VGS
Sbjct: 255 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 307

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + +P S      DG GGVIVDSG++ T++E   + A+ K F  QM     AAD     
Sbjct: 308 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 364

Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
           GL  CF     G   V +P L+  F GGA + LP ENY  L G    LCL +        
Sbjct: 365 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 419

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              RG +II G+FQ QNF   +D+ +D   FA  +C
Sbjct: 420 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 451


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 143/476 (30%), Positives = 216/476 (45%), Gaps = 61/476 (12%)

Query: 53  LKILHSLASSSLSRARHL--KTKTKPKTK---DSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
           L ++HSL+ +  +   HL   T T+  T+     +  +++++  +  PLS  S   Y++S
Sbjct: 28  LPLIHSLSKTQFTSTHHLLKSTSTRSTTRFHHHHHNKNSHNHRQVSLPLSPGS--DYTLS 85

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
            +  + P +      DTGS LVWFPC   + C+ C     + S      PK S ++  + 
Sbjct: 86  FTINSQPIS---LYLDTGSDLVWFPCQP-FECILCEGKAENASLASTPPPKLSKTATPVS 141

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLLL 216
           C++  CS +   N+ S    C+  N  CPL           +CP +   YG G     L 
Sbjct: 142 CKSSACSAVHS-NLPSS-DLCAISN--CPLESIEISDCRKHSCPQFYYAYGDGSLIARLY 197

Query: 217 SETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFS 265
            +++R P     +    NF  GC+  +  +P G+AGFGR   SLP+QL         +FS
Sbjct: 198 RDSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKT--------PGLSYTPFYKNPVGSSSAFGEF 317
           YCL+S  FD   V     L  G    D K         P   YT    NP         F
Sbjct: 258 YCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNP-----RHPYF 312

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y VGL  I +G K +  P         G+GGV+VDSG+TFT +   L++ V  EF  ++G
Sbjct: 313 YCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVG 372

Query: 378 NYS-RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------A 427
             + RA+ +E+ +GL PC+            ++     G+ + LP  NYF          
Sbjct: 373 RVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGK 432

Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               +V CL+L  +      L  GP   LG++Q Q F + +DL N R GFA+++CA
Sbjct: 433 GKKRKVGCLMLM-NGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCA 487


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + + +S GTP  A +  I DTGS LVW  C     CVDC          P F 
Sbjct: 67  VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 117

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P  SS+   + C +  CS +             P +K    +   Y   YG    T G+L
Sbjct: 118 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 164

Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            +ET       +P  + GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S 
Sbjct: 165 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 223

Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
             DD   +S L+L +  G     +    +  TP  KNP   S     FYYV L+ I VGS
Sbjct: 224 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 276

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + +P S      DG GGVIVDSG++ T++E   + A+ K F  QM     AAD     
Sbjct: 277 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 333

Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
           GL  CF     G   V +P L+  F GGA + LP ENY  L G    LCL +        
Sbjct: 334 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 388

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              RG +II G+FQ QNF   +D+ +D   FA  +C
Sbjct: 389 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 420


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 154/460 (33%), Positives = 215/460 (46%), Gaps = 50/460 (10%)

Query: 55  ILHSLASSSL-SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           I H + SSSL S ARH + +T       ++ S+  +  +  PL+  S   Y++SLS G P
Sbjct: 41  IHHLIRSSSLRSAARHGRHRTH------HLPSSRRHRQLSLPLAPGS--DYTLSLSVG-P 91

Query: 114 PQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP-KRSSSSQLIGCQN 170
              + P     DTGS LVWFPC   + C+ C      P    +  P    + S+ I C +
Sbjct: 92  LSTANPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCAS 150

Query: 171 PKCSWIF--GPNVE----SRCKGCSPRNKTCPL--ACPSYLLQYGLG-FTAGLLLSETLR 221
           P CS      P  +    +RC        +C    ACP     YG G   A L       
Sbjct: 151 PFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGI 210

Query: 222 FPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK----KFSYCLLSRKFD-DA 276
             S  V NF   C+  +  +P G+AGFGR   SLP+QL       +FSYCL++  F  D 
Sbjct: 211 AASVAVENFTFACAHTALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADR 270

Query: 277 PVS-SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           P+  S L+L   PG   +   G+ YTP   NP         FY V L  + VG   +   
Sbjct: 271 PIRPSPLILGRSPGEDPASETGIVYTPLLHNP-----KHPYFYSVALEAVSVGGTRIPAR 325

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRP 393
                 G  G+GG++VDSG+TFT +    +  VA+EF R M    + RA   E ++GL P
Sbjct: 326 PELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAP 385

Query: 394 CF----DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE----VLCLILFTDN 442
           C+    D S  +   +  +P L + F+G A + LP  NYF    +E    V CL+L    
Sbjct: 386 CYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMN-- 443

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             G   G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 444 -GGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|297740193|emb|CBI30375.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 103/250 (41%), Positives = 139/250 (55%), Gaps = 30/250 (12%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLA 60
           MA+    L  +F+L  +LF   +   +  AT+T+PLT   T        ++P   L  LA
Sbjct: 29  MASSTSLLFPVFTLFSILFLASSSNDNIPATITIPLTSTFTSKL----STEPRVFLQHLA 84

Query: 61  SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
           S+SLSRA HLK  T             ++ L+K  L  HSYGG++I LSFGTPPQ  + F
Sbjct: 85  SASLSRAHHLKHGT-------------TSPLVKASLFPHSYGGHTIPLSFGTPPQKLS-F 130

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           + DTGS +VW PCT+ Y C +C+F N  P ++P F PK SSS +++ C+NPKCS      
Sbjct: 131 LVDTGSHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPKLSSSYKILECRNPKCSL----- 183

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR 240
               C  C+  +K C  ACP Y LQYG G  +G  L E L FP KT+  FL GC+  +  
Sbjct: 184 ---GCPRCNGNSKNCSHACPQYSLQYGTGSASGFFLLENLNFPGKTIHKFLVGCTTSAAH 240

Query: 241 QPA--GIAGF 248
           +P    +AGF
Sbjct: 241 EPTSDALAGF 250


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 141/402 (35%), Positives = 191/402 (47%), Gaps = 30/402 (7%)

Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y++SLS G P  AS+  +F DTGS LVWFPC   + C+ C           + +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
           S+ I C +P CS      P  +    +RC   +    +C   ACP     YG G   A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
                    S  V NF   C+  +  +P G+AGFGR   SLP+QL      +FSYCL++ 
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265

Query: 272 KF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
            F  D    SS L+L     S D+   G S T F   P+  +     FY V L  + VG 
Sbjct: 266 SFRADRLIRSSPLILGR---STDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGG 322

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEK 387
           K ++          DGNGG++VDSG+TFT +    F  VA EF R M           E 
Sbjct: 323 KRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEA 382

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT--- 440
           ++GL PC+  S      +P + L F+G A +ALP  NYF    +  G  V CL+L     
Sbjct: 383 QTGLAPCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGG 441

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +N  G   G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 442 NNDDGED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 141/402 (35%), Positives = 191/402 (47%), Gaps = 30/402 (7%)

Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y++SLS G P  AS+  +F DTGS LVWFPC   + C+ C           + +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
           S+ I C +P CS      P  +    +RC   +    +C   ACP     YG G   A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
                    S  V NF   C+  +  +P G+AGFGR   SLP+QL      +FSYCL++ 
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265

Query: 272 KF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
            F  D    SS L+L     S D+   G S T F   P+  +     FY V L  + VG 
Sbjct: 266 SFRADRLIRSSPLILGR---STDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGG 322

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD--VEK 387
           K ++          DGNGG++VDSG+TFT +    F  VA EF R M           E 
Sbjct: 323 KRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEA 382

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT--- 440
           ++GL PC+  S      +P + L F+G A +ALP  NYF    +  G  V CL+L     
Sbjct: 383 QTGLAPCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGG 441

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +N  G   G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 442 NNDDGED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 140/410 (34%), Positives = 197/410 (48%), Gaps = 44/410 (10%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GY ISL+ GTPPQ     + DTGS L W PC +  + C++C+    +   +  F P  SS
Sbjct: 81  GYLISLNIGTPPQV-IQVLMDTGSDLTWVPCGNLSFDCMECD-DYRNNKLMATFSPSYSS 138

Query: 162 SSQLIGCQNPKCSWIF---GPNVESRCKGCSPRN---KTCPLACPSYLLQYGLG-FTAGL 214
           SS    C +P C  I     P       GCS       TC   CPS+   YG G    G+
Sbjct: 139 SSYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGI 198

Query: 215 LLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSY 266
           L  +TLR        +K +P F  GC   + R+P GIAGFGR + S+ SQLG   K FS+
Sbjct: 199 LTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREPIGIAGFGRGTLSMVSQLGFLQKGFSH 258

Query: 267 CLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           C L+ K+ + P +SS LV+ D    S D     + +TP   +P+     +  FYYVGL  
Sbjct: 259 CFLAFKYANNPNISSPLVVGDIALTSKDD----MQFTPMLNSPM-----YPNFYYVGLEA 309

Query: 325 IIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           I VG+    ++P S     S GNGG+ +DSG+T+T +  P +  V    ++   NY R  
Sbjct: 310 ITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVL-SILQSTINYPRDT 368

Query: 384 DVEKKSGLRPCFDI------SGKKSVYLPELILKFKGGAKMALPPENYFALV---GN-EV 433
            +E ++G   C+ +      +      LP +   F     + LP  N+F  V   GN  V
Sbjct: 369 GMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAV 428

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  ++F     G     GPA + G FQ QN  + +DL  +R GF    CA
Sbjct: 429 VKCLMFQSTDDG---DDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 475


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 143/489 (29%), Positives = 220/489 (44%), Gaps = 73/489 (14%)

Query: 24  GAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTK---- 79
           GA    + +   +  L+    LH       +++ +   +++SR + L+ K +PK      
Sbjct: 113 GAEPKNSVIDSTVRDLTRIQNLHR------RVIENRNQNTISRLQRLQ-KEQPKQSFKPV 165

Query: 80  ---DSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
               ++  S  S  L+ T  S  S G   Y + +  GTPP+  +  I DTGS L W  C 
Sbjct: 166 FAPAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCV 224

Query: 135 SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT 194
               C + + P  DP        K SSS + I C +P+C  +  P+  + CK     N++
Sbjct: 225 PCIACFEQSGPYYDP--------KDSSSFRNISCHDPRCQLVSSPDPPNPCKA---ENQS 273

Query: 195 CPLACPSYLLQYGLGF-TAGLLLSET----LRFPS-----KTVPNFLAGCSILS------ 238
           CP     Y   YG G  T G    ET    L  P+     K V N + GC   +      
Sbjct: 274 CP-----YFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHG 328

Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
                    G   F    +SL  Q     FSYCL+ R   +A VSS L+   G       
Sbjct: 329 AAGLLGLGKGPLSFASQMQSLYGQ----SFSYCLVDRN-SNASVSSKLIF--GEDKELLS 381

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
            P L++T F     G   +   FYYV +  ++V  + +KIP       S+G GG I+DSG
Sbjct: 382 HPNLNFTSFGG---GKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSG 438

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +T T+   P +E + + F+R++  Y     VE    L+PC+++SG + + LP+  + F  
Sbjct: 439 TTLTYFAEPAYEIIKEAFVRKIKGYEL---VEGLPPLKPCYNVSGIEKMELPDFGILFAD 495

Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
           GA    P ENYF  +  +V+CL +  +        R    I+G++Q QNF++ +D+   R
Sbjct: 496 GAVWNFPVENYFIQIDPDVVCLAILGN-------PRSALSIIGNYQQQNFHILYDMKKSR 548

Query: 475 FGFAKQKCA 483
            G+A  KCA
Sbjct: 549 LGYAPMKCA 557


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 138/383 (36%), Positives = 176/383 (45%), Gaps = 52/383 (13%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            GTP  A +  I DTGS LVW  C     CVDC          P F P  SS+   + C 
Sbjct: 173 IGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFDPSSSSTYATVPCS 223

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVP 228
           +  CS +             P +K    +   Y   YG    T G+L +ET       +P
Sbjct: 224 SASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP 270

Query: 229 NFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVL 284
             + GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S   DD   +S L+L
Sbjct: 271 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LDDTN-NSPLLL 327

Query: 285 DT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
            +  G     +    +  TP  KNP   S     FYYV L+ I VGS  + +P S     
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGSTRISLPSSAFAVQ 382

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGK 400
            DG GGVIVDSG++ T++E   + A+ K F  QM     AAD     GL  CF     G 
Sbjct: 383 DDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGVGLDLCFRAPAKGV 439

Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDF 459
             V +P L+  F GGA + LP ENY  L G    LCL +           RG +II G+F
Sbjct: 440 DQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS--------RGLSII-GNF 490

Query: 460 QLQNFYLEFDLANDRFGFAKQKC 482
           Q QNF   +D+ +D   FA  +C
Sbjct: 491 QQQNFQFVYDVGHDTLSFAPVQC 513


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 126/402 (31%), Positives = 185/402 (46%), Gaps = 57/402 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+  +  I DTGS L W  C     C + + P  DP        K SS
Sbjct: 195 GEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCVPCIACFEQSGPYYDP--------KDSS 245

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET- 219
           S + I C +P+C  +  P+    CK     N++CP     Y   YG G  T G    ET 
Sbjct: 246 SFRNISCHDPRCQLVSAPDPPKPCKA---ENQSCP-----YFYWYGDGSNTTGDFALETF 297

Query: 220 ---LRFPS-----KTVPNFLAGCSILS----------DRQPAGIAGFGRSSESLPSQLGL 261
              L  P+     K V N + GC   +               G   F    +SL  Q   
Sbjct: 298 TVNLTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQ--- 354

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
             FSYCL+ R   +A VSS L+   G        P L++T F     G   +   FYYV 
Sbjct: 355 -SFSYCLVDRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG---GKDGSVDTFYYVQ 407

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           ++ ++V  + +KIP       S+G GG I+DSG+T T+   P +E + + F+R++  Y  
Sbjct: 408 IKSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQL 467

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
              VE    L+PC+++SG + + LP+  + F   A    P ENYF  +  EV+CL +  +
Sbjct: 468 ---VEGLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGN 524

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   R    I+G++Q QNF++ +D+   R G+A  KCA
Sbjct: 525 -------PRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCA 559


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  165 bits (417), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 134/402 (33%), Positives = 191/402 (47%), Gaps = 56/402 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+  +  I DTGS L W  C   Y C + N P  DP        K SS
Sbjct: 193 GEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCVPCYACFEQNGPYYDP--------KDSS 243

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET- 219
           S + I C +P+C  +  P+    CKG     ++CP     Y   YG    T G    ET 
Sbjct: 244 SFKNITCHDPRCQLVSSPDPPQPCKG---ETQSCP-----YFYWYGDSSNTTGDFALETF 295

Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
              L  P      K V N + GC   +       AG+ G GR   S  +QL       FS
Sbjct: 296 TVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFS 355

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGL 322
           YCL+ R   ++ VSS L+   G        P L++T F    +NPV +      FYYV +
Sbjct: 356 YCLVDRN-SNSSVSSKLIF--GEDKELLSHPNLNFTSFVGGKENPVDT------FYYVLI 406

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + I+VG + +KIP       + G GG I+DSG+T T+   P +E + + F+R++  +   
Sbjct: 407 KSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL- 465

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTD 441
             VE    L+PC+++SG + + LPE  + F  GA    P ENYF  +  E V+CL +   
Sbjct: 466 --VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAIL-- 521

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   R    I+G++Q QNF++ +DL   R G+A  KCA
Sbjct: 522 -----GTPRSALSIIGNYQQQNFHILYDLKKSRLGYAPMKCA 558


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 137/472 (29%), Positives = 214/472 (45%), Gaps = 53/472 (11%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKT-----KDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
           L + HSL+ +  +   HL   T   +     +  +  + +++  +  PLS  S   Y++S
Sbjct: 28  LPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGS--DYTLS 85

Query: 108 LSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
            +  + P     F++ DTGS LVWFPC   + C+ C     + S      PK S ++  +
Sbjct: 86  FTLDSQPI----FLYLDTGSDLVWFPCQP-FECILCEGKAENTSLASTPPPKLSKTATPV 140

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLL 215
            C++  CS     N+ S    C+  N  CPL           +CP +   YG G     L
Sbjct: 141 SCKSSACSAAHS-NLPSS-DLCAISN--CPLESIETSDCQKHSCPQFYYAYGDGSLIARL 196

Query: 216 LSETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKF 264
             +++  P     +  V NF  GC+  +  +P G+AGFGR   SLP+QL         +F
Sbjct: 197 YRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSK---TPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           SYCL+S  FD   +     L  G    D K     G++   F    +  +     FY VG
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVG 316

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS- 380
           L  I +G K +  P        +G+GG++VDSG+TFT +   L+ +V  EF  ++G  + 
Sbjct: 317 LEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNE 376

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGN 431
           RA  +E+ +GL PC+            ++     G+ + LP  NYF              
Sbjct: 377 RARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +V CL+L  +      L  GP   LG++Q Q F + +DL N R GFA+++CA
Sbjct: 437 KVGCLMLM-NGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCA 487


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 137/472 (29%), Positives = 214/472 (45%), Gaps = 53/472 (11%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKT-----KDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
           L + HSL+ +  +   HL   T   +     +  +  + +++  +  PLS  S   Y++S
Sbjct: 28  LPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGS--DYTLS 85

Query: 108 LSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
            +  + P     F++ DTGS LVWFPC   + C+ C     + S      PK S ++  +
Sbjct: 86  FTLDSQPI----FLYLDTGSDLVWFPCQP-FECILCEGKAENTSLASTPPPKLSKTATPV 140

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLL 215
            C++  CS     N+ S    C+  N  CPL           +CP +   YG G     L
Sbjct: 141 SCKSSACSAAHS-NLPSS-DLCAISN--CPLESIETSDCQKHSCPQFYYAYGDGSLIARL 196

Query: 216 LSETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKF 264
             +++  P     +  V NF  GC+  +  +P G+AGFGR   SLP+QL         +F
Sbjct: 197 YRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSK---TPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           SYCL+S  FD   +     L  G    D K     G++   F    +  +     FY VG
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVG 316

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS- 380
           L  I +G K +  P        +G+GG++VDSG+TFT +   L+ +V  EF  ++G  + 
Sbjct: 317 LEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNE 376

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGN 431
           RA  +E+ +GL PC+            ++     G+ + LP  NYF              
Sbjct: 377 RARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +V CL+L  +      L  GP   LG++Q Q F + +DL N R GFA+++CA
Sbjct: 437 KVGCLMLM-NGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCA 487


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 135/416 (32%), Positives = 196/416 (47%), Gaps = 53/416 (12%)

Query: 87  YSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           YS+ L+ T  S  S G   Y + +  GTPP+  +  I DTGS L W  C     C + + 
Sbjct: 173 YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYS-LILDTGSDLNWIQCVPCIACFEQSG 231

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
           P  DP        K SSS + I C +P+C  +  P+    CK     N+TCP     Y  
Sbjct: 232 PYYDP--------KESSSFENITCHDPRCKLVSSPDPPKPCKD---ENQTCP-----YFY 275

Query: 205 QYG-LGFTAGLLLSETL---------RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRS 251
            YG    T G    ET          +   K V N + GC   +       AG+ G GR 
Sbjct: 276 WYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRG 335

Query: 252 SESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV 308
             S  SQL       FSYCL+ R   D  VSS L+   G        P L++T F     
Sbjct: 336 PLSFASQLQSIYGHSFSYCLVDRN-SDTSVSSKLIF--GEDKELLSHPNLNFTSFVG--- 389

Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
           G  ++   FYYVG++ I+V  + +KIP        +G GG I+DSG+T T+   P +E +
Sbjct: 390 GEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEII 449

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
            + F++++  Y     VE    L+PC+++SG + + LP+  + F  GA    P ENYF  
Sbjct: 450 KEAFMKKIKGYEL---VEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQ 506

Query: 429 VGNEVLCL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  +++CL IL T  +A          I+G++Q QNF++ +D+   R G+A  KC 
Sbjct: 507 IEPDLVCLAILGTPKSA--------LSIIGNYQQQNFHILYDMKKSRLGYAPMKCT 554


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 148/453 (32%), Positives = 199/453 (43%), Gaps = 62/453 (13%)

Query: 43  HYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
           H   H +   L++L   A  S  R   L  +       + + +      ++ P  VH+  
Sbjct: 46  HVDAHGNYSRLQLLQRAARRSHHRMSRLVARA------TGVKAVAGGGDLQVP--VHAGN 97

Query: 103 G-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G + + ++ GTP   S   I DTGS LVW  C     CVDC          P F P  SS
Sbjct: 98  GEFLMDVAIGTPA-LSYAAIVDTGSDLVWTQCKP---CVDCF-----KQSTPVFDPSSSS 148

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C +  CS +             P +     +   Y   YG    T G+L SET 
Sbjct: 149 TYATVPCSSALCSDL-------------PTSTCTSASKCGYTYTYGDASSTQGVLASETF 195

Query: 221 RF--PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
                 K +P    GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S   D
Sbjct: 196 TLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LD 253

Query: 275 DAPVSSNLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
           D    S L+L     +         +  TP  KNP   S     FYYV L  + VGS  +
Sbjct: 254 DGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPS-----FYYVSLTGLTVGSTRI 308

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            +P S      DG GGVIVDSG++ T++E   + A+ K F+ QM   +       + GL 
Sbjct: 309 TLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDG---SEIGLD 365

Query: 393 PCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALG 449
            CF     G   V +P+L+L F GGA + LP ENY  L   +  LCL +        A  
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTV--------APS 417

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           RG +II G+FQ QNF   +D+A D   FA  +C
Sbjct: 418 RGLSII-GNFQQQNFQFVYDVAGDTLSFAPVQC 449


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 150/481 (31%), Positives = 217/481 (45%), Gaps = 67/481 (13%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK-------TKPKTK 79
           S  A+ T  LT + T H          +IL     ++LSR    + K         P++ 
Sbjct: 118 SFVASTTRDLTRIQTLHK---------RILEKKNQNALSRLNKEEPKQPVVAPAASPESY 168

Query: 80  DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
            +N  S    + +++ +S+ S G Y + +  GTPP+  +  I DTGS L W  C   Y C
Sbjct: 169 PANGLSGQLMATLESGVSLGS-GEYFMDVFIGTPPRHFS-LILDTGSDLNWIQCVPCYDC 226

Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
              N P  DP        K SSS + IGC +P+C  +  P+    CK     N+TCP   
Sbjct: 227 FVQNGPYYDP--------KESSSFKNIGCHDPRCHLVSSPDPPQPCKA---ENQTCP--- 272

Query: 200 PSYLLQYG-LGFTAGLLLSET----LRFPS-----KTVPNFLAGCSILSD---RQPAGIA 246
             Y   YG    T G    ET    L  P+     K V N + GC   +       AG+ 
Sbjct: 273 --YFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLL 330

Query: 247 GFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           G GR   S  SQL       FSYCL+ R   D  VSS L+   G        P +++T  
Sbjct: 331 GLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SDTNVSSKLIF--GEDKDLLNHPEVNFTSL 387

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
                G  +    FYYV ++ I+VG + +KIP        +G GG IVDSG+T ++   P
Sbjct: 388 V---AGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEP 444

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
            +E +   F++++  Y    D      L PC+++SG + + LPE  + F+ GA    P E
Sbjct: 445 SYEIIKDAFVKKVKGYPVIKDFPI---LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVE 501

Query: 424 NYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           NYF  L   E++CL +           R    I+G++Q QNF++ +D    R G+A  KC
Sbjct: 502 NYFIKLEPEEIVCLAIL-------GTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554

Query: 483 A 483
           A
Sbjct: 555 A 555


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 149/499 (29%), Positives = 216/499 (43%), Gaps = 68/499 (13%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLA 60
           M  C  S +  ++L+  L  T A   ++  T+   LT     H          + L  +A
Sbjct: 1   MKDCSMSELLAYALIFTLLFTAAATPTAGLTMRADLT-----HVDKGRGFTRWERLSRMA 55

Query: 61  SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
             S +RA  L  +          G +Y   +  T  +V S G Y I  + GTP       
Sbjct: 56  VRSRARAASLYQR----------GGHYGQPVTAT--AVPSSGEYLIHFNIGTPRPQRVAL 103

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
             DTGS LVW  CT    C D  FP  DPS         SS+ + + C +P C    G +
Sbjct: 104 TMDTGSDLVWTQCTPCPVCFDQPFPLFDPSV--------SSTFRAVACPDPICRPSSGLS 155

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--------PSKTVPNFL 231
           V +    C+ +   C      YL  YG    TAG +  +T  F        P   V    
Sbjct: 156 VSA----CALKTFRC-----FYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLA 206

Query: 232 AGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
            GC   +        +GIAGFGR   SLPSQL + +FSYCL S    ++  +S + L T 
Sbjct: 207 FGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTSAVFLGTP 266

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           P    + + G    PF   P+  S +F  FYY+ L  I VG   + +  S      DG+G
Sbjct: 267 PNGLRAHSSG----PFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSG 322

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQ--MGNYSRAADVEKKSGLRPCFDI-SGKKSVY 404
           G ++DSG+  T     +FE +  EF+ Q  +  Y   ++V    G   CF    G K V 
Sbjct: 323 GTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEV----GNLLCFQRPKGGKQVP 378

Query: 405 LPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           +P+LI      A M LP ENY      + V+CL++   N A   +     +++G+FQ QN
Sbjct: 379 VPKLIFHL-ASADMDLPRENYIPEDTDSGVMCLMI---NGAEVDM-----VLIGNFQQQN 429

Query: 464 FYLEFDLANDRFGFAKQKC 482
            ++ +D+ N +  FA  +C
Sbjct: 430 MHIVYDVENSKLLFASAQC 448


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 188/402 (46%), Gaps = 56/402 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y I +  GTPP+  +  I DTGS L W  C   Y C + N P+ DP +        SS
Sbjct: 179 GEYFIDVFVGTPPKHFS-LILDTGSDLNWIQCVPCYECFEQNGPHYDPGQ--------SS 229

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-----------F 210
           S + IGC + +C  +  P+    CK     N+TCP     Y   YG             F
Sbjct: 230 SYRNIGCHDSRCHLVSSPDPPQPCKA---ENQTCP-----YYYWYGDSSNTTGDFALETF 281

Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKF 264
           T  L +S + +   + V N + GC   +       AG+ G GR   S  SQL       F
Sbjct: 282 TVNLTMS-SGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSF 340

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCL+ R   DA VSS L+   G        P L++T       G  +    FYYV ++ 
Sbjct: 341 SYCLVDRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV---AGKENPVDTFYYVQIKS 394

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I+VG + V IP       +DG+GG I+DSG+T ++   P ++ + + F+ ++  Y    D
Sbjct: 395 IVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKD 454

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNA 443
                 L PC++++G +   LP+  + F  GA    P ENYF  +   EV+CL +     
Sbjct: 455 FPV---LEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAI----- 506

Query: 444 AGPALGRGPAI--ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               LG  P+   I+G++Q QNF++ +D    R GFA  KCA
Sbjct: 507 ----LGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCA 544


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 141/460 (30%), Positives = 207/460 (45%), Gaps = 66/460 (14%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKD--------SNIGSNYSNSLIKTPLSVHSYGG-- 103
           +I+     + +SR +  K + + + K          + G+  S  L+ T  S  + G   
Sbjct: 30  RIIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGTGLSGQLMATLESGVTLGSGE 89

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +  GTPP+  +  I DTGS L W  C   + C + N P  DP        K SSS 
Sbjct: 90  YFMDVFIGTPPKHYS-LILDTGSDLNWIQCVPCHDCFEQNGPYYDP--------KESSSF 140

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET--- 219
           + IGC +P+C  +  P+    CK     N+TCP     Y   YG    T G   +ET   
Sbjct: 141 RNIGCHDPRCHLVSSPDPPLPCKA---ENQTCP-----YFYWYGDSSNTTGDFATETFTV 192

Query: 220 -LRFPS-----KTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGL---KKFSYC 267
            L  P+     K V N + GC   +     G +G         S  SQL       FSYC
Sbjct: 193 NLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 252

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGLRQ 324
           L+ R   D  VSS L+   G        P L++T      +NPV +      FYYV ++ 
Sbjct: 253 LVDRN-SDTNVSSKLIF--GEDKDLLNHPELNFTTLVGGKENPVDT------FYYVQIKS 303

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I+VG + + IP S     SDG GG IVDSG+T ++   P ++ +   F++++  Y    D
Sbjct: 304 IMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQD 363

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
                 L PC+++SG + + LP+  + F  GA    P ENYF  L   EV+CL +     
Sbjct: 364 FPI---LDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAIL---- 416

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 R    I+G++Q QNF++ +D    R G+A   CA
Sbjct: 417 ---GTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCA 453


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 133/411 (32%), Positives = 195/411 (47%), Gaps = 43/411 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y++S + G+     + ++ DTGS LVWFPC S + C+ C       S +P     +S 
Sbjct: 74  GDYTLSFNLGSESHKISLYM-DTGSDLVWFPC-SPFECILCEGKPKIQSPLPKIANNKSV 131

Query: 162 SSQLIGCQNPKCSWIFGPNV--ESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSE 218
           S     C       +   ++   SRC   S     C   +CP +   YG G     L  +
Sbjct: 132 SCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRD 191

Query: 219 TLRFPSK------TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSY 266
           +L  P+        V NF  GC+  +  +P G+AGFGR   S+PSQL         +FSY
Sbjct: 192 SLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSY 251

Query: 267 CLLSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           CL+S  F  D     S L+L    G   +      YT   +NP         FY VGL  
Sbjct: 252 CLVSHSFAADRVRRPSPLIL----GRYYTGETEFIYTSLLENP-----KHPYFYSVGLAG 302

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAA 383
           I VG+  +  P         G+GGV+VDSG+TFT +   L+E+V  EF  + G  + RA 
Sbjct: 303 ISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRAR 362

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYF--------ALVG--NE 432
            +E+ +GL PC+    + SV +P ++L F G  + + LP +NYF         +VG   +
Sbjct: 363 RIEENTGLSPCYYY--ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           V CL+L          G GP   LG++Q Q F + +DL  +R GFA+++C+
Sbjct: 421 VGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 470


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/445 (28%), Positives = 199/445 (44%), Gaps = 53/445 (11%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQA 116
           L  S++ R + ++  + P     +    +S  L+ T  S  S G   Y I +  G+PP+ 
Sbjct: 149 LKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKH 208

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
            +  I DTGS L W  C   + C + N P  DP        K S S + I C +P+C  +
Sbjct: 209 FS-LILDTGSDLNWIQCVPCFDCFEQNGPYYDP--------KDSISFRNITCNDPRCQLV 259

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSK 225
             P+    CK      ++CP     Y   YG             FT  L  S T +   +
Sbjct: 260 SSPDPPRPCKF---ETQSCP-----YFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVS 279
            V N + GC   +       AG+ G GR   S  SQL       FSYCL+ R   D  VS
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRD-SDTSVS 370

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L+   G        P L++T       G  +    FYY+ ++ I VG + ++IP    
Sbjct: 371 SKLIF--GEDKDLLTHPELNFTSLI---AGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
              +DG GG I+DSG+T ++   P +  + + F+R++  Y    D      L PC+++SG
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF---PILHPCYNVSG 482

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGD 458
              +  PE +++F  GA    P ENYF  +   +++CL +           +    I+G+
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAML-------GTPKSALSIIGN 535

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
           +Q QNF++ +D  N R G+A  +CA
Sbjct: 536 YQQQNFHILYDTKNSRLGYAPMRCA 560


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/445 (28%), Positives = 199/445 (44%), Gaps = 53/445 (11%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQA 116
           L  S++ R + ++  + P     +    +S  L+ T  S  S G   Y I +  G+PP+ 
Sbjct: 149 LKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKH 208

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
            +  I DTGS L W  C   + C + N P  DP        K S S + I C +P+C  +
Sbjct: 209 FS-LILDTGSDLNWIQCVPCFDCFEQNGPYYDP--------KDSISFRNITCNDPRCQLV 259

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSK 225
             P+    CK      ++CP     Y   YG             FT  L  S T +   +
Sbjct: 260 SSPDPPRPCKF---ETQSCP-----YFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVS 279
            V N + GC   +       AG+ G GR   S  SQL       FSYCL+ R   D  VS
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRD-SDTSVS 370

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L+   G        P L++T       G  +    FYY+ ++ I VG + ++IP    
Sbjct: 371 SKLIF--GEDKDLLTHPELNFTSLI---AGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
              +DG GG I+DSG+T ++   P +  + + F+R++  Y    D      L PC+++SG
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF---PILHPCYNVSG 482

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGD 458
              +  PE +++F  GA    P ENYF  +   +++CL +           +    I+G+
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAML-------GTPKSALSIIGN 535

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
           +Q QNF++ +D  N R G+A  +CA
Sbjct: 536 YQQQNFHILYDTKNSRLGYAPMRCA 560


>gi|118484651|gb|ABK94196.1| unknown [Populus trichocarpa]
          Length = 125

 Score =  160 bits (404), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 70/123 (56%), Positives = 97/123 (78%)

Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMA 419
           ME P++E VAKEF +Q+ +Y+ A +V+ ++GLRPCF+ISG+KSV +PE I  FKGGAKMA
Sbjct: 1   MEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMA 60

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           LP  NYF+ V + V+CL + +DN +G  +G GPAIILG++Q +NF++EFDL N+RFGF +
Sbjct: 61  LPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQ 120

Query: 480 QKC 482
           Q C
Sbjct: 121 QNC 123


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 147/439 (33%), Positives = 203/439 (46%), Gaps = 67/439 (15%)

Query: 62  SSLSRARHLKTKTKPKTKDSNI----GSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQA 116
           + L R +H   + K + +  N      S+  +S  +    +H+  G Y I L+ GTPP  
Sbjct: 61  TKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPP-V 119

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
           S P + DTGS L+W  C    RC         P+  P F PK+SSS   + C +  CS +
Sbjct: 120 SYPAVLDTGSDLIWTQCKPCTRCYK------QPT--PIFDPKKSSSFSKVSCGSSLCSAL 171

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK----TVPNFL 231
                          + TC   C  Y+  YG    T G+L +ET  F       +V N  
Sbjct: 172 --------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIG 216

Query: 232 AGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
            GC   ++     Q +G+ G GR   SL SQL  ++FSYCL     DD    S L+L + 
Sbjct: 217 FGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRFSYCL--TPIDDTK-ESVLLLGSL 273

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
               D+K   +  TP  KNP+  S     FYY+ L  I VG   + I  S    G DGNG
Sbjct: 274 GKVKDAKE--VVTTPLLKNPLQPS-----FYYLSLEAISVGDTRLSIEKSTFEVGDDGNG 326

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLP 406
           GVI+DSG+T T+++   +EA+ KEFI Q      A D    +GL  CF + SG   V +P
Sbjct: 327 GVIIDSGTTITYVQQKAYEALKKEFISQT---KLALDKTSSTGLDLCFSLPSGSTQVEIP 383

Query: 407 ELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           +L+  FKGG  + LP ENY  ++G+    V CL      A G + G     I G+ Q QN
Sbjct: 384 KLVFHFKGG-DLELPAENY--MIGDSNLGVACL------AMGASSGMS---IFGNVQQQN 431

Query: 464 FYLEFDLANDRFGFAKQKC 482
             +  DL  +   F    C
Sbjct: 432 ILVNHDLEKETISFVPTSC 450


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 135/405 (33%), Positives = 188/405 (46%), Gaps = 60/405 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +K P+ V   G + + L+ G+PP++ +  I DTGS L+W  C    +C D          
Sbjct: 355 VKAPV-VAGNGEFLMKLAIGSPPRSFSA-IMDTGSDLIWTQCKPCQQCFD--------QS 404

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
            P F PK+SSS   I C +  C  +  P       GC             YL  YG    
Sbjct: 405 TPIFDPKQSSSFYKISCSSELCGAL--PTSTCSSDGCE------------YLYTYGDSSS 450

Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL 261
           T G+L  ET  F   T     +P    GC   ++     Q AG+ G GR   SL SQL  
Sbjct: 451 TQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE 510

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           +KF+YCL +   DD+  SS L+      +  +    +  TP  KNP   S     FYY+ 
Sbjct: 511 QKFAYCLTA--IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS-----FYYLS 563

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L+ I VG   + IP S      DG+GGVI+DSG+T T++E   F ++  EFI QM   + 
Sbjct: 564 LQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM---NL 620

Query: 382 AADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLI 437
             D     GL  CF++ +G   V +P+L   FK GA + LP ENY  ++G+    +LCL 
Sbjct: 621 PVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENY--MIGDSKAGLLCL- 676

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                A G + G     I G+ Q QNF +  DL  +   F   +C
Sbjct: 677 -----AIGSSRGMS---IFGNLQQQNFMVVHDLQEETLSFLPTQC 713


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 137/396 (34%), Positives = 182/396 (45%), Gaps = 53/396 (13%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + + +S GTP  A    I DTGS LVW  C     CV+C   +      P F 
Sbjct: 95  VHAGNGEFLMDMSIGTPAVAYAAII-DTGSDLVWTQCKP---CVECFNQST-----PVFD 145

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P  SS+   + C +  CS +             P +K     C  Y   YG    T G+L
Sbjct: 146 PSSSSTYAALPCSSTLCSDL-------------PSSKCTSAKC-GYTYTYGDSSSTQGVL 191

Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            +ET       +P+   GC   ++     Q AG+ G GR   SL SQLGL KFSYCL S 
Sbjct: 192 AAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTS- 250

Query: 272 KFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             DD   S  L+          +    +  TP  +NP   S     FYYV L+ + VGS 
Sbjct: 251 -LDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPS-----FYYVNLKGLTVGST 304

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           H+ +P S      DG GGVIVDSG++ T++E   + A+ K F  QM     AAD     G
Sbjct: 305 HITLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAAD-GSGIG 361

Query: 391 LRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPA 447
           L  CF+   SG   V +P+L+     GA + LP ENY  L  G+  LCL +         
Sbjct: 362 LDTCFEAPASGVDQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGS------ 414

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             RG +II G+FQ QN    +D+  +   FA  +CA
Sbjct: 415 --RGLSII-GNFQQQNIQFVYDVGENTLSFAPVQCA 447


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 135/405 (33%), Positives = 188/405 (46%), Gaps = 60/405 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +K P+ V   G + + L+ G+PP++ +  I DTGS L+W  C    +C D          
Sbjct: 100 VKAPV-VAGNGEFLMKLAIGSPPRSFSA-IMDTGSDLIWTQCKPCQQCFD--------QS 149

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
            P F PK+SSS   I C +  C  +  P       GC             YL  YG    
Sbjct: 150 TPIFDPKQSSSFYKISCSSELCGAL--PTSTCSSDGCE------------YLYTYGDSSS 195

Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL 261
           T G+L  ET  F   T     +P    GC   ++     Q AG+ G GR   SL SQL  
Sbjct: 196 TQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE 255

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           +KF+YCL +   DD+  SS L+      +  +    +  TP  KNP   S     FYY+ 
Sbjct: 256 QKFAYCLTA--IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS-----FYYLS 308

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L+ I VG   + IP S      DG+GGVI+DSG+T T++E   F ++  EFI QM   + 
Sbjct: 309 LQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM---NL 365

Query: 382 AADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLI 437
             D     GL  CF++ +G   V +P+L   FK GA + LP ENY  ++G+    +LCL 
Sbjct: 366 PVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENY--MIGDSKAGLLCL- 421

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                A G + G     I G+ Q QNF +  DL  +   F   +C
Sbjct: 422 -----AIGSSRGMS---IFGNLQQQNFMVVHDLQEETLSFLPTQC 458


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 136/396 (34%), Positives = 182/396 (45%), Gaps = 45/396 (11%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y++++S GTPP    P I DTGS+L+W  C    RC    FP   P+  P   P RSS
Sbjct: 89  GAYNMNISLGTPP-LDFPVIVDTGSNLIWAQCAPCTRC----FPR--PTPAPVLQPARSS 141

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C    C ++      SR     PR      AC +Y   YG G+TAG L +ETL 
Sbjct: 142 TFSRLPCNGSFCQYL---PTSSR-----PRTCNATAAC-AYNYTYGSGYTAGYLATETLT 192

Query: 222 FPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
               T P    GCS  +     +GI G GR   SL SQL + +FSYCL S   D    +S
Sbjct: 193 VGDGTFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--AS 250

Query: 281 NLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
            ++     GS    T G  +  TP  KNP    S     YYV L  I V S  + +  S 
Sbjct: 251 PILF----GSLAKLTEGSVVQSTPLLKNPYLQRSTH---YYVNLTGIAVDSTELPVTGST 303

Query: 339 LVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFD 396
                 G  GG IVDSG+T T++    +  V + F  QM N ++          L  C+ 
Sbjct: 304 FGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYK 363

Query: 397 IS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPA 447
            S   G K+V +P L L+F GGAK  +P +NYFA V  +      V CL++       PA
Sbjct: 364 PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVL------PA 417

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               P  I+G+    + +L +D+    F FA   CA
Sbjct: 418 TDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCA 453


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 140/400 (35%), Positives = 185/400 (46%), Gaps = 51/400 (12%)

Query: 98  VHSYGG-YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           VH+  G + + LS GTP  P A+   I DTGS LVW  C     CV+C F        P 
Sbjct: 109 VHAGNGEFLMDLSVGTPALPYAA---IVDTGSDLVWTQCKP---CVEC-FNQT----TPV 157

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAG 213
           F P  SS+   + C +  C+ +      S     S  +         Y   YG    T G
Sbjct: 158 FDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPC------GYTYTYGDASSTQG 211

Query: 214 LLLSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
           +L +ET     + VP    GC   ++     Q AG+ G GR   SL SQLG+ +FSYCL 
Sbjct: 212 VLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLT 271

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           S   DDA   S L+L +  G   S        TP  KNP   S     FYYV L  + VG
Sbjct: 272 S--LDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPS-----FYYVSLTGLTVG 324

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           S  + +P S      DG GGVIVDSG++ T++E   + A+ K F+  M   +  A    +
Sbjct: 325 STRLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDA---SE 381

Query: 389 SGLRPCFD-----ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDN 442
            GL  CF      +     V +P+L+L F GGA + LP ENY  L   +  LCL +    
Sbjct: 382 IGLDLCFQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMAS- 440

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  RG +II G+FQ QNF   +D+A D   FA  +C
Sbjct: 441 -------RGLSII-GNFQQQNFQFVYDVAGDTLSFAPAEC 472


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 182/394 (46%), Gaps = 41/394 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y++++S GTPP    P I DTGS+L+W  C    RC    FP   P+  P   P RSS
Sbjct: 89  GAYNMNISLGTPP-LDFPVIVDTGSNLIWAQCAPCTRC----FPR--PTPAPVLQPARSS 141

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C    C ++      SR     PR      AC +Y   YG G+TAG L +ETL 
Sbjct: 142 TFSRLPCNGSFCQYL---PTSSR-----PRTCNATAAC-AYNYTYGSGYTAGYLATETLT 192

Query: 222 FPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
               T P    GCS  +     +GI G GR   SL SQL + +FSYCL S   D    +S
Sbjct: 193 VGDGTFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--AS 250

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            ++   G  +  ++   +  TP  KNP    S     YYV L  I V S  + +  S   
Sbjct: 251 PILF--GSLAKLTERSVVQSTPLLKNPYLQRSTH---YYVNLTGIAVDSTELPVTGSTFG 305

Query: 341 PGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDIS 398
               G  GG IVDSG+T T++    +  V + F  QM N ++          L  C+  S
Sbjct: 306 FTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPS 365

Query: 399 ---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALG 449
              G K+V +P L L+F GGAK  +P +NYFA V  +      V CL++       PA  
Sbjct: 366 AGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVL------PATD 419

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             P  I+G+    + +L +D+    F FA   CA
Sbjct: 420 DLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCA 453


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 146/441 (33%), Positives = 204/441 (46%), Gaps = 72/441 (16%)

Query: 62  SSLSRARHLKTKTKPKTKDSN---IGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQAS 117
           + L R +H   + K + +  N   + ++  +S  +    +H+  G Y + L+ GTPP  S
Sbjct: 62  TKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPP-VS 120

Query: 118 TPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
            P + DTGS L+W    PCT  Y+          P+  P F PK+SSS   + C +  CS
Sbjct: 121 YPAVLDTGSDLIWTQCKPCTQCYK---------QPT--PIFDPKKSSSFSKVSCGSSLCS 169

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK----TVPN 229
            +               + TC   C  Y+  YG    T G+L +ET  F       +V N
Sbjct: 170 AV--------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214

Query: 230 FLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
              GC   ++     Q +G+ G GR   SL SQL   +FSYCL     DD    S L+L 
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCL--TPMDDTK-ESILLLG 271

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
           +     D+K   +  TP  KNP+  S     FYY+ L  I VG   + I  S    G DG
Sbjct: 272 SLGKVKDAKE--VVTTPLLKNPLQPS-----FYYLSLEGISVGDTRLSIEKSTFEVGDDG 324

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVY 404
           NGGVI+DSG+T T++E   FEA+ KEFI Q        D    +GL  CF + SG   V 
Sbjct: 325 NGGVIIDSGTTITYIEQKAFEALKKEFISQT---KLPLDKTSSTGLDLCFSLPSGSTQVE 381

Query: 405 LPELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGPALGRGPAIILGDFQL 461
           +P+++  FKGG  + LP ENY  ++G+    V CL      A G + G     I G+ Q 
Sbjct: 382 IPKIVFHFKGG-DLELPAENY--MIGDSNLGVACL------AMGASSGMS---IFGNVQQ 429

Query: 462 QNFYLEFDLANDRFGFAKQKC 482
           QN  +  DL  +   F    C
Sbjct: 430 QNILVNHDLEKETISFVPTSC 450


>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 441

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 140/437 (32%), Positives = 201/437 (45%), Gaps = 59/437 (13%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
           I  P++ ++  GY +SL+ GTPPQ    ++ DTGS L W PC   + Y+C++C   +   
Sbjct: 14  IIEPIATYT-DGYLLSLNLGTPPQVFQVYL-DTGSDLTWVPCGTNTSYQCLECGNEHSIS 71

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GCSP---RNKTCPLACPSYL 203
              PAF   +S SS    C +  C  +    N    C   GCS     +  C   CP + 
Sbjct: 72  KPTPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPPFA 131

Query: 204 LQYG-LGFTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
             YG      G L  +T+               P F  GC   S R+P GIAGFG+   S
Sbjct: 132 YTYGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGCVGSSIREPIGIAGFGKGKLS 191

Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPV 308
           LPSQLG   K FS+C L   F   P ++S +V+      GD   S   G  +TP  K   
Sbjct: 192 LPSQLGFLDKGFSHCFLGFWFARNPNITSPMVI------GDLALSVKDGFLFTPMLK--- 242

Query: 309 GSSSAFGEFYYVGLRQIIVGSK-HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
             S  +  FYY+GL  + +G    +  P S     S+GNGGVIVD+G+T+T +  P F A
Sbjct: 243 --SLTYPNFYYIGLEGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDP-FYA 299

Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPE 423
                +     Y+R+ ++E ++G   C  +    +      LP + +   G   +ALP E
Sbjct: 300 SVLSSLSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKE 359

Query: 424 N-YFALVG--NEVL--CL---------ILFTDNAAGPAL---GRGPAIILGDFQLQNFYL 466
           + Y+A+    N V+  CL         +   DN  G        GPA +LG FQ+QN  +
Sbjct: 360 SCYYAVTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEV 419

Query: 467 EFDLANDRFGFAKQKCA 483
            +DL + R GF  + CA
Sbjct: 420 VYDLESGRVGFQPRDCA 436


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 133/393 (33%), Positives = 186/393 (47%), Gaps = 57/393 (14%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + + L+ GTP +  +  I DTGS L+W  C     C D       P+  P F 
Sbjct: 90  VHAGNGEFLMKLAIGTPAETYSA-IMDTGSDLIWTQCKPCKDCFD------QPT--PIFD 140

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           PK+SSS   + C +  C+ +    + S   GC             YL  YG    T G+L
Sbjct: 141 PKKSSSFSKLPCSSDLCAAL---PISSCSDGCE------------YLYSYGDYSSTQGVL 185

Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            +ET  F   +V     GC   +D     Q AG+ G GR   SL SQLG  KFSYCL S 
Sbjct: 186 ATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGEPKFSYCLTS- 244

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
             DD+   S+L++ +     ++ T     TP  +NP   S     FYY+ L  I VG   
Sbjct: 245 -MDDSKGISSLLVGSEATMKNAIT-----TPLIQNPSQPS-----FYYLSLEGISVGDTL 293

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + I  S     +DG+GG+I+DSG+T T++E   F A+ KEFI Q+       D    +GL
Sbjct: 294 LPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQL---KLDVDESGSTGL 350

Query: 392 RPCFDISGKKS-VYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
             CF +    S V +P+L+  F+ GA + LP ENY  A  G  V+CL + + +       
Sbjct: 351 DLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSSGMS---- 405

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I G+FQ QN  +  DL  +   FA  +C
Sbjct: 406 -----IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 130/412 (31%), Positives = 193/412 (46%), Gaps = 73/412 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L  GTP       I DTGS + W  C     C DC      P+  P F P+ SSS 
Sbjct: 139 YYVPLQVGTP-AVEVVLIMDTGSDVSWIQCVP---CKDCV-----PALRPPFNPRHSSSF 189

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             + C +  C+ ++    +     CSP  +TC  +     +QYG G  ++GLL  ET+  
Sbjct: 190 FKLPCASSTCTNVY----QGVKPFCSPSGRTCLFS-----IQYGDGSLSSGLLAMETI-- 238

Query: 223 PSKTVPNF-----------LAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG---LKK 263
            +   PNF             GC+ + DR+      +G+ G  R   S PSQL     +K
Sbjct: 239 -AGNTPNFGDGEPVKLSNITLGCADI-DREGLPTGASGLLGMDRRPISFPSQLSSRYARK 296

Query: 264 FSYCLLSRKFDDAPV---SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           FS+C     F D      SS LV     G  D  +P L YTP  +NP   S++  ++YYV
Sbjct: 297 FSHC-----FPDKIAHLNSSGLVFF---GESDIISPYLRYTPLVQNPAVPSASL-DYYYV 347

Query: 321 GLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           GL  I V    + + + ++ +    G+GG I+DSG+ FT+++ P F+A+ +EF   +   
Sbjct: 348 GLVGISVDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF---LART 404

Query: 380 SRAADVEKKSGLRPCFDIS----GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--- 432
           S  A V+  SG  PC++I+      +S  LP + L F+GG  + LP  +    V +    
Sbjct: 405 SHLAKVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQ 464

Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             LCL            G  P  I+G++Q QN ++E+DL   R G A  +CA
Sbjct: 465 TTLCLAFLMS-------GDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 509


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 194/410 (47%), Gaps = 48/410 (11%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPS 150
           +  P+       Y      G PPQ +   I DTGS+L+W  C+  R  C   N P  DPS
Sbjct: 59  VTAPIHWGGQSQYIAEYLIGDPPQRAEAII-DTGSNLIWTQCSRCRPTCFRQNLPYYDPS 117

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
           R        S +++ +GC +  C+   G   +     C   NKTC     + +  YG G 
Sbjct: 118 R--------SRAARAVGCNDAACA--LGSETQ-----CLSDNKTC-----AVVTGYGAGN 157

Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKF 264
            AG L +E L F S+TV + + GC +++   P      +GI G GR   SLPSQLG  +F
Sbjct: 158 IAGTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRF 216

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPG--SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           SYC L+  F+D    S++V+    G  +G + +  ++  PF ++P  S   F  FYY+ L
Sbjct: 217 SYC-LTPYFEDTIEPSHMVVGASAGLINGSASSTPVTTVPFVRSP--SDDPFSTFYYLPL 273

Query: 323 RQIIVGSKHVKIPYS-----YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
             I  G   + +P +      + PG     G  +DSG+  T +    ++A+  E  RQ+G
Sbjct: 274 TGITAGKVKLAVPSAAFDLRQVAPGM--WTGTFIDSGAPLTSLVDVAYQALRAELARQLG 331

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA----KMALPPENYFALVGNEV 433
             +    +   +G   C  +   + + +P L+L F GG+     + +PP NY+A V +  
Sbjct: 332 A-ALVQPLAGTTGFDLCVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSAT 389

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            C+++F+ +    +L      ++G++  QN ++ +DLA     F    C+
Sbjct: 390 ACMVVFS-SVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADCS 438


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 130/396 (32%), Positives = 186/396 (46%), Gaps = 54/396 (13%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           +  GGY++++S GTP   + P + DTGS L+W  C    +C         P+  P F P 
Sbjct: 81  NGVGGYNMNISVGTP-LLTFPVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPA 131

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SS+   + C +  C ++  PN            +TC      Y  +YG G+TAG L +E
Sbjct: 132 SSSTFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATE 179

Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
           TL+    + P+   GCS  +      +GIAG GR + SL  QLG+ +FSYCL  R    A
Sbjct: 180 TLKVGDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCL--RSGSAA 237

Query: 277 PVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
             S  L      GS  + T G +  TPF  NP    S    +YYV L  I VG   + + 
Sbjct: 238 GASPILF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVT 288

Query: 336 YSYLVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
            S      +G  GG IVDSG+T T++    +E V + F+ Q  N +    V    GL  C
Sbjct: 289 TSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTT---VNGTRGLDLC 345

Query: 395 F-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPA 447
           F    G   + +P L+L+F GGA+ A+P   YFA V  +      V CL++       PA
Sbjct: 346 FKSTGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PA 397

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            G  P  ++G+    + +L +DL    F F+   CA
Sbjct: 398 KGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 135/415 (32%), Positives = 186/415 (44%), Gaps = 47/415 (11%)

Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y++SLS G P  A+ P     DTGS LVWFPC   + C+ C      P+           
Sbjct: 90  YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEG---KPTPGRLGPLPPPP 144

Query: 162 SSQLIGCQNPKCSWIFGPN------VESRCKGCSPRNKTC--PLACPSYLLQYGLGFTAG 213
            S+ I C +P CS              +RC        +C    ACP     YG G    
Sbjct: 145 DSRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVA 204

Query: 214 LLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
            L    +   +         V NF   C+  +  +P G+AGFGR   SLP QL  +   +
Sbjct: 205 HLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGR 264

Query: 264 FSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYY 319
           FSYCL+S  F  D     S L+L   P    +  +T G  YTP   NP         FY 
Sbjct: 265 FSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNP-----KHPYFYS 319

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V L  + VG+  ++           GNGG++VDSG+TFT +   ++  VA+ F R M   
Sbjct: 320 VALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 379

Query: 380 SRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-------ALVG 430
             A     E+++GL PC+  +      +P L L F+G A +ALP  NYF       A  G
Sbjct: 380 GFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEDAGAG 438

Query: 431 ---NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++V CL+L     A    G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 439 TRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 128/402 (31%), Positives = 182/402 (45%), Gaps = 55/402 (13%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L   S G Y + L+ GTPP   T  + DTGS L+W  C     C D           P F
Sbjct: 84  LVAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCAD--------QPTPYF 134

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
            P RS++ +L+ C++P C+ +  P    R          C      Y   YG    TAG+
Sbjct: 135 RPARSATYRLVPCRSPLCAALPYPACFQR--------SVC-----VYQYYYGDEASTAGV 181

Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
           L SET  F +       V +   GC  ++  Q A   G+ G GR   SL SQLG  +FSY
Sbjct: 182 LASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSY 241

Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           CL S     +P  S L   V  T  G+  S     S +P    P+  ++A    Y++ L+
Sbjct: 242 CLTSFL---SPEPSRLNFGVFATLNGTNASS----SGSPVQSTPLVVNAALPSLYFMSLK 294

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +G K + I         DG GGV +DSG++ T+++   ++AV +E +  +       
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTN 354

Query: 384 DVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
           D E   GL  CF      S  V +P++ L F GGA M +PPENY  + G    LCL +  
Sbjct: 355 DTEI--GLETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIR 412

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                     G A I+G++Q QN ++ +D+AN    F    C
Sbjct: 413 S---------GDATIIGNYQQQNMHILYDIANSLLSFVPAPC 445


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 143/447 (31%), Positives = 207/447 (46%), Gaps = 61/447 (13%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
           L H DSD            + RA H     + +  ++ + +  SN+ I +P+ +   G +
Sbjct: 47  LKHVDSDKNLTKFQRIQHGIKRANH-----RLERLNAMVLAASSNAEINSPV-LSGNGEF 100

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
            ++L+ GTPP+  +  I DTGS L+W  C    +C D       PS  P F PK+SSS  
Sbjct: 101 LMNLAIGTPPETYSA-IMDTGSDLIWTQCKPCTQCFD------QPS--PIFDPKKSSSFS 151

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFP 223
            + C +  C  +             P++ +C  +C  YL  YG    T G + +ET  F 
Sbjct: 152 KLSCSSQLCKAL-------------PQS-SCSDSC-EYLYTYGDYSSTQGTMATETFTFG 196

Query: 224 SKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
             ++PN   GC   ++     Q +G+ G GR   SL SQL   KFSYCL S   DD   S
Sbjct: 197 KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTS--IDDTKTS 254

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           + L+      +G S    +  TP  +NP+  S     FYY+ L  I VG   + I  S  
Sbjct: 255 TLLMGSLASVNGTSA--AIRTTPLIQNPLQPS-----FYYLSLEGISVGGTRLPIKESTF 307

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-S 398
               DG GG+I+DSG+T T++E   F+ V KEF  QMG      D    +GL  C+++ S
Sbjct: 308 QLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMG---LPVDNSGATGLELCYNLPS 364

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALG-RGPAIIL 456
               + +P+L+L F  GA + LP ENY  A     V+CL          A+G  G   I 
Sbjct: 365 DTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICL----------AMGSSGGMSIF 413

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G+ Q QN ++  DL  +   F    C 
Sbjct: 414 GNVQQQNMFVSHDLEKETLSFLPTNCG 440


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 194/412 (47%), Gaps = 73/412 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L  GTP       I DTGS + W  C     C DC      P+  P F P+ SSS 
Sbjct: 138 YYVPLQLGTP-AVEVVLIMDTGSDVSWIQCVP---CKDCV-----PALRPPFNPRHSSSF 188

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             + C +  C+ ++    +     CSP  +TC  +     +QYG G  ++GLL  ET+  
Sbjct: 189 FKLPCASSTCTNVY----QGVKPFCSPSGRTCLFS-----IQYGDGSLSSGLLAMETI-- 237

Query: 223 PSKTVPNF-----------LAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG---LKK 263
            +   PNF             GC+ + DR+      +G+ G  R   S PSQL     +K
Sbjct: 238 -AGNTPNFGDGEPVKLSNITLGCADI-DREGLPTGASGLLGMDRRPISFPSQLSSRYARK 295

Query: 264 FSYCLLSRKFDDAPV---SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           FS+C     F D      SS LV     G  D  +P L YTP  +NP   S++  ++YYV
Sbjct: 296 FSHC-----FPDKIAHLNSSGLVFF---GESDIISPYLRYTPLVQNPAVPSASL-DYYYV 346

Query: 321 GLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           GL  I V    + + + ++ +    G+GG I+DSG+ FT+++ P F+A+ +EF   +   
Sbjct: 347 GLVGISVDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF---LART 403

Query: 380 SRAADVEKKSGLRPCFDIS----GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--- 432
           S  A V+  SG  PC++I+      +S  LP + L F+GG  + LP  +    V +    
Sbjct: 404 SHLAKVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQ 463

Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             LCL       A    G  P  I+G++Q QN ++E+DL   R G A  +CA
Sbjct: 464 TTLCL-------AFQMSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 508


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 144/487 (29%), Positives = 212/487 (43%), Gaps = 52/487 (10%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
           FS+L++L  T   A  +AA V V LT +          +DP          +L R  H  
Sbjct: 4   FSVLLILACTIL-ASDAAAAVRVGLTRI---------HADPEVTASEFVRGALRRDMHRH 53

Query: 72  TK-TKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
            +  + +   S+  +        T   + + G Y ++LS GTPP  S   I DTGS L+W
Sbjct: 54  ARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPL-SYRAIADTGSDLIW 112

Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP--KCSWIFGPNVESRCKGC 188
             C      V              + P  S++  ++ C +P   C+ + GP+    C   
Sbjct: 113 TQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGC--- 169

Query: 189 SPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGCSILSDRQ- 241
                    AC  Y   YG G+TAG+   ET  F      P+  VPN   GCS  S    
Sbjct: 170 ---------AC-MYNQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDW 219

Query: 242 --PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
              AG+ G GR S SL SQLG   FSYCL    F DA  +S L+L     +    T  + 
Sbjct: 220 NGSAGLVGLGRGSMSLVSQLGAGAFSYCL--TPFQDANSTSTLLLGPSAAAALKGTGPVR 277

Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
            TPF   P  S +    +YY+ L  I VG   + IP       +DG GG+I+DSG+T T 
Sbjct: 278 STPFVAGP--SKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITT 335

Query: 360 MEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRPCFDISGKK-SVYLPELILKFKGGA 416
           +    ++ V +  +R +       A   +  +GL  CF +        +P + L F+GGA
Sbjct: 336 LVDSAYQQV-RAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGA 394

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
            M LP ENY  ++G+ V CL +            G   ++G++Q QN ++ +D+  +   
Sbjct: 395 DMVLPVENYM-ILGSGVWCLAMRNQTV-------GAMSMVGNYQQQNIHVLYDVRKETLS 446

Query: 477 FAKQKCA 483
           FA   C+
Sbjct: 447 FAPAVCS 453


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 134/397 (33%), Positives = 181/397 (45%), Gaps = 46/397 (11%)

Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y++SLS G P  AS+  +F DTGS LVWFPC + + C+ C           + +P     
Sbjct: 88  YTLSLSVGPPSTASSVSLFLDTGSDLVWFPC-APFTCMLCEGKATPGGNHSSPLPP-PID 145

Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
           S+ I C +P CS      P  +    +RC   +    +C   ACP     YG G   A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
                    S  V NF   C+  +  +P G+AGFGR   SLP+QL               
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQL--------------- 250

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            AP  S        GS D+   G S T F   P+  +     FY V L  + VG K ++ 
Sbjct: 251 -APSLS--------GSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQA 301

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEKKSGLR 392
                    DGNGG++VDSG+TFT +    F  VA EF R M           E ++GL 
Sbjct: 302 QPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLA 361

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT---DNAAG 445
           PC+  S      +P + L F+G A +ALP  NYF    +  G  V CL+L     +N  G
Sbjct: 362 PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDG 420

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 421 ED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 128/402 (31%), Positives = 181/402 (45%), Gaps = 55/402 (13%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L   S G Y + L+ GTPP   T  + DTGS L+W  C     C D           P F
Sbjct: 84  LVAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCAD--------QPTPYF 134

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
            P RS++ +L+ C++P C+ +  P    R          C      Y   YG    TAG+
Sbjct: 135 RPARSATYRLVPCRSPLCAALPYPACFQR--------SVC-----VYQYYYGDEASTAGV 181

Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
           L SET  F +       V +   GC  ++  Q A   G+ G GR   SL SQLG  +FSY
Sbjct: 182 LASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSY 241

Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           CL S     +P  S L   V  T  G+  S     S +P    P+  ++A    Y++ L+
Sbjct: 242 CLTSFL---SPEPSRLNFGVFATLNGTNASS----SGSPVQSTPLVVNAALPSLYFMSLK 294

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +G K + I         DG GGV +DSG++ T+++   ++AV  E +  +       
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTN 354

Query: 384 DVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
           D E   GL  CF      S  V +P++ L F GGA M +PPENY  + G    LCL +  
Sbjct: 355 DTEI--GLETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIR 412

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                     G A I+G++Q QN ++ +D+AN    F    C
Sbjct: 413 S---------GDATIIGNYQQQNMHILYDIANSLLSFVPAPC 445


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 128/402 (31%), Positives = 188/402 (46%), Gaps = 54/402 (13%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           +S G Y+++LS GTPP  +   + DTGSSL+W  C     C +C      P+  P F P 
Sbjct: 85  NSAGAYNMNLSIGTPP-VTFSVLADTGSSLIWTQCAP---CTECA---ARPA--PPFQPA 135

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SS+   + C +  C ++  P +     GC             Y   YG+GFTAG L +E
Sbjct: 136 SSSTFSKLPCASSLCQFLTSPYLTCNATGCV------------YYYPYGMGFTAGYLATE 183

Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
           TL     + P    GCS  +      +GI G GRS  SL SQ+G+ +FSYCL S    DA
Sbjct: 184 TLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS----DA 239

Query: 277 PVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
               + +L    GS    T G +  TP  +NP   SS+   +YYV L  I VG+  + + 
Sbjct: 240 DAGDSPILF---GSLAKVTGGNVQSTPLLENPEMPSSS---YYYVNLTGITVGATDLPVT 293

Query: 336 YSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE-KKSG 390
            +        G+   GG IVDSG+T T++    +  V + F+ QM   +    V   + G
Sbjct: 294 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 353

Query: 391 LRPCFDIS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTD 441
              CFD +   G   V +P L+L+F GGA+ A+   +Y  +V  +      V CL++   
Sbjct: 354 FDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVL-- 411

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               PA  +    I+G+    + ++ +DL    F FA   CA
Sbjct: 412 ----PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 449


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 136/416 (32%), Positives = 186/416 (44%), Gaps = 48/416 (11%)

Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y++SLS G P  A+ P     DTGS LVWFPC   + C+ C      P R     P    
Sbjct: 90  YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEG-KPTPGRSGPLPPP--P 144

Query: 162 SSQLIGCQNPKCSWIFGPN------VESRCKGCSPRNKTC--PLACPSYLLQYGLGFTAG 213
            S+ I C +P CS              +RC        +C    ACP     YG G    
Sbjct: 145 DSRRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVA 204

Query: 214 LLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
            L    +   +         V NF   C+  +  +P G+AGFGR   SLP QL  +   +
Sbjct: 205 HLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGR 264

Query: 264 FSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS---KTPGLSYTPFYKNPVGSSSAFGEFY 318
           FSYCL+S  F  D     S L+L   P   D+   +T G  YTP   NP         FY
Sbjct: 265 FSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNP-----KHPYFY 319

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  + VG+  ++           GNGG++VDSG+TFT +   ++  VA+ F R M  
Sbjct: 320 SVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAA 379

Query: 379 YSRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------- 426
              A     E+++GL PC+  +      +P L L F+G A +ALP  NYF          
Sbjct: 380 AGFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEDAGA 438

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               ++V CL+L     A    G GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 439 GTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 135/414 (32%), Positives = 187/414 (45%), Gaps = 44/414 (10%)

Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCN---FPNVDPSRIPAFIPK 158
           Y++SLS G P  A+ P     DTGS LVWFPC   + C+ C     P+   S        
Sbjct: 94  YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPSGGHSSSAPLPLP 151

Query: 159 RSSSSQLIGCQNPKCSWIFG---PNVESRCKGCSPRN------KTCPLACPSYLLQYGLG 209
               S+ + C +P CS       P+      GC   +      +    ACP     YG G
Sbjct: 152 PPPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDG 211

Query: 210 -FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
              A L         S  V NF   C+  +  +P G+AGFGR   SLP QL  +   +FS
Sbjct: 212 SLVAHLRRGRVGLGASVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLAPQLSGRFS 271

Query: 266 YCLLSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           YCL+S  F  D     S L+L   P +  ++T G  YTP   NP         FY V L 
Sbjct: 272 YCLVSHSFRADRLIRPSPLILGRSPDAA-AETGGFVYTPLLHNP-----KHPYFYSVALE 325

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            + VG+  ++           GNGG++VDSG+TFT +    +  VA+ F R M     A 
Sbjct: 326 AVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFAR 385

Query: 384 --DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-----------ALVG 430
               E+++GL PC+  +      +P L L F+G A +ALP  NYF           A   
Sbjct: 386 AERAEEQTGLTPCYHYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRK 444

Query: 431 NEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ++V CL+L    D +       GPA  LG+FQ Q F + +D+   R GFA+++C
Sbjct: 445 DDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 132/402 (32%), Positives = 182/402 (45%), Gaps = 59/402 (14%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L + S G Y + +  GTP +  +  I DTGS L+W  C     CVD           P F
Sbjct: 84  LVLASDGEYLMEMGIGTPARFYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYF 134

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTA 212
            P  SS+ + +GC  P C+ ++ P     C       KTC       + QY  G    TA
Sbjct: 135 DPANSSTYRSLGCSAPACNALYYP----LC-----YQKTC-------VYQYFYGDSASTA 178

Query: 213 GLLLSETLRFPSK----TVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFS 265
           G+L +ET  F +     T+P    GC  L+    A   G+ GFGR S SL SQLG  +FS
Sbjct: 179 GVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 238

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL S     +PV S L          +    +  TPF  NP     A    Y++ +  I
Sbjct: 239 YCLTSFL---SPVRSRLYFGAYATLNSTNASTVQSTPFIINP-----ALPTMYFLNMTGI 290

Query: 326 IVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
            VG   + I P    +  +DG GG I+DSG+T T++  P + AV + F+  + +     D
Sbjct: 291 SVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLD 350

Query: 385 VEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFT 440
           V + S L  CF      ++SV LP+L+L F  GA   LP +NY  LV      LCL + T
Sbjct: 351 VTETSVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYM-LVDPSTGGLCLAMAT 408

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +            I+G +Q QNF + +DL N    F    C
Sbjct: 409 SSDGS---------IIGSYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 127/400 (31%), Positives = 184/400 (46%), Gaps = 51/400 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y I +  GTPP+     I DTGS L W  C   Y C + N P+ +P+         SS
Sbjct: 168 GEYFIDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNE--------SS 218

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET- 219
           S + I C +P+C  +  P+    CK     N+TCP     Y   Y  G  T G    ET 
Sbjct: 219 SYRNISCYDPRCQLVSSPDPLQHCK---TENQTCP-----YFYDYADGSNTTGDFALETF 270

Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
              L +P+     K V + + GC   +        G+ G GR   S PSQL       FS
Sbjct: 271 TVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFS 330

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL +  F +  VSS L+        D +        F K   G  +    FYY+ ++ I
Sbjct: 331 YCL-TDLFSNTSVSSKLIF-----GEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSI 384

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
           +VG + + IP       S+G GG I+DSGST TF     ++ + + F +++     AAD 
Sbjct: 385 VVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAAD- 443

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCL-ILFTDNA 443
                + PC+++SG   V LP+  + F  GA    P ENYF     +EV+CL IL T N 
Sbjct: 444 --DFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNH 501

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +          I+G+   QNF++ +D+   R G++ ++CA
Sbjct: 502 SH-------LTIIGNLLQQNFHILYDVKRSRLGYSPRRCA 534


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 143/453 (31%), Positives = 195/453 (43%), Gaps = 61/453 (13%)

Query: 45  LHHSD-----SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH 99
           LHH D     S P  +  +      +R   +    +       +G+ +S+S+I       
Sbjct: 64  LHHVDALSFNSTPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSSVISGL--AQ 121

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
             G Y   +  GTPP+     + DTGS +VW  C    RC    +   DP     F P++
Sbjct: 122 GSGEYFTRIGVGTPPRY-VYMVLDTGSDIVWIQCAPCKRC----YAQSDP----VFDPRK 172

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           S S   I C++P C  +  P       GC+ + +TC      Y + YG G FT G   +E
Sbjct: 173 SRSFASIACRSPLCHRLDSP-------GCNTQKQTC-----MYQVSYGDGSFTFGDFSTE 220

Query: 219 TLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
           TL F    V     GC   ++      AG+ G GR   S PSQ G +   KFSYCL+ R 
Sbjct: 221 TLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRS 280

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               P  S++V       GDS     + +TP   NP         FYYV L  I VG   
Sbjct: 281 ASSKP--SSMVF------GDSAVSRTARFTPLVSNP-----KLDTFYYVELLGISVGGTR 327

Query: 332 V-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           V  I  S       GNGGVI+DSG++ T +  P + A    F     N  RA      S 
Sbjct: 328 VPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQF---SL 384

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
              CFD+SGK  V +P ++L F+ GA ++LP  NY   V       + F     G +   
Sbjct: 385 FDTCFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDTSGNFCLAFAGTMGGLS--- 440

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I+G+ Q Q F + +DLA  R GFA   CA
Sbjct: 441 ----IIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 176/403 (43%), Gaps = 56/403 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+  +  I DTGS L W  C   Y C   N    DP        K S+
Sbjct: 160 GEYFMDVLVGTPPKHFS-LILDTGSDLNWLQCLPCYDCFHQNEAFYDP--------KTSA 210

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGF 210
           S + I C +P+CS I  P    +CK     N++CP     Y   YG             F
Sbjct: 211 SFKNITCNDPRCSLISSPEPPVQCKS---DNQSCP-----YFYWYGDRSNTTGDFAVETF 262

Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGLKK 263
           T  L  +E  R     V N + GC   +               G    S  L S  G   
Sbjct: 263 TVNLTTTEG-RSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYG-HS 320

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL+ R   D  VSS L+   G          L++T F     G  ++   FYY+ ++
Sbjct: 321 FSYCLVDRN-SDTNVSSKLIF--GEDKDLLNHTNLNFTSFVN---GKENSVETFYYIQIK 374

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRA 382
            I+VG + + IP        DG GG I+DSG+T ++   P +E +  +F  +M  NY   
Sbjct: 375 SILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVF 434

Query: 383 ADVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
            D      L PCF++SG  + +++LPEL + F  GA    P EN F  +  +++CL +  
Sbjct: 435 RDFPV---LDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLAIL- 490

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                    +    I+G++Q QNF++ +D    R GF   KCA
Sbjct: 491 ------GTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKCA 527


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 186/407 (45%), Gaps = 58/407 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           IK P    S G + + LS G P    +  I DTGS L+W  C     C D       P+ 
Sbjct: 96  IKAPTHGGS-GEFLMELSIGNPAVKYSA-IVDTGSDLIWTQCKPCTECFD------QPT- 146

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
            P F P++SSS   +GC +  C+ +   N       C+     C      YL  YG    
Sbjct: 147 -PIFDPEKSSSYSKVGCSSGLCNALPRSN-------CNEDKDAC-----EYLYTYGDYSS 193

Query: 211 TAGLLLSETLRFPSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
           T GLL +ET  F  + ++     GC + ++     Q +G+ G GR   SL SQL   KFS
Sbjct: 194 TRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFS 253

Query: 266 YCLLSRKFDDAP-------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           YCL S +  +A        ++S +V  TG       T  +S     +NP   S     FY
Sbjct: 254 YCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMS---LLRNPDQPS-----FY 305

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           Y+ L+ I VG+K + +  S      DG GG+I+DSG+T T++E   F+ + +EF  +M  
Sbjct: 306 YLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-- 363

Query: 379 YSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCL 436
            S   D    +GL  CF +    K++ +P++I  FK GA + LP ENY  A     VLCL
Sbjct: 364 -SLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCL 421

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + + N            I G+ Q QNF +  DL  +   F   +C 
Sbjct: 422 AMGSSNGMS---------IFGNVQQQNFNVLHDLEKETVSFVPTECG 459


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 131/407 (32%), Positives = 187/407 (45%), Gaps = 58/407 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           IK P    S G + + LS G P       I DTGS L+W  C     C D       P+ 
Sbjct: 97  IKAPTHGGS-GEFLMELSIGNPA-VKYAAIVDTGSDLIWTQCKPCTECFD------QPT- 147

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
            P F P++SSS   +GC +  C+ +   N       C+    +C      YL  YG    
Sbjct: 148 -PIFDPEKSSSYSKVGCSSGLCNALPRSN-------CNEDKDSC-----EYLYTYGDYSS 194

Query: 211 TAGLLLSETLRFPSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
           T GLL +ET  F  + ++     GC + ++     Q +G+ G GR   SL SQL   KFS
Sbjct: 195 TRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFS 254

Query: 266 YCLLSRKFDDAP-------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           YCL S +  +A        ++S +V  TG       T  +S     +NP   S     FY
Sbjct: 255 YCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMS---LLRNPDQPS-----FY 306

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           Y+ L+ I VG+K + +  S      DG GG+I+DSG+T T++E   F+ + +EF  +M  
Sbjct: 307 YLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-- 364

Query: 379 YSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCL 436
            S   D    +GL  CF + +  K++ +P+LI  FK GA + LP ENY  A     VLCL
Sbjct: 365 -SLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCL 422

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + + N            I G+ Q QNF +  DL  +   F   +C 
Sbjct: 423 AMGSSNGMS---------IFGNVQQQNFNVLHDLEKETVTFVPTECG 460


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 138/456 (30%), Positives = 202/456 (44%), Gaps = 71/456 (15%)

Query: 50  SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH-SYGGYSISL 108
           +DP          +L R  H     +     SN       + +  P  +  + G Y ++L
Sbjct: 37  ADPSVTASQFVRDALRRDMHRHNARQLAASSSN------GTTVSAPTQISPTAGEYLMTL 90

Query: 109 SFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IPKRSS-- 161
           + GTPP  S   I DTGS L+W    PC+S+  C     P  +PS    F  +P  SS  
Sbjct: 91  AIGTPP-VSYQAIADTGSDLIWTQCAPCSSQ--CFQQPTPLYNPSSSTTFAVLPCNSSLS 147

Query: 162 --SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
             ++ L G   P               GC     TC      Y + YG G+T+    SET
Sbjct: 148 MCAAALAGTTPPP--------------GC-----TC-----MYNMTYGSGWTSVYQGSET 183

Query: 220 LRFPSKT------VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
             F S T      VP    GCS  S        +G+ G GR S SL SQLG+ KFSYCL 
Sbjct: 184 FTFGSSTPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCL- 242

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
              + D   +S L+L  GP +  + T G+S TPF  +P  S +    +YY+ L  I +G+
Sbjct: 243 -TPYQDTNSTSTLLL--GPSASLNDTGGVSSTPFVASP--SDAPMSTYYYLNLTGISLGT 297

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + IP + L   +DG GG I+DSG+T T +    ++ V +  +  +            +
Sbjct: 298 TALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQV-RAAVVSLVTLPTTDGGSAAT 356

Query: 390 GLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
           GL  CF++    S    +P + L F  GA M LP ++Y  L  N + CL +      G +
Sbjct: 357 GLDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADSYMMLDSN-LWCLAMQNQTDGGVS 414

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                  ILG++Q QN ++ +D+  +   FA  KC+
Sbjct: 415 -------ILGNYQQQNMHILYDVGQETLTFAPAKCS 443


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 132/400 (33%), Positives = 188/400 (47%), Gaps = 55/400 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  GTPP+     I DTGS L W  C     C+DC        R P F P  S 
Sbjct: 150 GEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPATSL 200

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + + C +P+C  +  P     C+   P +  CP     Y   YG    T G L  E  
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACR--RPHSDPCP-----YYYWYGDQSNTTGDLALEAF 253

Query: 221 RF------PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
                    S+ V + + GC   S+R      AG+ G GR + S  SQL       FSYC
Sbjct: 254 TVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYC 312

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           L+      + V S +V     G  D+    P L+YT        +++A   FYYV L+ +
Sbjct: 313 LVDHG---SSVGSKIVF----GDDDALLGHPRLNYT---AFAPSAAAAADTFYYVQLKGV 362

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
           +VG + + I  S    G DG+GG I+DSG+T ++   P +E + + F+ +M   Y   AD
Sbjct: 363 LVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVAD 422

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
                 L PC+++SG + V +PE  L F  GA    P ENYF  L  + ++CL +     
Sbjct: 423 FPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVL---- 475

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 R    I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 476 ---GTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCA 512


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 130/401 (32%), Positives = 187/401 (46%), Gaps = 59/401 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+     I DTGS L W  C     C+DC        R P F P  S+
Sbjct: 148 GEYLVEVYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----DQRGPVFDPMAST 198

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + + C + +C  +  P     C+  S R+  CP     Y   YG    T G L  E  
Sbjct: 199 SYRNVTCGDTRCGLVSPPAAPRTCR--SSRSDPCP-----YYYWYGDQSNTTGDLALEAF 251

Query: 221 RF-----PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
                   S+ V   + GC   +       AG+ G GR   S  SQL       FSYCL+
Sbjct: 252 TVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLV 311

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKT----PGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
                 + V S +V       GD       P L+YT F       S+A   FYYV L+ I
Sbjct: 312 DHG---SAVGSKIVF------GDDNVLLSHPQLNYTAF-----APSAAENTFYYVQLKGI 357

Query: 326 IVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAA 383
           +VG + + IP  ++ V   DG+GG I+DSG+T ++   P ++A+ + F+ +M   Y   A
Sbjct: 358 LVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIA 417

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
           D      L PC+++SG + V +PE  L F  GA    P ENYF  +  E ++CL +    
Sbjct: 418 DFPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVL--- 471

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                  R    I+G++Q QNF++ +DL ++R GFA ++CA
Sbjct: 472 ----GTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRCA 508


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 132/400 (33%), Positives = 188/400 (47%), Gaps = 55/400 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  GTPP+     I DTGS L W  C     C+DC        R P F P  S 
Sbjct: 150 GEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASL 200

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + + C +P+C  +  P     C+   P +  CP     Y   YG    T G L  E  
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACR--RPHSDPCP-----YYYWYGDQSNTTGDLALEAF 253

Query: 221 RF------PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
                    S+ V + + GC   S+R      AG+ G GR + S  SQL       FSYC
Sbjct: 254 TVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYC 312

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           L+      + V S +V     G  D+    P L+YT        +++A   FYYV L+ +
Sbjct: 313 LVDHG---SSVGSKIVF----GDDDALLGHPRLNYT---AFAPSAAAAADTFYYVQLKGV 362

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
           +VG + + I  S    G DG+GG I+DSG+T ++   P +E + + F+ +M   Y   AD
Sbjct: 363 LVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVAD 422

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
                 L PC+++SG + V +PE  L F  GA    P ENYF  L  + ++CL +     
Sbjct: 423 FPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVL---- 475

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 R    I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 476 ---GTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCA 512


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 125/393 (31%), Positives = 180/393 (45%), Gaps = 57/393 (14%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + LS G P    +  I DTGS L+W  C     C D       P+  P F P++SSS   
Sbjct: 1   MELSIGNPAVKYSA-IVDTGSDLIWTQCKPCTECFD------QPT--PIFDPEKSSSYSK 51

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS 224
           +GC +  C+ +   N       C+     C      YL  YG    T GLL +ET  F  
Sbjct: 52  VGCSSGLCNALPRSN-------CNEDKDAC-----EYLYTYGDYSSTRGLLATETFTFED 99

Query: 225 K-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP-- 277
           + ++     GC + ++     Q +G+ G GR   SL SQL   KFSYCL S +  +A   
Sbjct: 100 ENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSS 159

Query: 278 -----VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
                ++S +V  TG       T  +S     +NP   S     FYY+ L+ I VG+K +
Sbjct: 160 LFIGSLASGIVNKTGASLDGEVTKTMS---LLRNPDQPS-----FYYLELQGITVGAKRL 211

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            +  S      DG GG+I+DSG+T T++E   F+ + +EF  +M   S   D    +GL 
Sbjct: 212 SVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM---SLPVDDSGSTGLD 268

Query: 393 PCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGR 450
            CF +    K++ +P++I  FK GA + LP ENY  A     VLCL + + N        
Sbjct: 269 LCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNGMS----- 322

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I G+ Q QNF +  DL  +   F   +C 
Sbjct: 323 ----IFGNVQQQNFNVLHDLEKETVSFVPTECG 351


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 152/483 (31%), Positives = 212/483 (43%), Gaps = 59/483 (12%)

Query: 7   SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
           SL  + +L I  F       +S   +  P      +  L H DS             + R
Sbjct: 6   SLSLVVALAIFAFVFSHAFSTSRRVLEHPKVQNGFRAKLKHVDSGKNLTKFERIQHGVKR 65

Query: 67  ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
            RH   + K     ++     SNS I  P+ +   G + + L+ GTPP+  +  I DTGS
Sbjct: 66  GRHRLQRFKAMALVAS-----SNSEIDAPV-LPGNGEFLMKLAIGTPPETYSA-IMDTGS 118

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
            L+W  C    +C D       P+  P F PK+SSS   + C +  C  +          
Sbjct: 119 DLIWTQCKPCTQCFD------QPT--PIFDPKKSSSFSKLSCSSKLCEAL---------- 160

Query: 187 GCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD----RQ 241
              P++ TC   C  YL  YG    T G+L SETL F   +VP    GC   ++     Q
Sbjct: 161 ---PQS-TCSDGC-EYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQ 215

Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
            +G+ G GR   SL SQL   KFSYCL S   DD   S+ L+       G   +   S +
Sbjct: 216 GSGLVGLGRGPLSLVSQLKEPKFSYCLTS--VDDTKASTLLM-------GSLASVKASDS 266

Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
                P+  +SA   FYY+ L  I VG   + I  S      DG+GG+I+DSG+T T++E
Sbjct: 267 EIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLE 326

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMAL 420
              F+ VAKEF  Q+   +   D    +GL  CF + SG   + +P+L+  F  GA + L
Sbjct: 327 QSAFDLVAKEFTSQI---NLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GADLEL 382

Query: 421 PPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           P ENY  A     V CL      A G + G     I G+ Q QN  +  DL  +   F  
Sbjct: 383 PAENYMIADASMGVACL------AMGSSSGMS---IFGNIQQQNMLVLHDLEKETLSFLP 433

Query: 480 QKC 482
            +C
Sbjct: 434 TQC 436


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 122/402 (30%), Positives = 177/402 (44%), Gaps = 54/402 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+  +  I DTGS L W  C   Y C   N    DP        K S+
Sbjct: 158 GEYFMDVLVGTPPKHFS-LILDTGSDLNWLQCLPCYDCFHQNGMFYDP--------KTSA 208

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + I C +P+CS I  P+   +C+     N++CP     Y   YG    T G    ET 
Sbjct: 209 SFKNITCNDPRCSLISSPDPPVQCES---DNQSCP-----YFYWYGDRSNTTGDFAVETF 260

Query: 221 RFPSKT---------VPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGLKKF 264
                T         V N + GC   +               G    S  L S  G   F
Sbjct: 261 TVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYG-HSF 319

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCL+ R   +  VSS L+   G          L++T F     G  ++   FYY+ ++ 
Sbjct: 320 SYCLVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSFVN---GKENSVETFYYIQIKS 373

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAA 383
           I+VG K + IP       SDG+GG I+DSG+T ++   P +E +  +F  +M  NY    
Sbjct: 374 ILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFR 433

Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
           D      L PCF++SG  + +++LPEL + F  G     P EN F  +  +++CL +   
Sbjct: 434 DFPV---LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAIL-- 488

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   +    I+G++Q QNF++ +D    R GF   KCA
Sbjct: 489 -----GTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 129/394 (32%), Positives = 185/394 (46%), Gaps = 55/394 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GGY++++S GTP   +   + DTGS L+W  C    +C         P+  P F P  SS
Sbjct: 84  GGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPASSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C +  C ++  PN            +TC      Y  +YG G+TAG L +ETL+
Sbjct: 135 TFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATETLK 182

Query: 222 FPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
               + P+   GCS  +      +GIAG GR + SL  QLG+ +FSYCL  R    A  S
Sbjct: 183 VGDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCL--RSGSAAGAS 240

Query: 280 SNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
             L      GS  + T G +  TPF  NP    S    +YYV L  I VG   + +  S 
Sbjct: 241 PILF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVTTST 291

Query: 339 LVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                +G  GG IVDSG+T T++    +E V + F+ Q  + +    V    GL  CF  
Sbjct: 292 FGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTT---VNGTRGLDLCFKS 348

Query: 398 S--GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALG 449
           +  G   + +P L+L+F GGA+ A+P   YFA V  +      V CL++       PA G
Sbjct: 349 TGGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PAKG 400

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             P  ++G+    + +L +DL    F FA   CA
Sbjct: 401 DQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 140/430 (32%), Positives = 201/430 (46%), Gaps = 55/430 (12%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
           I  P++ ++  GY +SL+ G PPQ    ++ DTGS L W PC   S Y+C++C   +   
Sbjct: 14  IIEPVTTYT-DGYLLSLNLGMPPQVFQVYL-DTGSDLTWVPCGTNSSYQCLECGNEHSTS 71

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GC---SPRNKTCPLACPSYL 203
             IP+F P +SSS+    C +  C  I    N    C   GC   S  +  C   CP + 
Sbjct: 72  KPIPSFSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFS 131

Query: 204 LQYGLG-FTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
             YG G    G L  + +              VP F  GC   S R+P GIAGFG+   S
Sbjct: 132 YTYGGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILS 191

Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYK--- 305
           LPSQLG   K FS+C L  +F   P  +S+L++      GD   S      +TP  K   
Sbjct: 192 LPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIM------GDLALSAKDDFLFTPMLKSIT 245

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
           NP         FYY+GL  + +G    +  P S     S+GNGG+IVD+G+T+T +  P 
Sbjct: 246 NP--------NFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPF 297

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMAL 420
           + A+       +  Y R+ D+E ++G   CF I    +      LP +   F G  K+ L
Sbjct: 298 YTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTL 356

Query: 421 PPEN-YFALVG--NEVL--CLIL--FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
           P ++ Y+A+    N V+  CL+     D         GP  +LG FQ+QN  + +D+   
Sbjct: 357 PKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAG 416

Query: 474 RFGFAKQKCA 483
           R GF  + CA
Sbjct: 417 RIGFQPKDCA 426


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 122/402 (30%), Positives = 173/402 (43%), Gaps = 53/402 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  G PPQ S   I DTGS LVW  C++   C +C+  +  P+ +  F P+ SS
Sbjct: 81  GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 132

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +     C +P C  +  P    RC      + TCP     Y   Y  G  T+GL   ET 
Sbjct: 133 TFSPAHCYDPVCRLVPKPGRAPRCNHTR-IHSTCP-----YEYGYADGSLTSGLFARETT 186

Query: 221 RFPSKT-----VPNFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLK---K 263
              + +     + +   GC      Q           G+ G GR   S  SQLG +   K
Sbjct: 187 SLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK 246

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL+       P S  ++     G G      L +TP   NP+  +     FYYV L+
Sbjct: 247 FSYCLMDYTLSPPPTSYLII-----GDGGDAVSKLFFTPLLTNPLSPT-----FYYVKLK 296

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            + V    ++I  S       GNGG ++DSG+T  F+  P +  V    ++Q      A 
Sbjct: 297 SVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA-VKQRIKLPNAD 355

Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
             E   G   C ++SG  K    LP L  +F GGA    PP NYF     ++ CL +   
Sbjct: 356 --ELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI--- 410

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  P +G     ++G+   Q F  EFD    R GF+++ CA
Sbjct: 411 QSVDPKVGFS---VIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 157/490 (32%), Positives = 220/490 (44%), Gaps = 70/490 (14%)

Query: 16  ILLFTTDAGAGSSAATVTV-------PLTPLSTKHYLHHSDSDPLKI-LHSLASSSLSRA 67
           +LLF   + A S   T+T+       PL        L  S   PL + LH L S SL++ 
Sbjct: 10  LLLFFFISTAASEFQTLTLRSLPTPSPLPLFPDSQSLQSSPDAPLTLDLHHLDSLSLNKT 69

Query: 68  ------RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI 121
                   L   T      ++  + +S+S++ + LS  S G Y   L  GTPP+     +
Sbjct: 70  PTDLFNLRLHRDTLRVHALNSRAAGFSSSVV-SGLSQGS-GEYFTRLGVGTPPRY-LYMV 126

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS +VW  C+   +C    +   DP     F P +S S   I C +P C  +     
Sbjct: 127 LDTGSDVVWLQCSPCRKC----YSQSDP----IFNPYKSKSFAGIPCSSPLCRRL----- 173

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD- 239
                GCS R  TC      Y + YG G FT G   +ETL F    +     GC   ++ 
Sbjct: 174 --DSSGCSTRRHTC-----LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHHNEG 226

Query: 240 --RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
                AG+ G GR   S PSQ G++   KFSYCL+ R     P  S++V       GD+ 
Sbjct: 227 LFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKP--SSMVF------GDAA 278

Query: 295 TPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVD 352
              L+ +TP  +NP         FYYVGL  I VG   V+ +  S     S GNGGVI+D
Sbjct: 279 ISRLARFTPLIRNP-----KLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIID 333

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG++ T +  P + A+   F     +  R  +    S    C+D+SG+ SV +P ++L F
Sbjct: 334 SGTSVTRLTRPAYTALRDAFRVGARHLKRGPEF---SLFDTCYDLSGQSSVKVPTVVLHF 390

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
           + GA MALP  NY   V         F    +G +       I+G+ Q Q F + +DLA 
Sbjct: 391 R-GADMALPATNYLIPVDENGSFCFAFAGTISGLS-------IIGNIQQQGFRVVYDLAG 442

Query: 473 DRFGFAKQKC 482
            R GFA + C
Sbjct: 443 SRIGFAPRGC 452


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 130/406 (32%), Positives = 187/406 (46%), Gaps = 61/406 (15%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L+ +  G Y + LS GTPP A  P I DTGS L W  C     C    F    P+  P +
Sbjct: 88  LAENGAGAYHMILSVGTPPLA-FPAIIDTGSDLTWTQCAP---CTTACF--AQPT--PLY 139

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLGFTAG 213
            P RSS+   + C +P C  +  P+    C   GC             Y  +Y +GFTAG
Sbjct: 140 DPARSSTFSKLPCASPLCQAL--PSAFRACNATGCV------------YDYRYAVGFTAG 185

Query: 214 LLLSETLRF--------PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK 262
            L ++TL           S +      GCS  +       +GI G GRS+ SL SQ+G+ 
Sbjct: 186 YLAADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVG 245

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           +FSYCL  R   DA  S  L       +GD     +  T   +NPV +      +YYV L
Sbjct: 246 RFSYCL--RSDADAGASPILFGALANVTGDK----VQSTALLRNPVAARRR-APYYYVNL 298

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I VGS  + +  S     + G GGVIVDSG+TFT++    +  + + F+ Q      A
Sbjct: 299 TGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQT-----A 353

Query: 383 ADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVLCLI 437
             + + SG +  FD+   +G     +P L+ +F GGA+ A+P ++YF  V  G  V CL+
Sbjct: 354 GLLTRVSGAQFDFDLCFEAGAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLL 413

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +           RG ++I G+    + ++ +DL    F FA   CA
Sbjct: 414 VLPT--------RGVSVI-GNVMQMDLHVLYDLDGATFSFAPADCA 450


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 129/394 (32%), Positives = 186/394 (47%), Gaps = 51/394 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y ++L+ GTPP  S P I DTGS L+W       +C  C       +  P + P  S+
Sbjct: 86  GEYIMTLAIGTPP-LSYPAIADTGSDLIW------TQCAPCGSQCFKQAGQP-YNPSSST 137

Query: 162 SSQLIGCQNP--KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
           +  ++ C +    C+ + GP   S   GCS         C  Y   YG G+TAG+   ET
Sbjct: 138 TFGVLPCNSSVSMCAALAGP---SPPPGCS---------C-MYNQTYGTGWTAGIQSVET 184

Query: 220 LRFPSK-----TVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
             F S       VP    GCS  S       AG+ G GR S SL SQLG   FSYCL   
Sbjct: 185 FTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCL--T 242

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            F DA  +S L+L  GP +  + T G+  TPF  +P  S +    +YY+ L  I +G+  
Sbjct: 243 PFQDANSTSTLLL--GPSAALNGT-GVLTTPFVASP--SKAPMSTYYYLNLTGISIGTTA 297

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP +     +DG GG+I+DSG+T T +    ++ V +  I  +      AD    +GL
Sbjct: 298 LSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQV-RAAIESLVTLP-VADGSDSTGL 355

Query: 392 RPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
             CF ++ + S    +P +   F  GA M LP +NY  ++G+ V CL +           
Sbjct: 356 DLCFALTSETSTPPSMPSMTFHFD-GADMVLPVDNYM-ILGSGVWCLAMRNQTV------ 407

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            G     G++Q QN +L +D+  +   FA  KC+
Sbjct: 408 -GAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 135/470 (28%), Positives = 207/470 (44%), Gaps = 64/470 (13%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDP---LKILHSLASSSLSRARHLKTKTKPKTKDSNIGS 85
            + +  P +  S    LHH    P   L+++     S ++  ++   K   K  +  + S
Sbjct: 15  VSAIVAPTSSTSRGTLLHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRS 74

Query: 86  N----YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
                 S+S I+TP+   S G Y ++++ GTP  +S   I DTGS L+W  C    +C  
Sbjct: 75  INAMLQSSSGIETPVYAGS-GEYLMNVAIGTPA-SSLSAIMDTGSDLIWTQCEPCTQCFS 132

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
                  P+  P F P+ SSS   + C++  C  +   +  + C+               
Sbjct: 133 ------QPT--PIFNPQDSSSFSTLPCESQYCQDLPSESCYNDCQ--------------- 169

Query: 202 YLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSES 254
           Y   YG G  T G + +ET  F + +VPN   GC    D Q       AG+ G G    S
Sbjct: 170 YTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCG--EDNQGFGQGNGAGLIGMGWGPLS 227

Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           LPSQLG+ +FSYC+ S     +   S L L +        +P  +      NP       
Sbjct: 228 LPSQLGVGQFSYCMTSSG---SSSPSTLALGSAASGVPEGSPSTTLIHSSLNPT------ 278

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
             +YY+ L+ I VG  ++ IP S      DG GG+I+DSG+T T++    + AVA+ F  
Sbjct: 279 --YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 336

Query: 375 QMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
           Q+ N S     E  SGL  CF + S   +V +PE+ ++F GG  + L  EN        V
Sbjct: 337 QI-NLSPVD--ESSSGLSTCFQLPSDGSTVQVPEISMQFDGGV-LNLGEENVLISPAEGV 392

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +CL + + +  G +       I G+ Q Q   + +DL N    F   +C 
Sbjct: 393 ICLAMGSSSQQGIS-------IFGNIQQQETQVLYDLQNLAVSFVPTQCG 435


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 126/401 (31%), Positives = 183/401 (45%), Gaps = 58/401 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
           ++TP+     G Y ++LS GTP Q   PF  I DTGS L+W  C    +C +        
Sbjct: 84  VETPVYAGD-GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN-------- 131

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
              P F P+ SSS   + C +  C  +  P        CS  N +C      Y   YG G
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-------CS--NNSC-----QYTYGYGDG 177

Query: 210 F-TAGLLLSETLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKF 264
             T G + +ETL F S ++PN   GC            AG+ G GR   SLPSQL + KF
Sbjct: 178 SETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKF 237

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYC+      +   SS L+L +   S  + +P  +           SS    FYY+ L  
Sbjct: 238 SYCMTPIGSSN---SSTLLLGSLANSVTAGSPNTTLI--------QSSQIPTFYYITLNG 286

Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           + VGS  + I P  + +  ++G GG+I+DSG+T T+     ++AV + FI QM N S   
Sbjct: 287 LSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQM-NLSVVN 345

Query: 384 DVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
                SG   CF + S + ++ +P  ++ F GG  + LP ENYF    N ++CL + + +
Sbjct: 346 G--SSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS 402

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                       I G+ Q QN  + +D  N    F   +C 
Sbjct: 403 QG--------MSIFGNIQQQNLLVVYDTGNSVVSFLSAQCG 435


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 126/401 (31%), Positives = 183/401 (45%), Gaps = 58/401 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
           ++TP+     G Y ++LS GTP Q   PF  I DTGS L+W  C    +C +        
Sbjct: 84  VETPVYAGD-GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN-------- 131

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
              P F P+ SSS   + C +  C  +  P        CS  N +C      Y   YG G
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-------CS--NNSC-----QYTYGYGDG 177

Query: 210 F-TAGLLLSETLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKF 264
             T G + +ETL F S ++PN   GC            AG+ G GR   SLPSQL + KF
Sbjct: 178 SETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKF 237

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYC+       +  SS L+L +   S  + +P  +           SS    FYY+ L  
Sbjct: 238 SYCMTPIG---SSTSSTLLLGSLANSVTAGSPNTTLI--------ESSQIPTFYYITLNG 286

Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           + VGS  + I P  + +  ++G GG+I+DSG+T T+     ++AV + FI QM N S   
Sbjct: 287 LSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQM-NLSVVN 345

Query: 384 DVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
                SG   CF + S + ++ +P  ++ F GG  + LP ENYF    N ++CL + + +
Sbjct: 346 G--SSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS 402

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                       I G+ Q QN  + +D  N    F   +C 
Sbjct: 403 QG--------MSIFGNIQQQNLLVVYDTGNSVVSFLFAQCG 435


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 144/484 (29%), Positives = 216/484 (44%), Gaps = 56/484 (11%)

Query: 11  LFSLLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
           + SL +L+F    A   S AA+V V LT       +H   SDP          +L R  H
Sbjct: 8   MASLAVLVFLVVCATLASGAASVRVGLT------RIH---SDPDITAPEFVRDALRRDMH 58

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
            + +++         S+ +    +T   + + G Y ++LS GTPP  S P I DTGS L+
Sbjct: 59  -RQQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPP-LSYPAIADTGSDLI 116

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           W       +C  C+         P + P  S++  ++ C +            S C G  
Sbjct: 117 W------TQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSL----------SMCAGVL 160

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQ--- 241
                 P     Y   YG G+TAG+  SET  F S       VP    GCS  S      
Sbjct: 161 AGKAPPPGCACMYNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG 220

Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
            AG+ G GR S SL SQLG  +FSYCL    F D   +S L+L  GP +  + T G+  T
Sbjct: 221 SAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT-GVRST 275

Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           PF  +P  + +    +YY+ L  I +G+K + I        +DG GG+I+DSG+T T + 
Sbjct: 276 PFVASP--AKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLV 333

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMA 419
              ++ V +  ++ +     A D    +GL  C+ +    S    +P + L F  GA M 
Sbjct: 334 NAAYQQV-RAAVQSLVTLP-AIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFD-GADMV 390

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           LP ++Y  + G+ V CL +            G     G++Q QN ++ +D+ N+   FA 
Sbjct: 391 LPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442

Query: 480 QKCA 483
            KC+
Sbjct: 443 AKCS 446


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 130/404 (32%), Positives = 185/404 (45%), Gaps = 58/404 (14%)

Query: 88  SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
           S+S I+ P+ +   G + + L+ GTPP+  +  I DTGS L+W  C    +C   + P  
Sbjct: 82  SSSEIEAPV-LPGNGEFLMKLAIGTPPETYSA-ILDTGSDLIWTQCKPCTQCFHQSTPIF 139

Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
           DP +  +F     SS          C+            GC             YL  YG
Sbjct: 140 DPKKSSSFSKLSCSSQLCEALPQSSCN-----------NGCE------------YLYSYG 176

Query: 208 -LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLK 262
               T G+L SETL F   +VPN   GC   ++     Q AG+ G GR   SL SQL   
Sbjct: 177 DYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEP 236

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           KFSYCL +   DD   S+  +L     S ++ +  +  TP   +P     A   FYY+ L
Sbjct: 237 KFSYCLTT--VDDTKTST--LLMGSLASVNASSSAIKTTPLIHSP-----AHPSFYYLSL 287

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I VG   + I  S      DG+GG+I+DSG+T T++E   F  VAKEF  ++   +  
Sbjct: 288 EGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI---NLP 344

Query: 383 ADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLIL 438
            D    +GL  CF + SG  ++ +P+L+  F  GA + LP ENY  ++G+    V CL  
Sbjct: 345 VDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GADLELPAENY--MIGDSSMGVACL-- 399

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               A G + G     I G+ Q QN  +  DL  +   F   +C
Sbjct: 400 ----AMGSSSGMS---IFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 138/457 (30%), Positives = 201/457 (43%), Gaps = 62/457 (13%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
           L    +DP          +L R  H     K     S+ G+  S     +P +    G Y
Sbjct: 36  LTRVHADPSVTASQFVRGALRRDMHRHNARKLALAASS-GATVSAPTQNSPTA----GEY 90

Query: 105 SISLSFGTPPQASTPF--IFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IP 157
            ++L+ GTPP    P+  I DTGS L+W    PCTS+  C     P  +PS    F  +P
Sbjct: 91  LMALAIGTPP---LPYQAIADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTFAVLP 145

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
             SS S         C+        +   GC         AC +Y + YG G+T+    S
Sbjct: 146 CNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSVFQGS 187

Query: 218 ETLRFPS-----KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
           ET  F S       VP    GCS  S        +G+ G GR   SL SQLG+ KFSYCL
Sbjct: 188 ETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL 247

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               + D   +S L+L  GP +  + T G+S TPF  +P  S++    FYY+ L  I +G
Sbjct: 248 --TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTGISLG 301

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           +  + IP    +  +DG GG+I+DSG+T T +    ++ V    +  +       D    
Sbjct: 302 TTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTDGSAA 359

Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
           +GL  CF +    S    +P + L F  GA M LP ++Y     + + CL +        
Sbjct: 360 TGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT---- 414

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               G   ILG++Q QN ++ +D+  +   FA  KC+
Sbjct: 415 ---DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 122/391 (31%), Positives = 177/391 (45%), Gaps = 57/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y ++LS GTP Q   PF  I DTGS L+W  C    +C +           P F P+ 
Sbjct: 93  GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN--------QSTPIFNPQG 141

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           SSS   + C +  C  +  P        CS  N  C      Y   YG G  T G + +E
Sbjct: 142 SSSFSTLPCSSQLCQALSSPT-------CS--NNFC-----QYTYGYGDGSETQGSMGTE 187

Query: 219 TLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
           TL F S ++PN   GC            AG+ G GR   SLPSQL + KFSYC+      
Sbjct: 188 TLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG-- 245

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            +   SNL+L +   S  + +P  +           SS    FYY+ L  + VGS  + I
Sbjct: 246 -SSTPSNLLLGSLANSVTAGSPNTTLI--------QSSQIPTFYYITLNGLSVGSTRLPI 296

Query: 335 -PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
            P ++ +  ++G GG+I+DSG+T T+     +++V +EFI Q+   +        SG   
Sbjct: 297 DPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI---NLPVVNGSSSGFDL 353

Query: 394 CFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
           CF   S   ++ +P  ++ F GG  + LP ENYF    N ++CL + + +          
Sbjct: 354 CFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMGSSSQG-------- 404

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             I G+ Q QN  + +D  N    FA  +C 
Sbjct: 405 MSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 121/397 (30%), Positives = 182/397 (45%), Gaps = 62/397 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS L+W  C     C D          +P F   RSS++
Sbjct: 35  YLVHLAIGTPPQP-VQLTLDTGSDLIWTQCKPCVSCFD--------QPLPYFDTSRSSTN 85

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
            L+ C++ +C     P V + C   +   +TC     +Y   YG    T GLL ++   F
Sbjct: 86  ALLPCESTQCK--LDPTV-TVCVKLNQTVQTC-----AYYTSYGDNSVTIGLLAADKFTF 137

Query: 223 PSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
            + T +P    GC +    + +    GIAGFGR   SLPSQL +  FS+C  +       
Sbjct: 138 VAGTSLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GA 194

Query: 278 VSSNLVLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           + S ++LD        G G+  + TP + Y     NP          YY+ L+ I VGS 
Sbjct: 195 IPSTVLLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKGITVGST 245

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + +P S     ++G GG I+DSG++ T +   +++ V  EF  Q+            +G
Sbjct: 246 RLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVPGNATG 301

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
              CF    +    +P+L+L F+ GA M LP ENY   V    GN ++CL          
Sbjct: 302 HYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL---------- 350

Query: 447 ALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A+ +G    I+G+FQ QN ++ +DL N+   F   +C
Sbjct: 351 AINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 387


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 115/386 (29%), Positives = 174/386 (45%), Gaps = 36/386 (9%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            G PPQ +   I DTGS+L+W  C++      C         +  + P RS +++ + C 
Sbjct: 77  IGDPPQQAEAII-DTGSNLIWTQCST------CQPAGCFSQNLSFYDPSRSRTARPVACN 129

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN 229
           +  C+       E+RC   +  NK C     + L  YG G   G+L +E   F  ++   
Sbjct: 130 DTACAL----GSETRC---ARDNKAC-----AVLTAYGAGVIGGVLGTEAFTFQPQSENV 177

Query: 230 FLA-GCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
            LA GC   +   P      +GI G GR + SL SQLG  KFSYCL +  F  +  +S L
Sbjct: 178 SLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCL-TPYFSQSTNTSRL 236

Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
            +    G      P  S  PF KNP      F  FYY+ L  I VG   + +P +     
Sbjct: 237 FVGASAGLSSGGAPATS-VPFLKNP--DVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLR 293

Query: 343 SDGNG---GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS- 398
               G   G ++DSGS FT +    ++A+  E ++Q+G  S         GL  C  ++ 
Sbjct: 294 QVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLG-ASIVPPPAGAEGLDLCAAVAH 352

Query: 399 GKKSVYLPELILKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
           G     +P L+L F  GG  +A+PPENY+  V +   C+++F+       L      I+G
Sbjct: 353 GDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIG 412

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++  Q+ +L +DL      F    C+
Sbjct: 413 NYMQQDMHLLYDLEKGMLSFQPADCS 438


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 138/457 (30%), Positives = 200/457 (43%), Gaps = 62/457 (13%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
           L    +DP          +L R  H     K     S+ G+  S     +P +    G Y
Sbjct: 38  LTRVHADPSVTASQFVRGALRRDMHRHNARKLALAASS-GATVSAPTQDSPTA----GEY 92

Query: 105 SISLSFGTPPQASTPF--IFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IP 157
            ++L+ GTPP    P+  I DTGS L+W    PCTS+  C     P  +PS    F  +P
Sbjct: 93  LMALAIGTPP---LPYQAIADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTFAVLP 147

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
             SS S         C+        +   GC         AC +Y + YG G+T+    S
Sbjct: 148 CNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSVFQGS 189

Query: 218 ETLRFPSK-----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
           ET  F S       VP    GCS  S        +G+ G GR   SL SQLG+ KFSYCL
Sbjct: 190 ETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL 249

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               + D   +S L+L  GP +  + T G+S TPF  +P  S++    FYY+ L  I +G
Sbjct: 250 --TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTGISLG 303

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           +  + IP       +DG GG+I+DSG+T T +    ++ V    +  +       D    
Sbjct: 304 TTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTDGSAD 361

Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
           +GL  CF +    S    +P + L F  GA M LP ++Y     + + CL +        
Sbjct: 362 TGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT---- 416

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               G   ILG++Q QN ++ +D+  +   FA  KC+
Sbjct: 417 ---DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 124/404 (30%), Positives = 178/404 (44%), Gaps = 56/404 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKR 159
           G Y +SL  GTPPQ +   + DTGS L+W  C+    C +C+       R P  AF  + 
Sbjct: 84  GQYFVSLRIGTPPQ-TLLLVADTGSDLIWVKCSP---CRNCS------HRSPGSAFFARH 133

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S++   I C +P+C  +  P+         P N+T   +   Y   Y     T G    E
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPN-------PCNRTRLHSPCRYQYTYADSSTTTGFFSKE 186

Query: 219 TLRFPSKT-----VPNFLAGCSI---------LSDRQPAGIAGFGRSSESLPSQLGLK-- 262
            L   + T     +     GC            S     G+ G GR+  S  SQLG +  
Sbjct: 187 ALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFG 246

Query: 263 -KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
            KFSYCL+       P +S L +        SK   +S+TP   NP+  +     FYY+ 
Sbjct: 247 SKFSYCLMDYTLSPPP-TSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPT-----FYYIA 300

Query: 322 LRQIIVGSKHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           ++ + V    VK+P +  V   D  GNGG I+DSG+T TF+  P +  + K F +++   
Sbjct: 301 IKGVYVNG--VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLP 358

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           S A   E   G   C ++SG     LP +     GG+  + PP NYF   G+++ CL   
Sbjct: 359 SPA---EPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCL--- 412

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              A  P    G   +LG+   Q F LEFD    R GF ++ CA
Sbjct: 413 ---AVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 142/434 (32%), Positives = 204/434 (47%), Gaps = 60/434 (13%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
           I  P++ ++  GY +SL+ G PPQ    ++ DTGS L W PC   S Y+C++C   +   
Sbjct: 14  IIEPVTTYT-DGYLLSLNLGMPPQVFQVYL-DTGSDLTWVPCGTNSSYQCLECGNEHSTS 71

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GC---SPRNKTCPLACPSYL 203
             IP+F P +SSS+    C +  C  I    N    C   GC   S  +  C   CP + 
Sbjct: 72  KPIPSFSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFS 131

Query: 204 LQYGLG-FTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
             YG G    G L  + +              VP F  GC   S R+P GIAGFG+   S
Sbjct: 132 YTYGGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILS 191

Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYK--- 305
           LPSQLG   K FS+C L  +F   P  +S+L++      GD   S      +TP  K   
Sbjct: 192 LPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIM------GDLALSAKDDFLFTPMLKSIT 245

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
           NP         FYY+GL  + +G    +  P S     S+GNGG+IVD+G+T+T +  P 
Sbjct: 246 NP--------NFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPF 297

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMAL 420
           + A+       +  Y R+ D+E ++G   CF I    +      LP +   F G  K+ L
Sbjct: 298 YTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTL 356

Query: 421 PPEN-YFALVG--NEVL--CLIL------FTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
           P ++ Y+A+    N V+  CL+         D+  G A   GP  +LG FQ+QN  + +D
Sbjct: 357 PKDSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGA-NNGPGAVLGSFQMQNVEVVYD 415

Query: 470 LANDRFGFAKQKCA 483
           +   R GF  + CA
Sbjct: 416 MEAGRIGFQPKDCA 429


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 138/409 (33%), Positives = 190/409 (46%), Gaps = 71/409 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L + S G Y +S+  GTPP+  +  I DTGS L+W  C     CVD   P  DP++ P++
Sbjct: 81  LVLASEGEYLMSMGIGTPPRYYSA-ILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSY 139

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTA 212
                     + C +P C+ ++ P     C     RN  C       + QY  G    TA
Sbjct: 140 AK--------LPCNSPMCNALYYP----LCY----RN-VC-------VYQYFYGDSANTA 175

Query: 213 GLLLSETLRFPSK----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFS 265
           G+L +ET  F +     TVP    GC  L   S    +G+ GFGR   SL SQLG  +FS
Sbjct: 176 GVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFS 235

Query: 266 YCLLSRKFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           YCL S     +PV S L      T   +  S    +  TPF  NP          YY+ +
Sbjct: 236 YCLTSFM---SPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNP-----GLPTMYYLNM 287

Query: 323 RQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG---- 377
             I VG + + I P  + +  +DG GGVI+DSGST T++    ++ V + F  Q+G    
Sbjct: 288 TGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLT 347

Query: 378 NYSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VL 434
           N +  ADV     L  CF      +K V +PEL   F+ GA M LP ENY  + G+   L
Sbjct: 348 NATSLADV-----LDTCFVWPPPPRKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNL 401

Query: 435 CL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           CL I  +D+ +          I+G FQ QNF++ +D  N    F    C
Sbjct: 402 CLAIAASDDGS----------IIGSFQHQNFHVLYDNENSLLSFTPATC 440


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 128/401 (31%), Positives = 183/401 (45%), Gaps = 59/401 (14%)

Query: 102 GGYSISLSFGTPP---QASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF 155
           G Y ++L+ GTPP   QA    I DTGS L+W    PCTS+  C     P  +PS    F
Sbjct: 30  GEYLMALAIGTPPLPYQA----IADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTF 83

Query: 156 --IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAG 213
             +P  SS S         C+        +   GC         AC +Y + YG G+T+ 
Sbjct: 84  AVLPCNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSV 125

Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKF 264
              SET  F S       VP    GCS  S        +G+ G GR   SL SQLG+ KF
Sbjct: 126 FQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKF 185

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCL    + D   +S L+L  GP +  + T G+S TPF  +P  S++    FYY+ L  
Sbjct: 186 SYCL--TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTG 239

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I +G+  + IP       +DG GG+I+DSG+T T +    ++ V    +  +       D
Sbjct: 240 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTD 297

Query: 385 VEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
               +GL  CF +    S    +P + L F  GA M LP ++Y     + + CL +    
Sbjct: 298 GSADTGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT 356

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   G   ILG++Q QN ++ +D+  +   FA  KC+
Sbjct: 357 -------DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 200/448 (44%), Gaps = 75/448 (16%)

Query: 49  DSDPLKILHSLASS----SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
           D+  +K L SLA++    +L+RAR               G  +S+S+I         G Y
Sbjct: 103 DAARVKSLISLAATVGGTNLTRAR---------------GPGFSSSVISGL--AQGSGEY 145

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
              L  GTP +     + DTGS +VW  C     C+ C +   DP     F P +S S  
Sbjct: 146 FTRLGVGTPARY-VYMVLDTGSDIVWIQCAP---CIKC-YSQTDP----VFDPTKSRSFA 196

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
            I C +P C  +  P       GCS + + C      Y + YG G FT G   +ETL F 
Sbjct: 197 NIPCGSPLCRRLDYP-------GCSTKKQICL-----YQVSYGDGSFTVGEFSTETLTFR 244

Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
              V   + GC   ++      AG+ G GR   S PSQ+G +   KFSYCL  R     P
Sbjct: 245 GTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRP 304

Query: 278 VSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IP 335
             S++V       GDS     + +TP   NP         FYYV L  I VG   V  I 
Sbjct: 305 --SSIVF------GDSAISRTTRFTPLLSNP-----KLDTFYYVELLGISVGGTRVSGIS 351

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S     S GNGGVI+DSG++ T +    + A+   F+    N  RA +    S    CF
Sbjct: 352 ASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEF---SLFDTCF 408

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D+SGK  V +P ++L F+ GA + LP  NY   V N       F   A+G +       I
Sbjct: 409 DLSGKTEVKVPTVVLHFR-GADVPLPASNYLIPVDNSGSFCFAFAGTASGLS-------I 460

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +G+ Q Q F + +DLA  R GFA + CA
Sbjct: 461 IGNIQQQGFRVVYDLATSRVGFAPRGCA 488


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 125/402 (31%), Positives = 178/402 (44%), Gaps = 53/402 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  G PPQ S   I DTGS LVW  C++   C +C+  +  P+ +  F P+ SS
Sbjct: 82  GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +     C +P C  +  P+    C      N T   +   Y   Y  G  T+GL   ET 
Sbjct: 134 TFSPAHCYDPVCRLVPKPDRAPIC------NHTRIHSTCHYEYGYADGSLTSGLFARETT 187

Query: 221 RFPSKT-----VPNFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLK---K 263
              + +     + +   GC      Q           G+ G GR   S  SQLG +   K
Sbjct: 188 SLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK 247

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL+       P +S L++  G G G SK   L +TP   NP+  +     FYYV L+
Sbjct: 248 FSYCLMDYTLSPPP-TSYLIIGNG-GDGISK---LFFTPLLTNPLSPT-----FYYVKLK 297

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            + V    ++I  S       GNGG +VDSG+T  F+  P + +V     R++      A
Sbjct: 298 SVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK--LPIA 355

Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
           D     G   C ++SG  K    LP L  +F GGA    PP NYF     ++ CL +   
Sbjct: 356 DA-LTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI--- 411

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  P +G     ++G+   Q F  EFD    R GF+++ CA
Sbjct: 412 QSVDPKVGFS---VIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 144/458 (31%), Positives = 193/458 (42%), Gaps = 68/458 (14%)

Query: 45  LHH-----SDSDPLKILHSLASSSLSRARHLKT---KTKPKTKDSNIGSNYSNSLIKTPL 96
           LHH     SD  P  + +S  +   SR + L +         +    G  +S+S+  T  
Sbjct: 82  LHHLDALSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSV--TSG 139

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
                G Y   L  GTP +     + DTGS +VW  C    +C    +   DP     F 
Sbjct: 140 LAQGSGEYFTRLGVGTPARY-VFMVLDTGSDVVWIQCAPCKKC----YSQTDP----VFN 190

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
           P +S S   I C +P C  +  P       GCS +   C      Y + YG G FT G  
Sbjct: 191 PTKSRSFANIPCGSPLCRRLDSP-------GCSTKKHIC-----LYQVSYGDGSFTYGEF 238

Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLK---KFSYC 267
            +ETL F    V     GC    D +   I   G         S PSQ+G +   KFSYC
Sbjct: 239 STETLTFRGTRVGRVALGCG--HDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYC 296

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQII 326
           L+ R     P  S +V       GDS     + +TP   NP         FYYV L  + 
Sbjct: 297 LVDRSASSKP--SYMVF------GDSAISRTARFTPLVSNP-----KLDTFYYVELLGVS 343

Query: 327 VGSKHV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
           VG   V  I  S     S GNGGVI+DSG++ T +  P + A+   F     N  RA + 
Sbjct: 344 VGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEF 403

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
              S    CFD+SGK  V +P ++L F+ GA ++LP  NY   V N       F    +G
Sbjct: 404 ---SLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDNSGSFCFAFAGTMSG 459

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +       I+G+ Q Q F + +DLA  R GFA + CA
Sbjct: 460 LS-------IVGNIQQQGFRVVYDLAASRVGFAPRGCA 490


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 168/370 (45%), Gaps = 54/370 (14%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     + P  S+S   +GC +P+C  +   
Sbjct: 178 MVLDTGSDVTWLQCQP---CADC-YAQSDP----VYDPSVSTSYATVGCDSPRCRDL--- 226

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
                   C     +C      Y + YG G +T G   +ETL    S  V N   GC   
Sbjct: 227 ----DAAACRNSTGSCL-----YEVAYGDGSYTVGDFATETLTLGDSAPVSNVAIGCG-- 275

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G  +      S PSQ+    FSYCL+ R   D+P SS L        GD
Sbjct: 276 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 326

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           S+ P ++  P  ++P  ++     FYYV L  I VG + + IP S       G+GGVIVD
Sbjct: 327 SEQPAVT-APLIRSPRTNT-----FYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVD 380

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T ++   + A+ + F++   +  RA+ V   S    C+D++G+ SV +P + L F
Sbjct: 381 SGTAVTRLQSGAYGALREAFVQGTQSLPRASGV---SLFDTCYDLAGRSSVQVPAVALWF 437

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
           +GG ++ LP +NY   V       + F   +       GP  I+G+ Q Q   + FD A 
Sbjct: 438 EGGGELKLPAKNYLIPVDAAGTYCLAFAGTS-------GPVSIIGNVQQQGVRVSFDTAK 490

Query: 473 DRFGFAKQKC 482
           +  GF   KC
Sbjct: 491 NTVGFTADKC 500


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 124/399 (31%), Positives = 176/399 (44%), Gaps = 50/399 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+  +  I DTGS L W  C   Y C   N    DP        K S+
Sbjct: 168 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLPCYDCFQQNGAFYDP--------KASA 218

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + I C + +C+ +  P+    CK     N++CP     Y   YG    T G    ET 
Sbjct: 219 SYKNITCNDQRCNLVSSPDPPMPCKS---DNQSCP-----YYYWYGDSSNTTGDFAVETF 270

Query: 221 RFPSKT---------VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
                T         V N + GC   +       AG+ G GR   S  SQL       FS
Sbjct: 271 TVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFS 330

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL+ R   D  VSS L+   G        P L++T F     G  +    FYYV ++ I
Sbjct: 331 YCLVDRN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV---AGKENLVDTFYYVQIKSI 384

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD 384
           +V  + + IP       SDG GG I+DSG+T ++   P +E +  +   +  G Y    D
Sbjct: 385 LVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRD 444

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
                 L PCF++SG  +V LPEL + F  GA    P EN F  +  +++CL +      
Sbjct: 445 FPI---LDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAML----- 496

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                +    I+G++Q QNF++ +D    R G+A  KCA
Sbjct: 497 --GTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 158/469 (33%), Positives = 212/469 (45%), Gaps = 75/469 (15%)

Query: 35  PLTPLSTKHYLHHSDS-----DPLKILHSLASSSLSRAR---HLKTKTKPKTKDSNIGSN 86
           P T LS    LHH D+      P ++ H       +R +   HL   T  KT+ +N GS 
Sbjct: 60  PTTSLS----LHHIDALSFNKTPSQLFHLRLERDAARVKTLTHLAAATN-KTRPANPGSG 114

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCN 143
           +S+S++   LS  S G Y   L  GTPP+     + DTGS +VW    PCT  Y   D  
Sbjct: 115 FSSSVVSG-LSQGS-GEYFTRLGVGTPPKYLY-MVLDTGSDVVWLQCKPCTKCYSQTD-- 169

Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
                      F P +S S   I C +P C  +  P       GCS +N  C      Y 
Sbjct: 170 ---------QIFDPSKSKSFAGIPCYSPLCRRLDSP-------GCSLKNNLC-----QYQ 208

Query: 204 LQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL 259
           + YG G FT G   +ETL F    VP    GC   ++      AG+ G GR   S P+Q 
Sbjct: 209 VSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQT 268

Query: 260 GLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFG 315
           G +   KFSYCL  R     P  S++V       GDS     + +TP  KNP        
Sbjct: 269 GTRFNNKFSYCLTDRTASAKP--SSIVF------GDSAVSRTARFTPLVKNP-----KLD 315

Query: 316 EFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
            FYYV L  I VG   V+ I  S+    S GNGGVI+DSG++ T +  P + ++   F  
Sbjct: 316 TFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRV 375

Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
              +  RA +    S    C+D+SG   V +P ++L F+G A ++LP  NY   V N   
Sbjct: 376 GASHLKRAPEF---SLFDTCYDLSGLSEVKVPTVVLHFRG-ADVSLPAANYLVPVDNSGS 431

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               F    +G +       I+G+ Q Q F + FDLA  R GFA + CA
Sbjct: 432 FCFAFAGTMSGLS-------IIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 129/406 (31%), Positives = 185/406 (45%), Gaps = 42/406 (10%)

Query: 90  SLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
           S    P++  S  G+S+++  GTPPQ  T  I DTGS L+W  C+   R       +   
Sbjct: 70  SAADVPVAPLSDQGHSLTVGIGTPPQPRT-LIVDTGSDLIWTQCSMLSRRTR-TAASASR 127

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
            R P + P+RSSS   + C +  C        +   K C+ RN  C      Y   YG  
Sbjct: 128 QREPLYEPRRSSSFAYLPCSDRLCQ-----EGQFSYKNCA-RNNRC-----MYDELYGSA 176

Query: 210 FTAGLLLSETLRF--PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKF 264
              G+L SET  F   +K       GC  LS       +G+ G      SL SQL + +F
Sbjct: 177 EAGGVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRF 236

Query: 265 SYCLL---SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYV 320
           SYCL     RK      +S L+          +T G +  T   +NP   ++    +YYV
Sbjct: 237 SYCLTPFAERK------TSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETA----YYYV 286

Query: 321 GLRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  + +G+K + +P + L +   DG+GG IVDSGST +++E   F AV K  +  +   
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346

Query: 380 SRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
                 E       CF +      ++V  P L+L F GGA M LP +NYF      ++CL
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCL 406

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            + T        G G +II G+ Q QN ++ FD+ N +F FA  KC
Sbjct: 407 AVGTSPD-----GFGVSII-GNVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 128/400 (32%), Positives = 179/400 (44%), Gaps = 52/400 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+  +  I DTGS L W  C   + C   N    DP        K S+
Sbjct: 153 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLPCHDCFQQNGAFYDP--------KASA 203

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + I C +P+C+ +  P+    CK     N++CP     Y   YG    T G    ET 
Sbjct: 204 SYKNITCNDPRCNLVSPPDPPKPCKS---DNQSCP-----YYYWYGDSSNTTGDFAVETF 255

Query: 221 RFPSKT---------VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
                T         V N + GC   +       AG+ G GR   S  SQL       FS
Sbjct: 256 TVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFS 315

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL+ R   D  VSS L+   G        P L++T F        +    FYYV ++ I
Sbjct: 316 YCLVDRN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFVAR---KENLVDTFYYVQIKSI 369

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD 384
           IV  + + IP       SDG GG I+DSG+T ++   P +E +  +   +  G Y    D
Sbjct: 370 IVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRD 429

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL-ILFTDNA 443
                 L PCF++SG  S+ LPEL + F  GA    P EN F  +  +++CL IL T  +
Sbjct: 430 FPI---LDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKS 486

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A          I+G++Q QNF++ +D    R G+A  KCA
Sbjct: 487 AFS--------IIGNYQQQNFHILYDTKRSRLGYAPTKCA 518


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 136/447 (30%), Positives = 191/447 (42%), Gaps = 53/447 (11%)

Query: 57  HSLASSSLSR---ARH--LKTKTKPKTKDSNIGSNYSN-----SLIKTPLSVHSYGGYSI 106
           H  A SSLSR    RH    +KT+     + +    SN     S     LS  S  G+S+
Sbjct: 34  HPYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQGHSL 93

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           ++  GTPPQ     I DTGS L+W  C    +              P + P  SS+   +
Sbjct: 94  TVGIGTPPQPRK-LIVDTGSDLIWTQC----KLSSSTAVAARHGSPPVYDPGESSTFAFL 148

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
            C +  C        +   K C+ +N+        Y   YG     G+L SET  F ++ 
Sbjct: 149 PCSDRLCQ-----EGQFSFKNCTSKNRCV------YEDVYGSAAAVGVLASETFTFGARR 197

Query: 227 VPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
             +   G  C  LS        GI G    S SL +QL +++FSYCL    F D   S  
Sbjct: 198 AVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PFADKKTSPL 255

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
           L       S    T  +  T    NPV +      +YYV L  I +G K + +P + L  
Sbjct: 256 LFGAMADLSRHKTTRPIQTTAIVSNPVKTV-----YYYVPLVGISLGHKRLAVPAASLAM 310

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI---- 397
             DG GG IVDSGST  ++    FEAV KE +  +     A    +   L  CF +    
Sbjct: 311 RPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFVLPRRT 367

Query: 398 --SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
             +  ++V +P L+L F GGA M LP +NYF      ++CL      A G         I
Sbjct: 368 AAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDGSGVSI 421

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +G+ Q QN ++ FD+ + +F FA  +C
Sbjct: 422 IGNVQQQNMHVLFDVQHHKFSFAPTQC 448


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 136/457 (29%), Positives = 203/457 (44%), Gaps = 55/457 (12%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           + +L  ++ D ++I      ++ S    +   + P+       S    + +++ ++V S 
Sbjct: 92  ESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPR----RALSERMVATVESGVAVGS- 146

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y I +  GTPP+     I DTGS L W  C     C+DC        R P F P  SS
Sbjct: 147 GEYLIDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASS 197

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S + + C + +C  +  P     C+   P   +CP     Y   YG    T G L  E+ 
Sbjct: 198 SYRNVTCGDQRCGLVAPPEAPRACR--RPAEDSCP-----YYYWYGDQSNTTGDLALESF 250

Query: 221 RF------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
                    S+ V   + GC   +       AG+ G GR   S  SQL       FSYCL
Sbjct: 251 TVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL 310

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           +    D     S +V   G        P L YT F      +SS    FYYV L+ ++VG
Sbjct: 311 VEHGSD---AGSKVVF--GEDYLVLAHPQLKYTAF----APTSSPADTFYYVKLKGVLVG 361

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEK 387
              + I       G DG+GG I+DSG+T ++   P ++ + + F+  M   Y    D   
Sbjct: 362 GDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPV 421

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAAGP 446
              L PC+++SG +   +PEL L F  GA    P ENYF  L  + ++CL +      G 
Sbjct: 422 ---LNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGM 478

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +       I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 479 S-------IIGNFQQQNFHVVYDLQNNRLGFAPRRCA 508


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 129/413 (31%), Positives = 184/413 (44%), Gaps = 71/413 (17%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS----------------SSQL 165
            DTGS LVWFPC   + C+ C    + PS  P      ++                SS L
Sbjct: 98  LDTGSDLVWFPCRP-FTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSDL 156

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
               N    +I           C+    T    CP +   YG G     L S++L  PS 
Sbjct: 157 CAISNCPLDYI-------ETGDCN----TSSYPCPPFYYAYGDGSLVAKLFSDSLSLPSV 205

Query: 226 TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS 279
           +V NF  GC+  +  +P G+AGFGR   SLP+QL +        FSYCL+S  FD   V 
Sbjct: 206 SVANFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVR 265

Query: 280 --SNLVLDTGPGSGDSKTPG----------------LSYTPFYKNPVGSSSAFGEFYYVG 321
             S L+L       + +                     +T    NP         FY V 
Sbjct: 266 RPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNP-----KHPYFYSVS 320

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YS 380
           L+ I +G +++  P        +G GGV+VDSG+TFT +    + +V +EF  ++G  + 
Sbjct: 321 LQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHE 380

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFALVGN-------- 431
           RA  VE  SG+ PC+ ++  ++V +P L+L F G G+ + LP  NYF    +        
Sbjct: 381 RADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEK 438

Query: 432 -EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +V CL+L          G G   ILG++Q Q F + +DL N R GFAK+KCA
Sbjct: 439 RKVGCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 490


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 134/391 (34%), Positives = 181/391 (46%), Gaps = 55/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTPP+     + DTGS +VW  C    +C    +   D    P F PK+S 
Sbjct: 145 GEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKC----YSQTD----PVFDPKKSG 195

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   I C++P C  +  P       GC+ R ++C      Y + YG G FT G   +ETL
Sbjct: 196 SFSSISCRSPLCLRLDSP-------GCNSR-QSCL-----YQVAYGDGSFTFGEFSTETL 242

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
            F    VP    GC   ++      AG+ G GR   S P+Q GL   +KFSYCL+ R   
Sbjct: 243 TFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSAS 302

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
             P  S++V      S  S+T    +TP   NP         FYY+ L  I VG   V  
Sbjct: 303 SKP--SSVVFGQ---SAVSRTA--VFTPLITNP-----KLDTFYYLELTGISVGGARVAG 350

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           I  S     + GNGGVI+DSG++ T +    + ++   F     +  RA D    S    
Sbjct: 351 ITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDY---SLFDT 407

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRGP 452
           CFD+SGK  V +P +++ F+ GA ++LP  NY   V  N V C        AG   G   
Sbjct: 408 CFDLSGKTEVKVPTVVMHFR-GADVSLPATNYLIPVDTNGVFCFAF-----AGTMSGLS- 460

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             I+G+ Q Q F + FD+A  R GFA + CA
Sbjct: 461 --IIGNIQQQGFRVVFDVAASRIGFAARGCA 489


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 138/424 (32%), Positives = 185/424 (43%), Gaps = 73/424 (17%)

Query: 84  GSNYSNSLIKTPLS---VHSYGGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYR 138
           G+  + S +  P+        G Y   +  GTP   +TP   + DTGS +VW  C    R
Sbjct: 119 GTRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTP---ATPALMVLDTGSDVVWLQCAPCRR 175

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C D +           F P+RS S   +GC  P C  +          GC  R K C   
Sbjct: 176 CYDQSGQ--------VFDPRRSRSYGAVGCSAPLCRRL-------DSGGCDLRRKAC--- 217

Query: 199 CPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSE 253
              Y + YG G  TAG   +ETL F     V     GC   ++      AG+ G GR S 
Sbjct: 218 --LYQVAYGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSL 275

Query: 254 SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
           S P+Q+  +    FSYCL+ R     P S +  +  G G+  S T   S+TP  KNP   
Sbjct: 276 SFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGS-TVAASFTPMVKNP--- 331

Query: 311 SSAFGEFYYVGLRQIIVGSKHVK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
                 FYYV L  I VG   V  +  S L +  S G GGVIVDSG++ T +  P + A+
Sbjct: 332 --RMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSAL 389

Query: 369 AKEFIRQMGNYSRAADVEKKSGLR----------PCFDISGKKSVYLPELILKFKGGAKM 418
              F        RAA     +GLR           C+D+SG+K V +P + + F GGA+ 
Sbjct: 390 RDAF--------RAA----AAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEA 437

Query: 419 ALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           ALPPENY   V ++      F     G +       I+G+ Q Q F + FD    R GF 
Sbjct: 438 ALPPENYLIPVDSKGTFCFAFAGTDGGVS-------IIGNIQQQGFRVVFDGDGQRVGFV 490

Query: 479 KQKC 482
            + C
Sbjct: 491 PKGC 494


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 130/393 (33%), Positives = 183/393 (46%), Gaps = 57/393 (14%)

Query: 98  VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + ++L+ GTP +  +  I DTGS L+W  C     C D       P+  P F 
Sbjct: 90  VHAGNGEFLMNLAIGTPAETYSA-IMDTGSDLIWTQCKPCKVCFD------QPT--PIFD 140

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P++SSS   + C +  C  +    + S   GC             Y   YG    T G+L
Sbjct: 141 PEKSSSFSKLPCSSDLCVAL---PISSCSDGCE------------YRYSYGDHSSTQGVL 185

Query: 216 LSETLRFPSKTVPNFLAGCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
            +ET  F   +V     GC    +R     Q AG+ G GR   SL SQLG+ KFSYCL S
Sbjct: 186 ATETFTFGDASVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS 244

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
              DD+   S L++ +   +  S  P    TP  +NP   S     FYY+ L  I VG  
Sbjct: 245 --IDDSKGISTLLVGS-EATVKSAIP----TPLIQNPSRPS-----FYYLSLEGISVGDT 292

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S      DG+GG+I+DSG+T T+++   F A+ KEFI QM       D    + 
Sbjct: 293 LLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQM---KLDVDASGSTE 349

Query: 391 LRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
           L  CF +    S V +P+L+  F+ G  + LP ENY  ++ +  L +I  T    G + G
Sbjct: 350 LELCFTLPPDGSPVEVPQLVFHFE-GVDLKLPKENY--IIEDSALRVICLT---MGSSSG 403

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I G+FQ QN  +  DL  +   FA  +C
Sbjct: 404 MS---IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 130/393 (33%), Positives = 183/393 (46%), Gaps = 57/393 (14%)

Query: 98  VHS-YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           VH+  G + ++L+ GTP +  +  I DTGS L+W  C     C D       P+  P F 
Sbjct: 90  VHAGNGEFLMNLAIGTPAETYSA-IMDTGSDLIWTQCKPCKVCFD------QPT--PIFD 140

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           P++SSS   + C +  C  +    + S   GC             Y   YG    T G+L
Sbjct: 141 PEKSSSFSKLPCSSDLCVAL---PISSCSDGCE------------YRYSYGDHSSTQGVL 185

Query: 216 LSETLRFPSKTVPNFLAGCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
            +ET  F   +V     GC    +R     Q AG+ G GR   SL SQLG+ KFSYCL S
Sbjct: 186 ATETFTFGDASVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS 244

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
              DD+   S L++ +   +  S  P    TP  +NP   S     FYY+ L  I VG  
Sbjct: 245 --IDDSKGISTLLVGS-EATVKSAIP----TPLIQNPSRPS-----FYYLSLEGISVGDT 292

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S      DG+GG+I+DSG+T T+++   F A+ KEFI QM       D    + 
Sbjct: 293 LLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQM---KLDVDASGSTE 349

Query: 391 LRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
           L  CF +    S V +P+L+  F+ G  + LP ENY  ++ +  L +I  T    G + G
Sbjct: 350 LELCFTLPPDGSPVDVPQLVFHFE-GVDLKLPKENY--IIEDSALRVICLT---MGSSSG 403

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I G+FQ QN  +  DL  +   FA  +C
Sbjct: 404 MS---IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 119/400 (29%), Positives = 176/400 (44%), Gaps = 43/400 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           G Y +S++FGTPPQ     I DTGS L+W  C T+      C  P    SR PAF+  +S
Sbjct: 52  GQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFC--PKKACSRRPAFVASKS 108

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           ++  ++ C   +C  +  P        CSP     P+ C  Y   Y  G  T G L  +T
Sbjct: 109 ATLSVVPCSAAQCLLV--PAPRGHGPSCSP---AAPVPC-GYAYDYADGSSTTGFLARDT 162

Query: 220 LRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG---LKKFSYC 267
               + T     V     GC   +         G+ G G+   S P+Q G    + FSYC
Sbjct: 163 ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYC 222

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
           LL  +      SS+ +    P     +    +YTP   NP+  +     FYYVG+  I V
Sbjct: 223 LLDLEGGRRGRSSSFLFLGRP----ERRAAFAYTPLVSNPLAPT-----FYYVGVVAIRV 273

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G++ + +P S       GNGG ++DSGST T++    +  +   F   +      +    
Sbjct: 274 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF 333

Query: 388 KSGLRPCFDISGKKSVY-----LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
             GL  C+++S   S+       P L + F  G  + LP  NY   V ++V CL      
Sbjct: 334 FQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCL------ 387

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A  P L      +LG+   Q +++EFD A+ R GFA+ +C
Sbjct: 388 AIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 427


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 120/412 (29%), Positives = 172/412 (41%), Gaps = 82/412 (19%)

Query: 92  IKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
           I+ PL   +    G Y   +  G P +     + DTGS + W  CT    C DC      
Sbjct: 133 IEAPLISGTTQGSGEYFTRVGIGKPAR-EVYMVLDTGSDVNWLQCTP---CADCYHQTE- 187

Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
               P F P  SSS + + C  P+C+ +      S C     RN TC      Y + YG 
Sbjct: 188 ----PIFEPSSSSSYEPLSCDTPQCNAL----EVSEC-----RNATCL-----YEVSYGD 229

Query: 209 G-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESL------------ 255
           G +T G   +ETL   S  V N   GC              G S+E L            
Sbjct: 230 GSYTVGDFATETLTIGSTLVQNVAVGC--------------GHSNEGLFVGAAGLLGLGG 275

Query: 256 -----PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
                PSQL    FSYCL+ R  D A              G S +P     P  +N    
Sbjct: 276 GLLALPSQLNTTSFSYCLVDRDSDSASTVD---------FGTSLSPDAVVAPLLRN---- 322

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
                 FYY+GL  I VG + ++IP S       G+GG+I+DSG+  T ++  ++ ++  
Sbjct: 323 -HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRD 381

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
            F++   +  +AA V   +    C+++S K +V +P +   F GG  +ALP +NY   V 
Sbjct: 382 SFVKGTLDLEKAAGV---AMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVD 438

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +     + F   A+  A       I+G+ Q Q   + FDLAN   GF+  KC
Sbjct: 439 SVGTFCLAFAPTASSLA-------IIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 179/396 (45%), Gaps = 47/396 (11%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +  GTPP+     I DTGS L W  C     C+DC        R P F P  SSS 
Sbjct: 146 YLMDVYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASSSY 196

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
           + + C +P+C  +  P   +      P    CP     Y   YG    + G L  E+   
Sbjct: 197 RNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCP-----YYYWYGDQSNSTGDLALESFTV 251

Query: 223 ------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL----GLKKFSYCLL 269
                  S  V   + GC   +       AG+ G GR   S  SQL    G   FSYCL+
Sbjct: 252 NLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV 311

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
               D   V+S +V         +  P L YT F      +SS    FYYV L  ++VG 
Sbjct: 312 DHGSD---VASKVVFGEDDALALAAHPRLKYTAFAP----ASSPADTFYYVRLTGVLVGG 364

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKK 388
           + + I          G+GG I+DSG+T ++   P ++ + + FI +M G+Y    D    
Sbjct: 365 ELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPV- 423

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPA 447
             L PC+++SG +   +PEL L F  GA    P ENYF  L  + ++CL +      G +
Sbjct: 424 --LSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMS 481

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                  I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 482 -------IIGNFQQQNFHVAYDLHNNRLGFAPRRCA 510


>gi|15450651|gb|AAK96597.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 110

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 64/109 (58%), Positives = 82/109 (75%), Gaps = 1/109 (0%)

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVL 434
           M NY+R  D+EK++GL PCF+ISGK  V +PELI +FKGGAK+ LP  NYF  VGN + +
Sbjct: 1   MSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTV 60

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           CL + +D    P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 61  CLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 109


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 142/490 (28%), Positives = 210/490 (42%), Gaps = 66/490 (13%)

Query: 14  LLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKT 72
           L +L+F    A   S AA+V V LT +          SDP          +L R  H + 
Sbjct: 27  LAVLVFLVVCATLASGAASVRVGLTRI---------HSDPDTTAPQFVRDALRRDMH-RQ 76

Query: 73  KTKPKTKDSNIGSNYSNSLIKTPLSVHSY------GGYSISLSFGTPPQASTPF--IFDT 124
           +++   +D +     S+    T +S  +       G Y ++L+ GTPP    P+  + DT
Sbjct: 77  RSRSFGRDRDRELAESDGRTSTTVSARTRKDLPNGGEYLMTLAIGTPP---LPYAAVADT 133

Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
           GS L+W       +C  C     +    P + P  S++  ++ C +            S 
Sbjct: 134 GSDLIW------TQCAPCGTQCFE-QPAPLYNPASSTTFSVLPCNSSL----------SM 176

Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSD 239
           C G        P     Y   YG G+TAG+  SET  F S       VP    GCS  S 
Sbjct: 177 CAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASS 236

Query: 240 RQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
                 AG+ G GR S SL SQLG  +FSYCL    F D   +S L+L  GP +  + T 
Sbjct: 237 SDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT- 291

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
           G+  TPF  +P  + +    +YY+ L  I +G+K + I         DG GG+I+DSG+T
Sbjct: 292 GVRSTPFVASP--ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTT 349

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFK 413
            T +    ++ V      Q+       D    +GL  CF +    S     LP + L F 
Sbjct: 350 ITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD 409

Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
            GA M LP ++Y  + G+ V CL +            G     G++Q QN ++ +D+  +
Sbjct: 410 -GADMVLPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVREE 460

Query: 474 RFGFAKQKCA 483
              FA  KC+
Sbjct: 461 TLSFAPAKCS 470


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 128/468 (27%), Positives = 203/468 (43%), Gaps = 76/468 (16%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG- 103
           L H DS      H L    ++R+         K + +++ S+  ++ +  P+    +GG 
Sbjct: 40  LTHVDSGRGFTKHELLRRMVARS---------KARLASLRSSACDTALTAPVD---HGGS 87

Query: 104 ------YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
                 Y I L  GTP         DTGS LVW  C     C D          +P F  
Sbjct: 88  DVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTV-CFD--------QPVPVFRA 138

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL---GFTAGL 214
             S +   + C +P C    G  V     GC+ R+++C          YG      T G 
Sbjct: 139 SVSHTFSRVPCSDPLC----GHAVYLPLSGCAARDRSC-------FYAYGYMDHSITTGK 187

Query: 215 LLSETLRFPS-------KTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKK 263
           +  +T  F +         VPN   GC +++        +GIAGFG    SLPSQL +++
Sbjct: 188 MAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRR 247

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGL 322
           FSYC  +   +++ VS  ++L   P + ++   G +  TPF   P G+      FY++ L
Sbjct: 248 FSYCFTA--MEESRVSP-VILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSL 304

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           R + VG   +    S      DG+GG  +DSG+  TF    +F ++ + F+ Q+      
Sbjct: 305 RGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAK 364

Query: 383 ADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYF-------ALVGNEVL 434
              +  + L  CF +  KK    +P+LIL  + GA   LP ENY        +  G ++ 
Sbjct: 365 GYTDPDNLL--CFSVPAKKKAPAVPKLILHLE-GADWELPRENYVLDNDDDGSGAGRKLC 421

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +IL   N+ G         I+G+FQ QN ++ +DL +++  FA  +C
Sbjct: 422 VVILSAGNSNG--------TIIGNFQQQNMHIVYDLESNKMVFAPARC 461


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 118/401 (29%), Positives = 184/401 (45%), Gaps = 57/401 (14%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           +S +++ G+S+++  GTPPQ S   I D GS L+W  C+     V      ++P     F
Sbjct: 99  ISPYAHQGHSLTVGVGTPPQPSK-VILDLGSDLLWTQCS----LVGPTAKQLEP----VF 149

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
              RSSS  ++ C +              C+  +  NKTC     +Y   YG+    G+L
Sbjct: 150 DAARSSSFSVLPCDS------------KLCEAGTFTNKTCTDRKCAYENDYGIMTATGVL 197

Query: 216 LSETLRFPSK--TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLL- 269
            +ET  F +      N   GC  L++    + +GI G      S+  QL + KFSYCL  
Sbjct: 198 ATETFTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTP 257

Query: 270 --SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT-PFYKNPVGSSSAFGEFYYVGLRQII 326
              RK   +PV    + D G      KT G   T P  KNPV        +YYV +  + 
Sbjct: 258 FADRK--TSPVMFGAMADLG----KYKTTGKVQTIPLLKNPVEDI-----YYYVPMVGMS 306

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR--QMGNYSRAAD 384
           VGSK + +P   L    DG GG ++DS +T  ++  P F  + K  +   ++   +R+ D
Sbjct: 307 VGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVD 366

Query: 385 VEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
                    CF++      + V +P L+L F G A+M+LP +NYF      ++CL +   
Sbjct: 367 -----DYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQ- 420

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                A   G   ++G+ Q QN ++ +D+ N +F +A  KC
Sbjct: 421 -----APFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 131/429 (30%), Positives = 192/429 (44%), Gaps = 59/429 (13%)

Query: 64  LSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH-SYGGYSISLSFGTPPQASTPFIF 122
           + RA     +   K + ++  + +    I+TP++     G Y I ++ GTP   S   I 
Sbjct: 1   MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPA-LSLSAIM 59

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS LVW  C     C DC+  ++      +   K    S L  CQ P    IF  N +
Sbjct: 60  DTGSDLVWTKCNP---CTDCSTSSIYDPSSSSTYSKVLCQSSL--CQPPS---IFSCNND 111

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ 241
             C+               Y+  YG    T+G+L  ET    S+++PN   GC    D Q
Sbjct: 112 GDCE---------------YVYPYGDRSSTSGILSDETFSISSQSLPNITFGCG--HDNQ 154

Query: 242 ----PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
                 G+ GFGR S SL SQLG     KFSYCL+SR   D+  +S L +          
Sbjct: 155 GFDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRT--DSSKTSPLFI--------GN 204

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
           T  L  T     P+  SS+    YY+ L  I VG + + IP       SDG+GG+I+DSG
Sbjct: 205 TASLEATTVGSTPLVQSSSTNH-YYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSG 263

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +T TF++   ++AV +  +  + N  +A        L  CF+  G  +   P +   FK 
Sbjct: 264 TTLTFLQQTAYDAVKEAMVSSI-NLPQA-----DGQLDLCFNQQGSSNPGFPSMTFHFK- 316

Query: 415 GAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
           GA   +P ENY F    ++++CL +   N+       G   I G+ Q QN+ + +D  N+
Sbjct: 317 GADYDVPKENYLFPDSTSDIVCLAMMPTNS-----NLGNMAIFGNVQQQNYQILYDNENN 371

Query: 474 RFGFAKQKC 482
              FA   C
Sbjct: 372 VLSFAPTAC 380


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 131/396 (33%), Positives = 172/396 (43%), Gaps = 48/396 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
           G Y ++L+ GTPPQ S P I DTGS LVW  C     RC     P  +PS  P F  +P 
Sbjct: 90  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 147

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
              SS L  C             E+R  G +P     P  C   Y   YG G+T+GL  S
Sbjct: 148 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 189

Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
           ET  F S       VP    GCS  S     G AG            SQL    FSYCL 
Sbjct: 190 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 248

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
              F D    S L+L     +      G+  TPF  +P  S      +YY+ L  I VG+
Sbjct: 249 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGA 305

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + IP       +DG GG+I+DSG+T T +    ++ V +  +R +       D    +
Sbjct: 306 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 363

Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
           GL  CF +  S      LP + L F GGA M LP ENY  L G  + CL + +       
Sbjct: 364 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 417

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              G    LG++Q QN ++ +D+  +   FA  KC+
Sbjct: 418 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 131/470 (27%), Positives = 200/470 (42%), Gaps = 63/470 (13%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDP---LKILHSLASSSLSRARHLKTKTKPKTKDSNIGS 85
            + +  P +  S    LHH    P   L++      S  +  ++   K   K  +  + S
Sbjct: 15  VSAIVAPTSSTSRGTLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRS 74

Query: 86  N----YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
                 S+S I+TP+     G Y ++++ GTP  +S   I DTGS L+W  C    +C  
Sbjct: 75  INAMLQSSSGIETPVYAGD-GEYLMNVAIGTP-DSSFSAIMDTGSDLIWTQCEPCTQCFS 132

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
                      P F P+ SSS   + C++  C  +               ++TC      
Sbjct: 133 --------QPTPIFNPQDSSSFSTLPCESQYCQDL--------------PSETCNNNECQ 170

Query: 202 YLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSES 254
           Y   YG G  T G + +ET  F + +VPN   GC    D Q       AG+ G G    S
Sbjct: 171 YTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCG--EDNQGFGQGNGAGLIGMGWGPLS 228

Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           LPSQLG+ +FSYC+ S     +   S L L +        +P  +      NP       
Sbjct: 229 LPSQLGVGQFSYCMTSYG---SSSPSTLALGSAASGVPEGSPSTTLIHSSLNPT------ 279

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
             +YY+ L+ I VG  ++ IP S      DG GG+I+DSG+T T++    + AVA+ F  
Sbjct: 280 --YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 337

Query: 375 QMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
           Q+   +     E  SGL  CF   S   +V +PE+ ++F GG  + L  +N        V
Sbjct: 338 QI---NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGV 393

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +CL + + +  G +       I G+ Q Q   + +DL N    F   +C 
Sbjct: 394 ICLAMGSSSQLGIS-------IFGNIQQQETQVLYDLQNLAVSFVPTQCG 436


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 171/412 (41%), Gaps = 82/412 (19%)

Query: 92  IKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
           I+ PL   +    G Y   +  G P +     + DTGS + W  CT    C DC      
Sbjct: 136 IEAPLISGTTQGSGEYFTRVGIGNPAR-EVYMVLDTGSDVNWLQCTP---CADCYHQTE- 190

Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
               P F P  SSS + + C  P+C+ +      S C     RN TC      Y + YG 
Sbjct: 191 ----PIFEPSSSSSYEPLSCDTPQCNAL----EVSEC-----RNATCL-----YEVSYGD 232

Query: 209 G-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESL------------ 255
           G +T G   +ETL   S  V N   GC              G S+E L            
Sbjct: 233 GSYTVGDFATETLTIGSTLVQNVAVGC--------------GHSNEGLFVGAAGLLGLGG 278

Query: 256 -----PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
                PSQL    FSYCL+ R  D A              G S  P     P  +N    
Sbjct: 279 GLLALPSQLNTTSFSYCLVDRDSDSASTVE---------FGTSLPPDAVVAPLLRN---- 325

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
                 FYY+GL  I VG + ++IP S       G+GG+I+DSG+  T ++  ++ ++  
Sbjct: 326 -HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRD 384

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
            F++   +  +AA V   +    C+++S K ++ +P +   F GG  +ALP +NY   V 
Sbjct: 385 SFLKGTSDLEKAAGV---AMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVD 441

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +     + F   A+  A       I+G+ Q Q   + FDLAN   GF+  KC
Sbjct: 442 SVGTFCLAFAPTASSLA-------IIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 132/397 (33%), Positives = 188/397 (47%), Gaps = 53/397 (13%)

Query: 97  SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           SVH S   Y + ++ GTPP   T  + DTGS L+W  C +   C  C FP   P+  P +
Sbjct: 84  SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
            P RS++   + C++P C  +  P   SRC   SP +  C     +Y   YG G  T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSRC---SPPDTGC-----AYYFSYGDGTSTDGV 185

Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           L +ET    S T    +A GC   ++ S    +G+ G GR   SL SQLG+ +FSYC   
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTP 245

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
               +A  +S L L    GS    +     TPF  +P G +     +YY+ L  I VG  
Sbjct: 246 F---NATAASPLFL----GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  +       G+GGVI+DSG+TFT +E   F A+A+    ++     A+      G
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARALASRV-RLPLASGAHL--G 355

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
           L  CF  +  ++V +P L+L F  GA M L  E+Y            +  D +AG A LG
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 402

Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               RG + +LG  Q QN ++ +DL      F   KC
Sbjct: 403 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 132/397 (33%), Positives = 188/397 (47%), Gaps = 53/397 (13%)

Query: 97  SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           SVH S   Y + ++ GTPP   T  + DTGS L+W  C +   C  C FP   P+  P +
Sbjct: 84  SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
            P RS++   + C++P C  +  P   SRC   SP +  C     +Y   YG G  T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSRC---SPPDTGC-----AYYFSYGDGTSTDGV 185

Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           L +ET    S T    +A GC   ++ S    +G+ G GR   SL SQLG+ +FSYC   
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTP 245

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
               +A  +S L L    GS    +     TPF  +P G +     +YY+ L  I VG  
Sbjct: 246 F---NATAASPLFL----GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  +       G+GGVI+DSG+TFT +E   F A+A+    ++     A+      G
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAHL--G 355

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
           L  CF  +  ++V +P L+L F  GA M L  E+Y            +  D +AG A LG
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 402

Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               RG + +LG  Q QN ++ +DL      F   KC
Sbjct: 403 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 117/395 (29%), Positives = 164/395 (41%), Gaps = 58/395 (14%)

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIP 157
           YG + + +  GTPPQ +   I DTGS L W    PC + +   D           P F P
Sbjct: 22  YGEFLVPIYLGTPPQKAV-VIIDTGSDLTWIQSEPCRACFEQAD-----------PIFDP 69

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
            +SS+   I C +  C+ + G    S    C             Y   YG G  T G   
Sbjct: 70  SKSSTYNKIACSSSACADLLGTQTCSAAANCI------------YAYGYGDGSVTRGYFS 117

Query: 217 SETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
            ET+            G S+       D    GI G G+   S+PSQLG     KFSYCL
Sbjct: 118 KETITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCL 177

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           +      +  S+    D    SG+     + YTP   N     +    +YY+ ++ I VG
Sbjct: 178 VDWLSAGSETSTMYFGDAAVPSGE-----VQYTPIVPN-----ADHPTYYYIAVQGISVG 227

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
              + I  S     S G+GG I+DSG+T T+++  +F A+   +  Q+    R       
Sbjct: 228 GSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQV----RYPTTTSA 283

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           +GL  CF+  G  S   P + +    G  + LP  N F  +   ++CL      A   AL
Sbjct: 284 TGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTANTFISLETNIICL------AFASAL 336

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              P  I G+ Q QNF + +DL N R GFA   CA
Sbjct: 337 DF-PIAIFGNIQQQNFDIVYDLDNMRIGFAPADCA 370


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 139/487 (28%), Positives = 214/487 (43%), Gaps = 63/487 (12%)

Query: 14  LLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH-LK 71
           L +L+F    A   S AA+V V LT +          SDP          +L R  H  +
Sbjct: 27  LAVLVFLVVCATLASGAASVRVGLTRI---------HSDPDTTAPQFVRDALRRDMHRQR 77

Query: 72  TKTKPKTKDSNIGSNYSNSLI--KTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSS 127
           +++  + +D  +  +   + +  +T   + + G Y ++L+ GTPP    P+  + DTGS 
Sbjct: 78  SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPP---LPYAAVADTGSD 134

Query: 128 LVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG 187
           L+W       +C  C     +    P + P  S++  ++ C +            S C G
Sbjct: 135 LIW------TQCAPCGTQCFE-QPAPLYNPASSTTFSVLPCNSSL----------SMCAG 177

Query: 188 CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQ- 241
                   P     Y   YG G+TAG+  SET  F S       VP    GCS  S    
Sbjct: 178 ALAGAAPPPGCACMYNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDW 237

Query: 242 --PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
              AG+ G GR S SL SQLG  +FSYCL    F D   +S L+L  GP +  + T G+ 
Sbjct: 238 NGSAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT-GVR 292

Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
            TPF  +P  + +    +YY+ L  I +G+K + I         DG GG+I+DSG+T T 
Sbjct: 293 STPFVASP--ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITS 350

Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFKGGA 416
           +    ++ V +  ++ +       D    +GL  CF +    S     LP + L F  GA
Sbjct: 351 LANAAYQQV-RAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GA 408

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
            M LP ++Y  + G+ V CL +            G     G++Q QN ++ +D+  +   
Sbjct: 409 DMVLPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVREETLS 460

Query: 477 FAKQKCA 483
           FA  KC+
Sbjct: 461 FAPAKCS 467


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 131/396 (33%), Positives = 171/396 (43%), Gaps = 48/396 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
           G Y ++L+ GTPPQ S P I DTGS LVW  C     RC     P  +PS  P F  +P 
Sbjct: 90  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 147

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
              SS L  C             E+R  G +P     P  C   Y   YG G+T+GL  S
Sbjct: 148 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 189

Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
           ET  F S       VP    GCS  S     G AG            SQL    FSYCL 
Sbjct: 190 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 248

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
              F D    S L+L     +      G+  TPF  +P  S      +YY+ L  I VG 
Sbjct: 249 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGP 305

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + IP       +DG GG+I+DSG+T T +    ++ V +  +R +       D    +
Sbjct: 306 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 363

Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
           GL  CF +  S      LP + L F GGA M LP ENY  L G  + CL + +       
Sbjct: 364 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 417

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              G    LG++Q QN ++ +D+  +   FA  KC+
Sbjct: 418 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 122/390 (31%), Positives = 172/390 (44%), Gaps = 56/390 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     + DTGS + W  C     C DC +   DP     F P  S+
Sbjct: 167 GEYFSRVGIGSPAR-ELYMVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSA 217

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +P+C  +           C  RN T   AC  Y + YG G +T G   +ETL
Sbjct: 218 SYAAVSCDSPRCRDL-------DTAAC--RNATG--AC-LYEVAYGDGSYTVGDFATETL 265

Query: 221 RFPSKT-VPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFD 274
                T V N   GC    D +   +   G  +      S PSQ+    FSYCL+ R   
Sbjct: 266 TLGDSTPVTNVAIGCG--HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR--- 320

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
           D+P +S L         D+ T      P  ++P       G FYYV L  I VG + + I
Sbjct: 321 DSPAASTLQFGADGAEADTVT-----APLVRSP-----RTGTFYYVALSGISVGGQALSI 370

Query: 335 PYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           P S + +  + G+GGVIVDSG+  T ++   + A+   F+R   +  R + V   S    
Sbjct: 371 PSSAFAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGV---SLFDT 427

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGP 452
           C+D+S + SV +P + L+F+GG  + LP +NY   V G    CL     NAA        
Sbjct: 428 CYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA-------- 479

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             I+G+ Q Q   + FD A    GF   KC
Sbjct: 480 VSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 166/371 (44%), Gaps = 56/371 (15%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     F P  S+S   + C NP+C  +   
Sbjct: 182 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSTSYASVACDNPRCHDL--- 230

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
                    + RN T   AC  Y + YG G +T G   +ETL    S  V +   GC   
Sbjct: 231 ------DAAACRNSTG--AC-LYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCG-- 279

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G  +      S PSQ+    FSYCL+ R   D+P SS L        GD
Sbjct: 280 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 330

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           +    ++  P  ++P  S+     FYYVGL  + VG + + IP S     S G GGVIVD
Sbjct: 331 AADAEVT-APLIRSPRTST-----FYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVD 384

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T ++   + A+   F+R   +  R + V   S    C+D+S + SV +P + L+F
Sbjct: 385 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLRF 441

Query: 413 KGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
            GG ++ LP +NY   V G    CL     NAA          I+G+ Q Q   + FD A
Sbjct: 442 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDTA 493

Query: 472 NDRFGFAKQKC 482
               GF   KC
Sbjct: 494 KSTVGFTTNKC 504


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 131/396 (33%), Positives = 171/396 (43%), Gaps = 48/396 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
           G Y ++L+ GTPPQ S P I DTGS LVW  C     RC     P  +PS  P F  +P 
Sbjct: 95  GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 152

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
              SS L  C             E+R  G +P     P  C   Y   YG G+T+GL  S
Sbjct: 153 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 194

Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
           ET  F S       VP    GCS  S     G AG            SQL    FSYCL 
Sbjct: 195 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 253

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
              F D    S L+L     +      G+  TPF  +P  S      +YY+ L  I VG 
Sbjct: 254 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGP 310

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + IP       +DG GG+I+DSG+T T +    ++ V +  +R +       D    +
Sbjct: 311 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 368

Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
           GL  CF +  S      LP + L F GGA M LP ENY  L G  + CL + +       
Sbjct: 369 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 422

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              G    LG++Q QN ++ +D+  +   FA  KC+
Sbjct: 423 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 126/410 (30%), Positives = 173/410 (42%), Gaps = 66/410 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + LS GTPP+       DTGS LVW  C     C++C     D   IP   P  SS+ 
Sbjct: 94  YLVHLSVGTPPR-PVALTLDTGSDLVWTQCAP---CLNC----FDQGAIPVLDPAASSTH 145

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C  P C  +  P       G S   ++C      Y+  YG    T G L S+   F
Sbjct: 146 AAVRCDAPVCRAL--PFTSCGRGGSSWGERSC-----VYVYHYGDKSITVGKLASDRFTF 198

Query: 223 -PSKTVP-------NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
            P                GC   +         GIAGFGR   SLPSQLG+  FSYC  S
Sbjct: 199 GPGDNADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTS 258

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             F+    SS + L   P      T  +  TP  ++P   S      Y++ L+ I VG+ 
Sbjct: 259 -MFES--TSSLVTLGVAPAE-LHLTGQVQSTPLLRDPSQPS-----LYFLSLKAITVGAT 309

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + IP              I+DSG++ T +   ++EAV  EF+ Q+G    A +    S 
Sbjct: 310 RIPIPERRQ---RLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVE---GSA 363

Query: 391 LRPCFDISGKKS-----------------VYLPELILKFKGGAKMALPPENY-FALVGNE 432
           L  CF +    +                 V +P L+    GGA   LP ENY F   G  
Sbjct: 364 LDLCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGAR 423

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V+CL+L  D A G   G    +++G++Q QN ++ +DL ND   FA  +C
Sbjct: 424 VMCLVL--DAATG---GGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 131/422 (31%), Positives = 192/422 (45%), Gaps = 61/422 (14%)

Query: 89  NSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           N  +K+PL   +    G Y + +  GTPPQ S   + DTGS LVW  C++   C +C+  
Sbjct: 70  NPTLKSPLISGASTGSGQYFVDIRLGTPPQ-SLLLVADTGSDLVWVKCSA---CRNCS-- 123

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
           +  PS   AF+P+ SSS     C +P C  +  P+        +  +  C      +L  
Sbjct: 124 HHPPSS--AFLPRHSSSFSPFHCFDPHCRLL--PHAPHHLCNHTRLHSPC-----RFLYS 174

Query: 206 YGLG-FTAGLLLSETLRFPSKT-----VPNFLAGCSI------LSDRQ---PAGIAGFGR 250
           Y  G  ++G    ET    S +     +     GC        +S  Q     G+ G GR
Sbjct: 175 YADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGR 234

Query: 251 SSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFY 304
            S S  SQLG +   KFSYCL+       P S  ++   L + P +  +K   +SYTP  
Sbjct: 235 GSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATK---ISYTPLQ 291

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEG 362
            NP+  +     FYY+ +  I +    VK+P +  V   D  GNGG +VDSG+T T++  
Sbjct: 292 INPLSPT-----FYYITIHSITIDG--VKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTK 344

Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALP 421
             +E V K  +R+      AA++    G   C + SG+ +   LP L  +  GGA  A P
Sbjct: 345 TAYEEVLKS-VRRRVKLPNAAELTP--GFDLCVNASGESRRPSLPRLRFRLGGGAVFAPP 401

Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
           P NYF      V+CL +    +     G G ++I G+   Q F LEFD    R GF ++ 
Sbjct: 402 PRNYFLETEEGVMCLAIRAVES-----GNGFSVI-GNLMQQGFLLEFDKEESRLGFTRRG 455

Query: 482 CA 483
           C 
Sbjct: 456 CG 457


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 126/402 (31%), Positives = 177/402 (44%), Gaps = 67/402 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +S GTPP+       DTGS LVW  C     C+DC     +    P   P  SS+ 
Sbjct: 90  YLMHVSVGTPPR-PVALTLDTGSDLVWTQCAP---CLDC----FEQGAAPVLDPAASSTH 141

Query: 164 QLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLR 221
             + C  P C  + F     + C G S  +++C      Y+  YG    T G L +++  
Sbjct: 142 AALPCDAPLCRALPF-----TSCGGRSWGDRSC-----VYVYHYGDRSLTVGQLATDSFT 191

Query: 222 FPSKTVPNFLA------GCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           F        LA      GC  ++         GIAGFGR   SLPSQL +  FSYC  S 
Sbjct: 192 FGGDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTS- 250

Query: 272 KFDDAPVSSNLVLDTGPGSGD-------SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
            FD     S+ V+  G  + +       + T  +  T   KNP   S      Y+V LR 
Sbjct: 251 MFD---TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPS-----LYFVPLRG 302

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I VG   V +P S L          I+DSG++ T +   ++EAV  EF+ Q+G     A 
Sbjct: 303 ISVGGARVAVPESRL------RSSTIIDSGASITTLPEDVYEAVKAEFVSQVG---LPAA 353

Query: 385 VEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFT 440
               + L  CF +   +  +   +P L L   GGA   LP  NY F      VLC++L  
Sbjct: 354 AAGSAALDLCFALPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVL-- 411

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           D AA      G  +++G++Q QN ++ +DL ND   FA  +C
Sbjct: 412 DAAA------GEQVVIGNYQQQNTHVVYDLENDVLSFAPARC 447


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 165/371 (44%), Gaps = 56/371 (15%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     F P  S+S   + C NP+C  +   
Sbjct: 178 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSTSYASVACDNPRCHDL--- 226

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
                    + RN T   AC  Y + YG G +T G   +ETL    S  V +   GC   
Sbjct: 227 ------DAAACRNSTG--AC-LYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCG-- 275

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G  +      S PSQ+    FSYCL+ R   D+P SS L        GD
Sbjct: 276 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 326

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           +    ++  P  ++P  S+     FYYVGL  I VG + + IP S       G GGVIVD
Sbjct: 327 AADAEVT-APLIRSPRTST-----FYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVD 380

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T ++   + A+   F+R   +  R + V   S    C+D+S + SV +P + L+F
Sbjct: 381 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLRF 437

Query: 413 KGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
            GG ++ LP +NY   V G    CL     NAA          I+G+ Q Q   + FD A
Sbjct: 438 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDTA 489

Query: 472 NDRFGFAKQKC 482
               GF   KC
Sbjct: 490 KSTVGFTSNKC 500


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 119/400 (29%), Positives = 175/400 (43%), Gaps = 43/400 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           G Y +S++FGTPPQ     I DTGS L+W  C T+      C  P    SR PAF+  +S
Sbjct: 51  GQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFC--PKKACSRRPAFVASKS 107

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           ++  ++ C   +C  +  P        CSP     P+ C  Y   Y  G  T G L  +T
Sbjct: 108 ATLSVVPCSAAQCLLV--PAPRGHGPACSP---AAPVPC-GYAYDYADGSSTTGFLARDT 161

Query: 220 LRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG---LKKFSYC 267
               + T     V     GC   +         G+ G G+   S P+Q G    + FSYC
Sbjct: 162 ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYC 221

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
           LL  +      SS+ +    P     +    +YTP   NP+  +     FYYVG+  I V
Sbjct: 222 LLDLEGGRRGRSSSFLFLGRP----ERRAAFAYTPLVSNPLAPT-----FYYVGVVAIRV 272

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G++ + +P S       GNGG ++DSGST T++    +  +   F   +      +    
Sbjct: 273 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF 332

Query: 388 KSGLRPCFDISGKKSVY-----LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
             GL  C+++S   S        P L + F  G  + LP  NY   V ++V CL      
Sbjct: 333 FQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCL------ 386

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A  P L      +LG+   Q +++EFD A+ R GFA+ +C
Sbjct: 387 AIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 426


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 126/392 (32%), Positives = 171/392 (43%), Gaps = 58/392 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS +VW  C    +C    +   DP     F P +S 
Sbjct: 127 GEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKC----YTQADP----VFDPTKSR 177

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C  P C  +  P       GC+ +NK C      Y + YG G FT G   +ETL
Sbjct: 178 TYAGIPCGAPLCRRLDSP-------GCNNKNKVC-----QYQVSYGDGSFTFGDFSTETL 225

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLK---KFSYCLLSRK 272
            F    V     GC    D +   I   G         S P Q G +   KFSYCL+ R 
Sbjct: 226 TFRRTRVTRVALGCG--HDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRS 283

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               P  S++V       GDS     + +TP  KNP         FYY+ L  I VG   
Sbjct: 284 ASAKP--SSVVF------GDSAVSRTARFTPLIKNP-----KLDTFYYLELLGISVGGSP 330

Query: 332 VK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           V+ +  S     + GNGGVI+DSG++ T +  P + A+   F     +  RAA+    S 
Sbjct: 331 VRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEF---SL 387

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
              CFD+SG   V +P ++L F+ GA ++LP  NY   V N       F    +G +   
Sbjct: 388 FDTCFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLS--- 443

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q Q F + FDLA  R GFA + C
Sbjct: 444 ----IIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 145/462 (31%), Positives = 202/462 (43%), Gaps = 74/462 (16%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           K  L H D+D       L S +L R+   +  T         G   + + I   L + S 
Sbjct: 32  KATLRHVDADAGYTEEQLLSRALRRSSA-RVATLQSLAALAPGDAITAARI---LVLASD 87

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTP +  +  I DTGS L+W  C     CVD           P F P RS+
Sbjct: 88  GEYLMEMGIGTPTRYYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYFDPARSA 138

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSE 218
           + + +GC +P C+ ++ P     C       K C       + QY  G    TAG+L +E
Sbjct: 139 TYRSLGCASPACNALYYP----LCY-----QKVC-------VYQYFYGDSASTAGVLANE 182

Query: 219 TLRFPSK----TVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T  F +     ++P    GC  L+    A   G+ GFGR S SL SQLG  +FSYCL S 
Sbjct: 183 TFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSF 242

Query: 272 KFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               +PV S L   V  T   +  S  P  S TPF  NP     A    Y++ +  I VG
Sbjct: 243 L---SPVPSRLYFGVYATLNSTNASSEPVQS-TPFVVNP-----ALPTMYFLNMTGISVG 293

Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
              + I P  + +  +DG GG I+DSG+T T++  P ++AV   F  Q+       +V  
Sbjct: 294 GYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT--LPLLNVTD 351

Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
            S L  CF      ++SV LP+L+L F  GA   LP +NY            +  D + G
Sbjct: 352 ASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY------------MLVDPSTG 398

Query: 446 PALGRGPA-----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             L    A      I+G +Q QNF + +DL N    F    C
Sbjct: 399 GGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 118/395 (29%), Positives = 172/395 (43%), Gaps = 47/395 (11%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            G PPQ +   I DTGS+L+W  C++      C         +  + P RS +++ + C 
Sbjct: 90  IGDPPQQAAAII-DTGSNLIWTQCST------CRANGCFGQDLTFYDPSRSRTAKPVACN 142

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF----PSK 225
           +  C  + G   E+RC   +   K C     + L  YG G   G L +E   F     S+
Sbjct: 143 DTAC--LLGS--ETRC---ARDGKAC-----AVLTAYGAGAIGGFLGTEVFTFGHGQSSE 190

Query: 226 TVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
              +   GC   S   P      +GI G GR   SLPSQLG  KFSYCL +  F DA  +
Sbjct: 191 NNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCL-TPYFSDAANT 249

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-- 337
           S L +    G      P  S  PF KNP      F  FYY+ L  I VG+  + +P +  
Sbjct: 250 STLFVGASAGLSGGGAPATS-VPFLKNP--DDDPFDSFYYLPLTGITVGTAKLDVPAAAF 306

Query: 338 ---YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
               + P     GG ++DSGS FT +    ++A+  E +RQ+G  S         GL  C
Sbjct: 307 DLREVAPAK--WGGTLIDSGSPFTSLIDVAYQALRDELVRQLG-ASVVPPPAGAEGLDLC 363

Query: 395 FD--ISGKKSVYLPELILKFKGGA----KMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
                 G     +P L+L F  G      + +PPENY+  V +   C+++F+       L
Sbjct: 364 VGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTL 423

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 I+G++  Q+ +L +DL      F    C+
Sbjct: 424 PLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADCS 458


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 125/402 (31%), Positives = 181/402 (45%), Gaps = 59/402 (14%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L   S G Y + L+ GTPP   T  I DTGS L+W       +C  C      P+  P F
Sbjct: 81  LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIW------TQCAPCLLCAAQPT--PYF 131

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
             KRS++ + + C++ +C+ +  P         S   K C      Y   YG    TAG+
Sbjct: 132 DVKRSATYRALPCRSSRCAALSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177

Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
           L +ET  F + +       N   GC  L+  + A   G+ GFGR   SL SQLG  +FSY
Sbjct: 178 LANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSY 237

Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           CL S     +P  S L   V      +  S    +  TPF  NP     A    Y++ ++
Sbjct: 238 CLTSYL---SPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINP-----ALPNMYFLSVK 289

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +G+K + I         DG GGVI+DSG++ T+++   +EAV +     +      A
Sbjct: 290 GISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTI---PLPA 346

Query: 384 DVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
             +   GL  CF        +V +P+ +  F  GA M LPPENY  +      LCL +  
Sbjct: 347 MNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFD-GANMTLPPENYMLIASTTGYLCLAM-- 403

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              A  ++G     I+G++Q QN +L +D+AN    F    C
Sbjct: 404 ---APTSVG----TIIGNYQQQNLHLLYDIANSFLSFVPAPC 438


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 144/462 (31%), Positives = 202/462 (43%), Gaps = 74/462 (16%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           K  L H D+D       L S +L R+   +  T         G   + + I   L + S 
Sbjct: 32  KATLRHVDADAGYTEEQLLSRALRRSSA-RVATLQSLAALAPGDAITAARI---LVLASD 87

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTP +  +  I DTGS L+W  C     CVD           P F P RS+
Sbjct: 88  GEYLMEMGIGTPTRYYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYFDPARSA 138

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSE 218
           + + +GC +P C+ ++ P     C       K C       + QY  G    TAG+L +E
Sbjct: 139 TYRSLGCASPACNALYYP----LCY-----QKVC-------VYQYFYGDSASTAGVLANE 182

Query: 219 TLRFPSK----TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T  F +     ++P    GC  L+       +G+ GFGR S SL SQLG  +FSYCL S 
Sbjct: 183 TFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSF 242

Query: 272 KFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               +PV S L   V  T   +  S  P  S TPF  NP     A    Y++ +  I VG
Sbjct: 243 L---SPVPSRLYFGVYATLNSTNASSEPVQS-TPFVVNP-----ALPTMYFLNMTGISVG 293

Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
              + I P  + +  +DG GG I+DSG+T T++  P ++AV   F  Q+       +V  
Sbjct: 294 GYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT--LPLLNVTD 351

Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
            S L  CF      ++SV LP+L+L F  GA   LP +NY            +  D + G
Sbjct: 352 ASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY------------MLVDPSTG 398

Query: 446 PALGRGPA-----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             L    A      I+G +Q QNF + +DL N    F    C
Sbjct: 399 GGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 125/392 (31%), Positives = 178/392 (45%), Gaps = 58/392 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS LVW  C     C + + P  D SR        SS+ 
Sbjct: 35  YLLHLAIGTPPQP-VQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 85

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
            L  C + +C     P+V + C   +   +TC     +Y   YG    T G L  ET+ F
Sbjct: 86  ALPSCDSTQCK--LDPSV-TMC--VNQTVQTC-----AYSYSYGDKSATIGFLDVETVSF 135

Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
            +  +VP  + GC +    +      GIAGFGR   SLPSQL +  FS+C  +   RK  
Sbjct: 136 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 193

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                S ++ D       +    +  TP  KNP     A   FYY+ L+ I VGS  + +
Sbjct: 194 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 244

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S      +G GG I+DSG+ FT +   ++  V  EF   +           ++G   C
Sbjct: 245 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV---KLPVVPSNETGPLLC 300

Query: 395 FDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGR 450
           F      K+ ++P+L+L F+ GA M LP ENY       GN  +CL          A+  
Sbjct: 301 FSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL----------AIIE 349

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G   I+G+FQ QN ++ +DL N +  F + KC
Sbjct: 350 GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 169/390 (43%), Gaps = 52/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTP +     + DTGS +VW  C    RC    +   DP     F P++S 
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C +P C  +          GC+ R KTC      Y + YG G FT G   +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F    V     GC   ++      AG+ G G+   S P Q G +   KFSYCL+ R   
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
             P  S++V      S  ++     +TP   NP         FYYVGL  I VG   V  
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVGLLGISVGGTRVPG 346

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S       GNGGVI+DSG++ T +  P + A+   F        RA D    S    
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDF---SLFDT 403

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CFD+S    V +P ++L F+G A ++LP  NY   V         F     G +      
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+ Q Q F + +DLA+ R GFA   CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 128/398 (32%), Positives = 181/398 (45%), Gaps = 70/398 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS LVW  C     C + + P  D SR        SS+ 
Sbjct: 91  YLLHLAIGTPPQP-VQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 141

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
            L  C + +C     P+V + C   +   +TC     +Y   YG    T G L  ET+ F
Sbjct: 142 ALPSCDSTQCK--LDPSV-TMC--VNQTVQTC-----AYSYSYGDKSATIGFLDVETVSF 191

Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
            +  +VP  + GC +    +      GIAGFGR   SLPSQL +  FS+C  +   RK  
Sbjct: 192 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 249

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                S ++ D       +    +  TP  KNP     A   FYY+ L+ I VGS  + +
Sbjct: 250 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 300

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE------KK 388
           P S      +G GG I+DSG+ FT +   ++  V  EF         AA V+       +
Sbjct: 301 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEF---------AAHVKLPVVPSNE 350

Query: 389 SGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAA 444
           +G   CF      K+ ++P+L+L F+ GA M LP ENY       GN  +CL        
Sbjct: 351 TGPLLCFSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL-------- 401

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             A+  G   I+G+FQ QN ++ +DL N +  F + KC
Sbjct: 402 --AIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 123/401 (30%), Positives = 182/401 (45%), Gaps = 64/401 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  ++S GTP +  +  I DTGS L+W  C     C +         + P F P+ SS
Sbjct: 38  GDYVTTISLGTPAKVFS-VIADTGSDLIWIQCKPCQACFN--------QKDPIFDPEGSS 88

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C +  C  +             PR K+C   C  Y   YG G  T G L SET+
Sbjct: 89  SYTTMSCGDTLCDSL-------------PR-KSCSPNC-DYSYGYGDGSGTRGTLSSETV 133

Query: 221 RFPSK-----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
              S         N   GC  L   S    +G+ G GR + S  SQLG     KFSYCL+
Sbjct: 134 TLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLV 193

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIV 327
             +  DAP  ++ +      S  S    L Y  TP   NP     A   FYYV L+ I +
Sbjct: 194 PWR--DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP-----AMESFYYVKLKDISI 246

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
             + ++IP        DG+GG+I DSG+T T +    ++ V    +R + +     +++ 
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIV----LRALRSKVSFPEIDG 302

Query: 388 KS-GLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTD 441
            S GL  C+D+SG K+ Y   +P ++  F+ GA   LP ENYF    +   ++CL + + 
Sbjct: 303 SSAGLDLCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSS 361

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           N     +G     I G+   QNF + +D+ + + G+A  +C
Sbjct: 362 NM---DIG-----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 130/431 (30%), Positives = 187/431 (43%), Gaps = 76/431 (17%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFI 121
           +L+RA H K+  +     + +    S S  +TPL + S GG Y ++ S GTPPQ  +  +
Sbjct: 42  NLTRAAH-KSHQRLSMLAARLDDAASGS-AQTPLQLDSGGGAYDMTFSIGTPPQELSA-L 98

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS L+W  C +  RCV        P   P++ P +SSS   + C    CS +  P+ 
Sbjct: 99  ADTGSDLIWAKCGACTRCV--------PQGSPSYYPNKSSSFSKLPCSGSLCSDL--PSS 148

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-----FTAGLLLSETLRFPSKTVPNFLAGCSI 236
           +     CS     C      Y   YGL      +T G L SET    S  VP    GC+ 
Sbjct: 149 Q-----CSAGGAEC-----DYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTT 198

Query: 237 L---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP--GSG 291
           +        +G+ G GR   SL SQL +  FSYCL S    DA  +S L+  +G   G+G
Sbjct: 199 MSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS----DAAKTSPLLFGSGALTGAG 254

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
              TP L  + +Y             Y V L  I +G+               G+ G+I 
Sbjct: 255 VQSTPLLRTSTYY-------------YTVNLESISIGAA---------TTAGTGSSGIIF 292

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+T  F+  P +    +  + Q  N + A+    + G   CF  SG      P ++L 
Sbjct: 293 DSGTTVAFLAEPAYTLAKEAVLSQTTNLTMAS---GRDGYEVCFQTSG---AVFPSMVLH 346

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F GG  M LP ENYF  V + V C I+       P+L      I+G+    N+++ +D+ 
Sbjct: 347 FDGG-DMDLPTENYFGAVDDSVSCWIV----QKSPSLS-----IVGNIMQMNYHIRYDVE 396

Query: 472 NDRFGFAKQKC 482
                F    C
Sbjct: 397 KSMLSFQPANC 407


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 129/398 (32%), Positives = 180/398 (45%), Gaps = 70/398 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS LVW  C     C + + P  D SR        SS+ 
Sbjct: 91  YLLHLAIGTPPQP-VQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 141

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
            L  C + +C     P+V + C      N+T      SY   YG    T G L  ET+ F
Sbjct: 142 ALPSCDSTQCK--LDPSV-TMCV-----NQTVQTCAFSY--SYGDKSATIGFLDVETVSF 191

Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
            +  +VP  + GC +    +      GIAGFGR   SLPSQL +  FS+C  +   RK  
Sbjct: 192 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 249

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                S ++ D       +    +  TP  KNP     A   FYY+ L+ I VGS  + +
Sbjct: 250 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 300

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE------KK 388
           P S      +G GG I+DSG+ FT +   ++  V  EF         AA V+       +
Sbjct: 301 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEF---------AAHVKLPVVPSNE 350

Query: 389 SGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAA 444
           +G   CF      K+ ++P+L+L F+ GA M LP ENY       GN  +CL        
Sbjct: 351 TGPLLCFSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL-------- 401

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             A+  G   I+G+FQ QN ++ +DL N +  F + KC
Sbjct: 402 --AIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 122/399 (30%), Positives = 180/399 (45%), Gaps = 60/399 (15%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S G Y + +  G+PP+  +  I DTGS L+W  C     CV+       P+  P F P +
Sbjct: 84  SEGEYLMDVGIGSPPRYFSAMI-DTGSDLIWTQCAPCLLCVE------QPT--PYFEPAK 134

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S+S   + C +  C+ ++ P               C      Y   YG    +AG+L +E
Sbjct: 135 STSYASLPCSSAMCNALYSP--------------LCFQNACVYQAFYGDSASSAGVLANE 180

Query: 219 TLRFPSKT----VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T  F + +    VP    GC  ++       +G+ GFGR + SL SQLG  +FSYCL S 
Sbjct: 181 TFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSF 240

Query: 272 KFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               +P +S L      T   +  S +  +  TPF  NP     A    Y++ +  I V 
Sbjct: 241 M---SPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP-----ALPTMYFLNMTGISVA 292

Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
              + I P  + +  +DG GGVI+DSG+T TF+  P +  V   F+  +G     A+   
Sbjct: 293 GDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATP 350

Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILF-TDNA 443
                 CF      ++ V LPE++L F  GA M LP ENY  + G    LCL +  +D+ 
Sbjct: 351 SDTFDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDG 409

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +          I+G FQ QNF++ +DL N    F    C
Sbjct: 410 S----------IIGSFQHQNFHMLYDLENSLLSFVPAPC 438


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 130/400 (32%), Positives = 179/400 (44%), Gaps = 64/400 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ     I DTGS LVW  C     C       +DPS         SS+ 
Sbjct: 415 YLVHLAIGTPPQ-PVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSN--------SSTF 465

Query: 164 QLIGCQNPKC---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
            ++ C +P C   +W       S C   +  N+TC      Y+  Y  G  T G L +ET
Sbjct: 466 DVLPCSSPVCDNLTW-------SSCGKHNWGNQTC-----VYVYAYADGSITTGHLDAET 513

Query: 220 LRFPSK------TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
             F +       TVP+   GC + ++        GIAGFGR + SLPSQL +  FS+C  
Sbjct: 514 FTFAAADGTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFT 573

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           +    + P S  L L   P +  S   G +  TP  +N   S  A    YY+ L+ I VG
Sbjct: 574 AITGSE-PSSVLLGL---PANLYSDADGAVQSTPLVQN-FSSLRA----YYLSLKGITVG 624

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           S  + IP S      DG GG I+DSG+  T +    ++ V   F  Q+       D    
Sbjct: 625 STRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQV---RLPVDNATS 681

Query: 389 SGL-RPCFDISGKKSVY--LPELILKFKGGAKMALPPENY---FALVGNEVLCLILFTDN 442
           S L R CF  S  +     +P+L+L F+ GA + LP ENY   F   G  V CL +    
Sbjct: 682 SSLSRLCFSFSVPRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAI---- 736

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            AG  L      I+G++Q QN ++ +DL  +   F   +C
Sbjct: 737 NAGDDL-----TIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 129/407 (31%), Positives = 184/407 (45%), Gaps = 57/407 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+     I DTGS L W  C     C+DC        R P F P  SS
Sbjct: 149 GEYLMDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASS 199

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA----CPSYLLQYGLGFTAGLLLS 217
           S + + C + +C      +V    +  +   +TC       CP Y        T G L  
Sbjct: 200 SYRNVTCGDHRCG-----HVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLAL 254

Query: 218 ETLRF------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
           E+          S+ V   + GC   +       AG+ G GR   S  SQL       FS
Sbjct: 255 ESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFS 314

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGD-----SKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           YCL+    D   V S +V     G  D     +  P L YT F      SSS    FYYV
Sbjct: 315 YCLVDHGSD---VGSKVVF----GEDDDALALAAHPQLKYTAFAPASS-SSSPADTFYYV 366

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ ++VG + + I       G DG+GG I+DSG+T ++   P ++ +   F+ +M   S
Sbjct: 367 KLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRM---S 423

Query: 381 RAAD-VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCL 436
           R+   V +   L PC+++SG +   +PEL L F  GA    P ENYF  +   G  ++CL
Sbjct: 424 RSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCL 483

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +      G +       I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 484 AVLGTPRTGMS-------IIGNFQQQNFHVVYDLQNNRLGFAPRRCA 523


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 121/399 (30%), Positives = 178/399 (44%), Gaps = 60/399 (15%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S G Y + +  G+PP+  +  I DTGS L+W  C     CV+           P F P +
Sbjct: 81  SEGEYLMDVGIGSPPRYFSAMI-DTGSDLIWTQCAPCLLCVE--------QPTPYFEPAK 131

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S+S   + C +  C+ ++ P               C      Y   YG    +AG+L +E
Sbjct: 132 STSYASLPCSSAMCNALYSP--------------LCFQNACVYQAFYGDSASSAGVLANE 177

Query: 219 TLRFPSKT----VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T  F + +    VP    GC  ++       +G+ GFGR + SL SQLG  +FSYCL S 
Sbjct: 178 TFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSF 237

Query: 272 KFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               +P +S L      T   +  S +  +  TPF  NP     A    Y++ +  I V 
Sbjct: 238 M---SPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP-----ALPTMYFLNMTGISVA 289

Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
              + I P  + +  +DG GGVI+DSG+T TF+  P +  V   F+  +G     A+   
Sbjct: 290 GDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATP 347

Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILF-TDNA 443
                 CF      ++ V LPE++L F  GA M LP ENY  + G    LCL +  +D+ 
Sbjct: 348 SDTFDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDG 406

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +          I+G FQ QNF++ +DL N    F    C
Sbjct: 407 S----------IIGSFQHQNFHMLYDLENSLLSFVPAPC 435


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 122/400 (30%), Positives = 181/400 (45%), Gaps = 62/400 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  ++S GTP +  +  I DTGS L+W  C     C +         + P F P+ SS
Sbjct: 38  GDYVTTISLGTPAKVFS-VIADTGSDLIWIQCKPCQACFN--------QKDPIFDPEGSS 88

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C +  C  +             PR K+C   C  Y   YG G  T G L SET+
Sbjct: 89  SYTTMSCGDTLCDSL-------------PR-KSCSPDC-DYSYGYGDGSGTRGTLSSETV 133

Query: 221 RFPSK-----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
              S         N   GC  L   S    +G+ G GR + S  SQLG     KFSYCL+
Sbjct: 134 TLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLV 193

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIV 327
             +  DAP  ++ +      S  S    L Y  TP   NP     A   FYYV L+ I +
Sbjct: 194 PWR--DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP-----AMESFYYVKLKDISI 246

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
             + ++IP        DG+GG+I DSG+T T +    ++ V +  +R   ++ +      
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKISFPKIDG--S 303

Query: 388 KSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDN 442
            +GL  C+D+SG K+ Y   +P ++  F+ GA   LP ENYF    +   ++CL + + N
Sbjct: 304 SAGLDLCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSN 362

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                +G     I G+   QNF + +D+ + + G+A  +C
Sbjct: 363 M---DIG-----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 142/495 (28%), Positives = 204/495 (41%), Gaps = 77/495 (15%)

Query: 14  LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK 73
           LL++L T  A A     T+ +P+        +H     P +        SL R RH    
Sbjct: 6   LLVVLVTFTADATHRPKTLHIPV--------VHRGAVFPSR--RGAPPGSLRRCRHAAPF 55

Query: 74  TKPKTKDSNIGSNYSNSLIKTPLSVHSY--GGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
           T       +I ++  + L    +S   +  G Y   ++ G PP  +   + DTGS L+W 
Sbjct: 56  TAQVASFHSIAADDDDRLRSPVMSGVPFDSGEYFAVINVGDPPTRAL-VVIDTGSDLIWL 114

Query: 132 ---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
              PC   YR V            P + P+ SS+ + I C +P+C  +       R  GC
Sbjct: 115 QCVPCRHCYRQV-----------TPLYDPRSSSTHRRIPCASPRCRDVL------RYPGC 157

Query: 189 SPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGC---SILSDRQPA 243
             R   C      Y++ YG G  ++G L ++ L FP  T V N   GC   ++      A
Sbjct: 158 DARTGGC-----VYMVVYGDGSASSGDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAA 212

Query: 244 GIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY 300
           G+ G GR   S P+QL       FSYCL  R       SS LV    P     + P  ++
Sbjct: 213 GLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTP-----EPPSTAF 267

Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK--IPYSYLVPGSDGNGGVIVDSGSTFT 358
           TP   NP   S      YYV +    VG + V      S  +  + G GG++VDSG+  +
Sbjct: 268 TPLRTNPRRPS-----LYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAIS 322

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKK-----SGLRPCFDISGK----KSVYLPELI 409
                 + AV   F     +++ AA   +K     S    C+D+ G      +V +P ++
Sbjct: 323 RFARDAYAAVRDAF----DSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIV 378

Query: 410 LKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           L F GGA MALP  NY   V G +          AA   L      +LG+ Q Q F L F
Sbjct: 379 LHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLN-----VLGNVQQQGFGLVF 433

Query: 469 DLANDRFGFAKQKCA 483
           D+   R GF    C+
Sbjct: 434 DVERGRIGFTPNGCS 448


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 175/392 (44%), Gaps = 56/392 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     + DTGS + W  C     C DC +   DP     F P  SS
Sbjct: 194 GEYFSRIGIGSPAR-QLYMVLDTGSDVTWLQCAP---CADC-YAQSDP----LFDPALSS 244

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +P C  +   +  +     +  N +C      Y + YG G +T G   +ETL
Sbjct: 245 SYATVPCDSPHCRAL---DASACHNNAANGNSSC-----VYEVAYGDGSYTVGDFATETL 296

Query: 221 RFP---SKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRK 272
                 S  V +   GC    D +   +   G  +      S PSQ+   +FSYCL+ R 
Sbjct: 297 TLGGDGSAAVHDVAIGCG--HDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDR- 353

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
             D+P +S L      G+ DS T      P  ++P  ++     FYYV L  I VG + +
Sbjct: 354 --DSPSASTLQF----GASDSST---VTAPLMRSPRSNT-----FYYVALNGISVGGETL 399

Query: 333 -KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
             IP +       G+GGVIVDSG+  T ++   + A+   F+R      RA+ V   S  
Sbjct: 400 SDIPPAAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGV---SLF 456

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGR 450
             C+D++G+ SV +P + L+F+GG ++ LP +NY   V G    CL          A   
Sbjct: 457 DTCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAF--------AATG 508

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G   I+G+ Q Q   + FD A +  GF+  KC
Sbjct: 509 GAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 139/453 (30%), Positives = 188/453 (41%), Gaps = 73/453 (16%)

Query: 55  ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS---VHSYGGYSISLSFG 111
           + H L       AR  K         +N G+      +  P+        G Y   +  G
Sbjct: 89  LRHRLQRDKRRAARISKAAAGGGAGAAN-GTRSRGGAVAAPVVSGLAQGSGEYFTKIGVG 147

Query: 112 TPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
           TP   STP   + DTGS +VW  C    RC D + P         F P+RSSS   + C 
Sbjct: 148 TP---STPALMVLDTGSDVVWLQCAPCRRCYDQSGP--------VFDPRRSSSYGAVDCA 196

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-V 227
            P C  +          GC  R + C      Y + YG G  TAG   +ETL F     V
Sbjct: 197 APLCRRL-------DSGGCDLRRRAC-----LYQVAYGDGSVTAGDFATETLTFAGGARV 244

Query: 228 PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSN 281
                GC   ++      AG+ G GR S S P+Q+     K FSYCL+ R    +  +++
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAAS 304

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYSYL- 339
               +    G       S+TP  +NP         FYYV L  I VG   V  +  S L 
Sbjct: 305 RSRSSTVTFGPPSASAASFTPMVRNP-----RMETFYYVQLVGISVGGARVPGVAESDLR 359

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR------- 392
           +  S G GGVIVDSG++ T +  P + A+   F        RAA     +GLR       
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAF--------RAA----AAGLRLSPGGFS 407

Query: 393 ---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
               C+D+ G+K V +P + + F GGA+ ALPPENY   V +       F     G +  
Sbjct: 408 LFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS-- 465

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I+G+ Q Q F + FD    R GFA + C
Sbjct: 466 -----IIGNIQQQGFRVVFDGDGQRVGFAPKGC 493


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 132/456 (28%), Positives = 195/456 (42%), Gaps = 82/456 (17%)

Query: 54  KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           ++LH +A+ S +R AR L  +      D     +Y++ +  T   VH        ++ GT
Sbjct: 71  ELLHRMAARSKARSARLLSGRAASARVDPG---SYTDGVPDTEYLVH--------MAIGT 119

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           PPQ     I DTGS L W  C     CV C   +     +P F P RS +  ++ C    
Sbjct: 120 PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 170

Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
           C   +W       S C   S  N  C      Y   Y     T G L S+T  F S    
Sbjct: 171 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 218

Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
               +VP+   GC + ++        GIAGF R + S+P+QL +  FSYC  +    +  
Sbjct: 219 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 278

Query: 278 -----VSSNLVLDT-GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                V  NL  D  G G G  ++  L         +   S+  + YY+ L+ + VG+  
Sbjct: 279 PVFLGVPPNLYSDAAGGGHGVVQSTAL---------IRYHSSQLKAYYISLKGVTVGTTR 329

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP S      DG GG IVDSG+  T +   ++  V   F+ Q    ++       S L
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSL 385

Query: 392 -RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
            + CF +       +P L+L F+ GA + LP ENY   +    G  + CL +     AG 
Sbjct: 386 SQLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGE 440

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            L      ++G+FQ QN ++ +DLAND   F   +C
Sbjct: 441 DLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 122/411 (29%), Positives = 176/411 (42%), Gaps = 50/411 (12%)

Query: 84  GSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN 143
            SNY  +++  P+       +++++S GTPPQ  T  I DTGS L+W  C          
Sbjct: 70  ASNY-GTIVPMPIRPFGRLHHTLTVSIGTPPQPRT-LILDTGSDLIWTQCKL-------- 119

Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
           F        P + P +SSS     C    C            K CS RNK        Y 
Sbjct: 120 FDTRQHREKPLYDPAKSSSFAAAPCDGRLCE-----TGSFNTKNCS-RNKCI------YT 167

Query: 204 LQYGLGFTAGLLLSETLRFPS--KTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQ 258
             YG   T G L SET  F    +   +   GC  L+       +GI G      SL SQ
Sbjct: 168 YNYGSATTKGELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQ 227

Query: 259 LGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEF 317
           L + +FSYCL    F D   +S++           +T G +  T    NP GS+     +
Sbjct: 228 LQIPRFSYCLT--PFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSN----YY 281

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           YYV L  I VG+K + +P S    G DG+GG  VDSG T   +   + EA+ KE + +  
Sbjct: 282 YYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEAL-KEAMVEAV 340

Query: 378 NYSRAADVEKKSGLRPCFDI------SGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
                   +       CF +      + + +V +P L+  F GGA M L  ++Y   V  
Sbjct: 341 KLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSA 400

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             +CL++ +         RG   I+G++Q QN ++ FD+ N  F FA  +C
Sbjct: 401 GRMCLVISSG-------ARGA--IIGNYQQQNMHVLFDVENHEFSFAPTQC 442


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 139/459 (30%), Positives = 210/459 (45%), Gaps = 58/459 (12%)

Query: 45  LHHSDSDPLKI--LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
           L  +D D ++I  +H  A+    R+   +T   P +      S    + +++ ++V S G
Sbjct: 95  LDLADKDAVRIETMHRRAA----RSGGDRTPASPSSSPRRALSERMVATVESGVAVGS-G 149

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
            Y + +  GTPP+     I DTGS L W  C     C+DC F  V P     F P  SSS
Sbjct: 150 EYLMDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDC-FDQVGP----VFDPAASSS 200

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLR 221
            + + C + +C  +  P     C+   P   +CP     Y   YG    T G L  E+  
Sbjct: 201 YRNVTCGDQRCGLVAPPEPPRACR--RPGEDSCP-----YYYWYGDQSNTTGDLALESFT 253

Query: 222 F------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
                   S+ V + + GC   +       AG+ G GR   S  SQL       FSYCL+
Sbjct: 254 VNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV 313

Query: 270 SRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               D   V+S +V  +    +  +  P L+YT F      +SS    FYYV L+ ++VG
Sbjct: 314 DHGSD---VASKVVFGEDDALALAAAHPQLNYTAF----APASSPADTFYYVKLKGVLVG 366

Query: 329 SKHVKIPYSYLVPGSDGNGG--VIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADV 385
            + + I       G    G    I+DSG+T ++   P ++ + + FI +MG +Y    D 
Sbjct: 367 GELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDF 426

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAA 444
                L PC+++SG     +PEL L F  GA    P ENYF  L  + ++CL +      
Sbjct: 427 PV---LSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRT 483

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G +       I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 484 GMS-------IIGNFQQQNFHVVYDLKNNRLGFAPRRCA 515


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 140/455 (30%), Positives = 203/455 (44%), Gaps = 67/455 (14%)

Query: 37  TPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL 96
           TP    H     D+  +K L SL ++S    R+L   +KP        + +S+S+I    
Sbjct: 76  TPEELFHLRLQRDAIRVKKLSSLGATS----RNL---SKPGGT-----TGFSSSVISGL- 122

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
                G Y   +  GTPP+     + DTGS +VW  C     C +C +   DP     F 
Sbjct: 123 -AQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAP---CKNC-YSQTDP----VFN 172

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
           P +S S   + C+ P C  +  P       GC+ R +TC      Y + YG G +T G  
Sbjct: 173 PVKSGSFAKVLCRTPLCRRLESP-------GCNQR-QTCL-----YQVSYGDGSYTTGEF 219

Query: 216 LSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
           ++ETL F    V     GC   ++      AG+ G GR   S PSQ G    +KFSYCL+
Sbjct: 220 VTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLV 279

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
            R     P  S++V      S  ++     +TP   NP         FYYV L  I VG 
Sbjct: 280 DRSASSKP--SSVVFGNSAVSRTAR-----FTPLLTNP-----RLDTFYYVELLGISVGG 327

Query: 330 KHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
             V  I  S+      GNGGVI+D G++ T +  P + A+   F     +   A +    
Sbjct: 328 TPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEF--- 384

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           S    C+D+SGK +V +P ++L F+G A ++LP  NY   V         F    +G + 
Sbjct: 385 SLFDTCYDLSGKTTVKVPTVVLHFRG-ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS- 442

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 I+G+ Q Q F + +DLA+ R GF+ + CA
Sbjct: 443 ------IIGNIQQQGFRVVYDLASSRVGFSPRGCA 471


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 124/434 (28%), Positives = 194/434 (44%), Gaps = 60/434 (13%)

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-SVHSYGGYSISLSFGTPPQAST 118
           +++ + R   L+ K+       N  +    + +KT + + H  GGY++++  GTP +  +
Sbjct: 87  SAAEILRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFS 146

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
             +FDTGS L W  C     C    FP  D      F P +S+S + + C +  C  I  
Sbjct: 147 -LLFDTGSDLTWTQCEP---CSGGCFPQNDEK----FDPTKSTSYKNLSCSSEPCKSIG- 197

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
              +   +GCS  N         Y ++YG G+T G L +ETL   PS    NF+ GC   
Sbjct: 198 ---KESAQGCSSSNSCL------YGVKYGTGYTVGFLATETLTITPSDVFENFVIGCGER 248

Query: 238 SDRQ---PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           +  +    AG+ G GRS  +LPSQ        FSYCL        P SS+       G G
Sbjct: 249 NGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCL--------PASSSSTGHLSFGGG 300

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
            S+     +TP        +S   E Y + +  I VG + + I  S          G I+
Sbjct: 301 VSQAA--KFTPI-------TSKIPELYGLDVSGISVGGRKLPIDPSVFR-----TAGTII 346

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS--GKKSVYLPELI 409
           DSG+T T++      A++  F   M NY+     +  SGL+PC+D S     ++ +P++ 
Sbjct: 347 DSGTTLTYLPSTAHSALSSAFQEMMTNYTL---TKGTSGLQPCYDFSKHANDNITIPQIS 403

Query: 410 LKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           + F+GG ++ +     F A  G E +CL  F DN     +      I G+ Q + + + +
Sbjct: 404 IFFEGGVEVDIDDSGIFIAANGLEEVCLA-FKDNGNDTDVA-----IFGNVQQKTYEVVY 457

Query: 469 DLANDRFGFAKQKC 482
           D+A    GFA   C
Sbjct: 458 DVAKGMVGFAPGGC 471


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 139/494 (28%), Positives = 200/494 (40%), Gaps = 91/494 (18%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
           F ++ LL        ++AATV + LT         H+D+        LA+  L +   L+
Sbjct: 6   FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51

Query: 72  TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           +K +   +             Y N +  T   VH        L+ GTPPQ       DTG
Sbjct: 52  SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
           S L+W  C     C D   P  DPS           S+   G     C +PK    F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
                       +TC      Y   YG    T G L  +   F     +VP    GC + 
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           ++        GIAGFGR   SLPSQL +  FS+C  +    +    S ++LD       S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
               +  TP  +NP   +     FYY+ L+ I VGS  + +P S      +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
           G+  T +   ++  V   F  Q+        V   +   P F +S   +   Y+P+L+L 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367

Query: 412 FKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           F+ GA M LP ENY   V   G+ +LCL +            G    +G+FQ QN ++ +
Sbjct: 368 FE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GEVTTIGNFQQQNMHVLY 417

Query: 469 DLANDRFGFAKQKC 482
           DL N +  F   +C
Sbjct: 418 DLQNSKLSFVPAQC 431


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 125/390 (32%), Positives = 168/390 (43%), Gaps = 52/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTP +     + DTGS +VW  C    RC    +   DP     F P++S 
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C +P C  +          GC+ R KTC      Y + YG G FT G   +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F    V     GC   ++      AG+ G G+   S P Q G +   KFSYCL+ R   
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
             P  S++V      S  ++     +TP   NP         FYYV L  I VG   V  
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVELLGISVGGTRVPG 346

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S       GNGGVI+DSG++ T +  P + A+   F        RA D    S    
Sbjct: 347 VAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDF---SLFDT 403

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CFD+S    V +P ++L F+G A ++LP  NY   V         F     G +      
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+ Q Q F + +DLA+ R GFA   CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 139/494 (28%), Positives = 200/494 (40%), Gaps = 91/494 (18%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
           F ++ LL        ++AATV + LT         H+D+        LA+  L +   L+
Sbjct: 6   FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51

Query: 72  TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           +K +   +             Y N +  T   VH        L+ GTPPQ       DTG
Sbjct: 52  SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
           S L+W  C     C D   P  DPS           S+   G     C +PK    F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
                       +TC      Y   YG    T G L  +   F     +VP    GC + 
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           ++        GIAGFGR   SLPSQL +  FS+C  +    +    S ++LD       S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
               +  TP  +NP   +     FYY+ L+ I VGS  + +P S      +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFTL-KNGTGGTIIDS 312

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
           G+  T +   ++  V   F  Q+        V   +   P F +S   +   Y+P+L+L 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367

Query: 412 FKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           F+ GA M LP ENY   V   G+ +LCL +            G    +G+FQ QN ++ +
Sbjct: 368 FE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GEVTTIGNFQQQNMHVLY 417

Query: 469 DLANDRFGFAKQKC 482
           DL N +  F   +C
Sbjct: 418 DLQNSKLSFVPAQC 431


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 130/458 (28%), Positives = 195/458 (42%), Gaps = 71/458 (15%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY-GG 103
           L    +DP         ++L R  H     K       + ++ S+  +  P+S  +  G 
Sbjct: 32  LTRVHADPSVTASQFVRAALHRDMHRHNARK-------LAASSSDGTVSAPVSPTTVPGE 84

Query: 104 YSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAF--IPK 158
           + ++L+ GTPP    PF  I DTGS L+W  C    R C     P  +PS    F  +P 
Sbjct: 85  FLMTLAIGTPP---LPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPC 141

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SS    +G   P C+ +                         Y + YG G+T     +E
Sbjct: 142 NSS----LGLCAPACACM-------------------------YNMTYGSGWTYVFQGTE 172

Query: 219 TLRFPSKT------VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
           T  F S T      VP    GCS  S        +G+ G GR S SL SQLG  KFSYCL
Sbjct: 173 TFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCL 232

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               + D   +S L+L  GP +  + T  +S TPF  +P         +YY+ L  I +G
Sbjct: 233 --TPYQDTNSTSTLLL--GPSASLNDTGVVSSTPFVASPSS------IYYYLNLTGISLG 282

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           +  + IP +     +DG GG+I+DSG+T T +    ++ V    +  +       D    
Sbjct: 283 TTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLV--TLPTTDGSAA 340

Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
           +GL  CF++    S    +P + L F  GA M LP +NY   + +      L+       
Sbjct: 341 TGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYMMSLSDPDSDSSLWCLAMQNQ 399

Query: 447 ALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               G  + ILG++Q QN ++ +D+  +   FA  KC+
Sbjct: 400 TDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 131/466 (28%), Positives = 208/466 (44%), Gaps = 69/466 (14%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGS----------NYSNSLIKTPLSVHSYG- 102
           KI+    + S+SR + +K     + +++   +           +S +++ T  S  S G 
Sbjct: 109 KIIEKKDTKSMSRKQEVKESITIQQQNNLANAFVASLESSKGEFSGNIMATLESGASLGT 168

Query: 103 -GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
             Y + +  GTPP+     I DTGS L W  C   Y C + N  +        + PK SS
Sbjct: 169 GEYFLDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGSH--------YYPKDSS 219

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET- 219
           + + I C +P+C  +   +    CK     N+TCP     Y   Y  G  T G   SET 
Sbjct: 220 TYRNISCYDPRCQLVSSSDPLQHCKA---ENQTCP-----YFYDYADGSNTTGDFASETF 271

Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
              L +P+     K V + + GC   +       +G+ G GR   S PSQ+       FS
Sbjct: 272 TVNLTWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFS 331

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL +  F +  VSS L+   G          L++T       G  +    FYY+ ++ I
Sbjct: 332 YCL-TDLFSNTSVSSKLIF--GEDKELLNNHNLNFTTLL---AGEETPDETFYYLQIKSI 385

Query: 326 IVGSKHVKIP-----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           +VG + + I      +S     +D  GG I+DSGST TF     ++ + + F +++    
Sbjct: 386 MVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQ 445

Query: 381 RAADVEKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCL-I 437
            AAD      + PC+++SG    V LP+  + F  G     P ENYF     +EV+CL I
Sbjct: 446 IAAD---DFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAI 502

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           + T N +          I+G+   QNF++ +D+   R G++ ++CA
Sbjct: 503 MKTPNHSH-------LTIIGNLLQQNFHILYDVKRSRLGYSPRRCA 541


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 123/404 (30%), Positives = 172/404 (42%), Gaps = 72/404 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS--RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y      G PPQ +   I DTGSSL+W  CT+  R  CV  + P  + S   +F P    
Sbjct: 86  YIAEYMVGDPPQRAEALI-DTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAP---- 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
               + CQ+  C+     N    C      + TC     ++ + YG G   G L ++   
Sbjct: 141 ----VPCQDKACA----GNYLHFCA----LDGTC-----TFRVTYGAGGIIGFLGTDAFT 183

Query: 222 FPSK--------------TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
           F S                 P+ L G S        G+ G GR   SL SQ G K+FSYC
Sbjct: 184 FQSGGATLAFGCVSFTRFAAPDVLHGAS--------GLIGLGRGRLSLASQTGAKRFSYC 235

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           L +  F +   SS+L +    G+  S + G   +    F ++P      +  FYY+ L  
Sbjct: 236 L-TPYFHNNGASSHLFV----GAAASLSGGGGAVMSMAFVESP--KDYPYSTFYYLPLVG 288

Query: 325 IIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           I VG   + IP +      V      GGVI+DSGS FT +    +E +  E  RQ+    
Sbjct: 289 ITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSL 348

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
                E   G+  C    G     +P L+L F GGA MALPPENY+A +     C+    
Sbjct: 349 VPPPGEDDGGMALCV-ARGDLDRVVPTLVLHFSGGADMALPPENYWAPLEKSTACM---- 403

Query: 441 DNAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 A+ RG    I+G+FQ QN ++ FD+   R  F    C+
Sbjct: 404 ------AIVRGYLQSIIGNFQQQNMHILFDVGGGRLSFQNADCS 441


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 137/465 (29%), Positives = 210/465 (45%), Gaps = 79/465 (16%)

Query: 44  YLHHSDSDPLKI--LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           +L  ++ D ++I  +H  A+ S S A    +  +    +  + +      +++ + V S 
Sbjct: 94  FLDSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVAT------VESGVPVGS- 146

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTPP+     I DTGS L W  C     C+DC F    P     F P  S 
Sbjct: 147 GEYLVDVYLGTPPRRFR-MIMDTGSDLNWLQCAP---CLDC-FEQSGP----IFDPAASI 197

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYG-----------LG 209
           S + + C + +C  +  P  ES  + C  PR+  CP     Y   YG             
Sbjct: 198 SYRNVTCGDDRCRLV-SPPAESAPRECRRPRSDPCP-----YYYWYGDQSNTTGDLALEA 251

Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL----GLK 262
           FT  L  S T R     V     GC   +       AG+ G GR   S  SQL    G  
Sbjct: 252 FTVNLTQSGTRR-----VDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGH 306

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            FSYCL+      +   S ++     G  D+    P L+YT F       ++    FYY+
Sbjct: 307 AFSYCLVEHG---SAAGSKIIF----GHDDALLAHPQLNYTAF-----APTTDADTFYYL 354

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NY 379
            L+ I+VG + V I    L       GG I+DSG+T ++   P ++A+ + FI +M  +Y
Sbjct: 355 QLKSILVGGEAVNISSDTL-----SAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSY 409

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLIL 438
                +     L PC+++SG + V +PEL L F  GA    P ENYF  +  E ++CL +
Sbjct: 410 PL---ILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAV 466

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                +G +       I+G++Q QNF++ +DL ++R GFA ++CA
Sbjct: 467 LGTPRSGMS-------IIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 125/405 (30%), Positives = 176/405 (43%), Gaps = 65/405 (16%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L   S G Y + L+ GTPP   T  I DTGS L+W  C     C D           P F
Sbjct: 81  LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIWTQCAPCLLCAD--------QPTPYF 131

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
             K+S++ + + C++ +C+ +  P         S   K C      Y   YG    TAG+
Sbjct: 132 DVKKSATYRALPCRSSRCASLSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177

Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
           L +ET  F +         N   GC  L+    A   G+ GFGR   SL SQLG  +FSY
Sbjct: 178 LANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSY 237

Query: 267 CLLSRKFDDAP------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           CL S      P      V +NL   T   SG      +  TPF  NP     A    Y++
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLS-STNTSSGSP----VQSTPFVINP-----ALPNMYFL 286

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I +G+K + I         DG GGVI+DSG++ T+++   +EAV +  +  +    
Sbjct: 287 SLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---P 343

Query: 381 RAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
             A  +   GL  CF        +V +P+L+  F   A M L PENY  +      LCL+
Sbjct: 344 LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLV 402

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +       P    G   I+G++Q QN +L +D+ N    F    C
Sbjct: 403 M------APT---GVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 130/442 (29%), Positives = 190/442 (42%), Gaps = 83/442 (18%)

Query: 55  ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
           IL+ ++ S L   + L+T+ +P+   + + S  S             G Y   +  G P 
Sbjct: 123 ILNGVSKSDL---KPLQTEIQPQDLSTPVSSGTS----------QGSGEYFTRVGVGNPA 169

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
           + S   + DTGS + W  C     C DC +   DP     F P  SSS   + C + +C+
Sbjct: 170 K-SYYMVLDTGSDINWIQCQP---CSDC-YQQSDP----IFTPAASSSYSPLTCDSQQCN 220

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP-SKTVPNFLA 232
            +         +  S RN  C      Y + YG G FT G  ++ET+ F  S TV +   
Sbjct: 221 SL---------QMSSCRNGQC-----RYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL 266

Query: 233 GCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
           GC    D +   +   G         SL SQL    FSYCL++R   D+  SS L  ++ 
Sbjct: 267 GCG--HDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCLVNR---DSAASSTLDFNSA 321

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           P  GDS             P+  SS    FYYVGL  + VG + ++IP         G+G
Sbjct: 322 P-VGDSVIA----------PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDG 370

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL---RPCFDISGKKSVY 404
           GVIVD G+  T ++   + ++   F+      S +  +   SG+     C+D+SG+ SV 
Sbjct: 371 GVIVDCGTAITRLQSEAYNSLRDSFV------SMSRHLRSTSGVALFDTCYDLSGQSSVK 424

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQ 460
           +P +   F GG    LP  NY   V           D+A        P      I+G+ Q
Sbjct: 425 VPTVSFHFDGGKSWDLPAANYLIPV-----------DSAGTYCFAFAPTTSSLSIIGNVQ 473

Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
            Q   + FDLAN+R GF+  KC
Sbjct: 474 QQGTRVSFDLANNRVGFSTNKC 495


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 121/403 (30%), Positives = 172/403 (42%), Gaps = 73/403 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS L+W  C     C D   P  DPS           S+
Sbjct: 35  YLVHLAIGTPPQP-VQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 93

Query: 164 QLIG-----CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
              G     C +PK    F PN            +TC      Y   YG    T G L  
Sbjct: 94  LCQGLPVASCGSPK----FWPN------------QTC-----VYTYSYGDKSVTTGFLEV 132

Query: 218 ETLRF--PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           +   F     +VP    GC + ++        GIAGFGR   SLPSQL +  FS+C  + 
Sbjct: 133 DKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTI 192

Query: 272 KFDDAPVSSNLVLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
                 + S ++LD        G G+  + TP + Y     NP          YY+ L+ 
Sbjct: 193 T---GAIPSTVLLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKG 240

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I VGS  + +P S     ++G GG I+DSG++ T +   +++ V  EF  Q+        
Sbjct: 241 ITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVV 296

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFT 440
               +G   CF    +    +P+L+L F+ GA M LP ENY   V    GN ++CL    
Sbjct: 297 PGNATGHYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL---- 351

Query: 441 DNAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 A+ +G    I+G+FQ QN ++ +DL N+   F   +C
Sbjct: 352 ------AINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 388


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 124/431 (28%), Positives = 182/431 (42%), Gaps = 57/431 (13%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
           L +   +RA +L ++  P       G + S S + + L   S G Y + +  G+PP    
Sbjct: 84  LVARDNARAEYLASRLSPAAYQPT-GFSGSESKVVSGLDEGS-GEYFVRVGIGSPPTEQY 141

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
             + D+GS ++W  C     C++C +   DP     F P  S++   + C +  C  +  
Sbjct: 142 -LVVDSGSDVIWVQCKP---CLEC-YAQADP----LFDPATSATFSAVPCGSAVCRTL-- 190

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
                R  GC             Y + YG G +T G L  ETL      V     GC   
Sbjct: 191 -----RTSGCGDSGGC------DYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHR 239

Query: 238 SDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           +       AG+ G G    SL  QLG      FSYCL SR       + +LVL    G  
Sbjct: 240 NRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG------AGSLVL----GRS 289

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
           ++   G  + P  +NP   S     FYYVGL  I VG + + +         DG GGV++
Sbjct: 290 EAVPEGAVWVPLVRNPQAPS-----FYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVM 344

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           D+G+  T +    + A+   F+  +G   RA  V   S L  C+D+SG  SV +P +   
Sbjct: 345 DTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGV---SLLDTCYDLSGYTSVRVPTVSFY 401

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F G A + LP  N    V   + CL  F  +++GP+       ILG+ Q +   +  D A
Sbjct: 402 FDGAATLTLPARNLLLEVDGGIYCLA-FAPSSSGPS-------ILGNIQQEGIQITVDSA 453

Query: 472 NDRFGFAKQKC 482
           N   GF    C
Sbjct: 454 NGYIGFGPTTC 464


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 177/390 (45%), Gaps = 53/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS +VW  C     C +C +   DP     F P +S 
Sbjct: 40  GEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAP---CKNC-YSQTDP----VFNPVKSG 90

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C+ P C  +  P       GC+ R +TC      Y + YG G +T G  ++ETL
Sbjct: 91  SFAKVLCRTPLCRRLESP-------GCNQR-QTCL-----YQVSYGDGSYTTGEFVTETL 137

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
            F    V     GC   ++      AG+ G GR   S PSQ G    +KFSYCL+ R   
Sbjct: 138 TFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSAS 197

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
             P  S++V      S  ++     +TP   NP         FYYV L  I VG   V  
Sbjct: 198 SKP--SSVVFGNSAVSRTAR-----FTPLLTNP-----RLDTFYYVELLGISVGGTPVSG 245

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           I  S+      GNGGVI+D G++ T +  P + A+   F     +   A +    S    
Sbjct: 246 ITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEF---SLFDT 302

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D+SGK +V +P ++L F+G A ++LP  NY   V         F    +G +      
Sbjct: 303 CYDLSGKTTVKVPTVVLHFRG-ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS------ 355

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+ Q Q F + +DLA+ R GF+ + CA
Sbjct: 356 -IIGNIQQQGFRVVYDLASSRVGFSPRGCA 384


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 116/372 (31%), Positives = 165/372 (44%), Gaps = 55/372 (14%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     F P  S+S   + C + +C  +   
Sbjct: 181 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSASYAAVSCDSQRCRDL--- 229

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
                   C  RN T   AC  Y + YG G +T G   +ETL     T V N   GC   
Sbjct: 230 ----DTAAC--RNATG--AC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCG-- 278

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G  +      S PSQ+    FSYCL+ R   D+P +S L    G     
Sbjct: 279 HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEAG 335

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIV 351
           + T      P  ++P  S+     FYYV L  I VG + + IP S + +  + G+GGVIV
Sbjct: 336 TVT-----APLVRSPRTST-----FYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIV 385

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T ++   + A+   F++   +  R + V   S    C+D+S + SV +P + L+
Sbjct: 386 DSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLR 442

Query: 412 FKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
           F+GG  + LP +NY   V G    CL     NAA          I+G+ Q Q   + FD 
Sbjct: 443 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDT 494

Query: 471 ANDRFGFAKQKC 482
           A    GF   KC
Sbjct: 495 ARGAVGFTPNKC 506


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 124/377 (32%), Positives = 168/377 (44%), Gaps = 53/377 (14%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P+     + DTGS + W  C     C DC +   DP   PA     SSS +L+GCQ   C
Sbjct: 154 PRRDQLMVLDTGSDVTWIQCEP---CSDC-YQQSDPIYNPAL----SSSYKLVGCQANLC 205

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
             +          GCS RN +C      Y + YG G +T G   +ETL      + N   
Sbjct: 206 QQL-------DVSGCS-RNGSCL-----YQVSYGDGSYTQGNFATETLTLGGAPLQNVAI 252

Query: 233 GCSILSD---RQPAGIAGFGRSSESLPSQL---GLKKFSYCLLSRKFDDAPVSSNLVLDT 286
           GC   ++      AG+ G G  S S PSQL     K FSYCL+ R   D+  SS L    
Sbjct: 253 GCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDR---DSESSSTLQF-- 307

Query: 287 GPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
               G +  P G    P  KN     S    FYYV L  I VG K + I  S     + G
Sbjct: 308 ----GRAAVPNGAVLAPMLKN-----SRLDTFYYVSLSGISVGGKMLSISDSVFGIDASG 358

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
           NGGVIVDSG+  T ++   ++++   F     N      V   S    C+D+S K+SV +
Sbjct: 359 NGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGV---SLFDTCYDLSSKESVDV 415

Query: 406 PELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
           P ++  F GG  M+LP +NY   V +       F   ++  +       I+G+ Q Q   
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLS-------IVGNIQQQGIR 468

Query: 466 LEFDLANDRFGFAKQKC 482
           + FD AN++ GFA  KC
Sbjct: 469 VSFDRANNQVGFAVNKC 485


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 122/393 (31%), Positives = 169/393 (43%), Gaps = 60/393 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTPP+  T  + DTGS ++W  C     C  C +   DP     F P  SS
Sbjct: 151 GEYFTRLGVGTPPRY-TYMVLDTGSDIMWIQCLP---CAKC-YGQTDP----LFNPAASS 201

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT-CPLACPSYLLQYGLG-FTAGLLLSET 219
           + + + C  P C  +          GC  RNK  C      Y + YG G FT G   +ET
Sbjct: 202 TYRKVPCATPLCKKL-------DISGC--RNKRYCE-----YQVSYGDGSFTVGDFSTET 247

Query: 220 LRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSES-----LPSQLGL---KKFSYCLLSR 271
           L F  + +     GC    D +   I   G           PSQ G    K+FSYCL+ R
Sbjct: 248 LTFRGQVIRRVALGCG--HDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDR 305

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
               +  +S+L+       G +  P  + +TP   NP         FYYV L  I VG +
Sbjct: 306 S--ASGTASSLIF------GKAAIPKSAIFTPLLSNP-----KLDTFYYVELVGISVGGR 352

Query: 331 HV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
            +  IP S     + GNGGVI+DSG++ T +    +  +   F    GN   A      S
Sbjct: 353 RLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGF---S 409

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
               C+D+SG K+V +P L+  F+GGA ++LP  NY   V +       F  N  G +  
Sbjct: 410 LFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLS-- 467

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I+G+ Q Q + + FD   +R GF    C
Sbjct: 468 -----IIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 171/391 (43%), Gaps = 51/391 (13%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ GTPP    PF+   DTGS L W  C     C         P   P +    SS
Sbjct: 93  YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCF--------PQDTPIYDTAVSS 141

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C  I+           S RN T   +   Y   YG G ++AG+L +ETL
Sbjct: 142 SFSPVPCASATCLPIW-----------SSRNCTASSSPCRYRYAYGDGAYSAGVLGTETL 190

Query: 221 RFPSK---TVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
            FP     +V     GC + +        G  G GR S SL +QLG+ KFSYCL +  F+
Sbjct: 191 TFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL-TDFFN 249

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            +  S  L       +  S    +  TP  ++P         +YYV L  I +G   + I
Sbjct: 250 TSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPY-----VPTWYYVSLEGISLGDARLPI 304

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P        DG+GG+IVDSG+TFTF    L E+  +  +  +    R   V   S   PC
Sbjct: 305 PNGTFDLRDDGSGGMIVDSGTTFTF----LVESAFRVVVDHVAGVLRQPVVNASSLDSPC 360

Query: 395 F-DISGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRG 451
           F   +G++ +  +P+++L F GGA M L  +NY +    E   CL     N AG      
Sbjct: 361 FPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCL-----NIAGSP--SA 413

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ILG+FQ QN  + FD+   +  F    C
Sbjct: 414 DVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 128/397 (32%), Positives = 172/397 (43%), Gaps = 58/397 (14%)

Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y   +  GTP   +TP   + DTGS +VW  C    RC + +           F P+R
Sbjct: 138 GEYFTKIGVGTP---ATPALMVLDTGSDVVWLQCAPCRRCYEQSGQ--------VFDPRR 186

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           S S   +GC  P C  +          GC  R   C      Y + YG G  TAG   +E
Sbjct: 187 SRSYNAVGCAAPLCRRL-------DSGGCDLRRSAC-----LYQVAYGDGSVTAGDFATE 234

Query: 219 TLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
           TL F     V     GC   ++      AG+ G GR S S P+Q+  +    FSYCL+ R
Sbjct: 235 TLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDR 294

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                  S +  +  G G+  S T   S+TP  KNP         FYYV L  I VG   
Sbjct: 295 TSSANTASRSSTVTFGSGAVGS-TVASSFTPMVKNP-----RMETFYYVQLIGISVGGAR 348

Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           V  +  S L +  S G GGVIVDSG++ T +  P + A+   F         AA +    
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF------RGAAAGLRLSP 402

Query: 390 G----LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
           G       C+D+SG+K V +P + + F GGA+ ALPPENY   V ++      F     G
Sbjct: 403 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGG 462

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +       I+G+ Q Q F + FD    R  F  + C
Sbjct: 463 VS-------IIGNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 168/390 (43%), Gaps = 52/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTP +     + DTGS +VW  C    RC    +   DP     F P++S 
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C +P C  +          GC+ R KTC      Y + YG G FT G   +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F    V     GC   ++      AG+ G G+   S P Q G +   KFSYCL+ R   
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
             P  S++V      S  ++     +TP   NP         FYYVGL  I VG   V  
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVGLLGISVGGTRVPG 346

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S       GNGGVI+DSG++ T +  P + A+   F        RA +    S    
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNF---SLFDT 403

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CFD+S    V +P ++L F+  A ++LP  NY   V         F     G +      
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRR-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+ Q Q F + +DLA+ R GFA   CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 125/426 (29%), Positives = 184/426 (43%), Gaps = 74/426 (17%)

Query: 87  YSNSLIKT--PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           +SNS  KT   L  H     + SL+ GTPPQ  T  + DTGS L W  C           
Sbjct: 48  FSNSSSKTTGKLLFHHNVTLTASLTIGTPPQNIT-MVLDTGSELSWLRCKK--------- 97

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC-PSYL 203
              +P+    F P  S +   I C +  C        ++R       + T P+ C P+ L
Sbjct: 98  ---EPNFTSIFNPLASKTYTKIPCSSQTC--------KTRTS-----DLTLPVTCDPAKL 141

Query: 204 LQYGLGF-----TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRS 251
             + + +       G L  ET RF S T P  + GC       +   D +  G+ G  R 
Sbjct: 142 CHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRG 201

Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
           S S  +Q+G +KFSYC+       + + S   L  G        P L+YTP     V  S
Sbjct: 202 SLSFVNQMGFRKFSYCI-------SGLDSTGFLLLGEARYSWLKP-LNYTPL----VQIS 249

Query: 312 SAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           +    F    Y V L  I V +K + +P S  VP   G G  +VDSG+ FTF+ GP++ A
Sbjct: 250 TPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSA 309

Query: 368 VAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPP 422
           + KEF+ Q     R  +  +   +  +  C+ I    S    LP + L F+ GA+M++  
Sbjct: 310 LRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMFR-GAEMSVSG 368

Query: 423 ENYFALVGNE------VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
           +     V  E      V C      +  G +     + ++G  Q QN ++E+DL N R G
Sbjct: 369 QRLLYRVPGEVRGKDSVWCFTFGNSDELGIS-----SFLIGHHQQQNVWMEYDLENSRIG 423

Query: 477 FAKQKC 482
           FA+ +C
Sbjct: 424 FAELRC 429


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 131/456 (28%), Positives = 194/456 (42%), Gaps = 82/456 (17%)

Query: 54  KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           ++L  +A+ S +R AR L  +      D     +Y++ +  T   VH        ++ GT
Sbjct: 45  ELLRRMAARSKARSARLLSGRAASARMDPG---SYTDGVPDTEYLVH--------MAIGT 93

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           PPQ     I DTGS L W  C     CV C   +     +P F P RS +  ++ C    
Sbjct: 94  PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 144

Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
           C   +W       S C   S  N  C      Y   Y     T G L S+T  F S    
Sbjct: 145 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 192

Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
               +VP+   GC + ++        GIAGF R + S+P+QL +  FSYC  +    +  
Sbjct: 193 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 252

Query: 278 -----VSSNLVLDT-GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                V  NL  D  G G G  ++  L         +   S+  + YY+ L+ + VG+  
Sbjct: 253 PVFLGVPPNLYSDAAGGGHGVVQSTAL---------IRYHSSQLKAYYISLKGVTVGTTR 303

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP S      DG GG IVDSG+  T +   ++  V   F+ Q    ++       S L
Sbjct: 304 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSL 359

Query: 392 -RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
            + CF +       +P L+L F+ GA + LP ENY   +    G  + CL +     AG 
Sbjct: 360 SQLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGE 414

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            L      ++G+FQ QN ++ +DLAND   F   +C
Sbjct: 415 DLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 445


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 165/372 (44%), Gaps = 55/372 (14%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     F P  S+S   + C + +C  +   
Sbjct: 1   MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSASYAAVSCDSQRCRDL--- 49

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
                    + RN T   AC  Y + YG G +T G   +ETL     T V N   GC   
Sbjct: 50  ------DTAACRNATG--AC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCG-- 98

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G  +      S PSQ+    FSYCL+ R   D+P +S L    G     
Sbjct: 99  HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEAG 155

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIV 351
           + T      P  ++P  S+     FYYV L  I VG + + IP S + +  + G+GGVIV
Sbjct: 156 TVT-----APLVRSPRTST-----FYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIV 205

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T ++   + A+   F++   +  R + V   S    C+D+S + SV +P + L+
Sbjct: 206 DSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLR 262

Query: 412 FKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
           F+GG  + LP +NY   V G    CL     NAA          I+G+ Q Q   + FD 
Sbjct: 263 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDT 314

Query: 471 ANDRFGFAKQKC 482
           A    GF   KC
Sbjct: 315 ARGAVGFTPNKC 326


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 103/311 (33%), Positives = 144/311 (46%), Gaps = 32/311 (10%)

Query: 198 ACPSYLLQYGLGFTAGLLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGR 250
           ACP     YG G     L    +   +         V NF   C+  +  +P G+AGFGR
Sbjct: 64  ACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGR 123

Query: 251 SSESLPSQLGLK---KFSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS--KTPGLSYTPF 303
              SLP QL  +   +FSYCL+S  F  D     S L+L   P    +  +T G  YTP 
Sbjct: 124 GPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPL 183

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
             NP         FY V L  + VG+  ++           GNGG++VDSG+TFT +   
Sbjct: 184 LHNP-----KHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNE 238

Query: 364 LFEAVAKEFIRQMGNYSRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
           ++  VA+ F R M     A     E+++GL PC+  +      +P L L F+G A +ALP
Sbjct: 239 MYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALP 297

Query: 422 PENYF----------ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
             NYF              ++V CL+L     A    G GPA  LG+FQ Q F + +D+ 
Sbjct: 298 RRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVD 357

Query: 472 NDRFGFAKQKC 482
             R GFA+++C
Sbjct: 358 AGRVGFARRRC 368


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 135/475 (28%), Positives = 193/475 (40%), Gaps = 90/475 (18%)

Query: 36  LTPLSTKHYLHHSDSDPLKI--LH---------------SLASSSLSRARHLKTKTKPKT 78
           L P  T + +HH D   L +  LH               +L   S S  + L+T+ KP+ 
Sbjct: 86  LHPRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPED 145

Query: 79  KDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
             + + S  S             G Y   +  G P +     + DTGS + W  C     
Sbjct: 146 LSTPVTSGTS----------QGSGEYFTRVGVGNPARQFY-MVLDTGSDINWLQCQP--- 191

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C DC +   DP     F P  SS+   + CQ+ +CS +         +  S R+  C   
Sbjct: 192 CTDC-YQQTDP----IFDPTASSTYAPVTCQSQQCSSL---------EMSSCRSGQCL-- 235

Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGFGRSSE--- 253
              Y + YG G +T G   +E++ F  S +V N   GC    D +   +   G       
Sbjct: 236 ---YQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCG--HDNEGLFVGAAGLLGLGGG 290

Query: 254 --SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
             SL +QL    FSYCL++R   D+  SS L  ++     DS T      P  KN     
Sbjct: 291 PLSLTNQLKATSFSYCLVNR---DSAGSSTLDFNSAQLGVDSVT-----APLMKN----- 337

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FYYVGL  + VG + V IP S       GNGG+IVD G+  T ++   +  +   
Sbjct: 338 RKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDA 397

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
           F+R   N    + V   +    C+D+SG+ SV +P +   F  G    LP  NY   V  
Sbjct: 398 FVRMTQNLKLTSAV---ALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPV-- 452

Query: 432 EVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                    D+A        P      I+G+ Q Q   + FDLAN+R GF+  KC
Sbjct: 453 ---------DSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 177/392 (45%), Gaps = 67/392 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           GGY++++S GTP   +   + DTGS L+W  C    +C         P+  P F P  SS
Sbjct: 84  GGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPASSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C +  C ++  PN            +TC      Y  +YG G+TAG L +ETL+
Sbjct: 135 TFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATETLK 182

Query: 222 FPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
               + P+   GCS  +        G G+        LG+ +FSYCL  R    A  S  
Sbjct: 183 VGDASFPSVAFGCSTEN--------GLGQ------LDLGVGRFSYCL--RSGSAAGASPI 226

Query: 282 LVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
           L      GS  + T G +  TPF  NP    S    +YYV L  I VG   + +  S   
Sbjct: 227 LF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVTTSTFG 277

Query: 341 PGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS- 398
              +G  GG IVDSG+T T++    +E V + F+ Q  + +    V    GL  CF  + 
Sbjct: 278 FTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTT---VNGTRGLDLCFKSTG 334

Query: 399 -GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALGRG 451
            G   + +P L+L+F GGA+ A+P   YFA V  +      V CL++       PA G  
Sbjct: 335 GGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PAKGDQ 386

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           P  ++G+    + +L +DL    F FA   CA
Sbjct: 387 PMSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 418


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 122/399 (30%), Positives = 170/399 (42%), Gaps = 69/399 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + ++ GTPP      I DTGS LVW  C+S          + D      F P RSS+ 
Sbjct: 103 YLMYVNVGTPP-TQLLAIADTGSDLVWVNCSSS----GGGLADADAGGNVVFQPTRSSTY 157

Query: 164 QLIGCQNPKCSWIFGP--NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
             + CQ+  C  +     + +S C+               Y   YG G  T G+L +ET 
Sbjct: 158 SQLSCQSNACQALSQASCDADSECQ---------------YQYSYGDGSRTIGVLSTETF 202

Query: 221 RFPSK------TVPNFLAGCSILSDR--QPAGIAGFGRSSESLPSQLGL-----KKFSYC 267
            F          VP    GCS  S    +  G+ G G  + SL SQLG      +K SYC
Sbjct: 203 SFVDGGGKGQVRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYC 262

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
           L+     DA  SS L   +         PG + TP   + V S      +Y V L  + V
Sbjct: 263 LIPSY--DANSSSTLNFGS---RAVVSEPGAASTPLVPSDVDS------YYTVALESVAV 311

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G + V    S           +IVDSG+T TF++  L   +  E  R++    R    E+
Sbjct: 312 GGQEVATHDSR----------IIVDSGTTLTFLDPALLGPLVTELERRI-KLQRVQPPEQ 360

Query: 388 KSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
              L+ C+D+ GK       +P++ L+F GGA + L PEN F+L+    LCL+L      
Sbjct: 361 L--LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLV----- 413

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            P     P  ILG+   QNF++ +DL      FA   CA
Sbjct: 414 -PVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAADCA 451


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 122/400 (30%), Positives = 174/400 (43%), Gaps = 71/400 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + ++ GTPP A    I DTGS LVW  C+S       +   V       F P RS++ 
Sbjct: 100 YLMYVNVGTPP-AQMLAIADTGSDLVWVNCSSNGGGGGASDGAV------VFHPSRSTTY 152

Query: 164 QLIGCQNPKCSWIFGP--NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
            L+ CQ+  C  +     + +S C+               Y   YG G  T G+L +ET 
Sbjct: 153 SLLSCQSAACQALSQASCDADSECQ---------------YQYAYGDGSRTIGVLSTETF 197

Query: 221 RFPSKT--------VPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGL-----KKFS 265
            F +          VP    GCS  S    +  G+ G G  + SL SQLG      ++FS
Sbjct: 198 SFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFS 257

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL+   +  A  SS L         D   PG + TP   + V S      +Y V L  +
Sbjct: 258 YCLVP-PYAAANSSSTLSFGARAVVSD---PGAASTPLVPSEVDS------YYTVALESV 307

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            V  + V          S  +  +IVDSG+T TF++  L   +  E  R++    RA   
Sbjct: 308 AVAGQDV---------ASANSSRIIVDSGTTLTFLDPALLRPLVAELERRI-RLPRAQPP 357

Query: 386 EKKSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           E+   L+ C+D+ GK       +P++ L+F GGA + L PEN F+L+    LCL+L    
Sbjct: 358 EQL--LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLV--- 412

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              P     P  ILG+   QNF++ +DL      FA   C
Sbjct: 413 ---PVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAVDC 449


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 192/455 (42%), Gaps = 80/455 (17%)

Query: 54  KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           ++L  +A+ S +R AR L  +      D     +Y++ +  T   VH        ++ GT
Sbjct: 71  ELLRRMAARSKARSARLLSGRAASARMDPG---SYTDGVPDTEYLVH--------MAIGT 119

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           PPQ     I DTGS L W  C     CV C   +     +P F P RS +  ++ C    
Sbjct: 120 PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 170

Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
           C   +W       S C   S  N  C      Y   Y     T G L S+T  F S    
Sbjct: 171 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 218

Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
               +VP+   GC + ++        GIAGF R + S+P+QL +  FSYC  +    +  
Sbjct: 219 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 278

Query: 278 -----VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
                V  NL  D   G G       +   ++ + + +       YY+ L+ + VG+  +
Sbjct: 279 PVFLGVPPNLYSDAA-GGGHGVVQSTALIRYHSSQLKA-------YYISLKGVTVGTTRL 330

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL- 391
            IP S      DG GG IVDSG+  T +   ++  V   F+ Q    ++       S L 
Sbjct: 331 PIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSLS 386

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGPA 447
           + CF +       +P L+L F+ GA + LP ENY   +    G  + CL +     AG  
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGED 441

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L      ++G+FQ QN ++ +DLAND   F   +C
Sbjct: 442 LS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 172/390 (44%), Gaps = 52/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS +VW  C    +C    +   D      F P +S 
Sbjct: 116 GEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKC----YTQTDH----VFDPTKSR 166

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C  P C  +  P       GCS +NK C      Y + YG G FT G   +ETL
Sbjct: 167 TYAGIPCGAPLCRRLDSP-------GCSNKNKVC-----QYQVSYGDGSFTFGDFSTETL 214

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F    V     GC   ++      AG+ G GR   S P Q G +   KFSYCL+ R   
Sbjct: 215 TFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSAS 274

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
             P SS +  D    S  S+T    +TP  KNP         FYY+ L  I VG   V+ 
Sbjct: 275 AKP-SSVIFGD----SAVSRTA--HFTPLIKNP-----KLDTFYYLELLGISVGGAPVRG 322

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S     + GNGGVI+DSG++ T +  P + A+   F     +  RA +    S    
Sbjct: 323 LSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEF---SLFDT 379

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CFD+SG   V +P ++L F+ GA ++LP  NY   V N       F    +G +      
Sbjct: 380 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLS------ 432

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+ Q Q F + +DL   R GFA + C 
Sbjct: 433 -IIGNIQQQGFRISYDLTGSRVGFAPRGCV 461


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 140/465 (30%), Positives = 199/465 (42%), Gaps = 88/465 (18%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSN--IGSNYSNSLIKTPLSVH 99
           ++ LH    D L++L   +  SL  A   K+      K++N  +  ++   L ++ LS  
Sbjct: 22  RNRLHR---DELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPL-RSGLSDG 77

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFI 156
           S G Y +SL  GTPP+ +   + DTGS ++W    PC S Y   D           P F 
Sbjct: 78  S-GEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTD-----------PLFN 124

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
           P  SS+ Q I C +  C  +         +GC  R   C      Y + YG G FT G  
Sbjct: 125 PSFSSTFQSITCGSSLCQQLL-------IRGC--RRNQCL-----YQVSYGDGSFTVGEF 170

Query: 216 LSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
            +ETL F S  V +   GC   +       AG+ G G+   S PSQ+G      FSYCL 
Sbjct: 171 STETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLP 230

Query: 270 SRK--------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           +R+        F +  V+SN                  +T    NP         FYYV 
Sbjct: 231 TRESTGSVPLIFGNQAVASNA----------------QFTTLLTNP-----KLDTFYYVE 269

Query: 322 LRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           +  I VG   V IP   L +  S GNGGVI+DSG+  T +    +  +   F   M    
Sbjct: 270 MVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGM---- 325

Query: 381 RAADVEKKSGLR---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
             +D +  SG      C+D+SG+ S+ LP +   F GGA MALP +N    V N     +
Sbjct: 326 -PSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCL 384

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            F  N+   +       I+G+ Q Q+F + FD   +R G    +C
Sbjct: 385 AFAPNSENFS-------IIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 163/396 (41%), Gaps = 49/396 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G PP  +   + DTGS L+W  C    RC    +  V     P + P+ S 
Sbjct: 90  GEYFAVIGVGDPPTHAL-VVIDTGSDLIWLQCLPCRRC----YRQV----TPLYDPRNSK 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + I C +P+C  +       R  GC  R   C      Y++ YG G  ++G L ++TL
Sbjct: 141 THRRIPCASPQCRGVL------RYPGCDARTGGC-----VYMVVYGDGSASSGDLATDTL 189

Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
             P  T V N   GC   ++      AG+ G GR   S P+QL       FSYCL  R  
Sbjct: 190 VLPDDTRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMS 249

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                SS LV    P     + P  ++TP   NP   S      YYV +    VG + V 
Sbjct: 250 RARNSSSYLVFGRTP-----ELPSTAFTPLRTNPRRPS-----LYYVDMVGFSVGGERVA 299

Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
                S  +  + G GGV+VDSG+  +      + AV   F+             K S  
Sbjct: 300 GFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVF 359

Query: 392 RPCFDISGK---KSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAAGPA 447
             C+D+ G      V +P ++L F   A MALP  NY   +VG +          AA   
Sbjct: 360 DTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDG 419

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           L      +LG+ Q Q F + FD+   R GF    C+
Sbjct: 420 LN-----VLGNVQQQGFGVVFDVERGRIGFTPNGCS 450


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 134/457 (29%), Positives = 199/457 (43%), Gaps = 79/457 (17%)

Query: 50  SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGS------NYSNSLIKTPLSVHSYGG 103
           +D L +   L S  LS A+ L T  K +T  S+          YS +L+           
Sbjct: 41  TDSLSLSFPLTSLPLSTAKPLNTNPKLRTLSSSSSYNIKSSFKYSMALV----------- 89

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
             ++L  GTPPQ     + DTGS L W  C ++            P+   +F P  SSS 
Sbjct: 90  --VTLPIGTPPQPQQ-MVLDTGSQLSWIQCHNK----------TPPTA--SFDPSLSSSF 134

Query: 164 QLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
            ++ C +P C     P V +        +N+ C     SY    G  +  G L+ E L F
Sbjct: 135 YVLPCTHPLCK----PRVPDFTLPTTCDQNRLCHY---SYFYADGT-YAEGNLVREKLAF 186

Query: 223 -PSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
            PS+T P  + GCS  S R   GI G      S P Q  + KFSYC+ +R+    P ++N
Sbjct: 187 SPSQTTPPLILGCSSES-RDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQ----PANNN 241

Query: 282 ------LVLDTGPGSGDSK-TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                   L   P S   +    L++    + P     A    Y V ++ I +G + + I
Sbjct: 242 NFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLA----YTVPMQGIRIGGRKLNI 297

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEK 387
           P S   P + G+G  +VDSGS FTF+    ++ V +E IR +G        Y   AD+  
Sbjct: 298 PPSVFRPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADM-- 355

Query: 388 KSGLRPCFDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
                 CFD +  +    L ++  +F+ G ++ +P E   A VG  V C+ +      G 
Sbjct: 356 ------CFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGA 409

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A     + I+G+F  QN ++EFDLAN R GF    C+
Sbjct: 410 A-----SNIIGNFHQQNLWVEFDLANRRIGFGVADCS 441


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 124/441 (28%), Positives = 185/441 (41%), Gaps = 55/441 (12%)

Query: 52  PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
           P   +  L S   +RA +L ++  P  + ++     S S + + L   S G Y + +  G
Sbjct: 76  PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFG--SESKVVSGLDEGS-GEYFVRVGIG 132

Query: 112 TPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP 171
           +PP      + D+GS ++W  C     C++C +   DP     F P  S++   + C + 
Sbjct: 133 SPPTEQY-LVVDSGSDVIWVQCKP---CLEC-YAQADP----LFDPASSATFSAVSCGSA 183

Query: 172 KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNF 230
            C  +       R  GC             Y + YG G +T G L  ETL      V   
Sbjct: 184 ICRTL-------RTSGCGDSGGC------EYEVSYGDGSYTKGTLALETLTLGGTAVEGV 230

Query: 231 LAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDD---APVSSN 281
             GC   +       AG+ G G    SL  QLG      FSYCL SR       A  + +
Sbjct: 231 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGS 290

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
           LVL    G  ++   G  + P  +NP   S     FYYVG+  I VG + + +       
Sbjct: 291 LVL----GRSEAVPEGAVWVPLVRNPQAPS-----FYYVGVSGIGVGDERLPLQDGLFQL 341

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
             DG GGV++D+G+  T +    + A+   F+  +G   RA  V   S L  C+D+SG  
Sbjct: 342 TEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGV---SLLDTCYDLSGYT 398

Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
           SV +P +   F G A + LP  N    V   + CL  F  +++G +       ILG+ Q 
Sbjct: 399 SVRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLA-FAPSSSGLS-------ILGNIQQ 450

Query: 462 QNFYLEFDLANDRFGFAKQKC 482
           +   +  D AN   GF    C
Sbjct: 451 EGIQITVDSANGYIGFGPATC 471


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 139/466 (29%), Positives = 195/466 (41%), Gaps = 90/466 (19%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--- 98
           ++ LH    D L++L   +  SL  A   K+      K++N    +     +TPL     
Sbjct: 22  RNRLHR---DELRLLSISSRISLGVAGIPKSSLTNPLKNTNP---FLQQDFETPLRSGLS 75

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF 155
              G Y +SL  GTPP+ +   + DTGS ++W    PC S Y   D           P F
Sbjct: 76  DGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTD-----------PLF 123

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGL 214
            P  SS+ Q I C +  C  +         +GC  R   C      Y + YG G FT G 
Sbjct: 124 NPSFSSTFQSITCGSSLCQQLL-------IRGC--RRNQCL-----YQVSYGDGSFTVGE 169

Query: 215 LLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
             +ETL F S  V +   GC   +       AG+ G G+   S PSQ+G      FSYCL
Sbjct: 170 FSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL 229

Query: 269 LSRK--------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            +R+        F +  V+SN                  +T    NP         FYYV
Sbjct: 230 PTRESTGSVPLIFGNQAVASNA----------------QFTTLLTNP-----KLDTFYYV 268

Query: 321 GLRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            +  I VG   V IP   L +  S GNGGVI+DSG+  T +    +  +   F   M   
Sbjct: 269 EMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGM--- 325

Query: 380 SRAADVEKKSGLR---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
              +D +  SG      C+D+SG+ S+ LP +   F GGA MALP +N    V N     
Sbjct: 326 --PSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYC 383

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + F  N+   +       I+G+ Q Q+F + FD   +R G    +C
Sbjct: 384 LAFAPNSENFS-------IIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 123/429 (28%), Positives = 184/429 (42%), Gaps = 63/429 (14%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFI 121
           + +RA H +++ +     + +G+  + S  ++PL + S GG Y ++ S GTPPQ  +  +
Sbjct: 41  NFTRAAH-RSRERLSILATRLGAASAGS-AQSPLQMDSGGGAYDMTFSMGTPPQTLSA-L 97

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS L+W  C +  RC         P    ++ P +SSS   + C +  C  +   ++
Sbjct: 98  ADTGSDLIWAKCGACKRCA--------PRGSASYYPTKSSSFSKLPCSSALCRTLESQSL 149

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-----FTAGLLLSETLRFPSKTVPNFLAGCSI 236
            + C G   R   C     SY   YGL      +T G + SET    S  V     GC+ 
Sbjct: 150 AT-CGGTRARGAVC-----SYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFGCTT 203

Query: 237 LSDRQPAGIAGFG---RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           +S+      +G     R   SL  QL +  FSYCL S    D   SS L+   G  +G  
Sbjct: 204 MSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTS----DPSTSSPLLFGAGALTG-- 257

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
             PG+  TP       +      FY V L  I +G+           PG+ G  G+I DS
Sbjct: 258 --PGVQSTPLVNLKTST------FYTVNLDSISIGAAK--------TPGT-GRHGIIFDS 300

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
           G+T TF+  P +       + Q  N +R   V    G   CF  SG      P ++L F 
Sbjct: 301 GTTLTFLAEPAYTLAEAGLLSQTTNLTR---VPGTDGYEVCFQTSG--GAVFPSMVLHFD 355

Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
           GG  MAL  ENYF  V + V C ++       P+       I+G+    ++++ +DL   
Sbjct: 356 GG-DMALKTENYFGAVNDSVSCWLV----QKSPS----EMSIVGNIMQMDYHIRYDLDKS 406

Query: 474 RFGFAKQKC 482
              F    C
Sbjct: 407 VLSFQPTNC 415


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 118/415 (28%), Positives = 176/415 (42%), Gaps = 72/415 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           LS H     ++SL+ G+PPQ  T  + DTGS L W  C           PN+       F
Sbjct: 48  LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKKA--------PNLHS----VF 94

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
            P RSSS   I C +P C                 R+ + P++C    L + +   A   
Sbjct: 95  DPLRSSSYSPIPCTSPTCR-------------TRTRDFSIPVSCDKKKLCHAIISYADAS 141

Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
              G L S+T    +  +P  + GC       +   D +  G+ G  R S S  +Q+GL+
Sbjct: 142 SIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ 201

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
           KFSYC+  +        S+ +L  G  S  S    L YTP     V  S+    F    Y
Sbjct: 202 KFSYCISGQD-------SSGILLFGESSF-SWLKALKYTPL----VQISTPLPYFDRVAY 249

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  I V +  +++P S   P   G G  +VDSG+ FTF+ GP++ A+  EF+RQ   
Sbjct: 250 TVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKA 309

Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALV---- 429
             +  +      +  +  C+ +   +     LP + L F+ GA+M++  E     V    
Sbjct: 310 SLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 368

Query: 430 --GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              + V C         G       + I+G    QN ++EFDLA  R GFA+ +C
Sbjct: 369 RGSDSVYCFTFGNSELLGVE-----SYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 115/402 (28%), Positives = 171/402 (42%), Gaps = 60/402 (14%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           V  Y  Y I    GTP         DTGS +VW  C   + C     P  D S       
Sbjct: 86  VVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSA------ 139

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
             S +   + C +P C  +              R   C L   +Y + YG    T G L 
Sbjct: 140 --SDTVHGVLCTDPICRAL--------------RPHACFLGGCTYQVNYGDNSVTIGQLA 183

Query: 217 SETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
            ++  F  K     TVP+ + GC   +         GIAGFGR   SLP QLG+  FSYC
Sbjct: 184 KDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYC 243

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSG---DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
             +  F+    S+ + L   P  G    +  P LS TPF  N         E+YY+ L+ 
Sbjct: 244 FTT-IFESK--STPVFLGGAPADGLRAHATGPILS-TPFLPN-------HPEYYYLSLKG 292

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I VG   + +P S  V  +DG+GG I+DSG+  T     +F ++ + F+ Q+     + +
Sbjct: 293 ITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYN 352

Query: 385 VEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFT 440
              +  L+ CF    +     V +P++ L  + GA   LP ENY A    ++ LC+++  
Sbjct: 353 DTGEPTLQ-CFSTESVPDASKVPVPKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVLA 410

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G     ++G+FQ QN ++  DLA ++      +C
Sbjct: 411 --------GDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQC 444


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 121/400 (30%), Positives = 191/400 (47%), Gaps = 55/400 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G PP+     I DTGS L W  C     C D + P  DPS+        S+
Sbjct: 85  GEYFMDVFVGNPPRHFL-LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQ--------ST 135

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S ++I C    C  +    V   C+  S  +KT P  C  Y   YG    T+G L  E+L
Sbjct: 136 SFKIIPCNAAACDLV----VHDECRDNS--SKTSPKTC-KYFYWYGDSSRTSGDLALESL 188

Query: 221 RFP------SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
                    S  + + + GC   +    +   G+ G G+ + S PSQL      + FSYC
Sbjct: 189 SVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYC 248

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           L+ R  ++  VSS +    G G   S+    + +TPF    V ++++   FYY+G++ I 
Sbjct: 249 LVDRT-NNLSVSSAISF--GAGFALSRHFDQMKFTPF----VRTNNSVETFYYLGIQGIK 301

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +  + + IP       ++G+GG I+DSG+T T++    + AV   F+ ++ +Y RA   +
Sbjct: 302 IDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFD 360

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL--CL-ILFTDNA 443
               L  C++ +G+ +V  P L + F+ GA++ LP ENYF     +    CL IL TD  
Sbjct: 361 I---LGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM 417

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +          I+G+FQ QN +  +D+ + R GFA   C+
Sbjct: 418 S----------IIGNFQQQNIHFLYDVQHARLGFANTDCS 447


>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 126/410 (30%), Positives = 187/410 (45%), Gaps = 65/410 (15%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS LVWFPC   + C+ C    + PS   +     ++ S      +   S +   ++
Sbjct: 100 LDTGSDLVWFPCRP-FTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSDL 158

Query: 182 ESRCKGCSPRNKTCPLA-------------CPSYLLQYGLGFTAGLLLSETLRFPSKTVP 228
                 C+  N  CPL              CP +   YG G     L S++L  PS +V 
Sbjct: 159 ------CAISN--CPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVS 210

Query: 229 NFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS--S 280
           NF  GC+  +  +P G+AGFGR   SLP+QL +        FSYCL+S  FD   V   S
Sbjct: 211 NFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPS 270

Query: 281 NLVLDTGPGSGDSKT----------------PGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
            L+L       + +                     +T   +NP         FY V L+ 
Sbjct: 271 PLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP-----KHPYFYSVSLQG 325

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAA 383
           I +G +++  P        +G GGV+VDSG+TFT +    + +V +EF  ++G  + RA 
Sbjct: 326 ISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERAD 385

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFALVGN---------EV 433
            VE  SG+ PC+ ++  ++V +P L+L F G  + + LP  NYF    +         ++
Sbjct: 386 RVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 443

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            CL+L          G G   ILG++Q Q F + +DL N R GFAK+KCA
Sbjct: 444 GCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 492


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 174/387 (44%), Gaps = 55/387 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G PP +    + DTGS + W  C     C +C +   DP     F P  S+
Sbjct: 149 GEYFSRVGIGRPP-SPVYMVLDTGSDVSWVQCAP---CAEC-YEQTDP----XFEPTSSA 199

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C+  +C  +   +V S C     RN TC      Y + YG G +T G  ++ET+
Sbjct: 200 SFTSLSCETEQCKSL---DV-SEC-----RNGTCL-----YEVSYGDGSYTVGDFVTETV 245

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
              S ++ N   GC   ++      AG+ G G  S S PSQL    FSYCL+ R   D+ 
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR---DSD 302

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            +S L  ++        TP     P ++NP         F+Y+GL  + VG   + IP +
Sbjct: 303 STSTLDFNS------PITPDAVTAPLHRNP-----NLDTFFYLGLTGMSVGGAVLPIPET 351

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                 DGNGG+IVDSG+  T ++  ++  +   F++   +   A  V   +    C+D+
Sbjct: 352 SFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGV---ALFDTCYDL 408

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF--TDNAAGPALGRGPAII 455
           S K  V +P +   F  G ++ LP +NY   V +E      F  TD+            I
Sbjct: 409 SSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS---------I 459

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+ Q Q   + FDLAN   GF+  KC
Sbjct: 460 LGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 118/415 (28%), Positives = 176/415 (42%), Gaps = 72/415 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           LS H     ++SL+ G+PPQ  T  + DTGS L W  C           PN+       F
Sbjct: 55  LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKKA--------PNLHS----VF 101

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
            P RSSS   I C +P C                 R+ + P++C    L + +   A   
Sbjct: 102 DPLRSSSYSPIPCTSPTCR-------------TRTRDFSIPVSCDKKKLCHAIISYADAS 148

Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
              G L S+T    +  +P  + GC       +   D +  G+ G  R S S  +Q+GL+
Sbjct: 149 SIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ 208

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
           KFSYC+  +        S+ +L  G  S  S    L YTP     V  S+    F    Y
Sbjct: 209 KFSYCISGQD-------SSGILLFGESSF-SWLKALKYTPL----VQISTPLPYFDRVAY 256

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  I V +  +++P S   P   G G  +VDSG+ FTF+ GP++ A+  EF+RQ   
Sbjct: 257 TVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKA 316

Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALV---- 429
             +  +      +  +  C+ +   +     LP + L F+ GA+M++  E     V    
Sbjct: 317 SLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 375

Query: 430 --GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              + V C         G       + I+G    QN ++EFDLA  R GFA+ +C
Sbjct: 376 RGSDSVYCFTFGNSELLGVE-----SYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 167/391 (42%), Gaps = 46/391 (11%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           G+S+++    P +     I DTGS L+W  C    +              P + P  SS+
Sbjct: 15  GHSLTVGIVQPRK----LIVDTGSDLIWTQC----KLSSSTAAAARHGSPPVYDPGESST 66

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
              + C +  C        +   K C+ +N+        Y   YG     G+L SET  F
Sbjct: 67  FAFLPCSDRLCQ-----EGQFSFKNCTSKNRCV------YEDVYGSAAAVGVLASETFTF 115

Query: 223 PSKTVPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
            ++   +   G  C  LS        GI G    S SL +QL +++FSYCL    F D  
Sbjct: 116 GARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL--TPFADKK 173

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S  L       S    T  +  T    NPV +      +YYV L  I +G K + +P +
Sbjct: 174 TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETV-----YYYVPLVGISLGHKRLAVPAA 228

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L    DG GG IVDSGST  ++    FEAV KE +  +     A    +   L  CF +
Sbjct: 229 SLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFVL 285

Query: 398 ------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
                 +  ++V +P L+L F GGA M LP +NYF      ++CL      A G      
Sbjct: 286 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDGS 339

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ Q QN ++ FD+ + +F FA  +C
Sbjct: 340 GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 121/406 (29%), Positives = 170/406 (41%), Gaps = 60/406 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKR 159
           G Y + L  GTPPQ     + DTGS LVW  C++   C +C          P  AF+ + 
Sbjct: 87  GQYFVDLRLGTPPQKLL-LVADTGSDLVWVKCSA---CRNCT------RHTPGSAFLARH 136

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGF-TAGLLLS 217
           S++     C +  C  +  P    RC           L  P  Y   YG G  T+G    
Sbjct: 137 STTFSPNHCYDSACQLVPLPK-HHRCNHAR-------LHSPCRYEYSYGDGSKTSGFFSK 188

Query: 218 ETLRFPSKT-----VPNFLAGCSI---------LSDRQPAGIAGFGRSSESLPSQLGLK- 262
           ET    + +     +     GC+           S     G+ G GR   SL SQLG + 
Sbjct: 189 ETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRF 248

Query: 263 --KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEF 317
             KFSYCL+      +P S  L+  T     +   PG   + +TP + NP+  +     F
Sbjct: 249 GNKFSYCLMDHDISPSPTSYLLIGSTQ----NDVAPGKRRMRFTPLHINPLSPT-----F 299

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           YY+G+  + V    + I  S       GNGG IVDSG+T TF+  P +  +     R++ 
Sbjct: 300 YYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVR 359

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
             S A   E   G   C ++S  +   LP+L  K  G +  + PP NYF     +V CL 
Sbjct: 360 LPSPA---EPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLA 416

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           L    A     G     ++G+   Q F LEFD    R GF++  CA
Sbjct: 417 L---QAVMTPSGFS---VIGNLMQQGFLLEFDKDRTRLGFSRHGCA 456


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 174/387 (44%), Gaps = 55/387 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G PP +    + DTGS + W  C     C +C +   DP     F P  S+
Sbjct: 149 GEYFSRVGIGRPP-SPVYMVLDTGSDVSWVQCAP---CAEC-YEQTDP----IFEPTSSA 199

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C+  +C  +   +V S C     RN TC      Y + YG G +T G  ++ET+
Sbjct: 200 SFTSLSCETEQCKSL---DV-SEC-----RNGTCL-----YEVSYGDGSYTVGDFVTETV 245

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
              S ++ N   GC   ++      AG+ G G  S S PSQL    FSYCL+ R   D+ 
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR---DSD 302

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            +S L  ++        TP     P ++NP         F+Y+GL  + VG   + IP +
Sbjct: 303 STSTLDFNS------PITPDAVTAPLHRNP-----NLDTFFYLGLTGMSVGGAVLPIPET 351

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                 DGNGG+IVDSG+  T ++  ++  +   F++   +   A  V   +    C+D+
Sbjct: 352 SFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGV---ALFDTCYDL 408

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF--TDNAAGPALGRGPAII 455
           S K  V +P +   F  G ++ LP +NY   V +E      F  TD+            I
Sbjct: 409 SSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS---------I 459

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+ Q Q   + FDLAN   GF+  KC
Sbjct: 460 LGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 165/391 (42%), Gaps = 55/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+P +     + DTGS + W  C+    C  C   N        F P+ SS
Sbjct: 12  GEYFVRVGIGSPTKLQY-LVMDTGSDVPWIQCSP---CKSCYKQN-----DAVFDPRASS 62

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C  P+C  +         K C+  +  C      Y + YG G FT G L S++ 
Sbjct: 63  SFRRLSCSTPQCKLL-------DVKACASTDNRCL-----YQVSYGDGSFTVGDLASDSF 110

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDD 275
                     + GC    D +   +   G         S PSQL  +KFSYCL+SR  + 
Sbjct: 111 SVSRGRTSPVVFGCG--HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRD-NG 167

Query: 276 APVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              SS L+       GDS  P     +YT   KNP         FYY GL  I +G   +
Sbjct: 168 VRASSALLF------GDSALPTSASFAYTQLLKNP-----KLDTFYYAGLSGISIGGTLL 216

Query: 333 KIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
            IP  ++ +  S G GGVI+DSG++ T +    +  +   F        RAAD    S  
Sbjct: 217 SIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADF---SLF 273

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D S   SV +P +   F+GGA + LPP NY   V         F+  +   +    
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ Q Q   +  DL + R GFA ++C
Sbjct: 330 ---IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 121/390 (31%), Positives = 176/390 (45%), Gaps = 54/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  G+P +  T FIFDTGS L W  C     CV   +      R   F P  S 
Sbjct: 145 GNYVVTVGLGSPKRDLT-FIFDTGSDLTWTQCEP---CVGYCYQQ----REHIFDPSTSL 196

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +P C  +   +      GCS  + TC      Y ++YG G ++ G    E L
Sbjct: 197 SYSNVSCDSPSCEKL--ESATGNSPGCS--SSTCL-----YGIRYGDGSYSIGFFAREKL 247

Query: 221 RFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
              S  V  NF  GC   ++R      AG+ G  R+  SL SQ   K    FSYCL    
Sbjct: 248 SLTSTDVFNNFQFGCG-QNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL---- 302

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
               P SS+       GSGD  +  + +TP   N     S +  FY++ +  I VG + +
Sbjct: 303 ----PSSSSSTGYLSFGSGDGDSKAVKFTPSEVN-----SDYPSFYFLDMVGISVGERKL 353

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP S          G I+DSG+  + +   ++ +V K F   M +Y R   V   S L 
Sbjct: 354 PIPKSVF-----STAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGV---SILD 405

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            C+D+S  K+V +P++IL F GGA+M L PE    ++    +CL  F  N+    +    
Sbjct: 406 TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLA-FAGNSDDDEVA--- 461

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             I+G+ Q +  ++ +D A  R GFA   C
Sbjct: 462 --IIGNVQQKTIHVVYDDAEGRVGFAPSGC 489


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 175/392 (44%), Gaps = 58/392 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT--SRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y +++  GTP +  T FIFDTGS L W  C   +RY C           + P F P +
Sbjct: 136 GNYVVTVGLGTPKRDLT-FIFDTGSDLTWTQCEPCARY-CYH--------QQEPIFNPSK 185

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S+S   I C +P C  +         K  +  + +C  +   Y +QYG   ++ G    +
Sbjct: 186 STSYTNISCSSPTCDEL---------KSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQD 236

Query: 219 TLRFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
            L   S  V  NFL GC   ++R      AG+ G GR++ SL SQ   K    FSYCL S
Sbjct: 237 KLALTSTDVFNNFLFGCG-QNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPS 295

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
                   SS   L  G G G SK   + +TP   N  G S     FY++ L  I VG +
Sbjct: 296 TS------SSTGYLTFGSGGGTSK--AVKFTPSLVNSQGPS-----FYFLNLIAISVGGR 342

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            +    S          G I+DSG+  + +    +  +   F +QM  Y +AA     S 
Sbjct: 343 KLSTSASVF-----STAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPA---SI 394

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           L  C+D S   +V +P++ L F  GA+M L P   F ++    +CL  F  N+    +  
Sbjct: 395 LDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLA-FAGNSDATDIA- 452

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               ILG+ Q + F + +D+A  R GFA   C
Sbjct: 453 ----ILGNVQQKTFDVVYDVAGGRIGFAPGGC 480


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 165/391 (42%), Gaps = 55/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+P +     + DTGS + W  C+    C  C   N        F P+ SS
Sbjct: 12  GEYFVRVGIGSPTKLQY-LVMDTGSDVPWIQCSP---CKSCYKQN-----DAVFDPRASS 62

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C  P+C  +         K C+  +  C      Y + YG G FT G L S++ 
Sbjct: 63  SFRRLSCSTPQCKLL-------DVKACASTDNRCL-----YQVSYGDGSFTVGDLASDSF 110

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDD 275
                     + GC    D +   +   G         S PSQL  +KFSYCL+SR  + 
Sbjct: 111 LVSRGRTSPVVFGCG--HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRD-NG 167

Query: 276 APVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              SS L+       GDS  P     +YT   KNP         FYY GL  I +G   +
Sbjct: 168 VRASSALLF------GDSALPTSASFAYTQLLKNP-----KLDTFYYAGLSGISIGGTLL 216

Query: 333 KIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
            IP  ++ +  S G GGVI+DSG++ T +    +  +   F        RAAD    S  
Sbjct: 217 SIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADF---SLF 273

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D S   SV +P +   F+GGA + LPP NY   V         F+  +   +    
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ Q Q   +  DL + R GFA ++C
Sbjct: 330 ---IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 168/400 (42%), Gaps = 63/400 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS +VW       +C  C        R+  F P+RS 
Sbjct: 126 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 176

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C  P C  +          GC  R  +C      Y + YG G  TAG   SETL
Sbjct: 177 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 224

Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
            F     V     GC    D +   IA       GR   S PSQ+     + FSYCL+ R
Sbjct: 225 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 282

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                P S+     T      +   G S+TP  +NP         FYYV L    VG   
Sbjct: 283 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 337

Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
           VK +  S L +  + G GGVI+DSG++ T +  P++EAV   F        RAA V  + 
Sbjct: 338 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 389

Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
                S    C+++SG++ V +P + +   GGA +ALPPENY   V      C  +   +
Sbjct: 390 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 449

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G   I+G+ Q Q F + FD    R GF  + C
Sbjct: 450 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 168/400 (42%), Gaps = 63/400 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS +VW       +C  C        R+  F P+RS 
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 170

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C  P C  +          GC  R  +C      Y + YG G  TAG   SETL
Sbjct: 171 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 218

Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
            F     V     GC    D +   IA       GR   S PSQ+     + FSYCL+ R
Sbjct: 219 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 276

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                P S+     T      +   G S+TP  +NP         FYYV L    VG   
Sbjct: 277 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 331

Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
           VK +  S L +  + G GGVI+DSG++ T +  P++EAV   F        RAA V  + 
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 383

Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
                S    C+++SG++ V +P + +   GGA +ALPPENY   V      C  +   +
Sbjct: 384 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 443

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G   I+G+ Q Q F + FD    R GF  + C
Sbjct: 444 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 121/400 (30%), Positives = 190/400 (47%), Gaps = 55/400 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G PP+     I DTGS L W  C     C D + P  DPS+        S+
Sbjct: 169 GEYFMDVFVGNPPRHFL-LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQ--------ST 219

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S ++I C    C  +    V   C+  S  +KT P  C  Y   YG    T+G L  E+L
Sbjct: 220 SFKIIPCNAAACDLV----VHDECRDNS--SKTSPKTC-KYFYWYGDSSRTSGDLALESL 272

Query: 221 RFP------SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
                    S  + + + GC   +    +   G+ G G+ + S PSQL      + FSYC
Sbjct: 273 SVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYC 332

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           L+ R  ++  VSS +    G G   S+    + +TPF    V ++++   FYY+G++ I 
Sbjct: 333 LVDRT-NNLSVSSAISF--GAGFALSRHFDQMRFTPF----VRTNNSVETFYYLGIQGIK 385

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +  + + IP        +G+GG I+DSG+T T++    + AV   F+ ++ +Y RA   +
Sbjct: 386 IDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFD 444

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL--CL-ILFTDNA 443
               L  C++ +G+ +V  P L + F+ GA++ LP ENYF     +    CL IL TD  
Sbjct: 445 I---LGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM 501

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +          I+G+FQ QN +  +D+ + R GFA   C+
Sbjct: 502 S----------IIGNFQQQNIHFLYDVQHARLGFANTDCS 531


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 133/403 (33%), Positives = 171/403 (42%), Gaps = 69/403 (17%)

Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y   +  GTP    TP   + DTGS +VW  C    RC D +    DP        + 
Sbjct: 145 GEYFTKIGVGTP---VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP--------RA 193

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           S S   + C  P C  +          GC  R K C      Y + YG G  TAG   +E
Sbjct: 194 SHSYGAVDCAAPLCRRL-------DSGGCDLRRKAC-----LYQVAYGDGSVTAGDFATE 241

Query: 219 TLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
           TL F S   VP    GC   ++      AG+ G GR S S PSQ+  +    FSYCL+ R
Sbjct: 242 TLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDR 301

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               A  +S     T        +   S+TP  KNP         FYYV L  I VG   
Sbjct: 302 TSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNP-----RMETFYYVQLMGISVGGAR 356

Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           V  +  S L +  S G GGVIVDSG++ T +  P + A+   F        RAA     +
Sbjct: 357 VPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAF--------RAA----AA 404

Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           GLR           C+D+SG K V +P + + F GGA+ ALPPENY   V +       F
Sbjct: 405 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 464

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G +       I+G+ Q Q F + FD    R GF  + C
Sbjct: 465 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 117/388 (30%), Positives = 172/388 (44%), Gaps = 57/388 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G PP  +   I DTGS + W  C     C DC +   DP     F P  S+
Sbjct: 147 GEYFSRVGIGKPPSQAY-LILDTGSDVNWVQCAP---CADC-YQQADP----IFEPASSA 197

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C   +C  +   +V S C     RN TC      Y + YG G +T G  ++ET+
Sbjct: 198 SFSTLSCNTRQCRSL---DV-SEC-----RNDTC-----LYEVSYGDGSYTVGDFVTETI 243

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
              S  V N   GC   ++      AG+ G G  S S PSQ+    FSYCL+ R   D+ 
Sbjct: 244 TLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR---DSE 300

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            +S L  ++      +  P     P  +N          FYYVGL  + VG + V IP S
Sbjct: 301 SASTLEFNS------TLPPNAVSAPLLRN-----HHLDTFYYVGLTGLSVGGELVSIPES 349

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL---RPC 394
                  GNGGVIVDSG+  T ++  ++ ++   F++      R  D+   +G+     C
Sbjct: 350 AFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK------RTRDLPSTNGIALFDTC 403

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+S K +V +P +   F  G ++ LP +NY   + +E      F   A+  +       
Sbjct: 404 YDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLS------- 456

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q Q   + +DL N   GF   KC
Sbjct: 457 IIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 128/400 (32%), Positives = 177/400 (44%), Gaps = 59/400 (14%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L  TP++  + G Y I +SFG+PPQ ++  I DTGS L+W       +C+ C   N   S
Sbjct: 68  LFSTPVASGN-GEYLIDISFGSPPQKAS-VIVDTGSDLIW------TQCLPCETCNAAAS 119

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
            I  F P +SS+   + C +  CS +   +  + CK               Y   YG G 
Sbjct: 120 VI--FDPVKSSTYDTVSCASNFCSSLPFQSCTTSCK---------------YDYMYGDGS 162

Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKK 263
            T+G L +ET+   + T+PN   GC   ++ S    AGI G G+   SL SQ   +  KK
Sbjct: 163 STSGALSTETVTVGTGTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKK 222

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL+         S  L+ D+    G + T  L+ T    NP         FYY  L 
Sbjct: 223 FSYCLV--PLGSTKTSPMLIGDSAAAGGVAYTALLTNT---ANPT--------FYYADLT 269

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I V  K V  P       + G GG I+DSG+T T++E   F A+      ++       
Sbjct: 270 GISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADG 329

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDN 442
            +    GL  CF  +G  +   P +   FK GA   LPPEN F AL     +CL +    
Sbjct: 330 SLY---GLDYCFSTAGVANPTYPTMTFHFK-GADYELPPENVFVALDTGGSICLAM---- 381

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           AA          I+G+ Q QN  +  DL N R GF +  C
Sbjct: 382 AASTGFS-----IMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 115/375 (30%), Positives = 165/375 (44%), Gaps = 45/375 (12%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DT S L W  C     C D   P  DPS  P++          + C +P C  +   
Sbjct: 156 VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAA--------VPCDSPSCDALQQQ 207

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
                  G  P +   P AC SY L Y  G ++ G+L  + L    + +  F+ GC   +
Sbjct: 208 LATGAGAGAPPCDAGRPAAC-SYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSN 266

Query: 239 DRQP----AGIAGFGRSSESLPSQLGLK---KFSYCL-LSRKFDDAPVSSNLVLDTGPGS 290
              P    +G+ G GRS  SL SQ   +    FSYCL LSR+ D    S +LVL   P +
Sbjct: 267 QGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD---ASGSLVLGDDPSA 323

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG-NGGV 349
             + TP + YT    N        G FY V L  I VG + V+         S G +   
Sbjct: 324 YRNSTP-VVYTSMVSNS--DPLLQGPFYLVNLTGITVGGQEVE---------STGFSARA 371

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
           IVDSG+  T +   ++ AV  EF+ Q+  Y +A      S L  CF+++G K V +P L 
Sbjct: 372 IVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGF---SILDTCFNMTGLKEVQVPSLT 428

Query: 410 LKFKGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
           L F GGA++ +      YF    +  +CL      A           I+G++Q +N  + 
Sbjct: 429 LVFDGGAEVEVDSGGVLYFVSSDSSQVCL------AVASLKSEDETSIIGNYQQKNLRVV 482

Query: 468 FDLANDRFGFAKQKC 482
           FD +  + GFA++ C
Sbjct: 483 FDTSASQVGFAQETC 497


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 134/425 (31%), Positives = 182/425 (42%), Gaps = 80/425 (18%)

Query: 91  LIKTP---LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
           LI  P   LS H     ++SL+ G+PPQ  T  + DTGS L W  C              
Sbjct: 24  LISQPSNKLSFHHNVTLTVSLTVGSPPQQVT-MVLDTGSELSWLHCKK------------ 70

Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG------CSPRNKTCPLACPS 201
            P+    F P  SSS   I C +P C         +R +       C P+ K C  A  S
Sbjct: 71  SPNLTSVFNPLSSSSYSPIPCSSPVC--------RTRTRDLPNPVTCDPK-KLCH-AIVS 120

Query: 202 YLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSES 254
           Y     L    G L S+  R  S  +P  L GC       +   D +  G+ G  R S S
Sbjct: 121 YADASSL---EGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLS 177

Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSS 311
             +QLGL KFSYC+  R       SS ++L      GDS       L+YTP     V  S
Sbjct: 178 FVTQLGLPKFSYCISGRD------SSGVLL-----FGDSHLSWLGNLTYTPL----VQIS 222

Query: 312 SAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           +    F    Y V L  I VG+K + +P S   P   G G  +VDSG+ FTF+ GP++ A
Sbjct: 223 TPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 282

Query: 368 VAKEFIRQM-GNYSRAAD--VEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPE 423
           +  EF+ Q  G  +   D     +  +  C+ + +G K   LP + L F+ GA+M +  E
Sbjct: 283 LRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR-GAEMVVGGE 341

Query: 424 NYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
                V     G E V CL     +  G       A ++G    QN ++EFDL   R GF
Sbjct: 342 VLLYKVPGMMKGKEWVYCLTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLVKSRVGF 396

Query: 478 AKQKC 482
            + +C
Sbjct: 397 VETRC 401


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 180/415 (43%), Gaps = 72/415 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     ++SL+ GTP Q  T  + DTGS L W  C              +P+    F
Sbjct: 59  LLFHHNVTLTVSLTAGTPLQNIT-MVLDTGSELSWLHCKK------------EPNFNSIF 105

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC-PSYLLQYGLGF---- 210
            P  S +   I C +P C        E+R      R+   P++C P+ L  + + +    
Sbjct: 106 NPLASKTYTKIPCSSPTC--------ETRT-----RDLPLPVSCDPAKLCHFIISYADAS 152

Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
              G L  ET R  S T P  + GC       +   D +  G+ G  R S S  +Q+G +
Sbjct: 153 SVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFR 212

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
           KFSYC+  R        S+ VL  G  S     P L+YTP     V  S+    F    Y
Sbjct: 213 KFSYCISDR-------DSSGVLLLGEASFSWLKP-LNYTPL----VEMSTPLPYFDRVAY 260

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  I V  K + +P S  VP   G G  +VDSG+ FTF+ GP++ A+ +EF+ Q   
Sbjct: 261 SVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKG 320

Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNE- 432
             R  +  +   +  +  C+ I   ++    LP + L F+ GA+M++  +     V  E 
Sbjct: 321 VLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFR-GAEMSVSGQRLLYRVPGEV 379

Query: 433 -----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                V C      ++ G       + ++G  Q QN ++E+DL   R GFA+ +C
Sbjct: 380 RGKDSVWCFTFGNSDSLGIE-----SFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 123/400 (30%), Positives = 168/400 (42%), Gaps = 63/400 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS +VW       +C  C        R+  F P+RS 
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 170

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C  P C  +          GC  R  +C      Y + YG G  TAG   SETL
Sbjct: 171 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 218

Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
            F     V     GC    D +   IA       GR   S P+Q+     + FSYCL+ R
Sbjct: 219 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDR 276

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                P S+     T      +   G S+TP  +NP         FYYV L    VG   
Sbjct: 277 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 331

Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
           VK +  S L +  + G GGVI+DSG++ T +  P++EAV   F        RAA V  + 
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 383

Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
                S    C+++SG++ V +P + +   GGA +ALPPENY   V      C  +   +
Sbjct: 384 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 443

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G   I+G+ Q Q F + FD    R GF  + C
Sbjct: 444 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 107/347 (30%), Positives = 157/347 (45%), Gaps = 44/347 (12%)

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
           P F P  SS+   + C +  C ++  P +     GC             Y   YG+GFTA
Sbjct: 94  PPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCV------------YYYPYGMGFTA 141

Query: 213 GLLLSETLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           G L +ETL     + P    GCS  +      +GI G GRS  SL SQ+G+ +FSYCL S
Sbjct: 142 GYLATETLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS 201

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
               DA    + +L    GS    T G S     +NP   SS+   +YYV L  I VG+ 
Sbjct: 202 ----DADAGDSPILF---GSLAKVTGGKSSPAILENPEMPSSS---YYYVNLTGITVGAT 251

Query: 331 HVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
            + +  +        G+   GG IVDSG+T T++    +  V + F+ QM   +    V 
Sbjct: 252 DLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVN 311

Query: 387 -KKSGLRPCFDIS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCL 436
             + G   CFD +   G   V +P L+L+F GGA+ A+   +Y  +V  +      V CL
Sbjct: 312 GTRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECL 371

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++       PA  +    I+G+    + ++ +DL    F FA   CA
Sbjct: 372 LVL------PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 138/483 (28%), Positives = 204/483 (42%), Gaps = 74/483 (15%)

Query: 20  TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTK 79
           TT     S A++ ++ L P    H   H D   L +L  LA  S +R + + TK +    
Sbjct: 68  TTSFSPTSLASSFSLELHPRELLHGGSHKDYRAL-MLSRLARDS-ARVKAINTKLQLAVS 125

Query: 80  ----------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
                     D+ I      S   T  +    G Y + +  G P + +   + DTGS + 
Sbjct: 126 GTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSK-TFYMVIDTGSDVN 184

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           W  C     C DC +  VDP     F P  SSS   +GCQ P+C  +           C 
Sbjct: 185 WLQCKP---CDDC-YQQVDP----IFDPASSSSFSRLGCQTPQCRNL-------DVFAC- 228

Query: 190 PRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAG 247
            RN +C      Y + YG G +T G   +ET+ F  S +V     GC    D +   +  
Sbjct: 229 -RNDSCL-----YQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCG--HDNEGLFVGA 280

Query: 248 FGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTP 302
            G         SL SQ+    FSYCL++R   D+  SS L  ++   S DS T      P
Sbjct: 281 AGLIGLGGGPLSLTSQIKASSFSYCLVNR---DSVDSSTLEFNSAKPS-DSVT-----AP 331

Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
            +KN     S    FYYVG+  + VG + + IP S       G GG+IVD G+  T ++ 
Sbjct: 332 IFKN-----SKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQT 386

Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMA 419
             + A+   F++         D+   SG      C+++S + SV +P +   F GG  + 
Sbjct: 387 QAYNALRDTFVK------LTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLP 440

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           LPP NY   V +     + F    A  +       I+G+ Q Q   + +DLAN +  F+ 
Sbjct: 441 LPPSNYLIPVDSAGTFCLAFAPTTASLS-------IIGNVQQQGTRVTYDLANSQVSFSS 493

Query: 480 QKC 482
           +KC
Sbjct: 494 RKC 496


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 177/400 (44%), Gaps = 67/400 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + LS GTPPQ   P + DTGS LVW       +C +C+  ++D      F    SS
Sbjct: 3   GEYMMELSIGTPPQL-IPAMIDTGSDLVWL------KCDNCDHCDLDHHGETIFFSDASS 55

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C +  CS +    +  RC+      +TC      Y  +YG G  T+G + S+ +
Sbjct: 56  SYKKLPCNSTHCSGMSSAGIGPRCE------ETCK-----YKYEYGDGSRTSGDVGSDRI 104

Query: 221 RFPSKTV--------PNFLAGCS--ILSDRQ-PAGIAGFGRSSESLPSQLGLK---KFSY 266
            F S             FL GC+  +  D     G+ G G+ S SL  QLG K   KFSY
Sbjct: 105 SFRSHGAGEDHRSFFDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSY 164

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-FYYVGLRQI 325
           CL+S    D+P S+   L  G       +  L        P+       +  YYV L+ I
Sbjct: 165 CLVSY---DSPPSAKSFLFLG------SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSI 215

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGV--------IVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
            +G     +P       S  N  V        ++DSG+T+T +  P++EA+ K    Q+ 
Sbjct: 216 TIGG----VPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV- 270

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
                  +   +GL  CF+ SG  S   P +   F    ++ LP EN F +   +V+CL 
Sbjct: 271 ---ILPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLS 327

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           +   +++G     G   I+G+ Q QNF++ +DL   +  F
Sbjct: 328 M---DSSG-----GDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 125/393 (31%), Positives = 171/393 (43%), Gaps = 58/393 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +S GTPPQ  +  I DTGS L W  C    RC    F   DP     FIP  SS
Sbjct: 6   GEYVLQISLGTPPQQFSA-IVDTGSDLCWVQCAPCARC----FEQPDP----LFIPLASS 56

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S     C +  C  +  P        CS RN TC     +Y   YG G  T G    ET+
Sbjct: 57  SYSNASCTDSLCDALPRPT-------CSMRN-TC-----TYSYSYGDGSNTRGDFAFETV 103

Query: 221 RFPSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLG---LKKFSYCLLSRKFD 274
                T+     GC    +   AG   + G G+   SLPSQL       FSYCL+    D
Sbjct: 104 TLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLV----D 159

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            +   +   +  G  + +S+    S+TP  +N    S     +YYVG+  I VG++ V  
Sbjct: 160 QSTTGTFSPITFGNAAENSRA---SFTPLLQNEDNPS-----YYYVGVESISVGNRRVPT 211

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S     ++G GGVI+DSG+T T+     F  +  E  RQ+ +Y  A       GL  C
Sbjct: 212 PPSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQI-SYPEAD--PTPYGLNLC 268

Query: 395 FDIS--GKKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGR 450
           +DIS     S+ LP + +         +P  N + LV N  E +C  + T +        
Sbjct: 269 YDISSVSASSLTLPSMTVHLT-NVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFS----- 322

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I+G+ Q QN  +  D+AN R GF    C+
Sbjct: 323 ----IIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 118/400 (29%), Positives = 176/400 (44%), Gaps = 67/400 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + LS GTPPQ   P + DTGS LVW       +C +C+  ++D      F    SS
Sbjct: 3   GEYMMELSIGTPPQL-IPAMIDTGSDLVWL------KCDNCDHCDLDHHGETIFFSDASS 55

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C +  CS +    +  RC+      +TC      Y  +YG G  T+G + S+ +
Sbjct: 56  SYKKLPCNSTHCSGMSSAGIGPRCE------ETCK-----YKYEYGDGSRTSGDVGSDRI 104

Query: 221 RFPSKTV--------PNFLAGC--SILSDRQ-PAGIAGFGRSSESLPSQLGLK---KFSY 266
            F S             FL GC   +  D     G+ G G+ S SL  QLG K   KFSY
Sbjct: 105 SFRSHGAGEDHRSFFDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSY 164

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-FYYVGLRQI 325
           CL+S    D+P S+   L  G       +  L        P+       +  YYV L+ I
Sbjct: 165 CLVSY---DSPPSAKSFLFLG------SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSI 215

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGV--------IVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
            VG     +P       S  N  V        ++DSG+T+T +  P++EA+ K    Q+ 
Sbjct: 216 TVGG----VPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV- 270

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
                  +   +GL  CF+ SG  S   P +   F    ++ LP EN F +   +V+CL 
Sbjct: 271 ---ILPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLS 327

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           +   +++G     G   I+G+ Q QNF++ +DL   +  F
Sbjct: 328 M---DSSG-----GDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 124/415 (29%), Positives = 184/415 (44%), Gaps = 57/415 (13%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           + TPL    Y  +S+ L  G+  Q +   I DTGS  V   C SR R             
Sbjct: 90  VVTPL--EDYALFSMQLGIGSL-QKNLSAIIDTGSEAVLVQCGSRSR------------- 133

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
            P F P  S S + + C +  C  +         + C   + TC     +Y L YG    
Sbjct: 134 -PVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATC-----TYSLSYGDSRN 187

Query: 212 AGLLLSETLRFPSKTVPNFLA--------GCS-----ILSDRQPAGIAGFGRSSESLPSQ 258
           +    S+ + F + T  +  A        GC+      L D    GI GF R + SLPSQ
Sbjct: 188 STGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQ 247

Query: 259 L----GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           L    G  KFSYC  S+ +   P ++ ++     G   SK   + YTP   NPV  + A 
Sbjct: 248 LKDRLGGSKFSYCFPSQPWQ--PRATGVIFLGDSGLSKSK---VGYTPLLDNPV--TPAR 300

Query: 315 GEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
            + YYVGL  I V  K + IP S + +  S G+GG ++DSG+TFT +    + A    F 
Sbjct: 301 SQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFA 360

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFALV--- 429
               +  R   V   +G   C++IS   S+  +PE+ L  +   ++ L  E+ F  V   
Sbjct: 361 ASNRSGLRK-KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAA 419

Query: 430 GNEV-LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           GNEV +CL + +   +G     G   +LG++Q  N+ +E+D    R GF +  C+
Sbjct: 420 GNEVTVCLAILSSQKSGF----GKINVLGNYQQSNYLVEYDNERSRVGFERADCS 470


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 171/403 (42%), Gaps = 66/403 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS +VW  C    RC + + P         F P+RSS
Sbjct: 127 GEYFTKIGVGTPATQAL-MVLDTGSDVVWVQCAPCRRCYEQSGP--------VFDPRRSS 177

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   +GC    C  +          GC  R   C      Y + YG G  TAG  ++ETL
Sbjct: 178 SYGAVGCGAALCRRL-------DSGGCDLRRGAC-----MYQVAYGDGSVTAGDFVTETL 225

Query: 221 RFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
            F     V     GC   ++      AG+ G GR   S P+Q+  +    FSYCL+ R  
Sbjct: 226 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTS 285

Query: 274 DDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
             A  +  S+       G+G       S+TP  +NP         FYYV L  I VG   
Sbjct: 286 SGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP-----RMETFYYVQLVGISVGGAR 340

Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           V  +  S L +  S G GGVIVDSG++ T +    + A+   F        RAA      
Sbjct: 341 VPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAF--------RAA---AAG 389

Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           GLR           C+D+ G++ V +P + + F GGA+ ALPPENY   V +       F
Sbjct: 390 GLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 449

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G +       I+G+ Q Q F + FD    R GFA + C
Sbjct: 450 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 133/454 (29%), Positives = 193/454 (42%), Gaps = 71/454 (15%)

Query: 47  HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS--NSLIKTPLSVHSYGGY 104
           H    PL+ ++S +   +      +   +  T  S     YS  ++L   P S    G Y
Sbjct: 79  HGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQPGSKVGTGNY 138

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
            ++  FGTP + S   I DTGS + W  C     C DC +  VDP     F P++SSS +
Sbjct: 139 IVTAGFGTPAKNSL-LIIDTGSDVTWIQCKP---CSDC-YSQVDP----IFEPQQSSSYK 189

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
            + C +  C+ +   N   R  GC             Y + YG G  + G    ETL   
Sbjct: 190 HLSCLSSACTELTTMN-HCRLGGCV------------YEINYGDGSRSQGDFSQETLTLG 236

Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
           S + P+F  GC   +    +  AG+ G GR++ S PSQ   K   +FSYCL         
Sbjct: 237 SDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL--------- 287

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
              + V  T  GS       +  T  +  P+ S+S +  FY+VGL  I VG + + IP  
Sbjct: 288 --PDFVSSTSTGSFSVGQGSIPATATFV-PLVSNSNYPSFYFVGLNGISVGGERLSIP-- 342

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
              P   G GG IVDSG+  T +    ++A+   F  +  N   A   +  S L  C+D+
Sbjct: 343 ---PAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSA---KPFSILDTCYDL 396

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI--- 454
           S    V +P +   F+  A +A+      + VG      ILFT  + G  +    A    
Sbjct: 397 SSYSQVRIPTITFHFQNNADVAV------SAVG------ILFTIQSDGSQVCLAFASASQ 444

Query: 455 -----ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                I+G+FQ Q   + FD    R GFA   CA
Sbjct: 445 SISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSCA 478


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/393 (30%), Positives = 164/393 (41%), Gaps = 58/393 (14%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ G PP    PF+   DTGS L W  C     C   + P  DPS    F P    
Sbjct: 71  YLMELAIGKPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 123

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
               + C +  C  I+  N       C+P +  C      Y   YG G ++AG+L +ETL
Sbjct: 124 ----LPCSSATCLPIWSRN-------CTP-SSLC-----RYRYAYGDGAYSAGILGTETL 166

Query: 221 RFPSKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS--R 271
                + P  + G +               G  G GR + SL +QLG+ KFSYCL     
Sbjct: 167 TLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFN 226

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
              D+P     + +  PG    ++     TP  ++P   S      Y+V L+ I +G   
Sbjct: 227 SALDSPFLLGTLAELAPGPSTVQS-----TPLLQSPQNPSR-----YFVSLQGISLGDVR 276

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP        DG GG+IVDSG+TFT +    F  V     R +G       V   S  
Sbjct: 277 LPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQ----PPVNASSLD 332

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGR 450
            PCF     +  Y+P+L+L F GGA M L  +NY +    +   CL     N AG     
Sbjct: 333 APCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCL-----NIAGTT--P 385

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               +LG+FQ QN  + FD    +  F    C+
Sbjct: 386 ESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCS 418


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 120/399 (30%), Positives = 173/399 (43%), Gaps = 68/399 (17%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ GTPP    PF+   DTGS L W  C     C   + P  DPS    F P    
Sbjct: 66  YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 118

Query: 162 SSQLIGCQNPKC--SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
               + C +  C  +W        R + CS  +  C      Y+  Y  G ++ G+L +E
Sbjct: 119 ----VPCSSATCLPTW--------RSRNCSNPSSPC-----RYIYSYSDGAYSVGILGTE 161

Query: 219 TLRF----PSKTVP--NFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
           TL      P +TV   +   GC   +        G  G GR + SL +QLG+ KFSYCL 
Sbjct: 162 TLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLT 221

Query: 270 S--RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
                  D+P     + +  PG G  ++     TP  ++P+  S      Y+V L+ I +
Sbjct: 222 DFFNSTMDSPFFLGTLAELAPGPGTVQS-----TPLLQSPLNPSR-----YFVNLQGISL 271

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G   + IP       +DGNGG++VDSG+TFT +    F  V     + +G       V  
Sbjct: 272 GDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQ----PPVNA 327

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
            S   PCF  S     ++P+L+L F GGA M L  +NY +   ++   CL     N  G 
Sbjct: 328 SSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCL-----NIVG- 380

Query: 447 ALGRGPAII--LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                P+    LG+FQ QN  + FD+   +  F    C+
Sbjct: 381 ----SPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCS 415


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/374 (30%), Positives = 158/374 (42%), Gaps = 62/374 (16%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C     C DC +   DP     F P  SS+   + CQ+ +CS +   
Sbjct: 35  MVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPTASSTYAPVTCQSQQCSSL--- 83

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
                 +  S R+  C      Y + YG G +T G   +E++ F  S +V N   GC   
Sbjct: 84  ------EMSSCRSGQCL-----YQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCG-- 130

Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
            D +   +   G         SL +QL    FSYCL++R   D+  SS L  ++     D
Sbjct: 131 HDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNR---DSAGSSTLDFNSAQLGVD 187

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           S T      P  KN          FYYVGL  + VG + V IP S       GNGG+IVD
Sbjct: 188 SVTA-----PLMKN-----RKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVD 237

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
            G+  T ++   +  +   F+R   N    + V        C+D+SG+ SV +P +   F
Sbjct: 238 CGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL---FDTCYDLSGQASVRVPTVSFHF 294

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEF 468
             G    LP  NY   V           D+A        P      I+G+ Q Q   + F
Sbjct: 295 ADGKSWNLPAANYLIPV-----------DSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTF 343

Query: 469 DLANDRFGFAKQKC 482
           DLAN+R GF+  KC
Sbjct: 344 DLANNRMGFSPNKC 357


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 120/408 (29%), Positives = 173/408 (42%), Gaps = 63/408 (15%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           LS H     ++SL+ G+PPQ  T  + DTGS L W  C           PN++ +    F
Sbjct: 52  LSFHHNVTLTVSLTVGSPPQNVT-MVLDTGSELSWLHCK--------KLPNLNST----F 98

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLG 209
            P  SSS     C +  C+        +R +       C P NK C +     ++ Y   
Sbjct: 99  NPLLSSSYTPTPCNSSICT--------TRTRDLTIPASCDPNNKLCHV-----IVSYADA 145

Query: 210 FTA-GLLLSETLRFPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLG 260
            +A G L +ET        P  L GC         I  D +  G+ G  R S SL +Q+ 
Sbjct: 146 SSAEGTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS 205

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           L KFSYC+     +DA      VL  G G+ D+ +P L YTP       S       Y V
Sbjct: 206 LPKFSYCI---SGEDALG----VLLLGDGT-DAPSP-LQYTPLVTATTSSPYFNRVAYTV 256

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNY 379
            L  I V  K +++P S  VP   G G  +VDSG+ FTF+ G ++ ++  EF+ Q  G  
Sbjct: 257 QLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVL 316

Query: 380 SRAAD--VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVL 434
           +R  D     +  +  C+      +  +P + L F  GA+M +  E     V    + V 
Sbjct: 317 TRIEDPNFVFEGAMDLCYHAPASFAA-VPAVTLVFS-GAEMRVSGERLLYRVSKGSDWVY 374

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           C      +  G       A ++G    QN ++EFDL   R GF +  C
Sbjct: 375 CFTFGNSDLLGIE-----AYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 125/445 (28%), Positives = 194/445 (43%), Gaps = 77/445 (17%)

Query: 52  PLKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
           P +   SL S  +    +R R LK  ++   +D+N            P+   S G Y I 
Sbjct: 69  PNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANA---------NVPVRSGS-GEYIIQ 118

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           + FGTP Q+    I DTGS + W PC    +C  C+      S  P F P +SSS +   
Sbjct: 119 VDFGTPKQSMYTLI-DTGSDVAWIPCK---QCQGCH------STAPIFDPAKSSSYKPFA 168

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT 226
           C +  C  I G      C G    N  C      + + YG G    G L S+ +   S+ 
Sbjct: 169 CDSQPCQEISG-----NCGG----NSKC-----QFEVSYGDGTQVDGTLASDAITLGSQY 214

Query: 227 VPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
           +PNF  GC  S+  D  P+      G       +++  ++L    FSYCL S        
Sbjct: 215 LPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSST----S 270

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
           S +LVL        S    L +T   K+P     +   FY+V L+ I VG+  + +P + 
Sbjct: 271 SGSLVLGKEAAVSSSS---LKFTTLIKDP-----SIPTFYFVTLKAISVGNTRISVPGTN 322

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
           +  G    GG I+DSG+T T +    + A+   F +Q+ +  +   VE    +  C+D+S
Sbjct: 323 IASG----GGTIIDSGTTITHLVPSAYTALRDAFRQQLSSL-QPTPVED---MDTCYDLS 374

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
              SV +P + L       + LP EN      + + CL   + ++           I+G+
Sbjct: 375 -SSSVDVPTITLHLDRNVDLVLPKENILITQESGLACLAFSSTDSRS---------IIGN 424

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
            Q QN+ + FD+ N + GFA+++CA
Sbjct: 425 VQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 168/401 (41%), Gaps = 44/401 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  G+PPQ +   + DTGS L W  C++       + P         F+ + S+
Sbjct: 81  GQYFVSIRLGSPPQ-TLLLVADTGSDLTWVRCSACKTNCSIHPPGS------TFLARHST 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +     C +  C  +  PN        +P N T   +   Y   Y  G  T+G    ET 
Sbjct: 134 TFSPTHCFSSLCQLVPQPNP-------NPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186

Query: 221 RFPSKT-----VPNFLAGCSILSD---------RQPAGIAGFGRSSESLPSQLGLK---K 263
              + +     + +   GC   +             +G+ G GR   S  SQLG +    
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRS 246

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCLL       P S  ++ D      D+K+  +S+TP   NP   +     FYY+ ++
Sbjct: 247 FSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKS-MMSFTPLLINPEAPT-----FYYISIK 300

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RA 382
            + V    + I  S       GNGG ++DSG+T TF+  P +  +   F R++   S   
Sbjct: 301 GVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTP 360

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
                +SG   C +++G      P L L+  G +  + PP NYF  +   + CL +    
Sbjct: 361 GGASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVE 420

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A       G   ++G+   Q F LEFD    R GF+++ CA
Sbjct: 421 AES-----GRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 50/373 (13%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P +    + DTGS + W  C     C DC +   DP     F P  S+S   + C   +C
Sbjct: 153 PSSPVYMVLDTGSDVNWIQCAP---CADC-YHQADP----IFEPASSTSYSPLSCDTKQC 204

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
             +   +V S C     RN TC      Y + YG G +T G  ++ET+   S +V N   
Sbjct: 205 QSL---DV-SEC-----RNNTC-----LYEVSYGDGSYTVGDFVTETITLGSASVDNVAI 250

Query: 233 GCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
           GC   ++      AG+ G G    S PSQ+    FSYCL+ R  D A   S L  ++   
Sbjct: 251 GCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSA---STLEFNS--- 304

Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
              +  P     P  +N          FYYVG+  + VG + + IP S       GNGG+
Sbjct: 305 ---ALLPHAITAPLLRN-----RELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGI 356

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
           I+DSG+  T ++   + A+   F++   +    ++V   +    C+D+S K SV +P + 
Sbjct: 357 IIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEV---ALFDTCYDLSRKTSVEVPTVT 413

Query: 410 LKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
               GG  + LP  NY   V ++      F   ++  +       I+G+ Q Q   + FD
Sbjct: 414 FHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALS-------IIGNVQQQGTRVGFD 466

Query: 470 LANDRFGFAKQKC 482
           LAN   GF  ++C
Sbjct: 467 LANSLVGFEPRQC 479


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 120/394 (30%), Positives = 172/394 (43%), Gaps = 63/394 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS +VW  C    +C    +  VDP     F P  S+
Sbjct: 195 GEYFTRIGVGTPMREQY-MVLDTGSDVVWIQCEPCSKC----YSQVDP----IFNPSLSA 245

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   +GC +  CS++   N      GC             Y + YG G +T G   +E L
Sbjct: 246 SFSTLGCNSAVCSYLDAYNCHG--GGCL------------YKVSYGDGSYTIGSFATEML 291

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGL---KKFSYCLLS 270
            F + +V N   GC        AG+        G G    S PSQLG    + FSYCL+ 
Sbjct: 292 TFGTTSVRNVAIGCG----HDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVD 347

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R F +    S+  L+ GP   +S   G   TP   NP     +   FYYV L  I VG  
Sbjct: 348 R-FSE----SSGTLEFGP---ESVPLGSILTPLLTNP-----SLPTFYYVPLISISVGGA 394

Query: 331 HVKI--PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            +    P  + +  + G GG IVDSG+  T ++ P+++AV   F+       +A   E  
Sbjct: 395 LLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKA---EGV 451

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           S    C+D+SG   V +P ++  F  GA + LP +NY  ++  + +    F    A   L
Sbjct: 452 SIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNY--MIPMDFMGTFCFAFAPATSDL 509

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 I+G+ Q Q   + FD AN   GFA ++C
Sbjct: 510 S-----IMGNIQQQGIRVSFDTANSLVGFALRQC 538


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 119/393 (30%), Positives = 168/393 (42%), Gaps = 63/393 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGS + W  C+   +C           + P F P  SS
Sbjct: 12  GDYFARIGVGTPAR-SVYMVADTGSDVSWLQCSPCRKCYR--------QQDPIFNPSLSS 62

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C +  C  +       + KGCS +NK        Y + YG G FT G   +ETL
Sbjct: 63  SFKPLACASSICGKL-------KIKGCSRKNKCM------YQVSYGDGSFTVGDFSTETL 109

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLKK---FSYCLLS 270
            F    V +   GC     R   G+        G GR   S PSQ G      FSYCL  
Sbjct: 110 SFGEHAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR 165

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R   ++ ++++LV   GP +   K     +T    N          +YYVGL +I V   
Sbjct: 166 R---ESAIAASLVF--GPSAVPEKA---RFTKLLPN-----RRLDTYYYVGLARIRVAGS 212

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            V IP      GS G GGVIVDSG+  + +  P + A+   F R +  +  A  +   S 
Sbjct: 213 PVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGI---SL 268

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
              C+D+S  K+  LP ++L F GGA M LP +     V +E   CL    +  A     
Sbjct: 269 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 325

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I+G+ Q Q F +  D   ++ G A  +C
Sbjct: 326 -----IIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 127/408 (31%), Positives = 181/408 (44%), Gaps = 50/408 (12%)

Query: 81  SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
           SN G+    S  +TPL   S G Y++S   GTP    +    DTGS L+W  C +  RC 
Sbjct: 71  SNAGAAPGES-AQTPLKKGS-GDYAMSFGIGTPATGLSGEA-DTGSDLIWTKCGACARC- 126

Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
                   P   P++ P  SSS+  + C +  C  +  P + S   G    +  C     
Sbjct: 127 -------SPRGSPSYYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC----- 173

Query: 201 SYLLQYGLG-----FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGR 250
           SY   YG       +T G+L++ET  F   +   P    GC++ S+      +G+ G GR
Sbjct: 174 SYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGR 233

Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
              SL +QL ++ F Y L S     +P+S   + D   G+GDS       TP   NPV  
Sbjct: 234 GKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQ 289

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
                 FYYVGL  I VG K V+IP  ++    S G GGVI DSG+T T +  P +  V 
Sbjct: 290 DL---PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            E + QMG + +         L  CF   G  +   P ++L F GGA M L  ENY   +
Sbjct: 347 DELLSQMG-FQKPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403

Query: 430 ----GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
               G    C  +   + A          I+G+    +F++ FDL+ +
Sbjct: 404 QGQNGETARCWSVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 443


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 117/393 (29%), Positives = 166/393 (42%), Gaps = 51/393 (12%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +ISL+ G+PPQ  T  + DTGS L W  C           PN++ +    F P  SSS  
Sbjct: 60  TISLTIGSPPQNVT-MVLDTGSELSWLHCK--------KLPNLNST----FNPLLSSSYT 106

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
              C +  C  +      +    C P NK C +     ++ Y    +A G L +ET    
Sbjct: 107 PTPCNSSVC--MTRTRDLTIPASCDPNNKLCHV-----IVSYADASSAEGTLAAETFSLA 159

Query: 224 SKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
               P  L GC         I  D +  G+ G  R S SL +Q+ L KFSYC+     +D
Sbjct: 160 GAAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCI---SGED 216

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           A     L+L  GP    S    L YTP       S       Y V L  I V  K +++P
Sbjct: 217 A--FGVLLLGDGP----SAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLP 270

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD--VEKKSGLR 392
            S  VP   G G  +VDSG+ FTF+ GP++ ++  EF+ Q  G  +R  D     +  + 
Sbjct: 271 KSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMD 330

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG---NEVLCLILFTDNAAGPALG 449
            C+      +  +P + L F  GA+M +  E     V    + V C      +  G    
Sbjct: 331 LCYHAPASLAA-VPAVTLVFS-GAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIE-- 386

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              A ++G    QN ++EFDL   R GF +  C
Sbjct: 387 ---AYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 164/386 (42%), Gaps = 58/386 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTP Q +     DT +   W PC+    C    F NV           +S++ 
Sbjct: 96  YIVRAKIGTPAQ-TMLLAMDTSNDAAWIPCSGCVGCSSTVFNNV-----------KSTTF 143

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GC+ P+C  +  PN  S+C G          AC ++ + YG    A  L  + +   
Sbjct: 144 KTVGCEAPQCKQV--PN--SKCGGS---------AC-AFNMTYGSSSIAANLSQDVVTLA 189

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           + ++P++  GC   +  S   P G+ G GR   SL SQ   L    FSYCL S  F    
Sbjct: 190 TDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPS--FRSLN 247

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     +   +  TP  KNP  SS      YYV L  I VG + V IP S
Sbjct: 248 FSGSLRL--GPVGQPKR---IKTTPLLKNPRRSS-----LYYVNLMAIRVGRRVVDIPPS 297

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P + AV   F +++GN    A V    G   C+  
Sbjct: 298 ALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGN----ATVTSLGGFDTCY-- 351

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N       + + CL +    AA P        ++
Sbjct: 352 --TSPIVAPTITFMFS-GMNVTLPPDNLLIHSTASSITCLAM----AAAPDNVNSVLNVI 404

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  + FD+ N R G A++ C
Sbjct: 405 ANMQQQNHRILFDVPNSRLGVAREPC 430


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 127/408 (31%), Positives = 181/408 (44%), Gaps = 50/408 (12%)

Query: 81  SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
           SN G+    S  +TPL   S G Y++S   GTP    +    DTGS L+W  C +  RC 
Sbjct: 71  SNAGAAPGES-AQTPLKKGS-GDYAMSFGIGTPATGLSGEA-DTGSDLIWTKCGACARC- 126

Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
                   P   P++ P  SSS+  + C +  C  +  P + S   G    +  C     
Sbjct: 127 -------SPRGSPSYYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC----- 173

Query: 201 SYLLQYGLG-----FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGR 250
           SY   YG       +T G+L++ET  F   +   P    GC++ S+      +G+ G GR
Sbjct: 174 SYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGR 233

Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
              SL +QL ++ F Y L S     +P+S   + D   G+GDS       TP   NPV  
Sbjct: 234 GKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQ 289

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
                 FYYVGL  I VG K V+IP  ++    S G GGVI DSG+T T +  P +  V 
Sbjct: 290 DL---PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            E + QMG + +         L  CF   G  +   P ++L F GGA M L  ENY   +
Sbjct: 347 DELLSQMG-FQKPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403

Query: 430 ----GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
               G    C  +   + A          I+G+    +F++ FDL+ +
Sbjct: 404 QGQNGETARCWSVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 443


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 175/389 (44%), Gaps = 45/389 (11%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           ++L  GTPPQ     + DTGS L W  C ++            P    +F P  SSS  +
Sbjct: 84  VTLPIGTPPQLQQ-MVLDTGSQLSWIQCHNKKTP-----QKKQPPTTSSFDPSLSSSFFV 137

Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
           + C +P C   +   ++ + C   S       L   SY    G  +  G L+ E + F P
Sbjct: 138 LPCNHPLCKPRVPDFSLPTDCDANS-------LCHYSYFYADGT-YAEGNLVREKIAFSP 189

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
           S+T P  + GC+  SD    GI G        PSQ  + KFSYC+ +++    P S +  
Sbjct: 190 SQTTPPIILGCATQSD-DARGILGMNLGRLGFPSQAKITKFSYCVPTKQAQ--PASGSFY 246

Query: 284 LDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
           L   P S   +   L ++    + P     A    Y + L+ I +G K + IP S   P 
Sbjct: 247 LGNNPASSSFRYVNLLTFGQSQRMPNLDPLA----YTLPLQGISIGGKKLNIPPSVFKPN 302

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPCF 395
           + G+G  ++DSGS FT++    +  + +E ++++G        Y   AD+        CF
Sbjct: 303 AGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADI--------CF 354

Query: 396 DISG-KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           D    +    + +++ +F+ G ++ +P E   A V   V CL +         LG G  I
Sbjct: 355 DGDAIEIGRLVGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGM----GRSERLGAGGNI 410

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I G+F  QN ++EFDLAN R GF +  C+
Sbjct: 411 I-GNFHQQNLWVEFDLANRRVGFGEADCS 438


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 131/449 (29%), Positives = 188/449 (41%), Gaps = 74/449 (16%)

Query: 47  HSDSDPLKILHSLASSSLS-RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           +S   P K   S A + L  +AR L   +    + S++      +++++P        Y 
Sbjct: 37  NSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPT-------YI 89

Query: 106 ISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           +  + GTP Q   P +   DT +   W PC+    CV C       S    F P +SSSS
Sbjct: 90  VRANIGTPAQ---PMLVALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSS 136

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C+ P+C     P+           +K+C      + + YG       L  +TL   
Sbjct: 137 RTLQCEAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSTIEAYLTQDTLTLA 183

Query: 224 SKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           S  +PN+  GC         PA G+ G GR   SL SQ   L    FSYCL + K     
Sbjct: 184 SDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK----- 238

Query: 278 VSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            SSN    L  GP +   +   +  TP  KNP  SS      YYV L  I VG+K V IP
Sbjct: 239 -SSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIP 289

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S L        G I DSG+ +T +  P + AV  EF R++ N    A+     G   C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKN----ANATSLGGFDTCY 345

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPA 453
                 SV  P +   F  G  + LPP+N    +  GN + CL +    AA P       
Sbjct: 346 S----GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPVNVNSVL 395

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++   Q QN  +  D+ N R G +++ C
Sbjct: 396 NVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 118/393 (30%), Positives = 168/393 (42%), Gaps = 63/393 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGS + W  C+   +C           + P F P  SS
Sbjct: 79  GDYFARIGVGTPAR-SVYMVADTGSDVSWLQCSPCRKCYR--------QQDPIFNPSLSS 129

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + + C +  C  +       + KGCS +N+        Y + YG G FT G   +ETL
Sbjct: 130 SFKPLACASSICGKL-------KIKGCSRKNECM------YQVSYGDGSFTVGDFSTETL 176

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLK---KFSYCLLS 270
            F    V +   GC     R   G+        G GR   S PSQ G      FSYCL  
Sbjct: 177 SFGEHAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR 232

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R   ++ ++++LV   GP +   K     +T    N          +YYVGL +I V   
Sbjct: 233 R---ESAIAASLVF--GPSAVPEKA---RFTKLLPN-----RRLDTYYYVGLARIRVAGS 279

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            V IP      GS G GGVIVDSG+  + +  P + A+   F R +  +  A  +   S 
Sbjct: 280 PVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGI---SL 335

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
              C+D+S  K+  LP ++L F GGA M LP +     V +E   CL    +  A     
Sbjct: 336 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 392

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I+G+ Q Q F +  D   ++ G A  +C
Sbjct: 393 -----IIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 127/398 (31%), Positives = 170/398 (42%), Gaps = 62/398 (15%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S   Y I L FGTPPQ S   + DTGS++ W PC     C  C+      S+   F P +
Sbjct: 120 SSSNYIIKLGFGTPPQ-SFYTVLDTGSNIAWIPCNP---CSGCS------SKQQPFEPSK 169

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           SS+   + C + +C  +       R    S  +  C L       +YG       +L SE
Sbjct: 170 SSTYNYLTCASQQCQLL-------RVCTKSDNSVNCSLT-----QRYGDQSEVDEILSSE 217

Query: 219 TLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
           TL   S+ V NF+ GCS     L  R P+ + GFGR+  S  SQ        FSYCL S 
Sbjct: 218 TLSVGSQQVENFVFGCSNAARGLIQRTPS-LVGFGRNPLSFVSQTATLYDSTFSYCLPSL 276

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            F  A   S L+     G       GL +TP   N     S +  FYYVGL  I VG + 
Sbjct: 277 -FSSAFTGSLLL-----GKEALSAQGLKFTPLLSN-----SRYPSFYYVGLNGISVGEEL 325

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEKKS 389
           V IP   L        G I+DSG+  T +  P + A+   F  Q+ N + A+  D+    
Sbjct: 326 VSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTC 385

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGP 446
             RP  D      V  P + L F     + LP +N     GN+   VLCL      A G 
Sbjct: 386 YNRPSGD------VEFPLITLHFDDNLDLTLPLDNIL-YPGNDDGSVLCL------AFGL 432

Query: 447 ALGRGPAII--LGDFQLQNFYLEFDLANDRFGFAKQKC 482
             G G  ++   G++Q Q   +  D+A  R G A + C
Sbjct: 433 PPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 168/391 (42%), Gaps = 46/391 (11%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +    GTPPQ  +  I D+GS L+W  C     C+ C   +      P + P  SS
Sbjct: 63  GQYFVDFFLGTPPQKFS-LIVDSGSDLLWVQCAP---CLQCYAQDT-----PLYAPSNSS 113

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C +P+C  I  P  E       P +   P AC +Y  +Y     + G+   E+ 
Sbjct: 114 TFNPVPCLSPECLLI--PATEGF-----PCDFHYPGAC-AYEYRYADTSLSKGVFAYESA 165

Query: 221 RFPSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
                 +     GC    D Q       G+ G G+   S  SQ+G     KF+YCL++  
Sbjct: 166 TVDDVRIDKVAFGCG--RDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY- 222

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            D   VSS L+       GD     +    F   P+ S+S     YYV + +++VG + +
Sbjct: 223 LDPTSVSSWLIF------GDELISTIHDLQF--TPIVSNSRNPTLYYVQIEKVMVGGESL 274

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            I +S       GNGG I DSG+T T+   P +  +   F + +  Y RAA V+   GL 
Sbjct: 275 PISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQ---GLD 330

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            C D++G      P   +   GGA       NYF  V   V CL +     AG     G 
Sbjct: 331 LCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPNVQCLAM-----AGLPSSVGG 385

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              +G+   QNF +++D   +R GFA  KC+
Sbjct: 386 FNTIGNLLQQNFLVQYDREENRIGFAPAKCS 416


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 130/444 (29%), Positives = 186/444 (41%), Gaps = 74/444 (16%)

Query: 52  PLKILHSLASSSLS-RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
           P K   S A + L  +AR L   +    + S++      +++++P        Y +  + 
Sbjct: 42  PFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPT-------YIVRANI 94

Query: 111 GTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
           GTP Q   P +   DT +   W PC+    CV C       S    F P +SSSS+ + C
Sbjct: 95  GTPAQ---PMLVALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSSRTLQC 141

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVP 228
           + P+C     P+           +K+C      + + YG       L  +TL   S  +P
Sbjct: 142 EAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVIP 188

Query: 229 NFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSSNL 282
           N+  GC         PA G+ G GR   SL SQ   L    FSYCL + K      SSN 
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK------SSNF 242

Query: 283 V--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
              L  GP +   +   +  TP  KNP  SS      YYV L  I VG+K V IP S L 
Sbjct: 243 SGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIPTSALA 294

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
                  G I DSG+ +T +  P + AV  EF R++ N    A+     G   C+     
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKN----ANATSLGGFDTCYS---- 346

Query: 401 KSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
            SV  P +   F  G  + LPP+N    +  GN + CL +    AA P        ++  
Sbjct: 347 GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPVNVNSVLNVIAS 400

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
            Q QN  +  D+ N R G +++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 106/337 (31%), Positives = 157/337 (46%), Gaps = 40/337 (11%)

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           SS+ + + C +P C    G +V +    C+  N  C      YL  YG    TAG +  +
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSA----CAMENFQC-----FYLCSYGDRSITAGHIFKD 52

Query: 219 TLRFPSKT-----VPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
           T  F S       V     GC    + L     +GIAGFGR  +SLPSQL + +FSYCL 
Sbjct: 53  TFTFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCL- 111

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
                    SS ++L T P     +    +  PF   P+  +     FYY+ L  I VG 
Sbjct: 112 --TLVTESKSSVVILGTPPDPDGLR--AHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGK 167

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ--MGNYSRAADVEK 387
             +    S      DG+GG ++DSG++ T +   +FE + +E + Q  +  Y    +V  
Sbjct: 168 TRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEV-- 225

Query: 388 KSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAG 445
             G R CF    G K V +P+LIL    GA M LP +NYF    +  V+CL +   N A 
Sbjct: 226 --GDRLCFRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQI---NGAE 279

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   +++G+FQ QN ++ +D+ N++  FA  +C
Sbjct: 280 DTT----MVLIGNFQQQNMHVVYDVENNKLLFAPAQC 312


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 176/399 (44%), Gaps = 69/399 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y ++L+ G+PPQ S   I DTGS L W       +C+ C      P   P F P +S 
Sbjct: 37  GEYLMTLTLGSPPQ-SFDVIVDTGSDLNWV------QCLPCRVCYQQPG--PKFDPSKSR 87

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-ACPSYLLQYGLGF-----TAGLL 215
           S +   C +  C      NV +            PL AC + + QY   +     T G L
Sbjct: 88  SFRKAACTDNLC------NVSA-----------LPLKACAANVCQYQYTYGDQSNTNGDL 130

Query: 216 LSETLRFP----SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLG---LKKFS 265
             ET+       +++VPNF  GC   ++ +    AG+ G G+   SL SQL      KFS
Sbjct: 131 AFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFS 190

Query: 266 YCLLS-RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           YCL+S      +P++   +         +    + YT    N     +    +YYV L  
Sbjct: 191 YCLVSLNSLSASPLTFGSI---------AAAANIQYTSIVVN-----ARHPTYYYVQLNS 236

Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           I VG + + + P  + +  S G GG I+DSG+T T +  P + AV + +     NY R  
Sbjct: 237 IEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAY-ESFVNYPRLD 295

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
                 GL  CF+I+G  +  +P+++ KF+ GA   +  EN F LV      L L    +
Sbjct: 296 G--SAYGLDLCFNIAGVSNPSVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAMGGS 352

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G +       I+G+ Q QN  + +DL   + GFA   C
Sbjct: 353 QGFS-------IIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/418 (28%), Positives = 174/418 (41%), Gaps = 60/418 (14%)

Query: 88  SNSLIKTPLSVHSYGGYS--ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           S S  ++P  +H +   S  +SL+ GTPPQ +   + DTGS L W  C            
Sbjct: 67  SGSFPRSPNKLHFHHNVSLTVSLTVGTPPQ-NVSMVLDTGSELSWLRC------------ 113

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
           N   +    F P RSSS   + C +  C+                R+   P +C S  L 
Sbjct: 114 NKTQTFQTTFDPNRSSSYSPVPCSSLTCT-------------DRTRDFPIPASCDSNQLC 160

Query: 206 YGL------GFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSS 252
           + +        + G L S+T    +  +P  + GC       +   D +  G+ G  R S
Sbjct: 161 HAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGS 220

Query: 253 ESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
            S  SQ+   KFSYC+    F         VL  G  +     P L+YTP  +       
Sbjct: 221 LSFVSQMDFPKFSYCISDSDFSG-------VLLLGDANFSWLMP-LNYTPLIQISTPLPY 272

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
                Y V L  I V SK + +P S  VP   G G  +VDSG+ FTF+ GP++ A+  EF
Sbjct: 273 FDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEF 332

Query: 373 IRQMGNYSRAADVEK---KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFA 427
           + Q     R  +      + G+  C+ +  S     +LP + L F+ GA+M +  +    
Sbjct: 333 LNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLY 391

Query: 428 LVGNEVL---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            V  EV     +  FT       L    A ++G    QN ++EFDL   R GFA+ +C
Sbjct: 392 RVPGEVRGSDSVYCFT--FGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 100/316 (31%), Positives = 146/316 (46%), Gaps = 44/316 (13%)

Query: 188 CSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSKTVPNFLAGCSI 236
           C   N+TCP     Y   YG             FT  L +S   +   + V N + GC  
Sbjct: 67  CKAENQTCP-----YYYWYGDSSNTTGDFALETFTVNLTMSSG-KPELRRVENVMFGCGH 120

Query: 237 LSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
            +       AG+ G GR   S  SQL       FSYCL+ R   DA VSS L+   G   
Sbjct: 121 WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SDANVSSKLIF--GEDK 177

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
                P L++T       G  +    FYYV ++ I+VG + V IP       +DG+GG I
Sbjct: 178 DLLSHPELNFTTLV---AGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTI 234

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           +DSG+T ++   P ++ + + F+ ++  Y    D      L PC++++G +   LP+  +
Sbjct: 235 IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPV---LEPCYNVTGVEQPDLPDFGI 291

Query: 411 KFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRGPAI--ILGDFQLQNFYLE 467
            F  GA    P ENYF  +   EV+CL +         LG  P+   I+G++Q QNF++ 
Sbjct: 292 VFSDGAVWNFPVENYFIEIEPREVVCLAI---------LGTPPSALSIIGNYQQQNFHIL 342

Query: 468 FDLANDRFGFAKQKCA 483
           +D    R GFA  KCA
Sbjct: 343 YDTKKSRLGFAPTKCA 358


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 124/440 (28%), Positives = 187/440 (42%), Gaps = 77/440 (17%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           + + + AS  L +AR  K  T P  +  ++G+                G Y +S+  GTP
Sbjct: 112 RKIAAAASPVLDQARGKKGVTLPAQRGISLGT----------------GNYVVSMGLGTP 155

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
            +  T  +FDTGS L W  CT    C DC        + P F P RSS+   + C +P+C
Sbjct: 156 ARDMT-VVFDTGSDLSWVQCTP---CSDCY-----EQKDPLFDPARSSTYSAVPCASPEC 206

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF-PSKTVPNFL 231
                  ++SR   CS R+K C      Y + YG    T G L  +TL    S  +P F+
Sbjct: 207 Q-----GLDSR--SCS-RDKKC-----RYEVVYGDQSQTDGALARDTLTLTQSDVLPGFV 253

Query: 232 AGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
            GC         +  G+ G GR   SL SQ   K    FSYCL S     +P ++  +  
Sbjct: 254 FGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS-----SPSAAGYLSL 308

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
            GP   +++   +     + +P         FYYV L  + V  + V++      P    
Sbjct: 309 GGPAPANARFTAMETR--HDSP--------SFYYVRLVGVKVAGRTVRVS-----PIVFS 353

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLRPCFDISGKKSV 403
             G ++DSG+  T +   ++ A+   F R MG   Y RA  +   S L  C+D +G  +V
Sbjct: 354 AAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPAL---SILDTCYDFTGHTTV 410

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            +P + L F GGA + L       +      CL  F  N  G   G     I+G+ Q + 
Sbjct: 411 RIPSVALVFAGGAAVGLDFSGVLYVAKVSQACLA-FAPNGDGADAG-----IIGNTQQKT 464

Query: 464 FYLEFDLANDRFGFAKQKCA 483
             + +D+A  + GF    C+
Sbjct: 465 LAVVYDVARQKIGFGANGCS 484


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/426 (27%), Positives = 175/426 (41%), Gaps = 52/426 (12%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD------ 148
           PL       Y  S   G PPQ +   + DTGS LVW  C++      C  P         
Sbjct: 69  PLRWSGKTQYIASYGIGDPPQPAEAVV-DTGSDLVWTQCST------CRLPAAAAAGGGG 121

Query: 149 --PSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
             P  +P +    S +++ + C +   +     P      +G    +  C +A       
Sbjct: 122 CFPQNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAA-----S 176

Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQL 259
           YG G   G+L ++   FPS +      GC   +   P      +GI G GR + SL SQL
Sbjct: 177 YGAGVALGVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQL 236

Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG--------LSYTPFYKNPVGSS 311
              +FSYCL +  F D    S+L +  G  +G S   G        ++  PF KNP    
Sbjct: 237 NATEFSYCL-TPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNP--KD 293

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           S F  FYY+ L  +  G+  V +P               GG ++DSGS FT +  P   A
Sbjct: 294 SPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRA 353

Query: 368 VAKEFIRQM-GNYSRAADVEKKSG-LRPCF----DISGKKSVYLPELILKFK----GGAK 417
           + KE  RQ+ G+ S      K  G L  C     D     +  +P L+L+F     GG +
Sbjct: 354 LTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRE 413

Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           + +P E Y+A V     C+ + +  +    L      I+G+F  Q+  + +DLAN    F
Sbjct: 414 LVIPAEKYWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSF 473

Query: 478 AKQKCA 483
               C+
Sbjct: 474 QPANCS 479


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 124/445 (27%), Positives = 196/445 (44%), Gaps = 77/445 (17%)

Query: 52  PLKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
           P +   SL S  +    +R R LK  ++   +D+N            P+   S G Y I 
Sbjct: 69  PNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANA---------NVPVRSGS-GEYIIQ 118

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           + FGTP Q+    I DTGS + W PC    +C  C+      S  P F P +SSS +   
Sbjct: 119 VDFGTPKQSMYTLI-DTGSDVAWIPCK---QCQGCH------STAPIFDPAKSSSYKPFA 168

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT 226
           C +  C  I G      C G    N  C      + + YG G    G L S+ +   S+ 
Sbjct: 169 CDSQPCQEISG-----NCGG----NSKC-----QFEVLYGDGTQVDGTLASDAITLGSQY 214

Query: 227 VPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
           +PNF  GC  S+  D   +      G       +++  ++L    FSYCL S     +  
Sbjct: 215 LPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS----STS 270

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
           S +LVL        S    L +T   K+P     +F  FY+V L+ I VG+  + +P + 
Sbjct: 271 SGSLVLGKEAAVSSS---SLKFTTLIKDP-----SFPTFYFVTLKAISVGNTRISVPATN 322

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
           +  G    GG I+DSG+T T++    ++ +   F +Q+ +  +   VE    +  C+D+S
Sbjct: 323 IASG----GGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSL-QPTPVED---MDTCYDLS 374

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
              SV +P + L       + LP EN      + + CL   + ++           I+G+
Sbjct: 375 SS-SVDVPTITLHLDRNVDLVLPKENILITQESGLSCLAFSSTDSRS---------IIGN 424

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
            Q QN+ + FD+ N + GFA+++CA
Sbjct: 425 VQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 166/391 (42%), Gaps = 40/391 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS LVW  C+   RC           R   F P+RSS
Sbjct: 84  GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
           + + + C +P+C  +  P     C         C      Y++ YG G ++ G L ++ L
Sbjct: 135 TYRRVPCSSPQCRALRFPG----CDSGGAAGGGC-----RYMVAYGDGSSSTGDLATDKL 185

Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
            F + T V N   GC   ++      AG+ G GR   S+ +Q+       F YCL  R  
Sbjct: 186 AFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRT- 244

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
             +  SS LV    P     + P  ++T    NP   S      YYV +    VG + V 
Sbjct: 245 SRSTRSSYLVFGRTP-----EPPSTAFTALLSNPRRPS-----LYYVDMAGFSVGGERVT 294

Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
                S  +  + G GGV+VDSG+  +      + A+   F  +           + S  
Sbjct: 295 GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 354

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D+ G+ +   P ++L F GGA MALPPENYF  V         +       A   G
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++I G+ Q Q F + FD+  +R GFA + C
Sbjct: 415 LSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 173/404 (42%), Gaps = 67/404 (16%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +  PL+  S  GY++++  GTPPQ  T  I DT S L W           CN  N    +
Sbjct: 79  MSVPLARISDEGYTVTIGIGTPPQLHT-LIADTASDLTW---------TQCNLFNDTAKQ 128

Query: 152 I-PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
           + P F P +SSS   + C +  C+           K CS  NKTC      Y+  Y    
Sbjct: 129 VEPLFDPAKSSSFAFVTCSSKLCT-----EDNPGTKRCS--NKTC-----RYVYPYVSVE 176

Query: 211 TAGLLLSETLRFPSKT---VPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKF 264
            AG+L  E+            +F  GC  L+D      +GI G   +  S+ SQL + KF
Sbjct: 177 AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKF 236

Query: 265 SYCLL---SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           SYCL     RK      SS L        G  KT G         P+  S  F  +YYV 
Sbjct: 237 SYCLTPYTDRK------SSPLFFGAWADLGRYKTTG---------PIQKSLTF--YYYVP 279

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L  + +G++ + +P +     +   GG +VD G T   +  P F A+ KE +    N   
Sbjct: 280 LVGLSLGTRRLDVPAATF---ALKQGGTVVDLGCTVGQLAEPAFTAL-KEAVLHTLNLPL 335

Query: 382 AADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
                K    + CF +       +V  P L+L F GGA M LP +NYF      ++CL L
Sbjct: 336 TNRTVKD--YKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAGLMCLAL 393

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                     G G +II G+ Q QNF+L FD+ + +F FA   C
Sbjct: 394 VP--------GGGMSII-GNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 117/391 (29%), Positives = 172/391 (43%), Gaps = 54/391 (13%)

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
           Q +   I DTGS  V   C SR R              P F P  S S + + C +  C 
Sbjct: 9   QKNLSAIIDTGSEAVLVQCGSRSR--------------PVFDPAASQSYRQVPCISQLCL 54

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLA-- 232
            +         + C   +  C     +Y L YG    +    S+ + F + T  +  A  
Sbjct: 55  AVQQQTSNGSSQPCVNSSAAC-----TYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQ 109

Query: 233 ------GCS-----ILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKFDDAP 277
                 GC+      L D    GI GF R + SLPSQL    G  KFSYC  S+ +   P
Sbjct: 110 FRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQ--P 167

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            ++ ++     G   SK   +SYTP   NPV  + A  + YYVGL  I V  K + IP S
Sbjct: 168 RATGVIFLGDSGLSKSK---VSYTPLLDNPV--TPARSQLYYVGLTSISVDGKTLAIPES 222

Query: 338 -YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
            + +  S G+GG ++DSG+TFT +    + A    F     +  R   V   +G   C++
Sbjct: 223 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKK-VGAAAGFDDCYN 281

Query: 397 ISGKKSV-YLPELILKFKGGAKMALPPENYFALV---GNEV-LCLILFTDNAAGPALGRG 451
           IS   S+  +PE+ L  +   ++ L  E+ F  V   GNEV +CL + +   +G     G
Sbjct: 282 ISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSG----FG 337

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +LG++Q  N+ +E+D    R GF +  C
Sbjct: 338 KINVLGNYQQSNYLVEYDNERSRVGFERADC 368


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 116/397 (29%), Positives = 171/397 (43%), Gaps = 62/397 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTP +       DTGS LVW  C     C D + P +DP+         SS+ 
Sbjct: 84  YLVRLAVGTP-RRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAA--------SSTY 134

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C   +C  +   +   R  G    +++C      Y   YG    T G + ++   F
Sbjct: 135 AALPCGAARCRALPFTSCGVRTLG---NHRSC-----IYAYHYGDKSLTVGEIATDRFTF 186

Query: 223 -------PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
                   S        GC  L+         GIAGFGR   SLPSQL +  FSYC  S 
Sbjct: 187 GDSGGSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTS- 245

Query: 272 KFDDAPVSSNLVLDTGPGS--GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
            F+    SS + L   P +    + +  +  TP  KNP   S      Y++ L+ I VG 
Sbjct: 246 MFESK--SSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPS-----LYFLSLKGISVGK 298

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + +P +            I+DSG++ T +   ++EAV  EF  Q+G      +    S
Sbjct: 299 TRLPVPETKF-------RSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVE---GS 348

Query: 390 GLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAG 445
            L  CF +   +  +   +P L L  + GA   LP  NY F  +G  V+C++L     A 
Sbjct: 349 ALDLCFALPVTALWRRPAVPSLTLHLE-GADWELPRSNYVFEDLGARVMCIVL----DAA 403

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           P    G   ++G+FQ QN ++ +DL NDR  FA  +C
Sbjct: 404 P----GEQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 164/389 (42%), Gaps = 62/389 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +  + GTP QA      DT +   W PC+    CV C       S    F P +SSSS
Sbjct: 88  YIVRANIGTPAQAML-VALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSS 136

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C+ P+C     P+           +K+C      + + YG       L  +TL   
Sbjct: 137 RTLQCEAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSAIEAYLTQDTLTLA 183

Query: 224 SKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  +PN+  GC         PA G+ G GR   SL SQ   L    FSYCL + K     
Sbjct: 184 TDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK----- 238

Query: 278 VSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            SSN    L  GP +   +   +  TP  KNP  SS      YYV L  I VG+K V IP
Sbjct: 239 -SSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIP 289

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S L        G I DSG+ +T +  P + A+  EF R++ N    A+     G   C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKN----ANATSLGGFDTCY 345

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPA 453
                 SV  P +   F  G  + LPP+N    +  GN + CL +    AA P       
Sbjct: 346 S----GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPTNVNSVL 395

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++   Q QN  +  D+ N R G +++ C
Sbjct: 396 NVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 117/388 (30%), Positives = 162/388 (41%), Gaps = 49/388 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP      + D+GS ++W  C    +C    +   DP     F P  SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C       +     G       C      Y + YG G +T G L  ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                 V     GC   +       AG+ G G  + SL  QLG      FSYCL SR   
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG 288

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            A    +LVL    G  ++   G  + P  +N   SS     FYYVGL  I VG + + +
Sbjct: 289 GA---GSLVL----GRTEAVPVGAVWVPLVRNNQASS-----FYYVGLTGIGVGGERLPL 336

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
             S      DG GGV++D+G+  T +    + A+   F   MG   R+  V   S L  C
Sbjct: 337 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 393

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+SG  SV +P +   F  GA + LP  N    VG  V CL  F  +++G +       
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 445

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q +   +  D AN   GF    C
Sbjct: 446 ILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 163/385 (42%), Gaps = 65/385 (16%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS +VW  C    RC +           P F P+RSSS   +GC    C  +   
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYE--------QSGPVFDPRRSSSYGAVGCGAALCRRL--- 49

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
                  GC  R   C      Y + YG G  TAG  ++ETL F     V     GC   
Sbjct: 50  ----DSGGCDLRRGAC-----MYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHD 100

Query: 238 SDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVS--SNLVLDTGPG 289
           ++      AG+ G GR   S P+Q+  +    FSYCL+ R    A  +  S+       G
Sbjct: 101 NEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFG 160

Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYSYL-VPGSDGNG 347
           +G       S+TP  +NP         FYYV L  I VG   V  +  S L +  S G G
Sbjct: 161 AGSVGASSASFTPMVRNP-----RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRG 215

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR----------PCFDI 397
           GVIVDSG++ T +    + A+   F        RAA      GLR           C+D+
Sbjct: 216 GVIVDSGTSVTRLARASYSALRDAF--------RAA---AAGGLRLSPGGFSLFDTCYDL 264

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
            G++ V +P + + F GGA+ ALPPENY   V +       F     G +       I+G
Sbjct: 265 GGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS-------IIG 317

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q Q F + FD    R GFA + C
Sbjct: 318 NIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/393 (29%), Positives = 165/393 (41%), Gaps = 50/393 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +    GTPPQ  +  I D+GS L+W  C+   +C             P ++P  SS
Sbjct: 62  GQYFVDFFLGTPPQKFS-LIVDSGSDLLWVQCSPCRQCY--------AQDSPLYVPSNSS 112

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C +  C  I  P  E       P +   P AC    L      + G+   E+  
Sbjct: 113 TFSPVPCLSSDCLLI--PATEGF-----PCDFRYPGACAYEYLYADTSSSKGVFAYESAT 165

Query: 222 FPSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
                +     GC   SD Q       G+ G G+   S  SQ+G     KF+YCL++   
Sbjct: 166 VDGVRIDKVAFGCG--SDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-L 222

Query: 274 DDAPVSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           D   VSS+L+       GD   S    + YTP   NP   +      YYV + ++ VG K
Sbjct: 223 DPTSVSSSLIF------GDELISTIHDMQYTPIVSNPKSPT-----LYYVQIEKVTVGGK 271

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S       GNGG I DSG+T T+     +  +   F   + +Y RA  V+   G
Sbjct: 272 SLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQ---G 327

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           L  C +++G      P   ++F  GA      ENYF  V   V CL +     AG A   
Sbjct: 328 LDLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAM-----AGLASPL 382

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G    +G+   QNF++++D   +  GFA  KC+
Sbjct: 383 GGFNTIGNLLQQNFFVQYDREENLIGFAPAKCS 415


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 113/401 (28%), Positives = 171/401 (42%), Gaps = 69/401 (17%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ GTPP    PF+   DTGS L W  C     C   + P  DPS    F P    
Sbjct: 77  YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 129

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-----GFTAGLLL 216
               + C +  C  +       R + CS          PS L +YG       ++AG+L 
Sbjct: 130 ----VPCSSATCLPVL------RSRNCS---------TPSSLCRYGYSYSDGAYSAGILG 170

Query: 217 SETLRFPSK------TVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
           +ETL   S       +V +   GC   +        G  G GR + SL +QLG+ KFSYC
Sbjct: 171 TETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 230

Query: 268 LLS--RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           L        D+P     + +  PG G  ++     TP  ++P+  S      Y V L+ I
Sbjct: 231 LTDFFNSTLDSPFLLGTLAELAPGPGAVQS-----TPLLQSPLNPSR-----YVVSLQGI 280

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            +G   + IP       ++  GG++VDSG+TF+ +    F  V     + +G       V
Sbjct: 281 TLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQ----PPV 336

Query: 386 EKKSGLRPCFDI-SGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
              S   PCF   +G++ + ++P+L+L F GGA M L  +NY +    +   CL +    
Sbjct: 337 NASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTT 396

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +           +LG+FQ QN  + FD+   +  F    C+
Sbjct: 397 STWS--------MLGNFQQQNIQMLFDMTVGQLSFLPTDCS 429


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 111/381 (29%), Positives = 172/381 (45%), Gaps = 52/381 (13%)

Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKRSSSSQLIGCQNPKC 173
           A    + DT S L W  C     C D   P  DPS  P++  +P  SSS     C   + 
Sbjct: 129 AEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSS-----CDALRV 183

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
           +   G +       C+  N+  P AC SY L Y  G ++ G+L  + LR   + +  F+ 
Sbjct: 184 AMAAGTSP------CADDNEQQP-AC-SYALSYRDGSYSRGVLARDKLRLAGQDIEGFVF 235

Query: 233 GCSILSDRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
           GC   +   P    +G+ G GRS  SL SQ   +    FSYCL  R   ++  S +LVL 
Sbjct: 236 GCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMR---ESGSSGSLVL- 291

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSA--FGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
                GD  +   + TP     + S S    G FY++ L  I VG + V+ P+       
Sbjct: 292 -----GDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWF------ 340

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
              G VI+DSG+  T +   ++ AV  EF+ Q+  Y +A      S L  CF+++G K V
Sbjct: 341 -SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAF---SILDTCFNLTGLKEV 396

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQL 461
            +P L   F+G  ++ +  +     V ++   +CL L +  +           I+G++Q 
Sbjct: 397 QVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKS------EYDTSIIGNYQQ 450

Query: 462 QNFYLEFDLANDRFGFAKQKC 482
           +N  + FD    + GFA++ C
Sbjct: 451 KNLRVIFDTLGSQIGFAQETC 471


>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 154

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 4/151 (2%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       + GNG
Sbjct: 5   GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C+++SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLIL 438
               FKGG+ M LP  NYF+   ++ +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSDSICLTM 154


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 121/404 (29%), Positives = 169/404 (41%), Gaps = 73/404 (18%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++L+ G PPQ +   + DTGS L W  C               P+    F P  SS+  
Sbjct: 66  TVTLAVGDPPQ-NISMVLDTGSELSWLHCKK------------SPNLGSVFNPVSSSTYS 112

Query: 165 LIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            + C +P C         +R +       C P+   C +A  SY     +    G L  E
Sbjct: 113 PVPCSSPIC--------RTRTRDLPIPASCDPKTHLCHVAI-SYADATSI---EGNLAHE 160

Query: 219 TLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T    S T P  L GC  S LS     D +  G+ G  R S S  +QLG  KFSYC+   
Sbjct: 161 TFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI--- 217

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIV 327
               +   S++ L  G  S     P + YTP     V  S+    F    Y V L  I V
Sbjct: 218 ----SGSDSSVFLLLGDASYSWLGP-IQYTPL----VLQSTPLPYFDRVAYTVQLEGIRV 268

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           GSK + +P S  VP   G G  +VDSG+ FTF+ GP++ A+  EFI Q  +  R  D   
Sbjct: 269 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPD 328

Query: 388 ---KSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPPENYFALVG-------NEVL 434
              +  +  C+ +          LP + L F+ GA+M++  +     V         EV 
Sbjct: 329 FVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVY 387

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           C      +  G       A ++G    QN ++EFDLA  R GFA
Sbjct: 388 CFTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLAKSRVGFA 426


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 111/374 (29%), Positives = 172/374 (45%), Gaps = 49/374 (13%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DT S L W  C     C D   P  DPS  P++          + C +  C  +   
Sbjct: 126 VIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAA--------VPCNSSSCDAL--- 174

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
            V +   G +  ++  P AC SY L Y  G ++ G+L  + L    + +  F+ GC   S
Sbjct: 175 RVATGMSGQACDDQ--PAAC-SYTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGT-S 230

Query: 239 DRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           ++ P    +G+ G GRS  SL SQ   +    FSYCL  +   ++  S +LVL       
Sbjct: 231 NQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPK---ESGSSGSLVLGDDASVY 287

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP-YSYLVPGSDGNGGVI 350
            + TP + YT    +P+      G FY   L  I VG + V+ P +S     + G G  I
Sbjct: 288 RNSTP-IVYTAMVSDPLQ-----GPFYLANLTGITVGGEDVQSPGFS-----AGGGGKAI 336

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           VDSG+  T +   ++ AV  EF+ Q+  Y +AA     S L  CFD++G + V +P L L
Sbjct: 337 VDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPF---SILDTCFDLTGLREVQVPSLKL 393

Query: 411 KFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
            F GGA++ +  +    +V  +   +CL L +  +           I+G++Q +N  + F
Sbjct: 394 VFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKS------EYDTPIIGNYQQKNLRVIF 447

Query: 469 DLANDRFGFAKQKC 482
           D    + GFA++ C
Sbjct: 448 DTVGSQIGFAQETC 461


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 122/435 (28%), Positives = 178/435 (40%), Gaps = 78/435 (17%)

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           LKT+  P++    +   ++ +L             +++L+ G+PPQ +   + DTGS L 
Sbjct: 40  LKTQKLPRSSSDKLSFRHNVTL-------------TVTLAVGSPPQ-NISMVLDTGSELS 85

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--- 186
           W  C               P+    F P  SS+   + C +P C         +R +   
Sbjct: 86  WLHCKK------------SPNLGSVFNPVSSSTYSPVPCSSPIC--------RTRTRDLP 125

Query: 187 ---GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC--SILS--- 238
               C P+   C +A  SY     +    G L  +T    S T P  L GC  S LS   
Sbjct: 126 IPASCDPKTHFCHVAI-SYADATSI---EGNLAHDTFVIGSVTRPGTLFGCMDSGLSSDS 181

Query: 239 --DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
             D +  G+ G  R S S  +QLG  KFSYC+       +   S+ +L  G  S     P
Sbjct: 182 EEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI-------SGSDSSGILLLGDASYSWLGP 234

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            + YTP               Y V L  I VGSK + +P S  VP   G G  +VDSG+ 
Sbjct: 235 -IQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 293

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKS---VYLPELIL 410
           FTF+ GP++ A+  EFI Q  +  R  D      +  +  C+ +          LP + L
Sbjct: 294 FTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVISL 353

Query: 411 KFKGGAKMALPPENYFALVG-------NEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            F+ GA+M++  +     V         EV C      +  G       A ++G    QN
Sbjct: 354 MFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIE-----AFVIGHHHQQN 407

Query: 464 FYLEFDLANDRFGFA 478
            ++EFDLA  R GFA
Sbjct: 408 VWMEFDLAKSRVGFA 422


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 181/396 (45%), Gaps = 40/396 (10%)

Query: 97  SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           SVH S   Y +  + GTPP A +  + DTGS L+W  C +   C  C      P   P +
Sbjct: 92  SVHASTATYLVDFAIGTPPLALSA-VLDTGSDLIWTQCDAP--CRRCF-----PQPAPLY 143

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
            P RS +   + C +  C  +      SRC   +         C +Y   YG G  T G+
Sbjct: 144 APARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGC-TYYYSYGDGSSTDGV 202

Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           L +ET  F + T  + LA GC   ++      +G+ G GR   SL SQLG+ KFSYC   
Sbjct: 203 LATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFT- 261

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             F+D   SS L L    GS  S +P    TPF  +P G   +   +YY+ L  I VG  
Sbjct: 262 -PFNDTTTSSPLFL----GSSASLSPAAKSTPFVPSPSGPRRS--SYYYLSLEGITVGDT 314

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKS 389
            + I  +     + G GG+I+DSG+TFT +E   F  +A+    ++    +  A +    
Sbjct: 315 LLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL---- 370

Query: 390 GLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
           GL  CF      G ++V +P L+L F  GA M LP  +  A+V + V  +        G 
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFD-GADMELPRSS--AVVEDRVAGVACL-----GI 422

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              RG + +LG  Q QN ++ +D+  D   F    C
Sbjct: 423 VSARGMS-VLGSMQQQNMHVRYDVGRDVLSFEPANC 457


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 165/391 (42%), Gaps = 40/391 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS LVW  C+   RC           R   F P+RSS
Sbjct: 84  GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
           + + + C +P+C  +  P     C         C      Y++ YG G ++ G L ++ L
Sbjct: 135 TYRRVPCSSPQCRALRFPG----CDSGGAAGGGC-----RYMVAYGDGSSSTGELATDKL 185

Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
            F + T V N   GC   ++      AG+ G  R   S+ +Q+       F YCL  R  
Sbjct: 186 AFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRT- 244

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
             +  SS LV    P     + P  ++T    NP   S      YYV +    VG + V 
Sbjct: 245 SRSTRSSYLVFGRTP-----EPPSTAFTALLSNPRRPS-----LYYVDMAGFSVGGERVT 294

Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
                S  +  + G GGV+VDSG+  +      + A+   F  +           + S  
Sbjct: 295 GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 354

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D+ G+ +   P ++L F GGA MALPPENYF  V         +       A   G
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++I G+ Q Q F + FD+  +R GFA + C
Sbjct: 415 LSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 129/434 (29%), Positives = 189/434 (43%), Gaps = 58/434 (13%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
           LA+   +R  +L+ +  P T  + +GS   + + +        G Y + +  G+PP    
Sbjct: 94  LAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISE------GSGEYFVRVGVGSPPTEQY 147

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
             + D+GS ++W  C     C +C +   DP     F P  S+S   + C +  C  + G
Sbjct: 148 -LVVDSGSDVIWIQCRP---CAEC-YQQADP----LFDPAASASFTAVPCDSGVCRTLPG 198

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSI 236
            +      GC+        AC  Y + YG G +T G+L  ETL F   T V     GC  
Sbjct: 199 GS-----SGCADSG-----AC-RYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGH 247

Query: 237 LSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGS 290
            +       AG+ G G    SL  QLG      FSYCL SR  D    + +LV     G 
Sbjct: 248 RNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAG--AGSLVF----GR 301

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
            D+   G  + P  +N    S     FYYVGL  + VG + + +         DG GGV+
Sbjct: 302 DDAMPVGAVWVPLLRNAQQPS-----FYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVV 356

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
           +D+G+  T +    + A+   F   + G+  RA  V   S L  C+D+SG  SV +P + 
Sbjct: 357 MDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGV---SLLDTCYDLSGYASVRVPTVA 413

Query: 410 LKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           L F + GA + LP  N    +G  V CL  F  +A+G +       ILG+ Q Q   +  
Sbjct: 414 LYFGRDGAALTLPARNLLVEMGGGVYCLA-FAASASGLS-------ILGNIQQQGIQITV 465

Query: 469 DLANDRFGFAKQKC 482
           D AN   GF    C
Sbjct: 466 DSANGYVGFGPSTC 479


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 161/388 (41%), Gaps = 49/388 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP      + D+GS ++W  C    +C    +   DP     F P  SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C       +     G       C      Y + YG G +T G L  ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                 V     GC   +       AG+ G G  + SL  QLG      FSYCL SR   
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAG 288

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            A    +LVL    G  ++   G  + P  +N   SS     FYYVGL  I VG + + +
Sbjct: 289 GA---GSLVL----GRTEAVPVGAVWVPLVRNNQASS-----FYYVGLTGIGVGGERLPL 336

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
                    DG GGV++D+G+  T +    + A+   F   MG   R+  V   S L  C
Sbjct: 337 QDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 393

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+SG  SV +P +   F  GA + LP  N    VG  V CL  F  +++G +       
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 445

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q +   +  D AN   GF    C
Sbjct: 446 ILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 163/387 (42%), Gaps = 56/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           + +    GTP Q +     DT +   W PC+    C+ C    V       F   +SSS 
Sbjct: 103 FVVRAKIGTPAQ-TLLLALDTSNDAAWIPCSG---CIGCPSTTV-------FSSDKSSSF 151

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + CQ+P+C+ +  P+    C G          AC  + L YG    A  L+ + L   
Sbjct: 152 RPLPCQSPQCNQVPNPS----CSGS---------AC-GFNLTYGSSTVAADLVQDNLTLA 197

Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           + +VP++  GC      S +  +   G+     S       L    FSYCL S  F    
Sbjct: 198 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS--FKSVN 255

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  +NP  SS      YYV L  I VG K V IP S
Sbjct: 256 FSGSLRL--GPVAQPIR---IKYTPLLRNPRRSS-----LYYVNLISIRVGRKIVDIPPS 305

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L   S    G ++DSG+TFT +  P + AV  EF R++G   R   V    G   C+ +
Sbjct: 306 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTV 362

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N+          CL +    AA P        ++
Sbjct: 363 ----PIISPTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 413

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
              Q QN  + FD+ N R G A++ C+
Sbjct: 414 ASMQQQNHRILFDIPNSRVGVARESCS 440


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 170/394 (43%), Gaps = 55/394 (13%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR---YRCVDCNFPNVDPSRIPAFIPKRS 160
           Y  S   G+PPQ +   I DTGS L+W  C +      C     P  + S+   F+P   
Sbjct: 86  YIASYLIGSPPQRTEALI-DTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVP--- 141

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
                + C +   +     N    C      + +C     +++  YG G   G L +E+ 
Sbjct: 142 -----VPCADK--AGFCAANGVHLCG----LDGSC-----TFIASYGAGRVIGSLGTESF 185

Query: 221 RFPSKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
            F S T  +   GC  L+          +G+ G GR   SL SQ+G  +FSYC L+  F 
Sbjct: 186 AFESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYC-LTPYFH 243

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
            +  SS+L +        S   G +  PF K+P      +  FYY+ L  I VG   +  
Sbjct: 244 SSGASSHLFVGASA----SLGGGGASMPFVKSP--KDYPYSTFYYLPLEGITVGKTRLPA 297

Query: 333 ----KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
                     L  G    GGVI+D+GS  T +    +EA+ +E   Q+GN S     E  
Sbjct: 298 VNSTTFQLRQLFKGY-WAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPE-D 355

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           SGL  C    G + V +P L+  F GGA MA+P  +Y+A V     C+++         L
Sbjct: 356 SGLELCVAREGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAACMMI---------L 405

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             G   I+G+FQ Q+ +L +DL   RF F    C
Sbjct: 406 EGGYDSIIGNFQQQDMHLLYDLRRGRFSFQTADC 439


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 132/427 (30%), Positives = 178/427 (41%), Gaps = 78/427 (18%)

Query: 86   NYSNSLIKTP---LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC 142
            N    LI  P   LS H     ++SL+ G+PPQ  T  + DTGS L W  C         
Sbjct: 979  NTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVT-MVLDTGSELSWLHCKK------- 1030

Query: 143  NFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG------CSPRNKTCP 196
                  P+    F P  SSS   I C +P C         +R +       C P+ K C 
Sbjct: 1031 -----SPNLTSVFNPLSSSSYSPIPCSSPIC--------RTRTRDLPNPVTCDPK-KLCH 1076

Query: 197  LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFG 249
             A  SY     L    G L S+  R  S  +P  L GC       +   D +  G+ G  
Sbjct: 1077 -AIVSYADASSL---EGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMN 1132

Query: 250  RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
            R S S  +QLGL KFSYC+  R      +  +L L        S    L+YTP     V 
Sbjct: 1133 RGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDLHL--------SWLGNLTYTPL----VQ 1180

Query: 310  SSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
             S+    F    Y V L  I VG+K + +P S   P   G G  +VDSG+ FTF+ GP++
Sbjct: 1181 ISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVY 1240

Query: 366  EAVAKEFIRQM-GNYSRAAD--VEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALP 421
             A+  EF+ Q  G  +   D     +  +  C+ + +G K   LP + L F+ GA+M + 
Sbjct: 1241 TALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR-GAEMVVG 1299

Query: 422  PENYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
             E     V     GNE V CL     +  G       A ++G    QN ++EFDL     
Sbjct: 1300 GEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLV---- 1350

Query: 476  GFAKQKC 482
             FA   C
Sbjct: 1351 AFAADLC 1357


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 172/412 (41%), Gaps = 82/412 (19%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPP+       DTGS LVW  C     C     P +DP+         SS+ 
Sbjct: 92  YLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAA--------SSTY 142

Query: 164 QLIGCQNPKCSWI-----FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
             + C  P+C  +      G    S   G    N++C     +Y+  YG    T G + +
Sbjct: 143 AALPCGAPRCRALPFTSCGGGGRSSWGNG----NRSC-----AYIYHYGDKSVTVGEIAT 193

Query: 218 ETL-----------RFPSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK 262
           +             R P++       GC   +         GIAGFGR   SLPSQL + 
Sbjct: 194 DRFTFGGDNGDGDSRLPTR---RLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT 250

Query: 263 KFSYCLLSR--------KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
            FSYC  S             AP ++ L       SG+ +T     TP  KNP   S   
Sbjct: 251 TFSYCFTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRT-----TPLLKNPSQPS--- 302

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y++ L+ I VG   + +P + L          I+DSG++ T +   ++EAV  EF  
Sbjct: 303 --LYFLSLKGISVGKTRLAVPEAKL-------RSTIIDSGASITTLPEAVYEAVKAEFAA 353

Query: 375 QMGNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVG 430
           Q+G       V + S L  CF +   +  +   +P L L    GA   LP  NY F  + 
Sbjct: 354 QVG--LPPTGVVEGSALDLCFALPVTALWRRPPVPSLTLHLD-GADWELPRGNYVFEDLA 410

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             V+C++L     A P    G   ++G+FQ QN ++ +DL ND   FA  +C
Sbjct: 411 ARVMCVVL----DAAP----GDQTVIGNFQQQNTHVVYDLENDWLSFAPARC 454


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 117/401 (29%), Positives = 168/401 (41%), Gaps = 64/401 (15%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL+ GTPPQ +   + DTGS L W  C         ++P         F P RS+S Q 
Sbjct: 33  VSLTVGTPPQ-NVSMVIDTGSELSWLHCNKTL-----SYPT-------TFDPTRSTSYQT 79

Query: 166 IGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
           I C +P C+        +R +       C   N      C + L       + G L S+ 
Sbjct: 80  IPCSSPTCT--------NRTQDFPIPASCDSNN-----LCHATLSYADASSSDGNLASDV 126

Query: 220 LRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRK 272
               S  +   + GC  S+ S     D +  G+ G  R S S  SQLG  KFSYC+    
Sbjct: 127 FHIGSSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTD 186

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
           F      S L+L  G  +     P L+YTP  +            Y V L  I V  K +
Sbjct: 187 F------SGLLL-LGESNLTWSVP-LNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLL 238

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA---DVEKKS 389
            IP S   P   G G  +VDSG+ FTF+ GP++ A+   F+ Q  +  R     D   + 
Sbjct: 239 PIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQG 298

Query: 390 GLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYF-----ALVGNE-VLCLILFTD 441
            +  C+ +   + V   LP + L F+ GA+M +  +         L GN+ V CL     
Sbjct: 299 AMDLCYLVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNS 357

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +  G       A ++G    QN ++EFDL   R G A+ +C
Sbjct: 358 DLLGVE-----AYVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393


>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 60/152 (39%), Positives = 88/152 (57%), Gaps = 5/152 (3%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       S GNG
Sbjct: 5   GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C+++SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
               FKGG+ M LP  NYF+  V  + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 163/379 (43%), Gaps = 64/379 (16%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS L+W  C     C D           P F  K+S++ + + C++ +C+ +  P  
Sbjct: 1   MDTGSDLIWTQCAPCLLCAD--------QPTPYFDVKKSATYRALPCRSSRCASLSSP-- 50

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT-----VPNFLAGCS 235
                  S   K C      Y   YG    TAG+L +ET  F +         N   GC 
Sbjct: 51  -------SCFKKMC-----VYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG 98

Query: 236 ILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP------VSSNLVLDT 286
            L+    A   G+ GFGR   SL SQLG  +FSYCL S      P      V +NL   T
Sbjct: 99  SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSY-LSATPSRLYFGVYANLS-ST 156

Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
              SG      +  TPF  NP     A    Y++ L+ I +G+K + I         DG 
Sbjct: 157 NTSSGSP----VQSTPFVINP-----ALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGT 207

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI--SGKKSVY 404
           GGVI+DSG++ T+++   +EAV +  +  +      A  +   GL  CF        +V 
Sbjct: 208 GGVIIDSGTSITWLQQDAYEAVRRGLVSAI---PLPAMNDTDIGLDTCFQWPPPPNVTVT 264

Query: 405 LPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           +P+L+  F   A M L PENY  +      LCL++       P    G   I+G++Q QN
Sbjct: 265 VPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVM------APT---GVGTIIGNYQQQN 314

Query: 464 FYLEFDLANDRFGFAKQKC 482
            +L +D+ N    F    C
Sbjct: 315 LHLLYDIGNSFLSFVPAPC 333


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  118 bits (296), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 171/394 (43%), Gaps = 55/394 (13%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP---AFIPKRSSS 162
           +SL  GTPPQ     I DTGS L W  C  +            P + P    F P  SSS
Sbjct: 84  VSLPIGTPPQTQQ-MILDTGSQLSWIQCHKKV-----------PRKPPPSSVFDPSLSSS 131

Query: 163 SQLIGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
             ++ C +P C   I    + + C     +N+ C     SY    G     G L+ E + 
Sbjct: 132 FSVLPCNHPLCKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKIT 183

Query: 222 FP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA--PV 278
           F  S++ P  + GC+  S     GI G      S  SQ  L KFSYC+ +R+      P 
Sbjct: 184 FSRSQSTPPLILGCAEESS-DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPT 242

Query: 279 SSNLVLDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S   L   P SG  +   L +++   + P     A    Y V ++ I +G++ + IP S
Sbjct: 243 GS-FYLGENPNSGGFRYINLLTFSQSQRMPNLDPLA----YTVAMQGIRIGNQKLNIPIS 297

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSG 390
              P   G G  ++DSGS FT++    +  V +E +R +G        Y   +D+     
Sbjct: 298 AFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDM----- 352

Query: 391 LRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
              CF+ +  +    +  ++ +F  G ++ +  E   A VG  V C+ +      G A  
Sbjct: 353 ---CFNGNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAA-- 407

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              + I+G+F  QN ++EFDLAN R GF K  C+
Sbjct: 408 ---SNIIGNFHQQNIWVEFDLANRRVGFGKADCS 438


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 115/399 (28%), Positives = 170/399 (42%), Gaps = 62/399 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
           Y + L+ GTPPQ  +  + DTGS L+W  C     C  C + P+      P F P +S+S
Sbjct: 96  YVVDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASCLSQPD------PLFAPGQSAS 145

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLR 221
            + + C    CS I   + E        R  TC     +Y   YG G  T G+  +E   
Sbjct: 146 YEPMRCAGTLCSDILHHSCE--------RPDTC-----TYRYNYGDGTMTVGVYATERFT 192

Query: 222 FPSKTVPNFLA-------GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           F S               GC   ++ S    +GI GFGR+  SL SQL +++FSYCL S 
Sbjct: 193 FASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS- 251

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            +     S+ L      G     T  +  TP  ++P   +     FYYV    + VG++ 
Sbjct: 252 -YASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPT-----FYYVHFTGLTVGARR 305

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           ++IP S      DG+GGVIVDSG+  T +   +   V + F RQ      A     + G+
Sbjct: 306 LRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF-RQQLRLPFANGGNPEDGV 364

Query: 392 RPCFDI-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNA 443
             CF +       S    + +P ++L F+ GA + LP  NY         LCL+L     
Sbjct: 365 --CFLVPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGD 421

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G          +G+   Q+  + +DL  +    A  +C
Sbjct: 422 DGST--------IGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 129/500 (25%), Positives = 205/500 (41%), Gaps = 85/500 (17%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDS----DPLKIL 56
           M AC   L  LF L IL                 P+T  + + +L H D        ++L
Sbjct: 7   MKACSCMLPYLFFLAILF--------------AWPVTSATLRAHLSHVDDGRGFTKRELL 52

Query: 57  HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
             +   S +RA +L   +    + +      +N+ + +         Y I LS G P   
Sbjct: 53  RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSE--------YLIHLSIGAPRSQ 104

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
                 DTGS +VW  C     C +C         +P F    S++ + + C +P C   
Sbjct: 105 PVVLTLDTGSDVVWTQCEP---CAECF-----TQPLPRFDTAASNTVRSVACSDPLC--- 153

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK------TVPN 229
              N  S           C L   +Y+  YG G  + G  L ++  F         TVP+
Sbjct: 154 ---NAHS--------EHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPD 202

Query: 230 FLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
              GC + +     +   GIAGFGR   SLPSQL +++FSYC  +R   +A  S   +  
Sbjct: 203 IGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRF--EAKSSPVFLGG 260

Query: 286 TGPGSGDSKTPGLSYTPFYKN-PVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
            G     +  P LS TPF ++ P G+ ++    Y +  + + VG   + +P       +D
Sbjct: 261 AGDLKAHATGPILS-TPFVRSLPPGTDNS---HYVLSFKGVTVGKTRLPVPEIK----AD 312

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSV 403
           G+G   +DSG+  T     +F  +   FI Q     ++ AD +       CF   GKK+ 
Sbjct: 313 GSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDI-----CFSWDGKKTA 367

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
            +P+L+   + GA   LP ENY         +C+ + T        G+    ++G+FQ Q
Sbjct: 368 AMPKLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTS-------GQMDRTLIGNFQQQ 419

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N ++ +DLA  +      +C
Sbjct: 420 NTHIVYDLAAGKLLLVPAQC 439


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 127/434 (29%), Positives = 182/434 (41%), Gaps = 59/434 (13%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R R +++K K   K  +  S+   +   T   ++  G Y + L  GTP + S   + DTG
Sbjct: 16  RVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPAR-SLFMVVDTG 74

Query: 126 SSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           S L W    PC S Y+  D           P F P+ SSS Q I C +P C  +    V 
Sbjct: 75  SDLPWLQCQPCKSCYKQAD-----------PIFDPRNSSSFQRIPCLSPLCKAL---EVH 120

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA-GCSILSDR 240
           S C G       C     SY + YG G F+ G   S+     + +    +A GC   ++ 
Sbjct: 121 S-CSGSRGATSRC-----SYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEG 174

Query: 241 QPAGIAGFGRSSE---SLPSQL--------GLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
             AG AG         S PSQ+            FSYCL+ R       SS+L+      
Sbjct: 175 LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIF----- 229

Query: 290 SGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
            G +  P  +  +P  KNP         FYY  +  + VG   + I    L     G+GG
Sbjct: 230 -GVAAIPSTAALSPLLKNP-----KLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
           VI+DSG++ T     ++  +   F     N   A    + S    C++ SGK SV +P L
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLPSA---PRYSLFDTCYNFSGKASVDVPAL 340

Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           +L F+ GA + LPP NY   +       + F   +    LG     I+G+ Q Q+F + F
Sbjct: 341 VLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM--ELG-----IIGNIQQQSFRIGF 393

Query: 469 DLANDRFGFAKQKC 482
           DL      FA Q+C
Sbjct: 394 DLQKSHLAFAPQQC 407


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 120/417 (28%), Positives = 177/417 (42%), Gaps = 83/417 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR----IPAFIP 157
           G Y + L  GTP +   P I DTGS L W         + CN PN   +      P +  
Sbjct: 25  GQYFVELRVGTPAK-KFPLIIDTGSDLTW---------IQCNPPNTTANSSSPPAPWYDK 74

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TA 212
             SSS + I C + +C  +F P         +P   +C +  PS    Y  G+     T 
Sbjct: 75  SSSSSYREIPCTDDEC--LFLP---------APIGSSCSIKSPS-PCDYTYGYSDQSRTT 122

Query: 213 GLLLSETLRFPSKT---------------VPNFLAGCSILSDRQ----PAGIAGFGRSSE 253
           G+L  ET+   S+                + N   GCS  S        +G+ G G+   
Sbjct: 123 GILAYETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPI 182

Query: 254 SLPSQLGLKK----FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           SL +Q         FSYCL+      +  SS LV+      G ++   L++TP  +NP  
Sbjct: 183 SLATQTRHTALGGIFSYCLVDY-LRGSNASSFLVM------GRTRWRKLAHTPIVRNPAA 235

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            S     FYYV +  + V  K V  I  S      DGN G I DSG+T +++  P +  V
Sbjct: 236 QS-----FYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKV 290

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
                  +    RA ++ +  G   C++++ +    +P+L ++F+GGA M LP  NY  L
Sbjct: 291 LGALNASI-YLPRAQEIPE--GFELCYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVL 346

Query: 429 VGNEVLCLIL---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V   V C+ L    T N +          ILG+   Q+ ++E+DLA  R GF    C
Sbjct: 347 VAENVQCVALQKVTTTNGSN---------ILGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 138/445 (31%), Positives = 195/445 (43%), Gaps = 58/445 (13%)

Query: 47  HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
             DS  +K + SLA+ S  R     TK  P+T        +S ++I + LS  S G Y +
Sbjct: 88  QRDSLRVKSITSLAAVSTGRN---ATKRTPRT-----AGGFSGAVI-SGLSQGS-GEYFM 137

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
            L  GTP   +   + DTGS +VW  C+    C  C +   D      F PK+S +   +
Sbjct: 138 RLGVGTPA-TNVYMVLDTGSDVVWLQCSP---CKAC-YNQTDA----IFDPKKSKTFATV 188

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
            C +  C  +   +  S C   + R+KTC      Y + YG G FT G   +ETL F   
Sbjct: 189 PCGSRLCRRL---DDSSEC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGA 238

Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVS 279
            V +   GC   ++      AG+ G GR   S PSQ   +   KFSYCL+ R    +   
Sbjct: 239 RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSK 298

Query: 280 SNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYS 337
               +  G    ++  P  S +TP   NP         FYY+ L  I VG   V  +  S
Sbjct: 299 PPSTIVFG----NAAVPKTSVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVSES 349

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                + GNGGVI+DSG++ T +  P + A+   F        RA      S    CFD+
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRA---PSYSLFDTCFDL 406

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
           SG  +V +P ++  F GG +++LP  NY   V  E      F    AG     G   I+G
Sbjct: 407 SGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAF----AGTM---GSLSIIG 458

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q Q F + +DL   R GF  + C
Sbjct: 459 NIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 168/404 (41%), Gaps = 73/404 (18%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++L+ G PPQ +   + DTGS L W  C               P+    F P  SS+  
Sbjct: 66  TVTLAVGDPPQ-NISMVLDTGSELSWLHCKK------------SPNLGSVFNPVSSSTYS 112

Query: 165 LIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            + C +P C         +R +       C P+   C +A  SY     +    G L  E
Sbjct: 113 PVPCSSPIC--------RTRTRDLPIPASCDPKTHLCHVAI-SYADATSI---EGNLAHE 160

Query: 219 TLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T    S T P  L GC  S LS     D +  G+ G  R S S  +QLG  KFSYC+   
Sbjct: 161 TFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI--- 217

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIV 327
                  SS  +L  G  S     P + YTP     V  S+    F    Y V L  I V
Sbjct: 218 ---SGSDSSGFLL-LGDASYSWLGP-IQYTPL----VLQSTPLPYFDRVAYTVQLEGIRV 268

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           GSK + +P S  VP   G G  +VDSG+ FTF+ GP++ A+  EFI Q  +  R  D   
Sbjct: 269 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPD 328

Query: 388 ---KSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPPENYFALVG-------NEVL 434
              +  +  C+ +          LP + L F+ GA+M++  +     V         EV 
Sbjct: 329 FVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVY 387

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           C      +  G       A ++G    QN ++EFDLA  R GFA
Sbjct: 388 CFTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLAKSRVGFA 426


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 126/448 (28%), Positives = 181/448 (40%), Gaps = 60/448 (13%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
            + S  D     H+       R   L  +  P+   S+       + + + ++  S G Y
Sbjct: 84  FNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGS-GEY 142

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
            I +  G+PP+     + D+GS +VW    PCT  Y   D           P F P  S+
Sbjct: 143 FIRIGVGSPPREQY-VVIDSGSDIVWVQCQPCTQCYHQTD-----------PVFDPADSA 190

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C  I               N  C      Y + YG G +T G L  ETL
Sbjct: 191 SFMGVPCSSSVCERI--------------ENAGCHAGGCRYEVMYGDGSYTKGTLALETL 236

Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
            F    V N   GC   +       AG+ G G  S SL  QLG +    FSYCL+SR  D
Sbjct: 237 TFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTD 296

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            A       L+ G G+      G ++ P  +NP   S     FYY+ L  + VG   V I
Sbjct: 297 SAGS-----LEFGRGA---MPVGAAWIPLIRNPRAPS-----FYYIRLSGVGVGGMKVPI 343

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
                     GNGGV++D+G+  T +    + A    FI Q GN  RA+ V   S    C
Sbjct: 344 SEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGV---SIFDTC 400

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           ++++G  SV +P +   F GG  + LP  N+   V +       F  + +G +       
Sbjct: 401 YNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLS------- 453

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q +   + FD AN   GF    C
Sbjct: 454 IIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 118/387 (30%), Positives = 168/387 (43%), Gaps = 56/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           + +    GTP Q +     DT +   W PC+    C+ C    V       F   +SSS 
Sbjct: 26  FVVRAKIGTPAQ-TLLLALDTSNDAAWIPCSG---CIGCPSTTV-------FSSDKSSSF 74

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + CQ+P+C+ +  P+    C G          AC  + L YG    A  L+ + L   
Sbjct: 75  RPLPCQSPQCNQVPNPS----CSGS---------AC-GFNLTYGSSTVAADLVQDNLTLA 120

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           + +VP++  GC   +  S   P G+ G GR   SL  Q   L    FSYCL S  F    
Sbjct: 121 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS--FKSVN 178

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  +NP  SS      YYV L  I VG K V IP S
Sbjct: 179 FSGSLRL--GPVAQPIR---IKYTPLLRNPRRSS-----LYYVNLISIRVGRKIVDIPPS 228

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L   S    G ++DSG+TFT +  P + AV  EF R++G   R   V    G   C+ +
Sbjct: 229 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTV 285

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N+          CL +    AA P        ++
Sbjct: 286 ----PIISPTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAM----AAAPDNVNSVLNVI 336

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
              Q QN  + FD+ N R G A++ C+
Sbjct: 337 ASMQQQNHRILFDIPNSRVGVARESCS 363


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 171/415 (41%), Gaps = 72/415 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     ++SL+ G+PPQ  T  + DTGS L W        C    F N        F
Sbjct: 61  LLFHHNVSLTVSLTVGSPPQNVT-MVLDTGSELSWL------HCKKTQFLN------SVF 107

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
            P  S +   + C +P C                 R+ T P++C +  L + +   A   
Sbjct: 108 NPLSSKTYSKVPCLSPTCK-------------TRTRDLTIPVSCDATKLCHVIVSYADAT 154

Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
              G L  ET R  S T P  + GC       +   D +  G+ G  R S S  +Q+G  
Sbjct: 155 SIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP 214

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
           KFSYC+    FD A V     L  G  S     P LSYTP     V  S+    F    Y
Sbjct: 215 KFSYCI--SGFDSAGV-----LLLGNASFPWLKP-LSYTPL----VQISTPLPYFDRVAY 262

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  I V +K + +P S  VP   G G  +VDSG+ FTF+ GP++ A+  EF+ Q   
Sbjct: 263 TVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRG 322

Query: 379 YSRAADVEK---KSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE- 432
             +  + +    +  +  C+  D S      LP + L F+ GA+M++  E     V  E 
Sbjct: 323 ILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQ-GAEMSVSGERLLYRVPGEV 381

Query: 433 -----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                V C      +  G       A ++G    QN ++EFDL   R G A  +C
Sbjct: 382 RGRDSVWCFTFGNSDLLGVE-----AFVIGHHHQQNVWMEFDLEKSRIGLADVRC 431


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 130/458 (28%), Positives = 189/458 (41%), Gaps = 63/458 (13%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           K  LH  +   L+ L         R R +++K +   K  +  S+   +   T   ++  
Sbjct: 71  KEKLHTHEQLLLETLQR----DEQRVRWIESKAQLAGKKKDEASSTDLNGPVTSGLLYGS 126

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y + L  GTP + S   + DTGS L W    PC S Y+  D           P F P+
Sbjct: 127 GEYFVRLGVGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQAD-----------PIFDPR 174

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLS 217
            SSS Q I C +P C  +    + S C G       C     SY + YG G F+ G   S
Sbjct: 175 NSSSFQRIPCLSPLCKAL---EIHS-CSGSRGATSRC-----SYQVAYGDGSFSVGDFSS 225

Query: 218 ETLRFPSKTVPNFLA-GCSILSDRQPAGIAGFGRSSE---SLPSQL--------GLKKFS 265
           +     + +    +A GC   ++   AG AG         S PSQ+            FS
Sbjct: 226 DLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFS 285

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQ 324
           YCL+ R       SS+L+       G +  P  +  +P  KNP         FYY  +  
Sbjct: 286 YCLVDRSNPMTRSSSSLIF------GAAAIPSTAALSPLLKNP-----KLDTFYYAAMIG 334

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           + VG   + I    L     G+GGVI+DSG++ T     ++  +   F     N   A  
Sbjct: 335 VSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAP- 393

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
             + S    C++ SGK SV +P L+L F+ GA + LPP NY   +       + F   + 
Sbjct: 394 --RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM 451

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              LG     I+G+ Q Q+F + FDL      FA Q+C
Sbjct: 452 --ELG-----IIGNIQQQSFRIGFDLQKSHLAFAPQQC 482


>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       + GNG
Sbjct: 5   GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C+++SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
               FKGG+ M LP  NYF+  V  + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 160/389 (41%), Gaps = 66/389 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+  +  I DTGS L W  C     C DC F   D    P +     S
Sbjct: 168 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLP---CYDC-FQQNDNQSCPYYYWYGDS 222

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S                             N T   A  ++ +       +  L +    
Sbjct: 223 S-----------------------------NTTGDFAVETFTVNLTTNGGSSELYN---- 249

Query: 222 FPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDD 275
                V N + GC   +       AG+ G GR   S  SQL       FSYCL+ R   D
Sbjct: 250 -----VENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SD 303

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
             VSS L+   G        P L++T F     G  +    FYYV ++ I+V  + + IP
Sbjct: 304 TNVSSKLIF--GEDKDLLSHPNLNFTSFV---AGKENLVDTFYYVQIKSILVAGEVLNIP 358

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPC 394
                  SDG GG I+DSG+T ++   P +E +  +   +  G Y    D      L PC
Sbjct: 359 EETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPI---LDPC 415

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           F++SG  +V LPEL + F  GA    P EN F  +  +++CL +           +    
Sbjct: 416 FNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAML-------GTPKSAFS 468

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I+G++Q QNF++ +D    R G+A  KCA
Sbjct: 469 IIGNYQQQNFHILYDTKRSRLGYAPTKCA 497


>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       + GNG
Sbjct: 5   GDKALPTAMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C+++SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
               FKGG+ M LP  NYF+  V  + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 132/471 (28%), Positives = 197/471 (41%), Gaps = 98/471 (20%)

Query: 35  PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK----TKTKPKTKDSNIGSNYSNS 90
           PL+P       ++S+   L+ +++    S+SR  H          PK  +S++ SN    
Sbjct: 42  PLSPF------YNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNR--- 92

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
                      G Y +SLS GTPP      I DTGS L+W  C    RC    +  VD  
Sbjct: 93  -----------GEYLMSLSLGTPP-FKIMGIADTGSDLIWTQCKPCERC----YKQVD-- 134

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LG 209
             P F PK S + +   C   +CS +     +S C G       C      Y   YG   
Sbjct: 135 --PLFDPKSSKTYRDFSCDARQCSLL----DQSTCSG-----NIC-----QYQYSYGDRS 178

Query: 210 FTAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG 260
           +T G + S+T+   S T      P  + GC   +D     + +GI G G    SL SQ+G
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238

Query: 261 LK---KFSYCLL---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
                KFSYCL+   SR  + + ++  SN V+ +GPG                 P+ SS 
Sbjct: 239 SSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVV-SGPG-------------VQSTPLLSSE 284

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FY++ L  + VG++ +K   S L     G G +I+DSG+T T +    F  ++   
Sbjct: 285 TMSSFYFLTLEAMSVGNERIKFGDSSL---GTGEGNIIIDSGTTLTIVPDDFFSNLSTA- 340

Query: 373 IRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
              +GN       E  SG L  C+  S    + +P +   F  GA + L P N F  V +
Sbjct: 341 ---VGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHFT-GADVKLKPINTFVQVSD 394

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +V+CL  F    +G +       I G+    NF +E+++      F    C
Sbjct: 395 DVVCLA-FASTTSGIS-------IYGNVAQMNFLVEYNIQGKSLSFKPTDC 437


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 120/417 (28%), Positives = 176/417 (42%), Gaps = 83/417 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR----IPAFIP 157
           G Y + L  GTP +   P I DTGS L W         + CN PN   +      P +  
Sbjct: 57  GQYFVELRVGTPAK-KFPLIVDTGSDLTW---------IQCNPPNTTANSSSPPAPWYDK 106

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TA 212
             SSS + I C + +C ++  P + S C      + T P  C      Y  G+     T 
Sbjct: 107 SSSSSYREIPCTDDECQFLPAP-IGSSC------SITSPSPC-----DYTYGYSDQSRTT 154

Query: 213 GLLLSETLRFPSKT---------------VPNFLAGCSILSDRQ----PAGIAGFGRSSE 253
           G+L  ET+   S+                + N   GCS  S        +G+ G G+   
Sbjct: 155 GILAYETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPI 214

Query: 254 SLPSQLGLKK----FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           SL +Q         FSYCL+      +  SS LV+      G +    L++TP  +NP  
Sbjct: 215 SLATQTRHTALGGIFSYCLVD-YLRGSNASSFLVM------GRTHWRKLAHTPIVRNPAA 267

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            S     FYYV +  + V  K V  I  S      DGN G I DSG+T +++  P +  V
Sbjct: 268 QS-----FYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKV 322

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
                  +    RA ++ +  G   C++++ +    +P+L ++F+GGA M LP  NY  L
Sbjct: 323 LGALNASI-YLPRAQEIPE--GFELCYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVL 378

Query: 429 VGNEVLCLIL---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V   V C+ L    T N +          ILG+   Q+ ++E+DLA  R GF    C
Sbjct: 379 VAENVQCVALQKVTTTNGSN---------ILGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       + GNG
Sbjct: 5   GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C+++SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFSSQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
               FKGG+ M LP  NYF+  V  + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 169/400 (42%), Gaps = 66/400 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ  +  + DTGS L+W  C     C  C     DP     F P  S+S 
Sbjct: 102 YVVDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASC-LAQPDP----LFAPGESASY 152

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
           + + C    CS I          GC       P  C +Y   YG G  T G+  +E   F
Sbjct: 153 EPMRCAGQLCSDILH-------HGCE-----MPDTC-TYRYNYGDGTMTMGVYATERFTF 199

Query: 223 PSKTVPNFLA-----GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---- 270
            S      +      GC   ++ S    +GI GFGR+  SL SQL +++FSYCL S    
Sbjct: 200 TSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYGSG 259

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           RK       S L+     GS      G +  P    P+  S     FYYV L  + VG++
Sbjct: 260 RK-------STLLF----GSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGAR 308

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            ++IP S      DG+GGVIVDSG+  T + G +   V + F RQ      A     + G
Sbjct: 309 RLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAF-RQQLRLPFANGGNPEDG 367

Query: 391 LRPCFDI-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
           +  CF +       S    V +P ++  F+  A + LP  NY      +  LCL+L    
Sbjct: 368 V--CFLVPAAWRRSSSTSQVPVPRMVFHFQ-DADLDLPRRNYVLDDHRKGRLCLLLADSG 424

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             G          +G+   Q+  + +DL  +   FA  +C
Sbjct: 425 DDGST--------IGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/428 (26%), Positives = 195/428 (45%), Gaps = 53/428 (12%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R R ++ + +      N+ ++ +   + + +++ +   Y +++  G+    +   I DTG
Sbjct: 28  RVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLN-YIVTMGLGSK---NMTVIIDTG 83

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC-SWIFGPNVESR 184
           S L W  C     C +         + P F P  SSS Q + C +  C S  F       
Sbjct: 84  SDLTWVQCEPCMSCYN--------QQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135

Query: 185 CKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR--- 240
           C   +P   TC     +Y++ YG G +T G L  E L F   +V +F+ GC   +     
Sbjct: 136 CGSSNP--STC-----NYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCGRNNKGLFG 188

Query: 241 QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG 297
             +G+ G GRS  SL SQ        FSYCL +    +A  S +LV+        +  P 
Sbjct: 189 GVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---EAGSSGSLVMGNESSVFKNANP- 244

Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
           ++YT    NP         FY + L  I VG   +K P S+      GNGG+++DSG+  
Sbjct: 245 ITYTRMLSNP-----QLSNFYILNLTGIDVGGVALKAPLSF------GNGGILIDSGTVI 293

Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
           T +   +++A+  EF+++   +  A      S L  CF+++G   V +P + L+F+G A+
Sbjct: 294 TRLPSSVYKALKAEFLKKFTGFPSAPGF---SILDTCFNLTGYDEVSIPTISLRFEGNAQ 350

Query: 418 MALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
           + +     F +V  +   +CL L + + A          I+G++Q +N  + +D    + 
Sbjct: 351 LNVDATGTFYVVKEDASQVCLALASLSDA------YDTAIIGNYQQRNQRVIYDTKQSKV 404

Query: 476 GFAKQKCA 483
           GFA++ C+
Sbjct: 405 GFAEEPCS 412


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 161/390 (41%), Gaps = 77/390 (19%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPPQ       DTGS L+W  C     C D          +P F P  SS+ 
Sbjct: 89  YLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFD--------QALPYFDPSTSSTL 139

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF- 222
            L  C +            + C+G                        A L  S+   F 
Sbjct: 140 SLTSCDS------------TLCQGLP---------------------VASLPRSDKFTFV 166

Query: 223 -PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
               +VP    GC + ++        GIAGFGR   SLPSQL +  FS+C  +       
Sbjct: 167 GAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GA 223

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
           + S ++LD       +    +  TP  +NP   +     FYY+ L+ I VGS  + +P S
Sbjct: 224 IPSTVLLDLPADLFSNGQGAVQTTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPES 278

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                 +G GG I+DSG+  T +   ++  V   F  Q+        V   +   P F +
Sbjct: 279 EFAL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCL 332

Query: 398 SG--KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGP 452
           S   +   Y+P+L+L F+ GA M LP ENY   V   G+ +LCL +            G 
Sbjct: 333 SAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GE 382

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +G+FQ QN ++ +DL N +  F   +C
Sbjct: 383 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 112/393 (28%), Positives = 167/393 (42%), Gaps = 53/393 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +     IFDTGS L W  C     CV   +      + P F P  S 
Sbjct: 152 GNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQP---CVKSCYAQ----QQPIFDPSASK 203

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   I C +  CS +   +      GCS  N         Y +QYG   FT G    +TL
Sbjct: 204 TYSNISCTSTACSGL--KSATGNSPGCSSSNCV-------YGIQYGDSSFTVGFFAKDTL 254

Query: 221 RFPSKTV-PNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
                 V   F+ GC   ++R    + AG+ G GR   S+  Q   K    FSYCL + +
Sbjct: 255 TLTQNDVFDGFMFGCG-QNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSR 313

Query: 273 FDDAPVSSNLVLDTGPGSGDSKT--PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             +     +L    G G   SK    G+++TPF      +SS    FY++ +  I VG K
Sbjct: 314 GSNG----HLTFGNGNGVKTSKAVKNGITFTPF------ASSQGATFYFIDVLGISVGGK 363

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I      P    N G I+DSG+  T +   ++ ++   F + M  Y  A  +   S 
Sbjct: 364 ALSIS-----PMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPAL---SL 415

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           L  C+D+S   S+ +P++   F G A + L P       G   +CL  F  N     +G 
Sbjct: 416 LDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGILITNGASQVCLA-FAGNGDDDTIG- 473

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I G+ Q Q   + +D+A  + GF  + C+
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 123/404 (30%), Positives = 171/404 (42%), Gaps = 69/404 (17%)

Query: 94  TPLSVHSYGGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCT-SRYRCVDCNFPNVDPS 150
           TP +    G Y   +  GTP +   P+I   DTGSSL W  C+  R  C           
Sbjct: 107 TPGTSVGVGNYVTRMGLGTPAK---PYIMVVDTGSSLTWLQCSPCRVSC--------HRQ 155

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LG 209
             P F PK SSS   + C +P+C  +    +      CSP N         Y   YG   
Sbjct: 156 SGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAV--CSPSNVCI------YQASYGDSS 207

Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
           F+ G L  +T+ F + +VPNF  GC   ++    + AG+ G  R+  SL  QL       
Sbjct: 208 FSVGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYS 267

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           FSYCL S        SS+  L  G   PG       G SYTP   N +  S      Y++
Sbjct: 268 FSYCLPS-------TSSSGYLSIGSYNPG-------GYSYTPMVSNTLDDS-----LYFI 308

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNY 379
            L  + V  K + +  S        +   I+DSG+  T +   ++ A++K     M G+ 
Sbjct: 309 SLSGMTVAGKPLAVSSSEYT-----SLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGST 363

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            RAA     S L  CF+    K   +P + + F GGA + L   N    V     CL   
Sbjct: 364 KRAA---AYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDGATTCL--- 417

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              A  PA  R  AII G+ Q Q F + +D+ ++R GFA   C+
Sbjct: 418 ---AFAPA--RSAAII-GNTQQQTFSVVYDVKSNRIGFAAAGCS 455


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 124/412 (30%), Positives = 172/412 (41%), Gaps = 72/412 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y ++LS GTPP      I DTGS L W    PC   Y           P + P F P 
Sbjct: 78  GEYMMNLSIGTPPFPILA-IADTGSDLTWLQSKPCDQCY-----------PQKGPIFDPS 125

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
            S++   + C    C+ +            S R+ T P  C  Y   YG   +T G L S
Sbjct: 126 NSTTFHKLPCTTAPCNAL----------DESARSCTDPTTC-GYTYSYGDHSYTTGYLAS 174

Query: 218 ETLRFPSKTVP--NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
           +T+   + +V   N   GC   +    D Q +GI G G  + S  SQLG    KKFSYCL
Sbjct: 175 DTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCL 234

Query: 269 L------SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFY-KNPVGSSSAFGEFYY 319
           L      S +  D+P +S +V    P    S T G+ +  TP   K P         +YY
Sbjct: 235 LPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEP-------STYYY 287

Query: 320 VGLRQIIVGSKHV--------KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
           + +  I VG K +           Y      S   G +I+DSG+T TF+E   + A+   
Sbjct: 288 LTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAA 347

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
            + ++    R  DV K S    CF  SGK+ V LP + + F+GGA + L P N F     
Sbjct: 348 LVEEI-KMERVNDV-KNSMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEE 404

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            ++C  +   N  G         I G+    NF + +DL      F    C+
Sbjct: 405 GLVCFTMLPTNDVG---------IYGNLAQMNFVVGYDLGKRTVSFLPADCS 447


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 124/435 (28%), Positives = 182/435 (41%), Gaps = 74/435 (17%)

Query: 71  KTKTKPK--TKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSL 128
           K K KPK  ++ + +       +++TP        Y      GTPPQ +     D  +  
Sbjct: 72  KPKPKPKGHSRHTFVPIAAGRQILRTP-------SYVARARLGTPPQ-TLLVAIDPSNDA 123

Query: 129 VWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
            W PC++   C+ C       +  P+F P +SS+ + + C  P+C+ +            
Sbjct: 124 AWVPCSA---CLGC----APGASSPSFDPTQSSTYRPVRCGAPQCAQV------------ 164

Query: 189 SPRNKTCPL---ACPSYLLQYGLGFTAGLLLSETLRFPSK---TVPN--FLAGCSIL--- 237
            P   +CP    A  ++ L Y       +L  + L         VP+  +  GC  +   
Sbjct: 165 PPATPSCPAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTG 224

Query: 238 --SDRQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNL--VLDTGPGS 290
                 P G+ GFGR   S  SQ        FSYCL S K      SSN    L  GP  
Sbjct: 225 SGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYK------SSNFSGTLRLGPAG 278

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL-VPGSDGNGGV 349
              +   +  TP   NP   S      YYV +  + V  K V IP S L +  + G GG 
Sbjct: 279 QPRR---IKTTPLLSNPHRPS-----LYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
           IVD+G+ FT +  P + A+   F R +     A       G   C+ ++G KSV  P + 
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGV----SAPAAPALGGFDTCYYVNGTKSV--PAVA 384

Query: 410 LKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLE 467
             F GGA++ LP EN   +     V CL +    AAGP+ G    + +L   Q QN  + 
Sbjct: 385 FVFAGGARVTLPEENVVISSTSGGVACLAM----AAGPSDGVNAGLNVLASMQQQNHRVV 440

Query: 468 FDLANDRFGFAKQKC 482
           FD+ N R GF+++ C
Sbjct: 441 FDVGNGRVGFSRELC 455


>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 87/152 (57%), Gaps = 5/152 (3%)

Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           GD   P    L+YTPF  N   SSS +  FYY+ LR + +G K + +P       + GNG
Sbjct: 5   GDKALPTAMSLNYTPFLINTKASSSGYNTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNG 64

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G I+DSG+TFT      ++ +   F  Q+G + RA++VE ++G+R C++ SG   V LP+
Sbjct: 65  GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNASGVDHVLLPD 123

Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
               FKGG+ M LP  NYF+  V  + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 113/399 (28%), Positives = 160/399 (40%), Gaps = 54/399 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   ++ GTP   +     DT S L W  C    RC         P   P F P+ S+
Sbjct: 132 GEYMAKIAVGTPAVQAL-LALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 182

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TAGLLL 216
           S   +    P C  +       R  G   +  TC      Y +QYG G      + G L+
Sbjct: 183 SYGEMNYDAPDCQAL------GRSGGGDAKRGTC-----IYTVQYGDGHGSTSTSVGDLV 231

Query: 217 SETLRFPSKTVPNFLA-GCSI----LSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
            ETL F       +L+ GC      L     AGI G GR   S+P Q+        FSYC
Sbjct: 232 EETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYC 291

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
           L+   F   P S +  L  G G+ D+  P  S+TP   N          FYYV L  + V
Sbjct: 292 LV--DFISGPGSPSSTLTFGAGAVDTSPPA-SFTPTVLN-----QNMPTFYYVRLIGVSV 343

Query: 328 GSKHVKIP----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           G   V++P        +    G GGVI+DSG+T T +  P + A    F     +  + +
Sbjct: 344 GG--VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVS 401

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
                     C+ + G+  V +P + + F GG +++L P+NY   V +       F    
Sbjct: 402 TGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGT- 460

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G     ++G+   Q F + +DLA  R GFA   C
Sbjct: 461 -----GDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 135/483 (27%), Positives = 197/483 (40%), Gaps = 82/483 (16%)

Query: 37  TPLSTKHYLHHSDSD-PLKILHSLASSSLSRARHLKTKTKPKTK------------DSNI 83
           TP  T+     +  D  + + H  A   LSR R L  +   ++K             SN 
Sbjct: 22  TPAPTEGAFFFAGGDVRVDLTHVDAGKQLSR-RELVRRAVQRSKARAAALSVARLGGSNK 80

Query: 84  GSNYSNSLIKTP-LSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
           G+   +   + P L V   G   Y + L+ GTPPQ  +  + DTGS L+W  C     C 
Sbjct: 81  GARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQPVSALL-DTGSDLIWTQCAP---CA 136

Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
            C  P  DP     F P  SSS + + C    C+ I    +   C+    R  TC     
Sbjct: 137 SC-LPQPDP----IFSPGASSSYEPMRCAGELCNDI----LHHSCQ----RPDTC----- 178

Query: 201 SYLLQYGLGFTA-GLLLSETLRF--------PSKTVPNFLAGCSILSD---RQPAGIAGF 248
           +Y   YG G T  G+  +E   F         +K       GC  ++       +GI GF
Sbjct: 179 TYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGF 238

Query: 249 GRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--- 305
           GR+  SL SQL +++FSYCL    +     S+ L      G  D+ T  +  T   +   
Sbjct: 239 GRAPLSLVSQLAIRRFSYCLT--PYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQ 296

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
           NP         FYYV    + VG++ ++IP S      DG+GG IVDSG+  T    P+ 
Sbjct: 297 NPT--------FYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVL 348

Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPP 422
             V + F  Q+     AA+         CF  +  +      +P ++   + GA + LP 
Sbjct: 349 AEVVRAFRSQL-RLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQ-GADLDLPR 406

Query: 423 ENYF---ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
            NY       GN  LCL+L     +G          +G+F  Q+  + +DL  D   FA 
Sbjct: 407 RNYVLDDQRKGN--LCLLLADSGDSG--------TTIGNFVQQDMRVLYDLEADTLSFAP 456

Query: 480 QKC 482
            +C
Sbjct: 457 AQC 459


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 165/392 (42%), Gaps = 75/392 (19%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            GTPP        + G+ L+W        C +  FP  +P      +P  S       C 
Sbjct: 1   MGTPPNP-VKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFAS-------CG 52

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKT 226
           +PK    F PN            +TC      Y   YG    T G L  +   F     +
Sbjct: 53  SPK----FWPN------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGAS 91

Query: 227 VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
           VP    GC + ++        GIAGFGR   SLPSQL +  FS+C  +       + S +
Sbjct: 92  VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GAIPSTV 148

Query: 283 VLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +LD        G G+  + TP + Y     NP          YY+ L+ I VGS  + +P
Sbjct: 149 LLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKGITVGSTRLPVP 199

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S     ++G GG I+DSG++ T +   +++ V  EF  Q+            +G   CF
Sbjct: 200 ESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVPGNATGHYTCF 255

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGPALGRG 451
               +    +P+L+L F+ GA M LP ENY   V    GN ++CL          A+ +G
Sbjct: 256 SAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL----------AINKG 304

Query: 452 -PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+FQ QN ++ +DL N+   F   +C
Sbjct: 305 DETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 133/490 (27%), Positives = 200/490 (40%), Gaps = 80/490 (16%)

Query: 24  GAGSSAATVTV-----PLTPLSTKHYLHHSDSDPL--------KILHSLASSSLSRARHL 70
           GA SS   +T+     P +PL+  H    S  D L         I H +++++  R    
Sbjct: 79  GATSSGTRMTIVHRHGPCSPLADAHGKPPSHEDILAADQNRAESIQHRVSTTATGRGNPK 138

Query: 71  KTKTKPKTKDSNIGSNYSNSLIKTPLSVH--------SYGGYSISLSFGTPPQASTPFIF 122
           +++  P  +     +    + + +  +            G Y +++  GTP    T  +F
Sbjct: 139 RSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYT-VVF 197

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS   W  C     CV   +      R   F P RSS+   I C  P CS     +++
Sbjct: 198 DTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSSTYANISCAAPACS-----DLD 245

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR 240
           +R  GCS  N         Y +QYG G ++ G    +TL   S   V  F  GC   ++ 
Sbjct: 246 TR--GCSGGNCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEG 296

Query: 241 ---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
              + AG+ G GR   SLP Q   K    F++CL +R       S    LD GPGS  + 
Sbjct: 297 LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------SGTGYLDFGPGSPAAA 350

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
              L+      N          FYYVG+  I VG + + IP S          G IVDSG
Sbjct: 351 GARLTTPMLTDNGP-------TFYYVGMTGIRVGGQLLSIPQSVFT-----TAGTIVDSG 398

Query: 355 STFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           +  T +    + ++   F   M    Y +A  V   S L  C+D +G   V +P + L F
Sbjct: 399 TVITRLPPAAYSSLRSAFASAMAARGYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLF 455

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
           +GGA++ +             +CL  F  N  G  +G     I+G+ QL+ F + +D+  
Sbjct: 456 QGGARLDVDASGIMYAASVSQVCL-GFAANEDGGDVG-----IVGNTQLKTFGVAYDIGK 509

Query: 473 DRFGFAKQKC 482
              GF+   C
Sbjct: 510 KVVGFSPGAC 519


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 172/387 (44%), Gaps = 52/387 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS + W  C     C +C +   DP     F P  SS
Sbjct: 162 GEYFSRIGVGTPAK-EMYVVLDTGSDVNWIQC---LPCSEC-YQQSDP----IFDPTSSS 212

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + + C +PKC+ +           C  R+  C      Y + YG G FT G   ++T+
Sbjct: 213 TFKSLTCSDPKCASL-------DVSAC--RSNKCL-----YQVSYGDGSFTVGNYATDTV 258

Query: 221 RF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
            F  S  V +   GC   ++      AG+ G G  + S+ +Q+  K FSYCL+ R   D+
Sbjct: 259 TFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDR---DS 315

Query: 277 PVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
             SS+L  ++   G+GD+  P L            +S    FYYVGL    VG + V IP
Sbjct: 316 AKSSSLDFNSVQIGAGDATAPLLR-----------NSKMDTFYYVGLSGFSVGGQQVSIP 364

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S     + G GGVI+D G+  T ++   + ++   F++   ++ +       S    C+
Sbjct: 365 SSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKG--TSPISLFDTCY 422

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D S   +V +P +   F GG  + LP +NY   + +       F   ++  +       I
Sbjct: 423 DFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLS-------I 475

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +G+ Q Q   + +DLAN+  G +  KC
Sbjct: 476 IGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 132/434 (30%), Positives = 179/434 (41%), Gaps = 70/434 (16%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
           LA SSL+R+    T+T+    +           + TP+S  +  G     S     Q S 
Sbjct: 119 LALSSLNRSDLYPTETELLRPED----------LSTPVSSGTAQGSGEYFSRVGVGQPSK 168

Query: 119 PF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
           PF  + DTGS + W  C     C DC +   DP     F P  SSS   + C   +C   
Sbjct: 169 PFYMVLDTGSDVNWLQCKP---CSDC-YQQSDP----IFDPTASSSYNPLTCDAQQCQ-- 218

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCS 235
                +     C  RN  C      Y + YG G FT G  ++ET+ F + +V     GC 
Sbjct: 219 -----DLEMSAC--RNGKCL-----YQVSYGDGSFTVGEYVTETVSFGAGSVNRVAIGCG 266

Query: 236 ILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
             ++      AG+ G G    SL SQ+    FSYCL+ R   D+  SS L  ++ P  GD
Sbjct: 267 HDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDR---DSGKSSTLEFNS-PRPGD 322

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           S        P  KN          FYYV L  + VG + V +P         G GGVIVD
Sbjct: 323 SVV-----APLLKN-----QKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVD 372

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +    + +V   F R+  N   A   E  +    C+D+S  +SV +P +   F
Sbjct: 373 SGTAITRLRTQAYNSVRDAFKRKTSNLRPA---EGVALFDTCYDLSSLQSVRVPTVSFHF 429

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEF 468
            G    ALP +NY   V           D A        P      I+G+ Q Q   + F
Sbjct: 430 SGDRAWALPAKNYLIPV-----------DGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSF 478

Query: 469 DLANDRFGFAKQKC 482
           DLAN   GF+  KC
Sbjct: 479 DLANSLVGFSPNKC 492


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/418 (27%), Positives = 169/418 (40%), Gaps = 66/418 (15%)

Query: 75  KPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPC 133
           KP++  ++  SN  N     PL +   GG Y +  S GTPPQ  T  + DTGS L+W  C
Sbjct: 72  KPQSSSASQLSN--NDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTA-LADTGSDLIWTKC 128

Query: 134 TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNK 193
            +                  ++ P  SS+   + C +  C+ +   ++      C+    
Sbjct: 129 DAG--------GGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLAR----CAAGGA 176

Query: 194 TCPLACPSYLLQYGLG----FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIA 246
            C      Y   YGLG    FT G L SET       VP    GC+   +    + AG+ 
Sbjct: 177 EC-----DYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTALEGDYGEGAGLV 231

Query: 247 GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS-SNLVLDTGPGSGDSKTPGLSYTPFYK 305
           G GR   SL SQL    F YCL +     +P+    L   TG G+G   T  L+ T FY 
Sbjct: 232 GLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYA 291

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
                         V LR I +GS                   V+ DSG+T T++  P +
Sbjct: 292 --------------VNLRSITIGSATTAGVGGPGG--------VVFDSGTTLTYLAEPAY 329

Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
                 F+ Q  + +    VE + G   C++      + +P ++L F GGA MALP  NY
Sbjct: 330 TEAKAAFLSQTTSLT---PVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALPVANY 385

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              V + V+C +          + R P++ I+G+    N+ +  D+      F    C
Sbjct: 386 VVEVDDGVVCWV----------VQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 156/374 (41%), Gaps = 54/374 (14%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + +  G PPQ     IFD  +   W  C    +C D       P  I  F P +SSS  L
Sbjct: 189 VQIGVGGPPQKFY-MIFDLQTDFTWLQCQPCIKCYD------QPDSI--FDPSQSSSYTL 239

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS 224
           + C+   C+ +  PN      G    N T           Y  G  T G+L++ET+ F S
Sbjct: 240 LSCETKHCNLL--PNSSCSDDGYCRYNIT-----------YKDGTNTEGVLINETVSFES 286

Query: 225 K-TVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
              V     GCS   ++ P     G  G GR S S PS++     SYCL+  K  D   S
Sbjct: 287 SGWVDRVSLGCSN-KNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESK--DGYSS 343

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L  ++ P SG  K   L      +NP   +      YYVGL+ I VG + + +P S  
Sbjct: 344 STLEFNSPPCSGSVKAKLL------QNPKAEN-----LYYVGLKGIKVGGEKIDVPNSTF 392

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
                GNGG+IV S S  T +E   +  V   F+ +  +  R     +      C+++S 
Sbjct: 393 TIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQ---FDTCYNLSS 449

Query: 400 KKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
             +V LP L  +   G    LP E+Y +A+  N   C       A  P+  +G   ILG 
Sbjct: 450 NNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGTFCF------AFAPS--KGSFSILGT 501

Query: 459 FQLQNFYLEFDLAN 472
            Q     + FDL N
Sbjct: 502 LQQYGTRVTFDLVN 515


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 157/387 (40%), Gaps = 58/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPPQ +     D      W PC     CV C+           F   +S++ 
Sbjct: 35  YIVKAKVGTPPQ-TLLMALDNSYDAAWIPCKG---CVGCSST--------VFNTVKSTTF 82

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GC  P+C  +  P     C G      TC     ++   YG       L  +T+   
Sbjct: 83  KTLGCGAPQCKQVPNP----ICGG-----STC-----TWNTTYGSSTILSNLTRDTIALS 128

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
              VP +  GC   +  S   P G+ GFGR   S  SQ   L    FSYCL S  F    
Sbjct: 129 MDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPS--FRTLN 186

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     + P +  TP  KNP  SS      YYV L  I VG K V IP S
Sbjct: 187 FSGSLRL--GP---VGQPPRIKTTPLLKNPRRSS-----LYYVKLNGIRVGRKIVDIPRS 236

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P + AV  EF +++GN    A V    G   C+ +
Sbjct: 237 ALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGN----ATVSSLGGFDTCYSV 292

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIIL 456
                  +P  I     G  + +PPEN        V  CL +    AA P        ++
Sbjct: 293 P-----IVPPTITFMFSGMNVTMPPENLLIHSTAGVTSCLAM----AAAPDNVNSVLNVI 343

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
              Q QN  + FD+ N R G A+++C+
Sbjct: 344 ASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 172/386 (44%), Gaps = 52/386 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     + DTGS + W  C     C DC +   DP     F P  SS
Sbjct: 153 GEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAP---CADC-YQQADP----IFEPSFSS 203

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C+  +C  +   +V S C     RN +C      Y + YG G +T G   +ET+
Sbjct: 204 SYAPLTCETHQCKSL---DV-SEC-----RNDSCL-----YEVSYGDGSYTVGDFATETI 249

Query: 221 RFP-SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
               S ++ N   GC   ++      AG+ G G  S S PSQ+    FSYCL++R  D A
Sbjct: 250 TLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSA 309

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
              S L  +       S  P  S T     P+  ++    FYY+G+  I VG + + IP 
Sbjct: 310 ---STLEFN-------SPIPSHSVTA----PLLRNNQLDTFYYLGMTGIGVGGQMLSIPR 355

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           S       GNGG+IVDSG+  T ++  ++ ++   F+R   +    + V        C+D
Sbjct: 356 SSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL---FDTCYD 412

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
           +S + SV +P +   F  G  +ALP +NY   V +       F    +  +       I+
Sbjct: 413 LSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALS-------II 465

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+ Q Q   + +DL+N   GF+   C
Sbjct: 466 GNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 115/406 (28%), Positives = 177/406 (43%), Gaps = 81/406 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y I +S GTPP+     + DTGS ++W  C     CV C +   D      F P +SS
Sbjct: 35  GEYFIRVSVGTPPRG-MYLVMDTGSDILWLQCAP---CVSC-YHQCDE----VFDPYKSS 85

Query: 162 SSQLIGCQNPKC-SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +   +GC + +C +   G  V ++C                Y + YG G F+ G   ++ 
Sbjct: 86  TYSTLGCNSRQCLNLDVGGCVGNKCL---------------YQVDYGDGSFSTGEFATDA 130

Query: 220 LRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----------------SLPSQLGLK 262
           +   S +      G  ++ ++ P G    G  +E                 S P+Q+  +
Sbjct: 131 VSLNSTS-----GGGQVVLNKIPLGC---GHDNEGYFVGAAGLLGLGKGPLSFPNQINSE 182

Query: 263 ---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP--GLSYTPFYKNPVGSSSAFGEF 317
              +FSYCL  R  D    SS +        GD+  P  G+ +TP   N   S+     F
Sbjct: 183 NGGRFSYCLTGRDTDSTERSSLIF-------GDAAVPPAGVRFTPQASNLRVST-----F 230

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           YY+ +  I VG   + IP S     S GNGGVI+DSG++ T ++   + ++ + F     
Sbjct: 231 YYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAF---RA 287

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCL 436
             S      + S    C+++S   SV +P + L F+GGA + LP  NY   V N    CL
Sbjct: 288 GTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCL 347

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  AG     GP+II G+ Q Q F + +D  +++ GF   +C
Sbjct: 348 AF-----AGTT---GPSII-GNIQQQGFRVIYDNLHNQVGFVPSQC 384


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 167/398 (41%), Gaps = 61/398 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
           Y + L+ GTPPQ  T  + DTGS L+W  C +   C  C   P+      P F P+ SSS
Sbjct: 98  YVLDLAVGTPPQPITALL-DTGSDLIWTQCDT---CTACLRQPD------PLFSPRMSSS 147

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
            + + C    C    G  +   C     R  TC     +Y   YG G T  G   +E   
Sbjct: 148 YEPMRCAGQLC----GDILHHSCV----RPDTC-----TYRYSYGDGTTTLGYYATERFT 194

Query: 222 FPS-----KTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
           F S     ++VP    GC  +   S    +GI GFGR   SL SQL +++FSYCL     
Sbjct: 195 FASSSGETQSVPLGF-GCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL----- 248

Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              P +S+       GS  D      +  P    P+  S+    FYYV    + VG++ +
Sbjct: 249 --TPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL 306

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
           +IP S      DG+GGVI+DSG+  T     +   V + F  Q+     A       G+ 
Sbjct: 307 RIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQL-RLPFANGSSPDDGV- 364

Query: 393 PCFDISG--------KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
            CF             + V +P ++  F+ GA + LP ENY           +L  D+  
Sbjct: 365 -CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGD 422

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             A        +G+F  Q+  + +DL  +   FA  +C
Sbjct: 423 DGA-------TIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 119/402 (29%), Positives = 167/402 (41%), Gaps = 61/402 (15%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ GTPP    PF+   DTGS L W  C     C   + P  D +   +F P    
Sbjct: 95  YLMELAIGTPP---VPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSP---- 147

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLG-FTAGLLLSET 219
               + C +  C  I+           S RN T     P  Y   Y  G ++AG+L +ET
Sbjct: 148 ----VPCASATCLPIWR----------SSRNCTATTTSPCRYRYAYDDGAYSAGVLGTET 193

Query: 220 LRF---------PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
           L F         P  +V     GC + +        G  G GR S SL +QLG+ KFSYC
Sbjct: 194 LTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYC 253

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           L    F +  + S ++  +        T G   +  TP  + P   S      YYV L  
Sbjct: 254 L--TDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSR-----YYVSLEG 306

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I +G   + IP        DG+GG+IVDSG+ FT     L E+  +  +  +        
Sbjct: 307 ISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTV----LVESAFRVVVNHVAGVLNQPV 362

Query: 385 VEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTD 441
           V   S   PCF  +  +     +P+++L F GGA M L  +NY +    +   CL     
Sbjct: 363 VNASSLDSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCL----- 417

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           N AG     G   ILG+FQ QN  + FD+   +  F    C+
Sbjct: 418 NIAGAPSAYGS--ILGNFQQQNIQMLFDITVGQLSFVPTDCS 457


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 63/392 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  GTP +     IFDTGS L W  C     C DC +   D    P F P  SS
Sbjct: 147 GNYVVSVGLGTPAK-QYAVIFDTGSDLSWVQCKP---CADC-YEQQD----PLFDPSLSS 197

Query: 162 SSQLIGCQNPKCSWI--FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           +   + C  P+C  +   G + +SRC+               Y +QYG    T G L+ +
Sbjct: 198 TYAAVACGAPECQELDASGCSSDSRCR---------------YEVQYGDQSQTDGNLVRD 242

Query: 219 TLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
           TL    S T+P F+ GC   +     Q  G+ G GR   SLPSQ        F+YCL S 
Sbjct: 243 TLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSS 302

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                 +S           G +      +T        +  A   FYY+ L  I VG + 
Sbjct: 303 SSGRGYLS----------LGGAPPANAQFTAL------ADGATPSFYYIDLVGIKVGGRA 346

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           ++IP +            ++DSG+  T +    +  +   F R M  Y +A  +   S L
Sbjct: 347 IRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPAL---SIL 399

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G ++  +P + L F GGA ++L       +      CL  F  NA   ++   
Sbjct: 400 DTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQACLA-FAPNADDSSIA-- 456

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              ILG+ Q + F + +D+AN R GF  + C+
Sbjct: 457 ---ILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 118/435 (27%), Positives = 184/435 (42%), Gaps = 58/435 (13%)

Query: 71  KTKTKPKTKDSNIG-SNYSNSLIKTP--------LSVHSYGGYSISLSFGTPPQASTPFI 121
            T  K   ++S I  +N +N+ +K+P        LS        + L  GTPPQ   P +
Sbjct: 55  NTALKMMLRNSLIANTNNNNTQLKSPPSSPYNYKLSFKYSMALIVDLPIGTPPQVQ-PMV 113

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS-WIFGPN 180
            DTGS L W  C  +            P    +F P  SS+   + C +P C   I    
Sbjct: 114 LDTGSQLSWIQCHKK--------APAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIPDFT 165

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-SKTVPNFLAGCSILSD 239
           + + C     +N+ C     SY    G  +  G L+ E   F  S   P  + GC+  S 
Sbjct: 166 LPTSCD----QNRLCHY---SYFYADGT-YAEGNLVREKFTFSRSLFTPPLILGCATEST 217

Query: 240 RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR--KFDDAPVSSNLVLDTGPGSGDSK-TP 296
             P GI G  R   S  SQ  + KFSYC+ +R  +    P  S   L   P S   +   
Sbjct: 218 -DPRGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGS-FYLGHNPNSNTFRYIE 275

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            L++    + P     A    Y V L+ I +G + + I  +     + G+G  ++DSGS 
Sbjct: 276 MLTFARSQRMPNLDPLA----YTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSE 331

Query: 357 FTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPCFDISG-KKSVYLPEL 408
           FT++    ++ V  E +R +G        Y   AD+        CFD +  +    + ++
Sbjct: 332 FTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADM--------CFDGNAIEIGRLIGDM 383

Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
           + +F+ G ++ +P E   A V   V C+ +   +  G A     + I+G+F  QN ++EF
Sbjct: 384 VFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAA-----SNIIGNFHQQNLWVEF 438

Query: 469 DLANDRFGFAKQKCA 483
           DL N R GF    C+
Sbjct: 439 DLVNRRMGFGTADCS 453


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 125/425 (29%), Positives = 180/425 (42%), Gaps = 71/425 (16%)

Query: 88  SNSLIKTP--LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           SNS  ++P  L        ++SL+ GTPPQ +   + DTGS L W         + CN  
Sbjct: 13  SNSFPRSPNKLPFRHNISLTVSLTVGTPPQ-NVSMVIDTGSELSW---------LYCNKT 62

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
               S    F   RS S + I C +  C+                R+ + P +C S  L 
Sbjct: 63  TTTTSYPTTFNQTRSISYRPIPCSSSTCT-------------NQTRDFSIPASCDSNSLC 109

Query: 206 YG-LGF-----TAGLLLSETLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSS 252
           +  L +     + G L S+T    +  +P  + GC  S+ S     D +  G+ G  R S
Sbjct: 110 HATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGS 169

Query: 253 ESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
            S  SQ+G  KFSYC+    F     S  L+L  G  +     P L+YTP     V  S+
Sbjct: 170 LSFVSQMGFPKFSYCISGTDF-----SGMLLL--GESNFTWAVP-LNYTPL----VQIST 217

Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
               F    Y V L  I V  + + IP S   P   G G  +VDSG+ FTF+ GP + A+
Sbjct: 218 PLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTAL 277

Query: 369 AKEFIRQMGNYSRAA---DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPE 423
             EF+ Q   + R     D   +  +  C+ +   + V   LP + L F  GA+M +  E
Sbjct: 278 RSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFN-GAEMTVADE 336

Query: 424 NYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
                V     GN+ V CL     +  G       A ++G    QN ++EFDL   R G 
Sbjct: 337 RVLYRVPGEIRGNDSVHCLSFGNSDLLGVE-----AYVIGHHHQQNVWMEFDLERSRIGL 391

Query: 478 AKQKC 482
           A+ +C
Sbjct: 392 AQVRC 396


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 63/392 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  GTP +     IFDTGS L W  C     C DC +   D    P F P  SS
Sbjct: 147 GNYVVSVGLGTPAK-QYAVIFDTGSDLSWVQCKP---CADC-YEQQD----PLFDPSLSS 197

Query: 162 SSQLIGCQNPKCSWI--FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           +   + C  P+C  +   G + +SRC+               Y +QYG    T G L+ +
Sbjct: 198 TYAAVACGAPECQELDASGCSSDSRCR---------------YEVQYGDQSQTDGNLVRD 242

Query: 219 TLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
           TL    S T+P F+ GC   +     Q  G+ G GR   SLPSQ        F+YCL S 
Sbjct: 243 TLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSS 302

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                 +S           G +      +T        +  A   FYY+ L  I VG + 
Sbjct: 303 SSGRGYLS----------LGGAPPANAQFTAL------ADGATPSFYYIDLVGIKVGGRA 346

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           ++IP +            ++DSG+  T +    +  +   F R M  Y +A  +   S L
Sbjct: 347 IRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPAL---SIL 399

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G ++  +P + L F GGA ++L       +      CL  F  NA   ++   
Sbjct: 400 DTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQACLA-FAPNADDSSIA-- 456

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              ILG+ Q + F + +D+AN R GF  + C+
Sbjct: 457 ---ILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 122/482 (25%), Positives = 205/482 (42%), Gaps = 71/482 (14%)

Query: 14  LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK 73
           L++L F+ +    +++ +++ PLT LS  +     D+    +  S   S+  +  + + K
Sbjct: 5   LVVLFFSINPSQQTNSLSLSFPLTSLSLSN-----DTTSKMLYTSQLFSTTKKPNNPQNK 59

Query: 74  TKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC 133
           T   + +      YS +LI             I+L  GTPPQ + P + DTGS L W  C
Sbjct: 60  TP--SYNYKFSFKYSMALI-------------INLPIGTPPQ-TQPMVLDTGSQLSWIQC 103

Query: 134 TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS-WIFGPNVESRCKGCSPRN 192
             +            P+   +F P  SS+  ++ C +P C   I    + + C     +N
Sbjct: 104 HKKQ----------PPT--ASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCD----QN 147

Query: 193 KTCPLACPSYLLQYGLGFTAGLLLSETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRS 251
           + C     SY    G  +  G L+ E   F  S + P  + GC+  S   P GI G    
Sbjct: 148 RLCHY---SYFYADGT-YAEGNLVREKFTFSRSVSTPPLILGCATEST-DPRGILGMNLG 202

Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPV-SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
             S   Q  + KFSYC+  R+       + +  L   P S   K  G+  +   + P   
Sbjct: 203 RLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFD 262

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
             A    Y + +  I +  K + I  +     + G+G  ++DSGS FT++    ++ V  
Sbjct: 263 PLA----YTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRA 318

Query: 371 EFIRQMG-------NYSRAADVEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMALP 421
           + +R +G        Y   AD+        CFD      +   + E++ +F+ G ++ +P
Sbjct: 319 QVVRAVGPRLKKGYVYGGVADM--------CFDSVKAVEIGRLIGEMVFEFERGVEVVIP 370

Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
            E   A VG  V C+ + + +  G A     + I+G+F  QN ++EFDL   R GF K  
Sbjct: 371 KERVLADVGGGVHCVGIGSSDKLGAA-----SNIIGNFHQQNLWVEFDLVRRRVGFGKAD 425

Query: 482 CA 483
           C+
Sbjct: 426 CS 427


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 118/403 (29%), Positives = 167/403 (41%), Gaps = 71/403 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y   +  GTP +     + DTGS + W    PCT+ Y+  D             F P 
Sbjct: 14  GEYFAVVGVGTP-RRDMYLVVDTGSDITWLQCAPCTNCYKQKDA-----------LFNPS 61

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLS 217
            SSS +++ C +  C       +     GC   NK        Y   YG G FT G L++
Sbjct: 62  SSSSFKVLDCSSSLC-------LNLDVMGCL-SNKCL------YQADYGDGSFTMGELVT 107

Query: 218 ETLRF-----PSKTV-PNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FS 265
           + +       P + V  N   GC   ++      AGI G GR   S P+ L       FS
Sbjct: 108 DNVVLDDAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFS 167

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-----GLSYTPFYKNPVGSSSAFGEFYYV 320
           YCL  R+  D    S LV       GD+  P      + + P  +NP         +YYV
Sbjct: 168 YCLPDRE-SDPNHKSTLVF------GDAAIPHTATGSVKFIPQLRNP-----RVATYYYV 215

Query: 321 GLRQIIVGSKHV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            +  I VG   +  IP S     S GNGG I DSG+T T +E   + AV   F     + 
Sbjct: 216 QITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHL 275

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           + AAD +       C+D +G  S+ +P +   F+G   M LPP NY   V N  +    F
Sbjct: 276 TSAADFKI---FDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAF 332

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  A   GP++I G+ Q Q+F + +D  + + G    +C
Sbjct: 333 -------AASMGPSVI-GNVQQQSFRVIYDNVHKQIGLLPDQC 367


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 166/400 (41%), Gaps = 51/400 (12%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVD--------PSRIPAFIPKRSSSSQLIGCQNPK 172
           + DTGS LVW  C++      C  P V         P  +P +    S +++ + C +  
Sbjct: 77  VVDTGSDLVWTQCST------CRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDD 130

Query: 173 CSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFL 231
            +     P      +G    +  C +A       YG G   G+L ++   FPS +     
Sbjct: 131 GALCGVAPETAGCARGGGSGDDACVVAA-----SYGAGVALGVLGTDAFTFPSSSSVTLA 185

Query: 232 AGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
            GC   +   P      +GI G GR + SL SQL   +FSYCL +  F D    S+L + 
Sbjct: 186 FGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCL-TPYFRDTVSPSHLFVG 244

Query: 286 TGPGSGDSKTPG--------LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            G  +G     G        ++  PF KNP    S F  FYY+ L  +  G+  V +P  
Sbjct: 245 DGELAGLRAAAGGGGGGGAPVTTVPFAKNP--KDSPFSTFYYLPLVGLAAGNATVALPAG 302

Query: 338 YL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSG-L 391
                        GG ++DSGS FT +  P   A+ KE  RQ+ G+ S      K  G L
Sbjct: 303 AFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGAL 362

Query: 392 RPCF----DISGKKSVYLPELILKFK----GGAKMALPPENYFALVGNEVLCLILFTDNA 443
             C     D     +  +P L+L+F     GG ++ +P E Y+A V     C+ + +  +
Sbjct: 363 ELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSAS 422

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               L      I+G+F  Q+  + +DLAN    F    C+
Sbjct: 423 GNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 118/428 (27%), Positives = 186/428 (43%), Gaps = 55/428 (12%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
             R ++   + +T  S I  +    +  T         Y +++  G+    +   I DTG
Sbjct: 84  HVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGSQ---NMSVIVDTG 140

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S L W  C     C + N P   PS  P++ P        I C +  C      ++E   
Sbjct: 141 SDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQP--------ILCNSTTCQ-----SLELGA 187

Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---Q 241
            G  P       A   Y++ YG G +T+G L  E L F   +V NF+ GC   +      
Sbjct: 188 CGSDPSTS----ATCDYVVNYGDGSYTSGELGIEKLGFGGISVSNFVFGCGRNNKGLFGG 243

Query: 242 PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
            +G+ G GRS  S+ SQ        FSYCL S   D A  S +LV+    G   + TP +
Sbjct: 244 ASGLMGLGRSELSMISQTNATFGGVFSYCLPST--DQAGASGSLVMGNQSGVFKNVTP-I 300

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
           +YT    N          FY + L  I VG   + +  S     S GNGGVI+DSG+  +
Sbjct: 301 AYTRMLPNL-----QLSNFYILNLTGIDVGGVSLHVQAS-----SFGNGGVILDSGTVIS 350

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
            +   +++A+  +F+ Q   +  A      S L  CF+++G   V +P + + F+G A++
Sbjct: 351 RLAPSVYKALKAKFLEQFSGFPSAPGF---SILDTCFNLTGYDQVNIPTISMYFEGNAEL 407

Query: 419 ALPPENYFALVGNEV--LCLIL--FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
            +     F LV  +   +CL L   +D      +G     I+G++Q +N  + +D    +
Sbjct: 408 NVDATGIFYLVKEDASRVCLALASLSDEY---EMG-----IIGNYQQRNQRVLYDAKLSQ 459

Query: 475 FGFAKQKC 482
            GFAK+ C
Sbjct: 460 VGFAKEPC 467


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 66/404 (16%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           ++SL+ GTPPQ+ T  + DTGS L W  C  +         N++      F P  SSS  
Sbjct: 71  TVSLTVGTPPQSVT-MVLDTGSELSWLHCKKQ--------QNINS----VFNPHLSSSYT 117

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS----YLLQYGLGFTA--GLLLSE 218
            I C +P C                 R+   P++C S    ++      FT+  G L S+
Sbjct: 118 PIPCMSPICK-------------TRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASD 164

Query: 219 TLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           T        P  + G        +   D +  G+ G  R S S  +Q+G  KFSYC+  +
Sbjct: 165 TFAISGSGQPGIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCISGK 224

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                   ++ VL  G  +     P L YTP  K            Y V L  I VGSK 
Sbjct: 225 D-------ASGVLLFGDATFKWLGP-LKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKP 276

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD--VEKK 388
           +++P     P   G G  +VDSG+ FTF+ G ++ A+  EF+ Q  G  +   D     +
Sbjct: 277 LQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFE 336

Query: 389 SGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFALVG---------NEVLCLIL 438
             +  CF +     V  +P + + F+ GA+M++  E     VG          +V CL  
Sbjct: 337 GAMDLCFRVRRGGVVPAVPAVTMVFE-GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTF 395

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +  G       A ++G    QN ++EFDL N R GFA  KC
Sbjct: 396 GNSDLLGIE-----AYVIGHHHQQNVWMEFDLVNSRVGFADTKC 434


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 138/469 (29%), Positives = 192/469 (40%), Gaps = 58/469 (12%)

Query: 35  PLTPLSTKHYLHHSDS-DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIK 93
           P T L     L HSDS   L ILH L    + R        K K   S+     S+  I+
Sbjct: 17  PKTQLQRLKELVHSDSVRQLMILHKLRGGQIPR-------RKAKEVLSSSSGRGSDDAIE 69

Query: 94  TPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
            P+   + +  G YS++   GTP Q     + DTGS L W  C  +Y C   N  N    
Sbjct: 70  VPMHPAADYGIGQYSVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKAR 126

Query: 151 RIP---AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
           RI     F    SSS + I C    C       +E      S  N   PL    Y  +Y 
Sbjct: 127 RIRHKRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 208 LGFTA-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPS 257
            G TA G   +ET+    K      + N L GCS      S +   G+ G G S  S   
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           +   K   KFSYCL+        VS+ L      GS  SK   L+   + +  +G  ++F
Sbjct: 240 KAAEKFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF 294

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y V +  I +G   +KIP    V    G GG I+DSGS+ TF+  P ++ V      
Sbjct: 295 ---YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRV 349

Query: 375 QMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
            +  + +   VE   G L  CF+ +G +   +P L+  F  GA+   P ++Y     + V
Sbjct: 350 SLLKFRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGV 406

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            CL   +    G +       ++G+   QN   EFDL   + GFA   C
Sbjct: 407 RCLGFVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 165/392 (42%), Gaps = 70/392 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  G+P +     IFDTGS L W  C                S    F P +S+
Sbjct: 132 GNYIVSIGLGSPKK-DLMLIFDTGSDLTWARC----------------SAAETFDPTKST 174

Query: 162 SSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S   + C  P CS +       SRC        TC      Y +QYG G ++ G L  E 
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAA-----STCV-----YGIQYGDGSYSIGFLGKER 224

Query: 220 LRFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
           L   S  +  NF  GC    D    + AG+ G GR   S+ SQ   K    FSYCL    
Sbjct: 225 LTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL---- 280

Query: 273 FDDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               P SS+    TG  S G S++    +TP    P         FY + L  I VG + 
Sbjct: 281 ----PSSSS----TGFLSFGSSQSKSAKFTPLSSGP-------SSFYNLDLTGITVGGQK 325

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP S          G I+DSG+  T +    + A+   F + M +Y     +   S L
Sbjct: 326 LAIPLSVF-----STAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPL---SIL 377

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D S  K++ +P++++ F GG  + +     F   G + +CL       AG    R 
Sbjct: 378 DTCYDFSKYKTIKVPKIVISFSGGVDVDVDQAGIFVANGLKQVCLAF-----AGNTGARD 432

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            A I G+ Q +NF + +D++  + GFA   C+
Sbjct: 433 TA-IFGNTQQRNFEVVYDVSGGKVGFAPASCS 463


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 103/337 (30%), Positives = 149/337 (44%), Gaps = 39/337 (11%)

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-- 209
           +P   P  SSS+  + C +  C  +  P + S   G    +  C     SY   YG    
Sbjct: 12  LPLLYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC-----SYHYAYGNARD 65

Query: 210 ---FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL 261
              +T G+L++ET  F   +   P    GC++ S+      +G+ G GR   SL +QL +
Sbjct: 66  THHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNV 125

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           + F Y L S     +P+S   + D   G+GDS       TP   NPV        FYYVG
Sbjct: 126 EAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQDL---PFYYVG 178

Query: 322 LRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           L  I VG K V+IP  ++    S G GGVI DSG+T T +  P +  V  E + QMG + 
Sbjct: 179 LTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQ 237

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCL 436
           +         L  CF   G  +   P ++L F GGA M L  ENY   +    G    C 
Sbjct: 238 KPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCW 295

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
            +   + A          I+G+    +F++ FDL+ +
Sbjct: 296 SVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 324


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 121/439 (27%), Positives = 181/439 (41%), Gaps = 59/439 (13%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
           L +   +RA +L T+  P  +    G + S S + + L   S G Y + +S G+PP    
Sbjct: 129 LVARDNARAEYLATRLSPAYQPP--GFSGSESKVVSGLDEGS-GEYLVRVSVGSPPTEQY 185

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI-F 177
             + D+GS ++W  C     C++C +   DP     F P  S++   + C +  C  +  
Sbjct: 186 -LVVDSGSDVMWVQCKP---CLEC-YVQADP----LFDPATSATFSGVSCGSAICRILPT 236

Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSI 236
               +    GC             Y + Y  G +T G L  ETL      V   + GC  
Sbjct: 237 SACGDGELGGCE------------YEVSYADGSYTKGALALETLTLGGTAVEGVVIGCGH 284

Query: 237 LSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR-------KFDDAPVSSNLV 283
            +       AG+ G G    SL  QLG +    FSYCL SR         DDA     LV
Sbjct: 285 RNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDA---GWLV 341

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L    G  ++   G  + P  +NP   S     FYYVGL  I VG + + +         
Sbjct: 342 L----GRSEAVPEGAVWVPLVRNPRAPS-----FYYVGLSGIEVGDERLPLQAGLFQLTE 392

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
           DG G V++D+G+T T +    + A+   F+  +      A     S L  C+D+SG  SV
Sbjct: 393 DGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASV 452

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            +P +   F G A++ L   N    V   + CL  F  +++G +       I+G+ Q   
Sbjct: 453 RVPTVSFCFDGDARLILAARNVLLEVDMGIYCLA-FAPSSSGLS-------IMGNTQQAG 504

Query: 464 FYLEFDLANDRFGFAKQKC 482
             +  D AN   GF    C
Sbjct: 505 IQITVDSANGYIGFGPANC 523


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 170/387 (43%), Gaps = 52/387 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS + W  C     C DC +   DP     F P  SS
Sbjct: 160 GEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEP---CSDC-YQQSDP----VFNPTSSS 210

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + + C  P+CS +           C  R+  C      Y + YG G FT G L ++T+
Sbjct: 211 TYKSLTCSAPQCSLL-------ETSAC--RSNKCL-----YQVSYGDGSFTVGELATDTV 256

Query: 221 RFPSKTVPNFLA-GCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
            F +    N +A GC   ++      AG+ G G  + S+ +Q+    FSYCL+ R   D+
Sbjct: 257 TFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVDR---DS 313

Query: 277 PVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
             SS+L  ++   GSGD+  P L            +     FYYVGL    VG + V +P
Sbjct: 314 GKSSSLDFNSVQLGSGDATAPLLR-----------NQKIDTFYYVGLSGFSVGGQKVMMP 362

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            +     + G+GGVI+D G+  T ++   + ++   F++   N  +       S    C+
Sbjct: 363 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKG--TSSISLFDTCY 420

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D S   SV +P +   F GG  + LP +NY   V +       F   ++  +       I
Sbjct: 421 DFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLS-------I 473

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +G+ Q Q   + +DLAN   G +  KC
Sbjct: 474 IGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 118/390 (30%), Positives = 162/390 (41%), Gaps = 64/390 (16%)

Query: 104 YSISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y +  + GTP Q   P +   DT +   W PC+    CV C       +    F P +SS
Sbjct: 91  YIVRANIGTPAQ---PMLVALDTSNDAAWVPCSG---CVGC-------ASSVLFDPSKSS 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           SS+ + C  P+C     P   +         K+C      + + YG       L  +TL 
Sbjct: 138 SSRNLQCDAPQCKQAPNPTCTA--------GKSC-----GFNMTYGGSTIEASLTQDTLT 184

Query: 222 FPSKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDD 275
             +  + ++  GC         PA G+ G GR   SL SQ   L +  FSYCL + K   
Sbjct: 185 LANDVIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSK--- 241

Query: 276 APVSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
              SSN    L  GP     +   +  TP  KNP  SS      YYV L  I VG+K V 
Sbjct: 242 ---SSNFSGSLRLGPKYQPVR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVD 290

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           IP S L   +    G I DSG+ FT +  P + AV  EF R++ N    A+     G   
Sbjct: 291 IPTSALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKN----ANATSLGGFDT 346

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGP 452
           C+  SG  SV  P +   F  G  + LPP+N      +    CL +    AA P      
Sbjct: 347 CY--SG--SVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAM----AAAPNNVNSV 397

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             ++   Q QN  +  DL N R G +++ C
Sbjct: 398 LNVIASMQQQNHRVLIDLPNSRLGISRETC 427


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 166/387 (42%), Gaps = 61/387 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P ++ T  I DTGS + W  C     C  C+    DP     F P  SS+ 
Sbjct: 133 YLITVRLGSPGKSQTMLI-DTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 183

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+ +          GCS  +  C      Y + YG G  T G   S+TL  
Sbjct: 184 SPFSCSSAACAQL-----GQEGNGCS--SSQC-----QYTVTYGDGSSTTGTYSSDTLAL 231

Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
            S  V  F  GCS +    + Q  G+ G G  ++SL SQ        FSYCL       A
Sbjct: 232 GSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCL------PA 285

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
             SS+  L  G G+          + F K P+  SS    FY V ++ I VG + + IP 
Sbjct: 286 TSSSSGFLTLGAGT----------SGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPT 335

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPCF 395
           S        + G I+DSG+  T +    + A++  F   M  Y  A      SG L  CF
Sbjct: 336 SVF------SAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSA----PPSGILDTCF 385

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D SG+ SV +P + L F GGA + +  +       N +LCL  F  N+   +LG     I
Sbjct: 386 DFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNSILCLA-FAANSDDSSLG-----I 439

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +G+ Q + F + +D+     GF    C
Sbjct: 440 IGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 163/378 (43%), Gaps = 56/378 (14%)

Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           PQ  + F+ DTGS + W    PC  +  C +           P F P+ SSS   + C +
Sbjct: 6   PQQPSFFVLDTGSDVTWLQCLPCAGKNGCYE--------QITPIFDPELSSSYNPVSCDS 57

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVP 228
            +C  +                  C +    Y ++YG G FT G L +ETL F  S ++P
Sbjct: 58  EQCQLL--------------DEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIP 103

Query: 229 NFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
           N   GC   ++       G+ G G  + S+ SQL    FSYCL+     D+P  S L  +
Sbjct: 104 NISIGCGHDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCLVDI---DSPSFSTLDFN 160

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
           T P S DS       +P  KN       F  F YV +  + VG K + I  S       G
Sbjct: 161 TDPPS-DSLI-----SPLVKN-----DRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESG 209

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
            GG+IVDSG+T T +   ++E + + F+    N   A ++   S    C+D+S + +V +
Sbjct: 210 LGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEI---SPFDTCYDLSSQSNVEV 266

Query: 406 PELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           P +     G   + LP +N    V +    CL   +           P  I+G+FQ Q  
Sbjct: 267 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVS--------ATFPLSIIGNFQQQGI 318

Query: 465 YLEFDLANDRFGFAKQKC 482
            + +DL N   GF+  KC
Sbjct: 319 RVSYDLTNSLVGFSTNKC 336


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 172/390 (44%), Gaps = 46/390 (11%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL  GTPPQ     + DTGS L W  C  +            P    +F P  SSS  +
Sbjct: 82  VSLPIGTPPQTQQ-MVLDTGSQLSWIQCHKKSV-------PKKPPPTTSFDPSLSSSFSV 133

Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
           + C +P C   I    + + C     +N+ C     SY    G  +  G L+ E + F S
Sbjct: 134 LPCNHPLCKPRIPDFTLPTTCD----QNRLCHY---SYFYADGT-YAEGSLVREKITFSS 185

Query: 225 -KTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS-NL 282
            ++ P  + GC+  S  +  GI G      S  SQ  + KFSYC+ +R+      S+ + 
Sbjct: 186 SQSTPPLILGCAEASTDE-KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSF 244

Query: 283 VLDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
            L   P SG  +   L ++TP  ++P     A    Y + ++ I +G+  + I  +   P
Sbjct: 245 YLGNNPNSGRFQYINLLTFTPSQRSPNLDPLA----YTIPMQGIRMGNARLNISATLFRP 300

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPC 394
              G G  I+DSGS FT++    +  V +E +R +G        Y   +D+        C
Sbjct: 301 DPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDM--------C 352

Query: 395 FDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           FD +  +    +  ++ +F+ G ++ +      A VG  V C+ +      G A     +
Sbjct: 353 FDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAA-----S 407

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I+G+F  QN ++E+DLAN R G  K  C+
Sbjct: 408 NIIGNFHQQNLWVEYDLANRRIGLGKADCS 437


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 114/394 (28%), Positives = 170/394 (43%), Gaps = 63/394 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS + W  C     C +C +   DP     F P  S+
Sbjct: 155 GEYFTRIGVGTPTREQY-MVLDTGSDVAWIQCEP---CREC-YSQADP----IFNPSYSA 205

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   +GC +  CS +   +  S   GC             Y   YG G ++ G   +ETL
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS--GGCL------------YEASYGDGSYSTGSFATETL 251

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLKK---FSYCLLS 270
            F + +V N   GC      +  G+        G G  + S P+Q+G +    FSYCL+ 
Sbjct: 252 TFGTTSVANVAIGCG----HKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVD 307

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R+ D     S+  L  GP    S   G  +TP  KNP         FYY+ +  I VG  
Sbjct: 308 RESD-----SSGPLQFGP---KSVPVGSIFTPLEKNP-----HLPTFYYLSVTAISVGGA 354

Query: 331 HVKI--PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            +    P  + +  + G+GG I+DSG+  T +    ++AV   F+   G   R   V   
Sbjct: 355 LLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAV--- 411

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           S    C+D+SG + V +P +   F  GA + LP +NY  L+  + +    F    A  ++
Sbjct: 412 SIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNY--LIPMDTVGTFCFAFAPAASSV 469

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 I+G+ Q Q+  + FD AN   GFA  +C
Sbjct: 470 S-----IMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 165/388 (42%), Gaps = 61/388 (15%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           + + GTPPQA++ FI  TG  LVW  C+   +C+ C         +P F+P  SS+ +  
Sbjct: 57  NFTIGTPPQAASAFIDLTGE-LVWTQCS---QCIHCF-----KQDLPVFVPNASSTFKPE 107

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
            C    C  I  P   S           C     +Y    GLG  T G++ ++T    + 
Sbjct: 108 PCGTDVCKSIPTPKCAS---------DVC-----AYDGVTGLGGHTVGIVATDTFAIGTA 153

Query: 226 TVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
              +   GC + SD      P+G  G GR+  SL +Q+ L +FSYCL      D   +S 
Sbjct: 154 APASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSR 210

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
           L L    G+      G ++TPF K     +    ++Y + L +I  G   + +P      
Sbjct: 211 LFL----GASAKLAGGGAWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR----- 259

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISG 399
              G   V+V +      +   L ++V +EF + +     AA      G     CF  +G
Sbjct: 260 ---GRNTVLVQTAVVRVSL---LVDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFPKAG 313

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAII 455
                 P+L+  F+ GA + +PP NY   VGN+ +CL    I   +  A   L      I
Sbjct: 314 VSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----I 366

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG FQ +N +L FDL  D   F    C+
Sbjct: 367 LGSFQQENVHLLFDLDKDMLSFEPADCS 394


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 121/426 (28%), Positives = 177/426 (41%), Gaps = 68/426 (15%)

Query: 68  RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSS 127
           RHL    KP   +   GS+  + + +        G Y + +  G+PP+ +   + D+GS 
Sbjct: 105 RHLAAG-KPTYAEEAFGSDVVSGMEQ------GSGEYFVRIGVGSPPR-NQYVVIDSGSD 156

Query: 128 LVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV-ES 183
           ++W    PCT  Y   D           P F P  SSS   + C +  CS +      E 
Sbjct: 157 IIWVQCEPCTQCYHQSD-----------PVFNPADSSSYAGVSCASTVCSHVDNAGCHEG 205

Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ- 241
           RC+               Y + YG G +T G L  ETL F    + N   GC   +    
Sbjct: 206 RCR---------------YEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMF 250

Query: 242 --PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
              AG+ G G    S   QLG +    FSYCL+SR      + S+ +L  G    ++   
Sbjct: 251 VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRG-----IQSSGLLQFGR---EAVPV 302

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
           G ++ P   NP   S     FYYVGL  + VG   V I          G+GGV++D+G+ 
Sbjct: 303 GAAWVPLIHNPRAQS-----FYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTA 357

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
            T +    +EA    FI Q  N  RA+ V   S    C+D+ G  SV +P +   F GG 
Sbjct: 358 VTRLPTAAYEAFRDAFIAQTTNLPRASGV---SIFDTCYDLFGFVSVRVPTVSFYFSGGP 414

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
            + LP  N+   V +       F  +++G +       I+G+ Q +   +  D AN   G
Sbjct: 415 ILTLPARNFLIPVDDVGSFCFAFAPSSSGLS-------IIGNIQQEGIEISVDGANGFVG 467

Query: 477 FAKQKC 482
           F    C
Sbjct: 468 FGPNVC 473


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 132/443 (29%), Positives = 184/443 (41%), Gaps = 79/443 (17%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           RAR +  +  P+                  L  H     ++SL+ GTPPQ  T  + DTG
Sbjct: 61  RARQMPARALPRQPSK--------------LRFHHNVSLTVSLAVGTPPQNVT-MVLDTG 105

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           S L W  C           P    ++  A  F P+ SS+   + C + +C     P+  +
Sbjct: 106 SELSWLLCA----------PAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPA 155

Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILS---DR 240
            C G S R       C S  L Y  G ++   L+  + F   + P   A    +S   D 
Sbjct: 156 -CDGASSR-------C-SVSLSYADGSSSDGALATDV-FAVGSGPPLRAAFGCMSSAFDS 205

Query: 241 QPAGIA-----GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
            P G+A     G  R + S  SQ   ++FSYC+  R  DDA V   L+L      G S  
Sbjct: 206 SPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDR--DDAGV---LLL------GHSDL 254

Query: 296 PG---LSYTPFYKNPVGSSSAFGEFYY-VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
           P    L+YTP Y+ P      F    Y V L  I VG KH+ IP S L P   G G  +V
Sbjct: 255 PTFLPLNYTPMYQ-PALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMV 313

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKS---VYL 405
           DSG+ FTF+ G  + A+  EF RQ      A D      +     CF +   +S     L
Sbjct: 314 DSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARL 373

Query: 406 PELILKFKGGAKMALPPENYFALV------GNEVLCLILFTDNAAGPALGRGPAIILGDF 459
           P + L F  GA+MA+  +     V      G+ V CL  F +    P +    A ++G  
Sbjct: 374 PGVTLLFN-GAEMAVAGDRLLYKVPGERRGGDGVWCLT-FGNADMVPIM----AYVIGHH 427

Query: 460 QLQNFYLEFDLANDRFGFAKQKC 482
              N ++E+DL   R G A  +C
Sbjct: 428 HQMNVWVEYDLERGRVGLAPVRC 450


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 116/398 (29%), Positives = 167/398 (41%), Gaps = 61/398 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
           Y + L+ GTPPQ  T  + DTGS L+W  C +   C  C   P+      P F P+ SSS
Sbjct: 98  YVLDLAVGTPPQPITALL-DTGSDLIWTQCDT---CTACLRQPD------PLFSPRMSSS 147

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
            + + C    C    G  +   C     R  TC     +Y   YG G T  G   +E   
Sbjct: 148 YEPMRCAGQLC----GDILHHSCV----RPDTC-----TYRYSYGDGTTTLGYYATERFT 194

Query: 222 FPS-----KTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
           F S     ++VP    GC  +   S    +GI GFGR   SL SQL +++FSYCL     
Sbjct: 195 FASSSGETQSVPLGF-GCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL----- 248

Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              P +S+       GS  D      +  P    P+  S+    FYYV    + VG++ +
Sbjct: 249 --TPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL 306

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
           +IP S      DG+GGVI+DSG+  T     +   V + F  Q+     A       G+ 
Sbjct: 307 RIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQL-RLPFANGSSPDDGV- 364

Query: 393 PCFDISG--------KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
            CF             + V +P ++  F+ GA + LP ENY           +L  D+  
Sbjct: 365 -CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGD 422

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             A        +G+F  Q+  + +DL  +   FA  +C
Sbjct: 423 DGA-------TIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 165/388 (42%), Gaps = 61/388 (15%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           + + GTPPQA++ FI  TG  LVW  C+   +C+ C         +P F+P  SS+ +  
Sbjct: 27  NFTIGTPPQAASAFIDLTGE-LVWTQCS---QCIHCF-----KQDLPVFVPNASSTFKPE 77

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
            C    C  I  P   S    C+    T            GLG  T G++ ++T    + 
Sbjct: 78  PCGTDVCKSIPTPKCASDV--CAFDGVT------------GLGGHTVGIVATDTFAIGTA 123

Query: 226 TVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
              +   GC + SD      P+G  G GR+  SL +Q+ L +FSYCL      D   +S 
Sbjct: 124 APASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSR 180

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
           L L    G+      G ++TPF K     +    ++Y + L +I  G   + +P      
Sbjct: 181 LFL----GASAKLAGGGAWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR----- 229

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISG 399
              G   V+V +      +   L ++V +EF + +     AA      G     CF  +G
Sbjct: 230 ---GRNTVLVQTAVVRVSL---LVDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFPKAG 283

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAII 455
                 P+L+  F+ GA + +PP NY   VGN+ +CL    I   +  A   L      I
Sbjct: 284 VSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----I 336

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG FQ +N +L FDL  D   F    C+
Sbjct: 337 LGSFQQENVHLLFDLDKDMLSFEPADCS 364


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 161/392 (41%), Gaps = 73/392 (18%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           G+S+++    P +     I DTGS L+W  C                        K SSS
Sbjct: 42  GHSLTVGIVQPRK----LIVDTGSDLIWTQC------------------------KLSSS 73

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
           +                   +   G  P ++T P    ++         A G+L SET  
Sbjct: 74  T-----------------AAAARHGSPPLSRTAPARTGAFTRTCTASAAAVGVLASETFT 116

Query: 222 FPSKTVPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
           F ++   +   G  C  LS        GI G    S SL +QL +++FSYCL    F D 
Sbjct: 117 FGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PFADK 174

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
             S  L       S    T  +  T    NPV +      +YYV L  I +G K + +P 
Sbjct: 175 KTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETV-----YYYVPLVGISLGHKRLAVPA 229

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           + L    DG GG IVDSGST  ++    FEAV KE +  +     A    +   L  CF 
Sbjct: 230 ASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFV 286

Query: 397 I------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           +      +  ++V +P L+L F GGA M LP +NYF      ++CL      A G     
Sbjct: 287 LPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDG 340

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q QN ++ FD+ + +F FA  +C
Sbjct: 341 SGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 171/407 (42%), Gaps = 68/407 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++  GTPP+  T  +FDTGS L W       +C+ C   +  P + P F P +SS+ 
Sbjct: 122 YVVTIGIGTPPRNFT-VLFDTGSDLTWV------QCLPCPDSSCYPQQEPLFDPSKSSTY 174

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C  P+C    G   ++RC   S         C  Y ++YG    T G L  ET   
Sbjct: 175 VDVPCSAPECH--IGGVQQTRCGATS---------C-EYSVKYGDESETHGSLAEETFTL 222

Query: 223 --PSKTVP---NFLAGCS-----ILSD--RQPAGIAGFGRSSESLPSQL------GLKKF 264
             PS   P     + GCS     + +D     AG+ G GR   S+ SQ       G   F
Sbjct: 223 SPPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVF 282

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCL  R       +  L +  G  +   +   LS+TP     + + S     Y V L  
Sbjct: 283 SYCLPPR----GSSTGYLTIGGGAAAPQQQYSNLSFTPL----ITTISQLRSAYVVNLAG 334

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           + V    V IP S        + G ++DSG+  T M    +  +  EF   MG+Y    +
Sbjct: 335 VSVNGAAVDIPASAF------SLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPE 388

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--------VLCL 436
              K  L  C+D++G+  V  P + L+F GGA++ +       ++  E        + CL
Sbjct: 389 GSMKL-LDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACL 447

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                N+AG        +I+G+ Q + + + FD+   R GF    C+
Sbjct: 448 AFLPTNSAG-------LVIVGNMQQRAYNVVFDVDGGRIGFGPNGCS 487


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 127/449 (28%), Positives = 191/449 (42%), Gaps = 79/449 (17%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
           D  +I+  L S     A  L T+ KPK K      N +N  +  P++    + S   Y  
Sbjct: 38  DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 85

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
               GTP Q +     D  +   W PC++   C  C       +  P+F P +SS+ + +
Sbjct: 86  RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 135

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
            C +P+C+ +  P              +CP    S   + L Y       +L  ++L   
Sbjct: 136 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 181

Query: 224 SKTVPNFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
           +  V ++  GC  ++S     P G+ GFGR   S  SQ        FSYCL + +     
Sbjct: 182 NNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYR----- 236

Query: 278 VSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            SSN    L  GP  G  K   +  TP   NP   S      YYV +  I VGSK V++P
Sbjct: 237 -SSNFSGTLKLGP-IGQPKR--IKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVP 287

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S L        G I+D+G+ FT +  P++ AV   F  ++    R        G   C+
Sbjct: 288 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCY 343

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI 454
           ++    +V +P +   F G   + LP EN      +  V CL +    AAGP+ G   A+
Sbjct: 344 NV----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAAL 395

Query: 455 -ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +L   Q QN  + FD+AN R GF+++ C
Sbjct: 396 NVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 130/490 (26%), Positives = 201/490 (41%), Gaps = 80/490 (16%)

Query: 24  GAGSSAATVTV-----PLTPLSTKHYLHHSDSDPL--------KILHSLASSSLSRARHL 70
           GA SS   +T+     P +PL+  H    S  D L         I H +++++ +R    
Sbjct: 78  GATSSGTRMTIVHRHGPCSPLAAAHGKPPSHEDILAADQNRAESIQHRVSTTATARGNPK 137

Query: 71  KTKTKPKTKDSNIGSNYSNSLIKTPLSVH--------SYGGYSISLSFGTPPQASTPFIF 122
           +++  P  +     +    + + +  +            G Y +++  GTP    T  +F
Sbjct: 138 RSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYT-VVF 196

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS   W  C     CV   +   +      F P RSS+   + C  P C        +
Sbjct: 197 DTGSDTTWVQCQP---CVVVCYEQQEK----LFDPARSSTYANVSCAAPAC-------FD 242

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR 240
              +GCS  +         Y +QYG G ++ G    +TL   S   V  F  GC   ++ 
Sbjct: 243 LDTRGCSGGHCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEG 295

Query: 241 ---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
              + AG+ G GR   SLP Q   K    F++CL +R       S    LD GPGS  + 
Sbjct: 296 LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------SGTGYLDFGPGSPAAA 349

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
              L+ TP   +   +      FYYVG+  I VG + + IP S          G IVDSG
Sbjct: 350 GARLT-TPMLTDNGPT------FYYVGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSG 397

Query: 355 STFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           +  T +  P + ++   F+  M    Y +A  V   S L  C+D +G   V +P + L F
Sbjct: 398 TVITRLPPPAYSSLRSAFVSAMAARGYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLF 454

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
           +GGA + +             +CL  F  N  G  +G     I+G+ QL+ F + +D+  
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCL-GFAANEDGGDVG-----IVGNTQLKTFGVAYDIGK 508

Query: 473 DRFGFAKQKC 482
              GF+   C
Sbjct: 509 KVVGFSPGAC 518


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 127/449 (28%), Positives = 191/449 (42%), Gaps = 79/449 (17%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
           D  +I+  L S     A  L T+ KPK K      N +N  +  P++    + S   Y  
Sbjct: 57  DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 104

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
               GTP Q +     D  +   W PC++   C  C       +  P+F P +SS+ + +
Sbjct: 105 RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 154

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
            C +P+C+ +  P              +CP    S   + L Y       +L  ++L   
Sbjct: 155 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 200

Query: 224 SKTVPNFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
           +  V ++  GC  ++S     P G+ GFGR   S  SQ        FSYCL + +     
Sbjct: 201 NNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYR----- 255

Query: 278 VSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            SSN    L  GP  G  K   +  TP   NP   S      YYV +  I VGSK V++P
Sbjct: 256 -SSNFSGTLKLGP-IGQPKR--IKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVP 306

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S L        G I+D+G+ FT +  P++ AV   F  ++    R        G   C+
Sbjct: 307 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCY 362

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI 454
           ++    +V +P +   F G   + LP EN      +  V CL +    AAGP+ G   A+
Sbjct: 363 NV----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAAL 414

Query: 455 -ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +L   Q QN  + FD+AN R GF+++ C
Sbjct: 415 NVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 136/445 (30%), Positives = 193/445 (43%), Gaps = 58/445 (13%)

Query: 47  HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
             DS  +K + SLA+ S  R     TK  P++        +S ++I + LS  S G Y +
Sbjct: 91  QRDSLRVKSITSLAAVSTGRN---ATKRTPRS-----AGGFSGAVI-SGLSQGS-GEYFM 140

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSSSQL 165
            L  GTP   +   + DTGS +VW  C+    C  C N  +V       F PK+S +   
Sbjct: 141 RLGVGTPA-TNVYMVLDTGSDVVWLQCSP---CKACYNQSDV------IFDPKKSKTFAT 190

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS 224
           + C +  C  +   +  S C   + R+KTC      Y + YG G FT G   +ETL F  
Sbjct: 191 VPCGSRLCRRL---DDSSEC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHG 240

Query: 225 KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV 278
             V +   GC   ++      AG+ G GR   S PSQ   +   KFSYCL+ R    +  
Sbjct: 241 ARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSS 300

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYS 337
                +  G    D+      +TP   NP         FYY+ L  I VG   V  +  S
Sbjct: 301 KPPSTIVFG---NDAVPKTSVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVSES 352

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                + GNGGVI+DSG++ T +    + A+   F        RA      S    CFD+
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSY---SLFDTCFDL 409

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
           SG  +V +P ++  F GG +++LP  NY   V  E      F    AG     G   I+G
Sbjct: 410 SGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAF----AGTM---GSLSIIG 461

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q Q F + +DL   R GF  + C
Sbjct: 462 NIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 133/453 (29%), Positives = 192/453 (42%), Gaps = 65/453 (14%)

Query: 47  HSDSDPLKILHS-----LASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHS 100
           H    PL+ ++S     L S S  R    L T    ++K+S   +  SN  +++  +V +
Sbjct: 78  HGACSPLRPINSSSWIDLVSQSFERDNARLNTI---RSKNSGPYTTMSNLPLQSGTTVGT 134

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
            G Y ++  FGTP + S   I DTGS L W  C     C DC +  VD      F PK+S
Sbjct: 135 -GNYIVTAGFGTPAKNSL-LIIDTGSDLTWIQCKP---CADC-YSQVDA----IFEPKQS 184

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT-CPLACPSYLLQYGLGFTA-GLLLSE 218
           SS + + C +  C+ +            S  N T C L    Y + YG G ++ G    E
Sbjct: 185 SSYKTLPCLSATCTELI----------TSESNPTPCLLGGCVYEINYGDGSSSQGDFSQE 234

Query: 219 TLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
           TL   S +  NF  GC   +    +  +G+ G G++S S PSQ   K   +F+YCL    
Sbjct: 235 TLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLP--- 291

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            D    +S      G GS  +      +TP   N +     +  FY+VGL  I VG   +
Sbjct: 292 -DFGSSTSTGSFSVGKGSIPASA---VFTPLVSNFM-----YPTFYFVGLNGISVGGDRL 342

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP     P   G G  IVDSG+  T +    + A+   F  +  +   A   +  S L 
Sbjct: 343 SIP-----PAVLGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSA---KPFSILD 394

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVLCLILFTDNAAGPALGR 450
            C+D+S    V +P +   F+  A +A+        V  G   +CL      A   A   
Sbjct: 395 TCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCL------AFASASQM 448

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I+G+FQ Q   + FD    R GFA   CA
Sbjct: 449 DGFNIIGNFQQQRMRVAFDTGAGRIGFASGSCA 481


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 165/392 (42%), Gaps = 61/392 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y + +  G+PP+ S   + D+GS +VW    PCT  Y   D           P F P 
Sbjct: 41  GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCKPCTQCYHQTD-----------PLFDPA 88

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVES-RCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
            S+S   + C +  C  +      S RC+               Y + YG G  T G L 
Sbjct: 89  DSASFMGVSCSSAVCDQVDNAGCNSGRCR---------------YEVSYGDGSSTKGTLA 133

Query: 217 SETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
            ETL      V N   GC  ++       AG+ G G  S S   QL  ++   FSYCL+S
Sbjct: 134 LETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVS 193

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R       +SN  L+ G    ++   G ++ P  +NP   S     +YY+GL  + VG  
Sbjct: 194 RV-----TNSNGFLEFGS---EAMPVGAAWIPLIRNPHSPS-----YYYIGLSGLGVGDM 240

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            V I          GNGGV++D+G+  T      +EA    FI Q GN  RA+ V   S 
Sbjct: 241 KVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGV---SI 297

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
              C+++ G  SV +P +   F GG  + LP  N+   V +       F  + +G +   
Sbjct: 298 FDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLS--- 354

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               ILG+ Q +   +  D AN+  GF    C
Sbjct: 355 ----ILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 123/454 (27%), Positives = 180/454 (39%), Gaps = 57/454 (12%)

Query: 40  STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-SV 98
           S  H +H   S PL+ + +LA    +R   L +K              S  +   P+ S 
Sbjct: 24  SVYHNVHPPSSSPLESIIALAREDDARLLFLSSKAA------------STGVSSAPVASG 71

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
            S   Y +    G+P Q       DT +   W  C+    C  C      PS    F P 
Sbjct: 72  QSPPSYVVRAGLGSPAQPIL-LALDTSADATWAHCSP---CGTC------PSSGSLFAPA 121

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            S+S   + C +  C+ + G      C    P + + PL   ++   +        L S+
Sbjct: 122 NSTSYAPLPCSSTMCTVLQG----QPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASD 177

Query: 219 TLRFPSKTVPNFLAGC-SILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLS 270
            L      +PN+  GC S +S    +    G+ G GR   +L SQ+G      FSYCL S
Sbjct: 178 WLHLGKDAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPS 237

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
            K      S +L L      G +  P G+ YTP  KNP  SS      YYV +  + VG 
Sbjct: 238 YK--SYYFSGSLRL------GAAGQPRGVRYTPMLKNPNRSS-----LYYVNVTGLSVGR 284

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             VK+P            G +VDSG+  T    P++ A+ +EF R +   S    +    
Sbjct: 285 APVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSL---G 341

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPAL 448
               CF+     +   P + +   GG  +ALP EN         + CL +    A  P  
Sbjct: 342 AFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAM----AEAPQN 397

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 +L + Q QN  + FD+AN R GFA++ C
Sbjct: 398 VNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 133/485 (27%), Positives = 203/485 (41%), Gaps = 77/485 (15%)

Query: 24  GAGSSAATVTV-----PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSRA--- 67
           GA SS   +T+     P +PL+  H         L    +    I H +++++ SR    
Sbjct: 83  GATSSTTRMTIVHRHGPCSPLAAAHSKPPSHDEILAADQNRAESIQHRVSTTATSRGQPK 142

Query: 68  RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSS 127
           R  + +       +   S+ + SL  +P      G Y +++  GTP    T  +FDTGS 
Sbjct: 143 RSRRQQPSSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSD 201

Query: 128 LVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG 187
             W  C     CV   +      R   F P RSS+   + C  P CS     ++++R  G
Sbjct: 202 TTWVQCQP---CVVVCYEQ----REKLFDPARSSTYANVSCAAPACS-----DLDTR--G 247

Query: 188 CSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QP 242
           CS  +         Y +QYG G ++ G    +TL   S   V  F  GC   ++    + 
Sbjct: 248 CSGGHCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEA 300

Query: 243 AGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
           AG+ G GR   SLP Q   K    F++CL +R       +    LD G GS  ++   L+
Sbjct: 301 AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------TGTGYLDFGAGSPAAR---LT 351

Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
            TP   +   +      FYYVGL  I VG + + IP S          G IVDSG+  T 
Sbjct: 352 TTPMLVDNGPT------FYYVGLTGIRVGGRLLYIPQSVFA-----TAGTIVDSGTVITR 400

Query: 360 MEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
           +    + ++   F   M    Y +A  V   S L  C+D +G   V +P + L F+GGA+
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKKAPAV---SLLDTCYDFAGMSQVAIPTVSLLFQGGAR 457

Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           + +             +CL  F  N  G  +G     I+G+ QL+ F + +D+      F
Sbjct: 458 LDVDASGIMYAASASQVCLA-FAANEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVSF 511

Query: 478 AKQKC 482
           +   C
Sbjct: 512 SPGAC 516


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 130/447 (29%), Positives = 189/447 (42%), Gaps = 62/447 (13%)

Query: 47  HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
             DS  ++ L SLA+ S  R     TK  P++     G      ++ + LS  S G Y +
Sbjct: 89  QRDSLRVESLTSLAAVSAGRN---VTKRPPRSAGGFSG------VVISGLSQGS-GEYFM 138

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
            L  GTP   +   + DTGS +VW  C+    C +           P F P +S +   +
Sbjct: 139 RLGVGTPA-TNMYMVLDTGSDVVWLQCSPCKVCYN--------QSDPVFNPAKSKTFATV 189

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
            C +  C  +   +  S C   S R+K C      Y + YG G FT G   +ETL F   
Sbjct: 190 PCGSRLCRRL---DDSSEC--VSRRSKACL-----YQVSYGDGSFTVGDFSTETLTFHGA 239

Query: 226 TVPNFLAGCSILSDRQPAGI-----AGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
            V +   GC    D +   +      G GR   S PSQ   +   KFSYCL+ R    + 
Sbjct: 240 RVDHVALGCG--HDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSS 297

Query: 278 VSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIP 335
                 +  G G+     P  + +TP   NP         FYY+ L  I VG   V  + 
Sbjct: 298 SKPPSTIVFGNGA----VPKTAVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVS 348

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S     + GNGGVI+DSG++ T +    + A+   F  ++G  +R       S    CF
Sbjct: 349 ESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAF--RLGA-TRLKRAPSYSLFDTCF 405

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D+SG  +V +P ++  F GG +++LP  NY   V N+      F           G   I
Sbjct: 406 DLSGMTTVKVPTVVFHFTGG-EVSLPASNYLIPVNNQGRFCFAFAGTM-------GSLSI 457

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +G+ Q Q F + +DL   R GF  + C
Sbjct: 458 IGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 121/430 (28%), Positives = 182/430 (42%), Gaps = 66/430 (15%)

Query: 68  RHLKTKTKPKTKDSNIGSNYSN--SLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R LK K  P     N+    +   S + + +   S G Y   +  GTP +     + DTG
Sbjct: 117 RKLKLKKDPAGSYENVAGVTAEFGSEVVSGMEQGS-GEYFTRIGIGTPTREQY-MVLDTG 174

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S +VW  C     C +C +   DP     F P  S S   +GC +  CS +   +     
Sbjct: 175 SDVVWIQCEP---CREC-YSQADP----IFNPSSSVSFSTVGCDSAVCSQLDANDCHG-- 224

Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAG 244
            GC             Y + YG G +T G   +ETL F + ++ N   GC         G
Sbjct: 225 GGCL------------YEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCG----HDNVG 268

Query: 245 I-------AGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
           +        G G  S S P+QLG    + FSYCL+ R  +     S+  L+ GP   +S 
Sbjct: 269 LFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSE-----SSGTLEFGP---ESV 320

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IP-YSYLVPGSDGNGGVIVD 352
             G  +TP   NP         FYY+ +  I VG   +  +P  ++ +  + G GG+I+D
Sbjct: 321 PIGSIFTPLVANPF-----LPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIID 375

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T ++   ++A+   FI    +  RA  +   S    C+D+S  +SV +P +   F
Sbjct: 376 SGTAVTRLQTSAYDALRDAFIAGTQHLPRADGI---SIFDTCYDLSALQSVSIPAVGFHF 432

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
             GA   LP +N   L+  + +    F    A   L      I+G+ Q Q   + FD AN
Sbjct: 433 SNGAGFILPAKN--CLIPMDSMGTFCFAFAPADSNLS-----IMGNIQQQGIRVSFDSAN 485

Query: 473 DRFGFAKQKC 482
              GFA  +C
Sbjct: 486 SLVGFAIDQC 495


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 171/372 (45%), Gaps = 50/372 (13%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I DTGS L W  C    RC +         + P F P +S S + + C +  C  +    
Sbjct: 80  IVDTGSDLSWVQCQPCNRCYN--------QQDPVFNPSKSPSYRTVLCNSLTCRSLQLAT 131

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
             S   G +P   TC     +Y++ YG G +T+G +  E L   + TV NF+ GC   + 
Sbjct: 132 GNSGVCGSNP--PTC-----NYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQ 184

Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
                 +G+ G GR+  SL SQ+       FSYCL +    +A  S +LV+        +
Sbjct: 185 GLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTT---EAEASGSLVMGGNSSVYKN 241

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
            TP +SYT    NP+        FY++ L  I VG   V+ P       S G   +I+DS
Sbjct: 242 TTP-ISYTRMIHNPL------LPFYFLNLTGITVGGVEVQAP-------SFGKDRMIIDS 287

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
           G+  + +   +++A+  EF++Q   Y  A        L  CF++SG + V +P++ + F+
Sbjct: 288 GTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMI---LDSCFNLSGYQEVKIPDIKMYFE 344

Query: 414 GGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           G A++ +     F  V  +   +CL +    A+ P        I+G++Q +N  + +D  
Sbjct: 345 GSAELNVDVTGVFYSVKTDASQVCLAI----ASLPY--EDEVGIIGNYQQKNQRIIYDTK 398

Query: 472 NDRFGFAKQKCA 483
               GFA++ C+
Sbjct: 399 GSMLGFAEEACS 410


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 121/436 (27%), Positives = 190/436 (43%), Gaps = 82/436 (18%)

Query: 72  TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
           T+++ + + + + S    + + + LS+ S G Y I +S GTPP+     + DTGS ++W 
Sbjct: 27  TRSRSRDRQTKVPSQDFQAPVVSGLSLGS-GEYFIRISVGTPPR-RMYLVMDTGSDILWL 84

Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC-SWIFGPNVESRCKGCSP 190
            C     CV+C +   D      F P +SS+   +GC   +C +   G    ++C     
Sbjct: 85  QCAP---CVNC-YHQSDA----IFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCL---- 132

Query: 191 RNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
                      Y + YG G FT G   ++ +   S +         ++ ++ P G    G
Sbjct: 133 -----------YQVDYGDGSFTTGEFGTDDVSLNSTS-----GVGQVVLNKIPLGC---G 173

Query: 250 RSSE-----------------SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPG 289
             +E                 S P+Q+  +   +FSYCL  R+ D    SS LV      
Sbjct: 174 HDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSS-LVF----- 227

Query: 290 SGDSKTP--GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
            G++  P  G  +TP       S+     FYY+ +  I VG   + IP S     S GNG
Sbjct: 228 -GEAAVPPAGARFTP-----QDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNG 281

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           GVI+DSG++ T ++   + ++   F       S  A     S    C+D+SG  SV +P 
Sbjct: 282 GVIIDSGTSVTRLQNAAYASLRDAF---RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPT 338

Query: 408 LILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
           + L F+GG  + LP  NY   V N    CL       AG     GP+II G+ Q Q F +
Sbjct: 339 VTLHFQGGTDLKLPASNYLIPVDNSNTFCLAF-----AGTT---GPSII-GNIQQQGFRV 389

Query: 467 EFDLANDRFGFAKQKC 482
            +D  +++ GF   +C
Sbjct: 390 IYDNLHNQVGFVPSQC 405


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 43/375 (11%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKRSSSSQLIGCQNPKCSWIFG 178
           I DT S L W  C     C D   P  DPS  P++  +P  SSS     C   + +    
Sbjct: 167 IVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSS-----CDALQLATGGT 221

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
               + C+G       C     SY L Y  G ++ G+L  + L    + +  F+ GC   
Sbjct: 222 SGGAAACQGQDQSAAAC-----SYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTS 276

Query: 238 SDRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           +   P    +G+ G GRS  SL SQ   +    FSYCL  ++ D    S +LV+      
Sbjct: 277 NQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESDS---SGSLVI------ 327

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
           GD  +   + TP     + S    G FY+V L  I VG + V+        G       I
Sbjct: 328 GDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGK---AI 384

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           +DSG+  T +   ++ AV  EF+ Q   Y +A      S L  CF+++G + V +P L L
Sbjct: 385 IDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGF---SILDTCFNMTGLREVQVPSLKL 441

Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
            F GG ++ +      YF    +  +CL      A  P        I+G++Q +N  + F
Sbjct: 442 VFDGGVEVEVDSGGVLYFVSSDSSQVCL------AMAPLKSEYETNIIGNYQQKNLRVIF 495

Query: 469 DLANDRFGFAKQKCA 483
           D +  + GFA++ C 
Sbjct: 496 DTSGSQVGFAQETCG 510


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 137/469 (29%), Positives = 191/469 (40%), Gaps = 58/469 (12%)

Query: 35  PLTPLSTKHYLHHSDS-DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIK 93
           P T L     L HSDS   L ILH L    + R        K K   S+     S+  I+
Sbjct: 17  PKTQLQRLKELVHSDSVRQLMILHKLRGGQIPR-------RKAKEVLSSSSGRGSDDAIE 69

Query: 94  TPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
            P+   + +  G Y ++   GTP Q     + DTGS L W  C  +Y C   N  N    
Sbjct: 70  VPMHPAADYGIGQYFVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKAR 126

Query: 151 RIP---AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
           RI     F    SSS + I C    C       +E      S  N   PL    Y  +Y 
Sbjct: 127 RIRHKRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 208 LGFTA-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPS 257
            G TA G   +ET+    K      + N L GCS      S +   G+ G G S  S   
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           +   K   KFSYCL+        VS+ L      GS  SK   L+   + +  +G  ++F
Sbjct: 240 KAAEKFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF 294

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y V +  I +G   +KIP    V    G GG I+DSGS+ TF+  P ++ V      
Sbjct: 295 ---YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRV 349

Query: 375 QMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
            +  + +   VE   G L  CF+ +G +   +P L+  F  GA+   P ++Y     + V
Sbjct: 350 SLLKFRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGV 406

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            CL   +    G +       ++G+   QN   EFDL   + GFA   C
Sbjct: 407 RCLGFVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 162/389 (41%), Gaps = 55/389 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+ S   + D+GS +VW  C    +C   + P  DP+   +F     S
Sbjct: 138 GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 196

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           SS     +N  C          RC+               Y + YG G +T G L  ETL
Sbjct: 197 SSVCDRLENAGCH-------AGRCR---------------YEVSYGDGSYTKGTLALETL 234

Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F    V +   GC   +       AG+ G G  S S   QLG +    FSYCL+SR  D
Sbjct: 235 TFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTD 294

Query: 275 DAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
               S +LV       G    P G ++ P  +NP   S     FYY+GL  + VG   V 
Sbjct: 295 S---SGSLVF------GREALPAGAAWVPLVRNPRAPS-----FYYIGLAGLGVGGIRVP 340

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           I          G+GGV++D+G+  T +    ++A    F+ Q  N  RA  V        
Sbjct: 341 ISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDT 397

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D+ G  SV +P +   F GG  + LP  N+   + +       F  + +G +      
Sbjct: 398 CYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS------ 451

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ILG+ Q +   + FD AN   GF    C
Sbjct: 452 -ILGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 166/397 (41%), Gaps = 91/397 (22%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           +S G Y+++LS GTPP  +   + DTGSSL+W  C     C +C      P+  P F P 
Sbjct: 85  NSAGAYNMNLSIGTPP-VTFSVLADTGSSLIWTQCAP---CTECA---ARPA--PPFQPA 135

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SS+   + C +  C ++  P       GC             Y   YG+GFTAG L +E
Sbjct: 136 SSSTFSKLPCASSLCQFLTSPYRTCNATGCV------------YYYPYGMGFTAGYLATE 183

Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR-KFDD 275
           TL     + P    GCS  +      +GI G GRS  SL SQ+G+ +FSYCL S     D
Sbjct: 184 TLHVGGASFPGVTFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGD 243

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +P+    +     G+  S       TP  +NP   SS+   +YYV L  I VG+  + + 
Sbjct: 244 SPILFGSLAKVTGGNVQS-------TPLLENPEMPSSS---YYYVNLTGITVGATDLPMA 293

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            + L                  T + G  F                        G   CF
Sbjct: 294 MANL------------------TTVNGTRF------------------------GFDLCF 311

Query: 396 D---ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGP 446
           D     G   V +P L+L+F GGA+ A+   +YF +V  +      V CL++       P
Sbjct: 312 DATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVL------P 365

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A  +    I+G+    + ++ +DL    F FA   CA
Sbjct: 366 ASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 402


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 170/410 (41%), Gaps = 73/410 (17%)

Query: 89  NSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
           N+  + PL +   GG Y +  S GTPPQ  T  + DTGS L+W  C        C   + 
Sbjct: 75  NNTQRIPLRMDDSGGAYDMEFSMGTPPQKLTA-LADTGSDLIWAKCGGA-----CT-TSC 127

Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
           +P   P+++P  SS+   + C +  CS +   +V      C+     C      Y   YG
Sbjct: 128 EPQGSPSYLPNASSTFAKLPCSDRLCSLLRSDSV----AWCAAAGAEC-----DYRYSYG 178

Query: 208 LG-----FTAGLLLSETLRFPSKTVPNFLAGCSI---LSDRQPAGIAGFGRSSESLPSQL 259
           LG     +T G L  ET    +  VP+   GC+          +G+ G GR   SL SQL
Sbjct: 179 LGDDDHHYTQGFLARETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQL 238

Query: 260 GLKKFSYCLLSRKFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
               F YCL S    DA  +S L+   L +  G+    T  L+ T FY            
Sbjct: 239 NASTFMYCLTS----DASKASPLLFGSLASLTGAQVQSTGLLASTTFYA----------- 283

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
              V LR I +GS           PG     GV+ DSG+T T++  P +      F+ Q 
Sbjct: 284 ---VNLRSISIGSA--------TTPGVGEPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQ- 331

Query: 377 GNYSRAADVEKKSGLRPCFD--ISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEV 433
              +    VE   G   CF    +G+ S   +P ++L F  GA MALP  NY   V + V
Sbjct: 332 ---TSLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHFD-GADMALPVANYVVEVEDGV 387

Query: 434 LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +C I          + R P++ I+G+    N+ +  D+      F    C
Sbjct: 388 VCWI----------VQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 166/396 (41%), Gaps = 49/396 (12%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           ++SL+ GTPPQ  T  + DTGS L W         + CN      S    F P  SSS  
Sbjct: 74  TVSLTVGTPPQNVT-MVIDTGSELSW---------LHCNTSQNSSSSSSTFNPVWSSSYS 123

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
            I C +  C+       ++R     P   +    C + L       + G L ++T    S
Sbjct: 124 PIPCSSSTCT------DQTRDFPIRPSCDSNQF-CHATLSYADASSSEGNLATDTFYIGS 176

Query: 225 KTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
             +PN + GC  SI S     D +  G+ G  R S S  SQ+G  KFSYC+    F    
Sbjct: 177 SGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDF---- 232

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
             S L+L  G  +     P L+YTP  +            Y V L  I V  K + IP S
Sbjct: 233 --SGLLL-LGDANFSWLAP-LNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPES 288

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPC 394
              P   G G  +VDSG+ FTF+ GP + A+   F+ +     R  +      +  +  C
Sbjct: 289 VFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLC 348

Query: 395 FDISGKKSVY--LPELILKFKGGAKMALPPENYFALV-----GNE-VLCLILFTDNAAGP 446
           + +   ++    LP + L F+ GA+M +  +     V     GN+ + C      +  G 
Sbjct: 349 YRVPTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGV 407

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 A ++G    QN ++EFDL   R G A+ +C
Sbjct: 408 E-----AFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 126/438 (28%), Positives = 189/438 (43%), Gaps = 62/438 (14%)

Query: 58  SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
           S  +S LSR R+    + P T  SNI   YS +LI             +SL  GTP Q S
Sbjct: 50  SFKTSLLSR-RNPSPPSSPYTFRSNI--KYSMALI-------------LSLPIGTPSQ-S 92

Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
              + DTGS L W  C  +        P        +F P  SSS   + C +P C    
Sbjct: 93  QELVLDTGSQLSWIQCHPKKIKKPLPPPTT------SFDPSLSSSFSDLPCSHPLCK--- 143

Query: 178 GPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLSETLRFP-SKTVP 228
                       PR  + T P +C S     Y   Y  G F  G L+ E   F  S+T P
Sbjct: 144 ------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP 191

Query: 229 NFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS-NLVLDTG 287
             + GC+  S  +  GI G      S  SQ  + KFSYC+ +R       S+ +  L   
Sbjct: 192 PLILGCAKESTDE-KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 250

Query: 288 PGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
           P S G      L++    + P     A    Y V L+ I +G K + IP S   P + G+
Sbjct: 251 PNSRGFKYVSLLTFPQSQRMPNLDPLA----YTVPLQGIRIGQKRLNIPGSVFRPDAGGS 306

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV--Y 404
           G  +VDSGS FT +    ++ V +E +R +G+  +   V   +    CFD +    +   
Sbjct: 307 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA-DMCFDGNHSMEIGRL 365

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           + +L+ +F  G ++ +  ++    VG  + C+ +   +  G A     + I+G+   QN 
Sbjct: 366 IGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAA-----SNIIGNVHQQNL 420

Query: 465 YLEFDLANDRFGFAKQKC 482
           ++EFD+ N R GF+K +C
Sbjct: 421 WVEFDVTNRRVGFSKAEC 438


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 114/394 (28%), Positives = 168/394 (42%), Gaps = 56/394 (14%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           ISL  GTPPQA    + DTGS L W         + C+   + P    +F P  SSS   
Sbjct: 74  ISLPIGTPPQAQQ-MVLDTGSQLSW---------IQCHRKKLPPKPKTSFDPSLSSSFST 123

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
           + C +P C                PR  + T P +C S     Y   Y  G F  G L+ 
Sbjct: 124 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 168

Query: 218 ETLRFPSKTV-PNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCL--LSRK 272
           E + F +  + P  + GC+  S  DR   GI G  R   S  SQ  + KFSYC+   S +
Sbjct: 169 EKITFSNTEITPPLILGCATESSDDR---GILGMNRGRLSFVSQAKISKFSYCIPPKSNR 225

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
               P  S  + D     G      L++    + P     A    Y V +  I  G K +
Sbjct: 226 PGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA----YTVPMIGIRFGLKKL 281

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            I  S   P + G+G  +VDSGS FT +    ++ V  E + ++G   +   V   +   
Sbjct: 282 NISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM 341

Query: 393 PCFDISGKKSV---YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
            CFD  G  ++    + +L+  F  G ++ +P E     VG  + C+ +   +  G A  
Sbjct: 342 -CFD--GNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAA-- 396

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              + I+G+   QN ++EFD+ N R GFAK  C+
Sbjct: 397 ---SNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 167/378 (44%), Gaps = 61/378 (16%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I DTGS L W  C    RC +         + P F P  S S + + C +P C  +   +
Sbjct: 149 IVDTGSDLSWVQCQPCKRCYN--------QQDPVFNPSTSPSYRTVLCSSPTCQSL--QS 198

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILS 238
                  C     +C     +Y++ YG G +T G L +E L   + T V NF+ GC    
Sbjct: 199 ATGNLGVCGSNPPSC-----NYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCG--R 251

Query: 239 DRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           + Q      +G+ G GRSS SL SQ        FSYCL      +   S +LV+      
Sbjct: 252 NNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCL---PITETEASGSLVMGGNSSV 308

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
             + TP +SYT    NP         FY++ L  I VGS  V+ P       S G  G++
Sbjct: 309 YKNTTP-ISYTRMIPNPQL------PFYFLNLTGITVGSVAVQAP-------SFGKDGMM 354

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           +DSG+  T +   +++A+  EF++Q   +  A        L  CF++SG + V +P + +
Sbjct: 355 IDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMI---LDTCFNLSGYQEVEIPNIKM 411

Query: 411 KFKGGAKMALPPENYFALVGNEV--LCLILFT---DNAAGPALGRGPAIILGDFQLQNFY 465
            F+G A++ +     F  V  +   +CL + +   +N  G         I+G++Q +N  
Sbjct: 412 HFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG---------IIGNYQQKNQR 462

Query: 466 LEFDLANDRFGFAKQKCA 483
           + +D      GFA + C 
Sbjct: 463 VIYDTKGSMLGFAAEACT 480


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 124/487 (25%), Positives = 191/487 (39%), Gaps = 75/487 (15%)

Query: 14  LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH---- 69
           +L LLFT   G  +          P     + H  D   L++ H  +  S  R       
Sbjct: 7   VLFLLFTIAKGLHN----------PKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSW 56

Query: 70  ----LKTKTKPKTKD---SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF 122
               LK + K + +    S++ +  S   I +   +     Y +    GTP Q +     
Sbjct: 57  EESVLKLQAKDQARMQYLSSLVARRSIVPIASGRQITQSPTYIVKAKIGTPAQ-TLLLAM 115

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DT +   W PCT+   CV C       S    F P +S++ + +GC   +C  +      
Sbjct: 116 DTSNDASWVPCTA---CVGC-------STTTPFAPAKSTTFKKVGCGASQCKQV------ 159

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------SI 236
                   RN TC  +  ++   YG    A  L+ +T+   +  VP +  GC      S 
Sbjct: 160 --------RNPTCDGSACAFNFTYGTSSVAASLVQDTVTLATDPVPAYAFGCIQKVTGSS 211

Query: 237 LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
           +  +   G+     S  +   +L    FSYCL S  F     S +L L  GP +   +  
Sbjct: 212 VPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPS--FKTLNFSGSLRL--GPVAQPKR-- 265

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            + +TP  KNP  SS      YYV L  I VG + V IP   L   ++   G + DSG+ 
Sbjct: 266 -IKFTPLLKNPRRSS-----LYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTV 319

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
           FT +  P + AV  EF R++  + +   V    G   C+       +  P +   F  G 
Sbjct: 320 FTRLVEPAYNAVRNEFRRRIAVHKKLT-VTSLGGFDTCY----TAPIVAPTITFMF-SGM 373

Query: 417 KMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
            + LPP+N         V CL +    A  P        ++ + Q QN  + FD+ N R 
Sbjct: 374 NVTLPPDNILIHSTAGSVTCLAM----APAPDNVNSVLNVIANMQQQNHRVLFDVPNSRL 429

Query: 476 GFAKQKC 482
           G A++ C
Sbjct: 430 GVARELC 436


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 127/457 (27%), Positives = 185/457 (40%), Gaps = 63/457 (13%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK-TKPKTKDSNIGSNYSNSLIKTPL 96
           P ST   L H D+    +   LA++S + +R   T   KPK      G    +SL   PL
Sbjct: 66  PFST--VLTHDDARAAHLASRLATTSNAPSRRPTTSLRKPKAAAGASGGPLDDSLASVPL 123

Query: 97  SVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP 153
           +  +    G Y   L  GTP   S   + DTGSSL W  C+    CV      V P    
Sbjct: 124 TPGTSVGVGNYVTELGLGTP-ATSYAMVVDTGSSLTWLQCSP---CVVSCHRQVGP---- 175

Query: 154 AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTA 212
            + P+ SS+   + C   +C  +    +      CS RN         Y   YG   F+ 
Sbjct: 176 LYDPRASSTYATVPCSASQCDELQAATLNP--SACSVRNVCI------YQASYGDSSFSV 227

Query: 213 GLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSY 266
           G L  +T+ F S + PNF  GC   ++    + AG+ G  R+  SL  QL       FSY
Sbjct: 228 GYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL        P S+   L  GP +        SYTP     + SSS     Y+V L  + 
Sbjct: 288 CL------PTPASTGY-LSIGPYTSGH----YSYTP-----MASSSLDASLYFVTLSGMS 331

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VG   + +      P    +   I+DSG+  T +   ++ A++K     M     A    
Sbjct: 332 VGGSPLAVS-----PAEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSA---P 383

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
             S L  CF     + + +P + + F GGA + L  +N    V +   CL     ++   
Sbjct: 384 AFSILDTCFQGQASQ-LRVPAVAMAFAGGATLKLATQNVLIDVDDSTTCLAFAPTDST-- 440

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   I+G+ Q Q F + +D+A  R GFA   C+
Sbjct: 441 -------TIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 112/386 (29%), Positives = 158/386 (40%), Gaps = 56/386 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +   FGTPPQ +     DT S   W PC+    CV C       S    F P +S+S 
Sbjct: 97  YIVKAKFGTPPQ-TLLLALDTSSDAAWIPCSG---CVGC-------STSKPFAPIKSTSF 145

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C +P C  +  P     C G          AC ++   YG    A  ++ +TL   
Sbjct: 146 RNVSCGSPHCKQVPNPT----CGGS---------AC-AFNFTYGSSSIAASVVQDTLTLA 191

Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           +  +P +  GC      S    +   G+     S  S    L    FSYCL S  F    
Sbjct: 192 ADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSIN 249

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     +   + YTP  +NP  SS      YYV L  I VG K V IP +
Sbjct: 250 FSGSLRL--GPVYQPKR---IKYTPLLRNPRRSS-----LYYVNLVAIKVGRKIVDIPPA 299

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF R++G       V    G   C+++
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG---PKLPVTTLGGFDTCYNV 356

Query: 398 SGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
                + +P +   F  G  +ALPP+N           CL +    A  P        ++
Sbjct: 357 ----PIVVPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAM----AGAPDNVNSVLNVI 407

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  + FD+ N R G A++ C
Sbjct: 408 ANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 133/474 (28%), Positives = 206/474 (43%), Gaps = 75/474 (15%)

Query: 29  AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS 88
           ++T TVP  P + + YL H           L ++  +RA  L+ + KPK   S   +  S
Sbjct: 114 SSTATVPDHPAARERYLKH-----------LLAADSARAASLQLR-KPKPASSTTTTQAS 161

Query: 89  NSLIKTPLSV---HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
            +  + PL     +    Y  +++ G     +   I DTGS L W       +C  C   
Sbjct: 162 AAAAEVPLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWV------QCEPCPGS 215

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-----GPNVESRCKGCSPRNKTCPLACP 200
           +    R P F P  S +   + C +P C+         P   +R  G S   + C     
Sbjct: 216 SCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNS--EQRC----- 268

Query: 201 SYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILSDRQ----PAGIAGFGRSSES 254
            Y L YG G F+ G+L  +TL   + T +  F+ GC  LS+R      AG+ G GR+  S
Sbjct: 269 YYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFVFGCG-LSNRGLFGGTAGLMGLGRTDLS 327

Query: 255 LPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
           L SQ   +    FSYCL       A  +S   L  GPG   S  P ++YT    +P    
Sbjct: 328 LVSQTAARFGGVFSYCL------PATTTSTGSLSLGPGP-SSSFPNMAYTRMIADPTQP- 379

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FY++ +      +       + L     G G V+VDSG+  T +   +++AV  E
Sbjct: 380 ----PFYFINI------TGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAE 429

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-- 429
           F R+   Y  A      S L  C+D++G+  V +P L L  +GGA++ +       +V  
Sbjct: 430 FARRF-EYPAAPGF---SILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRK 485

Query: 430 -GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G++V CL +    A+ P   + P  I+G++Q +N  + +D    R GFA + C
Sbjct: 486 DGSQV-CLAM----ASLPYEDQTP--IIGNYQQRNKRVVYDTVGSRLGFADEDC 532


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 121/413 (29%), Positives = 174/413 (42%), Gaps = 95/413 (23%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA----FIPKR 159
           Y ++++ G+PP+ S   I DTGS LVW         V C   N D S   A    F P R
Sbjct: 101 YLMTVNLGSPPR-SMLAIADTGSDLVW---------VKCKKGNNDTSSAAAPTTQFDPSR 150

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           SS+   + CQ   C        E+  +        C     +YL  YG G  T G+L +E
Sbjct: 151 SSTYGRVSCQTDAC--------EALGRATCDDGSNC-----AYLYAYGDGSNTTGVLSTE 197

Query: 219 TLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSE------------------SLPSQL 259
           T  F          G S  S RQ   G   FG S+                   SL +QL
Sbjct: 198 TFTFDD--------GGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQL 249

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           G      ++FSYCL+        V+++  L+ G    D   PG + TP     V +    
Sbjct: 250 GGATSLGRRFSYCLVPHS-----VNASSALNFG-ALADVTEPGAASTPLVAGDVDT---- 299

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
             +Y V L  + VG+K V          S  +  +IVDSG+T TF++  L   +  E  R
Sbjct: 300 --YYTVVLDSVKVGNKTVA---------SAASSRIIVDSGTTLTFLDPSLLGPIVDELSR 348

Query: 375 QMGNYSRAADVEKKSGL-RPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVG 430
           ++        V+   GL + C++++G++      +P+L L+F GGA +AL PEN F  V 
Sbjct: 349 RI----TLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ 404

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              LCL +           + P  ILG+   QN ++ +DL      FA   CA
Sbjct: 405 EGTLCLAIVATTE------QQPVSILGNLAQQNIHVGYDLDAGTVTFAGADCA 451


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 114/394 (28%), Positives = 168/394 (42%), Gaps = 56/394 (14%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           ISL  GTPPQA    + DTGS L W         + C+   + P    +F P  SSS   
Sbjct: 74  ISLPIGTPPQAQQ-MVLDTGSQLSW---------IQCHRKKLPPKPKTSFDPSLSSSFST 123

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
           + C +P C                PR  + T P +C S     Y   Y  G F  G L+ 
Sbjct: 124 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 168

Query: 218 ETLRFPSKTV-PNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCL--LSRK 272
           E + F +  + P  + GC+  S  DR   GI G  R   S  SQ  + KFSYC+   S +
Sbjct: 169 EKITFSNTEITPPLILGCATESSDDR---GILGMNRGRLSFVSQAKISKFSYCIPPKSNR 225

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
               P  S  + D     G      L++    + P     A    Y V +  I  G K +
Sbjct: 226 PGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA----YTVPMIGIRFGLKKL 281

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            I  S   P + G+G  +VDSGS FT +    ++ V  E + ++G   +   V   +   
Sbjct: 282 NISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM 341

Query: 393 PCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
            CFD  G  ++    + +L+  F  G ++ +P E     VG  + C+ +   +  G A  
Sbjct: 342 -CFD--GNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAA-- 396

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              + I+G+   QN ++EFD+ N R GFAK  C+
Sbjct: 397 ---SNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 123/407 (30%), Positives = 173/407 (42%), Gaps = 48/407 (11%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     ++SL+ GTPPQ  T  + DTGS L W  C +  +               +F
Sbjct: 55  LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCATGRQGSAAAGAAAAMGE--SF 111

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
            P+ S++   + C + +CS    P   S C G S   + C ++     L Y  G  + G 
Sbjct: 112 RPRASATFAAVPCGSTQCSSRDLPAPPS-CDGAS---RQCHVS-----LSYADGSASDGA 162

Query: 215 LLSETLRFPSKTVPNFLAGC-SILSDRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
           L ++              GC S   D  P G+A     G  R + S  +Q   ++FSYC+
Sbjct: 163 LATDVFAVGEAPPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI 222

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
             R  DDA V   L+L    G  D     L+YTP Y+ P      F    Y V L  I V
Sbjct: 223 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRV 272

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G K + IP S L P   G G  +VDSG+ FTF+ G  + A+  EF++Q     RA D   
Sbjct: 273 GGKALPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPS 332

Query: 388 ---KSGLRPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE------VLC 435
              +  L  CF +   +   S  LP + L F  GA+M++  +     V  E      V C
Sbjct: 333 FAFQEALDTCFRVPAGRPPPSARLPPVTLLFN-GAEMSVAGDRLLYKVPGEHRGADGVWC 391

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L  F +    P      A ++G     N ++E+DL   R G A  KC
Sbjct: 392 LT-FGNADMVPLT----AYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 121/414 (29%), Positives = 178/414 (42%), Gaps = 81/414 (19%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++  GTPP      I DTGS LVW  C  +    D +  +  P  +  F+P  SS+ 
Sbjct: 110 YLMAIEVGTPP-VRVLAIADTGSDLVWVKCKGK----DNDNNSTAPPSV-YFVPSASSTY 163

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
             +GC    C  +      S    CSP + +C      YL  YG G  A G L +ET  F
Sbjct: 164 GRVGCDTKACRAL------SSAASCSP-DGSC-----EYLYSYGDGSRASGQLSTETFTF 211

Query: 223 P-----SKT-----------------VPNFLAGCSILSDR--QPAGIAGFGRSSESLPSQ 258
                 SKT                 +     GCS  +    +  G+ G G    SL SQ
Sbjct: 212 STIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQ 271

Query: 259 LGL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
           LG      +KFSYCL       A  +++  L+ G  +  S+ PG + TP     V +   
Sbjct: 272 LGATTSLGRKFSYCLAPY----ANTNASSALNFGSRAVVSE-PGAASTPLITGEVET--- 323

Query: 314 FGEFYYVGLRQI-IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
              +Y + L  I + G+K          P +     +IVDSG+T T+++  L   + K+ 
Sbjct: 324 ---YYTIALDSINVAGTKR---------PTTAAQAHIIVDSGTTLTYLDSALLTPLVKDL 371

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFALV 429
            R++    RA   EK   L  C+DISG   + ++ +P++ L   GG ++ L P+N F +V
Sbjct: 372 TRRI-KLPRAESPEKI--LDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVV 428

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              VLCL L   +       R    ILG+   QN ++ +DL      FA   CA
Sbjct: 429 QEGVLCLALVATSE------RQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCA 476


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 113/388 (29%), Positives = 157/388 (40%), Gaps = 58/388 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP      + D+GS ++W  C    +C    +   DP     F P  SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C       +     G       C      Y + YG G +T G L  ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                 V     GC   +       AG+ G G  + SL  QLG      FSYCL SR   
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR--- 285

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                         G+G + +  L  T        +SS    FYYVGL  I VG + + +
Sbjct: 286 --------------GAGGAGSLVLGRTEAVPRGRRASS----FYYVGLTGIGVGGERLPL 327

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
             S      DG GGV++D+G+  T +    + A+   F   MG   R+  V   S L  C
Sbjct: 328 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 384

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+SG  SV +P +   F  GA + LP  N    VG  V CL  F  +++G +       
Sbjct: 385 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 436

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q +   +  D AN   GF    C
Sbjct: 437 ILGNIQQEGIQITVDSANGYVGFGPNTC 464


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 158/383 (41%), Gaps = 46/383 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S S GTPPQ  T  + D  S  VW  C++   C  C       +  P F    SS
Sbjct: 95  GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPFYAFLSS 150

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
           + + + C N  C  +  P        CS  +  C      Y   YG G    TAGLL  +
Sbjct: 151 TIREVRCANRGCQRLV-PQT------CSADDSPC-----GYSYVYGGGAANTTAGLLAVD 198

Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
              F +      + GC++ ++    G+ G GR   SL SQL + +FSY L     DDA  
Sbjct: 199 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSLVSQLQIGRFSYYLAP---DDAVD 255

Query: 279 SSNLVL---DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
             + +L   D  P     +T     TP   N    S      YYV L  I V  + + IP
Sbjct: 256 VGSFILFLDDAKP-----RTSRAVSTPLVANRASRS-----LYYVELAGIRVDGEDLAIP 305

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
                  +DG+GGV++      TF++   ++ V +    ++G   RAAD   + GL  C+
Sbjct: 306 RGTFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIG--LRAAD-GSELGLDLCY 362

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAI 454
                 +  +P + L F GGA M L   NYF +     L CL +    A       G   
Sbjct: 363 TSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPA-------GDGS 415

Query: 455 ILGDFQLQNFYLEFDLANDRFGF 477
           +LG       ++ +D++  R  F
Sbjct: 416 LLGSLIQVGTHMIYDISGSRLVF 438


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/392 (27%), Positives = 164/392 (41%), Gaps = 51/392 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +     IFDTGS L W  C     CV   +      + P F P  S 
Sbjct: 152 GNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQP---CVKSCYAQ----QQPIFDPSTSK 203

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   I C +  CS +   +      GCS  N         Y +QYG   FT G    + L
Sbjct: 204 TYSNISCTSAACSSL--KSATGNSPGCSSSNCV-------YGIQYGDSSFTIGFFAKDKL 254

Query: 221 RFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
                 V   F+ GC   +     + AG+ G GR   S+  Q   K    FSYCL + + 
Sbjct: 255 TLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG 314

Query: 274 DDAPVSSNLVLDTGPGSGDSKT--PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            +     +L    G G   SK    G+++TPF      +SS    +Y++ +  I VG K 
Sbjct: 315 SNG----HLTFGNGNGVKASKAVKNGITFTPF------ASSQGTAYYFIDVLGISVGGKA 364

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + I      P    N G I+DSG+  T +    + ++   F + M  Y  A  +   S L
Sbjct: 365 LSIS-----PMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPAL---SLL 416

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D+S   S+ +P++   F G A + L P       G   +CL  F  N    ++G  
Sbjct: 417 DTCYDLSNYTSISIPKISFNFNGNANVELDPNGILITNGASQVCLA-FAGNGDDDSIG-- 473

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              I G+ Q Q   + +D+A  + GF  + C+
Sbjct: 474 ---IFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 121/458 (26%), Positives = 184/458 (40%), Gaps = 94/458 (20%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY---------SNSLIKTPLSVHSYGGY 104
           ++LH + +   +RAR L ++       +   S++          + L +TP+S  + G Y
Sbjct: 65  ELLHEVVTHDFARARALASRLVSSNSPNRSSSDHRHLAEEEEVEHDLAQTPVSFTNGGVY 124

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
             S++ G+PP+  +  + DTGS L W  C              DP               
Sbjct: 125 YSSITLGSPPKDFS-LVMDTGSDLTWVRC--------------DPC-------------- 155

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--LACPSYLLQYGLGFTAGLLLSETLRF 222
                +P CS  F      R    + +  TC   L  P  L  +   F +G  L +TL+ 
Sbjct: 156 -----SPDCSSTF-----DRLASNTYKALTCADDLRLPVLLRLWRRLFHSGRSLRDTLKM 205

Query: 223 PS------KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLS 270
                   +  P F+ GC  L         GI      S S PSQ+G K   KFSYCLL 
Sbjct: 206 AGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLR 265

Query: 271 RKFDDAPVSSNLVLDTG------PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           +   ++   S +V          PGSG  K   L YTP     +G SS +   Y V L  
Sbjct: 266 QTAQNSLKKSPMVFGEAAVELKEPGSG--KPQELQYTP-----IGESSIY---YTVRLDG 315

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I VG++ + +  S  + G D     I DSG+T T +   + +++ +     + +    A+
Sbjct: 316 ISVGNQRLDLSPSTFLNGQDKP--TIFDSGTTLTMLPSGVCDSIKQ----SLASMVSGAE 369

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
                GL  CF +       LP++   F GGA     P NY   +G+ + CLI    N  
Sbjct: 370 FVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVPTNEV 428

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                     I G+ Q Q+F++  D+ N R GF +  C
Sbjct: 429 S---------IFGNLQQQDFFVLHDMDNRRIGFKETDC 457


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/404 (25%), Positives = 166/404 (41%), Gaps = 65/404 (16%)

Query: 102 GGYSISLSFGTP-PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
             Y I+   G   P+ +   + DTGS + W   T+   C                   RS
Sbjct: 108 ASYIITFYLGNQRPEDNISAVVDTGSDIFW---TTEKEC------------------SRS 146

Query: 161 SSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLL 216
            +  ++ C +PKC          S  K  + +   C     +Y + YG      TAG++ 
Sbjct: 147 KTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKC-----TYAIIYGGNANDSTAGVMY 201

Query: 217 SETLRF---PSKTVPN------FLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLK 262
            + L      SK VP+         GCS        D    G+ G GRS+ SLP QL   
Sbjct: 202 EDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFS 261

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           KFSYCL S +  D P  S L+L   P    +              +  +S +   Y+V L
Sbjct: 262 KFSYCLSSYQEPDLP--SYLLLTAAPDM--ATGAVGGGAAVATTALQPNSDYKTLYFVHL 317

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + I +G         +    +   G + VD+G++FT +EG +F  +  E  R M      
Sbjct: 318 QNISIGGTR------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYV 371

Query: 383 ADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            +   ++  + C+     +  +S  LP+++L F   A M LP ++Y     ++ LCL ++
Sbjct: 372 KEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAIY 430

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N       +G   +LG+FQ+QN ++  D  N++  F +  C+
Sbjct: 431 KSNI------KGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 468


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 115/398 (28%), Positives = 169/398 (42%), Gaps = 71/398 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS +VW  C     C +C +   DP     F P  S 
Sbjct: 6   GEYFTRIGIGTPTREQY-MVLDTGSDVVWIQCEP---CREC-YSQADP----IFNPSSSV 56

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   +GC +  CS +   +      GC             Y + YG G +T G   +ETL
Sbjct: 57  SFSTVGCDSAVCSQLDANDCHG--GGCL------------YEVSYGDGSYTVGSYATETL 102

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGL---KKFSYCLLS 270
            F + ++ N   GC         G+        G G  S S P+QLG    + FSYCL+ 
Sbjct: 103 TFGTTSIQNVAIGCG----HDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVD 158

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R  +     S+  L+ GP   +S   G  +TP   NP         FYY+ +  I VG  
Sbjct: 159 RDSE-----SSGTLEFGP---ESVPIGSIFTPLVANPF-----LPTFYYLSMVAISVGGV 205

Query: 331 HVK-IP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            +  +P  ++ +  + G GG+I+DSG+  T ++   ++A+   FI    +  RA  +   
Sbjct: 206 ILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGI--- 262

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           S    C+D+S  +SV +P +   F  GA   LP +N          CLI   D+      
Sbjct: 263 SIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKN----------CLIPM-DSMGTFCF 311

Query: 449 GRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              PA     I+G+ Q Q   + FD AN   GFA  +C
Sbjct: 312 AFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/378 (29%), Positives = 167/378 (44%), Gaps = 59/378 (15%)

Query: 115 QASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           Q + PF  + DTGS + W  C     C DC +   DP     F P+ SSS   + C++ +
Sbjct: 163 QPAKPFYMVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPRSSSSFASLPCESQQ 214

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFL 231
           C  +          GC  R   C      Y + YG G FT G  ++ETL F +  + N +
Sbjct: 215 CQAL-------ETSGC--RASKCL-----YQVSYGDGSFTVGEFVTETLTFGNSGMINDV 260

Query: 232 A-GCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
           A GC   ++      AG+ G G    SL SQ+    FSYCL+ R    +  SS+L  ++ 
Sbjct: 261 AVGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCLVDRD---SSSSSDLEFNSA 317

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
             S     P L            S     FYYVGL  + VG + + IP +       G G
Sbjct: 318 APSDSVNAPLLK-----------SGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVY 404
           G+IVDSG+  T ++   +  +   F+      SR   ++K +G      C+D+S +  V 
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLRDAFV------SRTPYLKKTNGFALFDTCYDLSSQSRVT 420

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P +  +F GG  + LPP+NY   V +       F    +  +       I+G+ Q Q  
Sbjct: 421 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS-------IIGNVQQQGT 473

Query: 465 YLEFDLANDRFGFAKQKC 482
            + +DLAN   GF+  KC
Sbjct: 474 RVHYDLANSVVGFSPHKC 491


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 170/391 (43%), Gaps = 46/391 (11%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL  GTP Q S   + DTGS L W  C  +        P        +F P  SSS   
Sbjct: 83  LSLPIGTPSQ-SQELVLDTGSQLSWIQCHPKKIKKPLPPPTT------SFDPSLSSSFSD 135

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
           + C +P C                PR  + T P +C S     Y   Y  G F  G L+ 
Sbjct: 136 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 180

Query: 218 ETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
           E   F  S+T P  + GC+  S     GI G      S  SQ  + KFSYC+ +R     
Sbjct: 181 EKFTFSNSQTTPPLILGCAKEST-DVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPG 239

Query: 277 PVSS-NLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
             S+ +  L   P S G      L++    + P     A    Y V L  I +G K + I
Sbjct: 240 LASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLA----YTVPLLGIRIGQKRLNI 295

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S   P + G+G  +VDSGS FT +    ++ V +E +R +G+  +   V   +    C
Sbjct: 296 PSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA-DMC 354

Query: 395 FDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
           FD + +  +   + +L+ +F  G ++ +  +     VG  + C+ +   +  G A     
Sbjct: 355 FDGNHQMVIGRLIGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAA----- 409

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           + I+G+   QN ++EFD+AN R GF+K +C+
Sbjct: 410 SNIIGNVHQQNLWVEFDVANRRVGFSKAECS 440


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 170/398 (42%), Gaps = 63/398 (15%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL  GTPPQ S   I DTGS L W  C  +        P   P     F P  SSS  +
Sbjct: 79  VSLPIGTPPQ-SQQMILDTGSQLSWIQCHKK-------VPRKPPPST-VFDPSLSSSFSV 129

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACP-----SYLLQYGLGFTA-GLLLS 217
           + C +P C                PR  + T P +C       Y   Y  G  A G L+ 
Sbjct: 130 LPCNHPLCK---------------PRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVR 174

Query: 218 ETLRF-PSKTVPNFLAGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
           E + F  S++ P  + GC+   SD +  GI G      S  SQ  + KFSYC+ +R+   
Sbjct: 175 EKITFSTSQSTPPLILGCAEDASDDK--GILGMNLGRLSFASQAKITKFSYCVPTRQVRP 232

Query: 276 A--PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
              P  S  + +    +G      L+++   + P     A    + V L+ I +G+K + 
Sbjct: 233 GFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLA----HTVALQGIRIGNKKLN 288

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVE 386
           IP S       G G  ++DSGS FT++    +  V +E +R  G        YS  +D+ 
Sbjct: 289 IPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDM- 347

Query: 387 KKSGLRPCFDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
                  CFD +  +    +  ++ +F  G ++ +      A VG  V C+ +      G
Sbjct: 348 -------CFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLG 400

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            A     + I+G+F  QN ++EFD+AN R GF K  C+
Sbjct: 401 AA-----SNIIGNFHQQNLWVEFDIANRRVGFGKADCS 433


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/399 (29%), Positives = 173/399 (43%), Gaps = 69/399 (17%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           V + G Y +++  GTP +  T   FDTGS L W  C     C+   FP       P F P
Sbjct: 134 VPTGGAYVVTVGLGTPKKDFT-LSFDTGSDLTWTQCEP---CLGGCFPQ----NQPKFDP 185

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
             S+S + + C +  C  I         +G  P        C  Y +QYG G+T G L +
Sbjct: 186 TTSTSYKNVSCSSEFCKLI--------AEGNYPAQDCISNTC-LYGIQYGSGYTIGFLAT 236

Query: 218 ETLRFPSKTV-PNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
           ETL   S  V  NFL GCS  S        G+ G GRS  +LPSQ   K    FSYCL  
Sbjct: 237 ETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCL-- 294

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN-PVGSSSAFGEFYYVGLRQIIVGS 329
                 P S        P S    + G+  +   K+ P+  S    + Y  GL  + +  
Sbjct: 295 ------PAS--------PSSTGHLSFGVEVSQAAKSTPI--SPKLKQLY--GLNTVGISV 336

Query: 330 KHVKIPYSYLVPGSDGNGGV---IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +  ++P          NG +   I+DSG+TFTF+  P + A+   F   M NY+      
Sbjct: 337 RGRELPI---------NGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLT---N 384

Query: 387 KKSGLRPCFDIS--GKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNA 443
             S  +PC+D S  G  ++ +P + + F+GG ++ +        V G + +CL  F D  
Sbjct: 385 GTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLA-FADTG 443

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +          I G++Q + + + +D+A    GFA + C
Sbjct: 444 SDSDFA-----IFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 121/408 (29%), Positives = 163/408 (39%), Gaps = 69/408 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y++ +  G+PP+     I DTGS LVW  C    +C   + P  DPS    F      
Sbjct: 2   GAYTMEIELGSPPKKFNA-IVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTF------ 54

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
                     K S            GCS   KTC      Y  QYG    T G    ETL
Sbjct: 55  ---------AKTSCSTSSCQSLPASGCSSSAKTCI-----YGYQYGDSSSTQGDFALETL 100

Query: 221 RF-----PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
                   SK  PNF  GC  L+       AGI G G+   SL +QLG     KFSYCL+
Sbjct: 101 TLRSSGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLV 160

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
               DD+  +S L+     GS  S   G   TP   N     S    +Y+VGL  I VG 
Sbjct: 161 DFD-DDSSKTSPLIF----GSSASTGSGAISTPIIPN-----SGRSTYYFVGLEGISVGG 210

Query: 330 KHVKIPYSYL------------VPGSDGN-GGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           K + +    +            V   + N GG I DSG+T T ++  ++  V   F   +
Sbjct: 211 KQLSLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSV 270

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVL 434
              S        SG   C+D+S  K+   P L L FK G K + P +NYF +V     V 
Sbjct: 271 ---SLPTVDASSSGFDLCYDVSKSKNFKFPALTLAFK-GTKFSPPQKNYFVIVDTAETVA 326

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           CL +    +           I+G+   QN+++ +D        +  +C
Sbjct: 327 CLAMGGSGSL-------GLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 112/385 (29%), Positives = 172/385 (44%), Gaps = 51/385 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G PP  +   + DTGS + W  C     C +C +   DP     F P  S+
Sbjct: 147 GEYFLRVGIGKPPSQAY-VVLDTGSDVSWIQCAP---CSEC-YQQSDP----IFDPISSN 197

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   I C  P+C  +      S C     RN TC      Y + YG G +T G   +ET+
Sbjct: 198 SYSPIRCDEPQCKSL----DLSEC-----RNGTC-----LYEVSYGDGSYTVGEFATETV 243

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
              S  V N   GC   ++      AG+ G G    S P+Q+    FSYCL++R   D+ 
Sbjct: 244 TLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNR---DSD 300

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
             S L  ++ P   ++ T      P  +NP         FYY+GL+ I VG + + IP S
Sbjct: 301 AVSTLEFNS-PLPRNAAT-----APLMRNP-----ELDTFYYLGLKGISVGGEALPIPES 349

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                + G GG+I+DSG+  T +   +++A+   F++      +A  V   S    C+D+
Sbjct: 350 SFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGV---SLFDTCYDL 406

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
           S ++SV +P +  +F  G ++ LP  NY   V +       F    +  +       I+G
Sbjct: 407 SSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS-------IIG 459

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q Q   + FD+AN   GF+   C
Sbjct: 460 NVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/397 (27%), Positives = 178/397 (44%), Gaps = 53/397 (13%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            GTPP+     + DT S L W   TS   C +C+     P+++P F P  SSS     C 
Sbjct: 5   IGTPPR-EVLLLVDTASELTWVQGTS---CTNCS-----PTKVPPFNPGLSSSFISEPCT 55

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPS---- 224
           +  C    G +       C+    +C     S+ + Y  G  A G++  E     S    
Sbjct: 56  SSVC---LGRSKLGFQSACNRSTGSC-----SFQVAYLDGSEAYGVIAREIFSLQSWDGA 107

Query: 225 -KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLK-------KFSYCLLSRK 272
             T+ + + GC+    ++P    +G  G  R S S P+Q+G +       +FSYC  +R 
Sbjct: 108 ASTLGDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRA 167

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             +   SS +++      GDS  P     Y    + P  +S    +FYYVGL+ I VG +
Sbjct: 168 --EHLNSSGVII-----FGDSGIPAHHFQYLSLEQEPPIASIV--DFYYVGLQGISVGGE 218

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + IP S       GNGG   DSG+T +F+  P   A+ + F R++ + +R +  +    
Sbjct: 219 LLHIPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKE 278

Query: 391 LRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGP 446
           L  C+D++   +     P + L FK    M L   + +  +    +V+ + L   NA   
Sbjct: 279 L--CYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAG-- 334

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A+ +G   ++G++Q Q++ +E DL   R GFA   C 
Sbjct: 335 AVAQGGVNVIGNYQQQDYLIEHDLERSRIGFAPANCV 371


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 111/386 (28%), Positives = 157/386 (40%), Gaps = 56/386 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +   FGTPPQ +     DT S   W PC+    CV C       S    F P +S+S 
Sbjct: 97  YIVKAKFGTPPQ-TLLLALDTSSDAAWIPCSG---CVGC-------STSKPFAPIKSTSF 145

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C +P C  +  P     C G          AC ++   YG    A  ++ +TL   
Sbjct: 146 RNVSCGSPHCKQVPNPT----CGGS---------AC-AFNFTYGSSSIAASVVQDTLTLA 191

Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           +  +P +  GC      S    +   G+     S  S    L    FSYCL S  F    
Sbjct: 192 TDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSIN 249

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     +   + YTP  +NP  SS      YYV L  I VG K V IP +
Sbjct: 250 FSGSLRL--GPVYQPKR---IKYTPLLRNPRRSS-----LYYVNLVAIKVGRKIVDIPPA 299

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF R++G       V    G   C+++
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG---PKLPVTTLGGFDTCYNV 356

Query: 398 SGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
                + +P +   F  G  + LPP+N           CL +    A  P        ++
Sbjct: 357 ----PIVVPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAM----AGAPDNVNSVLNVI 407

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  + FD+ N R G A++ C
Sbjct: 408 ANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/410 (26%), Positives = 176/410 (42%), Gaps = 55/410 (13%)

Query: 82  NIGSNYSNSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
           N  + Y    + TP+   +    G Y   +  GTP +     + DTGS + W  C     
Sbjct: 137 NEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAK-DMYLVLDTGSDVNWIQCEP--- 192

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C DC +   DP     F P  SS+ + + C  P+CS +           C  R+  C   
Sbjct: 193 CADC-YQQSDP----VFNPTSSSTYKSLTCSAPQCSLL-------ETSAC--RSNKCL-- 236

Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE 253
              Y + YG G FT G L ++T+ F  S  + N   GC   ++      AG+ G G    
Sbjct: 237 ---YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVL 293

Query: 254 SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSS 312
           S+ +Q+    FSYCL+ R   D+  SS+L  ++   G GD+  P L            + 
Sbjct: 294 SITNQMKATSFSYCLVDR---DSGKSSSLDFNSVQLGGGDATAPLLR-----------NK 339

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FYYVGL    VG + V +P +     + G+GGVI+D G+  T ++   + ++   F
Sbjct: 340 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 399

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
           ++   N  + +     S    C+D S   +V +P +   F GG  + LP +NY   V + 
Sbjct: 400 LKLTVNLKKGS--SSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDS 457

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 F   ++  +       I+G+ Q Q   + +DL+ +  G +  KC
Sbjct: 458 GTFCFAFAPTSSSLS-------IIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/410 (26%), Positives = 176/410 (42%), Gaps = 55/410 (13%)

Query: 82  NIGSNYSNSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
           N  + Y    + TP+   +    G Y   +  GTP +     + DTGS + W  C     
Sbjct: 137 NEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEP--- 192

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C DC +   DP     F P  SS+ + + C  P+CS +           C  R+  C   
Sbjct: 193 CADC-YQQSDP----VFNPTSSSTYKSLTCSAPQCSLL-------ETSAC--RSNKCL-- 236

Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE 253
              Y + YG G FT G L ++T+ F  S  + N   GC   ++      AG+ G G    
Sbjct: 237 ---YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVL 293

Query: 254 SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSS 312
           S+ +Q+    FSYCL+ R   D+  SS+L  ++   G GD+  P L            + 
Sbjct: 294 SITNQMKATSFSYCLVDR---DSGKSSSLDFNSVQLGGGDATAPLLR-----------NK 339

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FYYVGL    VG + V +P +     + G+GGVI+D G+  T ++   + ++   F
Sbjct: 340 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 399

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
           ++   N  + +     S    C+D S   +V +P +   F GG  + LP +NY   V + 
Sbjct: 400 LKLTVNLKKGS--SSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDS 457

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 F   ++  +       I+G+ Q Q   + +DL+ +  G +  KC
Sbjct: 458 GTFCFAFAPTSSSLS-------IIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 165/378 (43%), Gaps = 59/378 (15%)

Query: 115 QASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           Q + PF  + DTGS + W  C     C DC +   DP     F P+ SSS   + C++ +
Sbjct: 163 QPAKPFYMVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPRSSSSFASLPCESQQ 214

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNF 230
           C  +          GC  R   C      Y + YG G FT G  + ETL F  S  + N 
Sbjct: 215 CQAL-------ETSGC--RASKCL-----YQVSYGDGSFTVGEFVIETLTFGNSGMINNV 260

Query: 231 LAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
             GC   ++      AG+ G G  S SL SQ+    FSYCL+ R    +  SS+L  ++ 
Sbjct: 261 AVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDRD---SSSSSDLEFNSA 317

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
             S     P L            S     FYYVGL  + VG + + IP +       G G
Sbjct: 318 APSDSVNAPLLK-----------SGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVY 404
           G+IVDSG+  T ++   +  +   F+      SR   ++K +G      C+D+S +  V 
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLRDAFV------SRTPYLKKTNGFALFDTCYDLSSQSRVT 420

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P +  +F GG  + LPP+NY   V +       F    +  +       I+G+ Q Q  
Sbjct: 421 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS-------IIGNVQQQGT 473

Query: 465 YLEFDLANDRFGFAKQKC 482
            + +DLAN   GF+  KC
Sbjct: 474 RVHYDLANSVVGFSPHKC 491


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/405 (29%), Positives = 168/405 (41%), Gaps = 47/405 (11%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP- 153
           P + +  G YS++   GTP Q     + DTGS L W  C  +Y C   N  N    RI  
Sbjct: 3   PAADYGIGQYSVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKARRIRH 59

Query: 154 --AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
              F    SSS + I C    C       +E      S  N   PL    Y  +Y  G T
Sbjct: 60  KRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYSDGST 112

Query: 212 A-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGL 261
           A G   +ET+    K      + N L GCS      S +   G+ G G S  S   +   
Sbjct: 113 ALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAE 172

Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           K   KFSYCL+        VS+ L      GS  SK   L+   + +  +G  ++F   Y
Sbjct: 173 KFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF---Y 224

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V +  I +G   +KIP    V    G GG I+DSGS+ TF+  P ++ V       +  
Sbjct: 225 AVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK 282

Query: 379 YSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
           + +   VE   G L  CF+ +G +   +P L+  F  GA+   P ++Y     + V CL 
Sbjct: 283 FRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLG 339

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             +    G +       ++G+   QN   EFDL   + GFA   C
Sbjct: 340 FVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 377


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 174/410 (42%), Gaps = 63/410 (15%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
           ++ ++ G PPQ  T  + DTGS L W       RC     P+  P + PA F    SS+ 
Sbjct: 63  TVPVAVGAPPQNVT-MVLDTGSELSWL------RCNGSRVPSTPPPQAPAAFNGSASSTY 115

Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
               C +P+C W  G +  V   C G  P + +C ++     L Y    +A G+L ++T 
Sbjct: 116 AAAHCSSPECQW-RGRDLPVPPFCAG--PPSNSCRVS-----LSYADASSADGILAADTF 167

Query: 221 RFPSKTVPNFLAGC---------SILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLS 270
                     L GC         +  SD + A G+ G  R S S  +Q    +F+YC+  
Sbjct: 168 LLGGAPPVRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-- 225

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIVG 328
                AP     +L  G G G +  P L+YTP  +   P+         Y V L  I VG
Sbjct: 226 -----APGDGPGLLVLG-GDGAALAPQLNYTPLIQISRPLPYFDRVA--YSVQLEGIRVG 277

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR---AADV 385
           +  + IP S L P   G G  +VDSG+ FTF+    +  +  EF+ Q          +D 
Sbjct: 278 AALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDF 337

Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNE--------- 432
             +     CF  S  +    S  LPE+ L  + GA++A+  E     V  E         
Sbjct: 338 VFQGAFDACFRASEARVAAASQMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEA 396

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V CL     + AG +     A ++G    QN ++E+DL N R GFA  +C
Sbjct: 397 VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 118/464 (25%), Positives = 177/464 (38%), Gaps = 80/464 (17%)

Query: 44  YLHHSDSDPLKILHSLASSSLSR----------ARHLKTKTKPKTKD-SNIGSNYSNSLI 92
           Y H  D   L++ H  +  S  R             L+ K + + +  SN+ +  S   I
Sbjct: 35  YQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLVARRSIVPI 94

Query: 93  KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
            +   +     Y +   FGTP Q +     DT +   W PCT+   CV C       S  
Sbjct: 95  ASGRQITQSPTYIVRAKFGTPAQ-TLLLAMDTSNDAAWVPCTA---CVGC-------STT 143

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
             F P +S++ + +GC   +C  +              RN TC  +  ++   YG    A
Sbjct: 144 TPFAPPKSTTFKKVGCGASQCKQV--------------RNPTCDGSACAFNFTYGTSSVA 189

Query: 213 GLLLSETLRFPSKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
             L+ +T+   +  VP +  GC      S L  +   G+     S  +   +L    FSY
Sbjct: 190 ASLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 249

Query: 267 CLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
           CL S K        D  PV+                P     P +KNP  SS      YY
Sbjct: 250 CLPSFKTLNFSGHXDLXPVAQ---------------PRDQVYPSFKNPRRSS-----LYY 289

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V L  I VG + V IP   L        G + DSG+ FT +  P + AV  EF R++  +
Sbjct: 290 VNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVH 349

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLIL 438
            +   V    G   C+ +     +  P +   F  G  + LPP+N         V CL +
Sbjct: 350 KKLT-VTSLGGFDTCYTV----PIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAM 403

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               A  P        ++ + Q QN  + FD+ N R G A++ C
Sbjct: 404 ----APAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELC 443


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/407 (29%), Positives = 169/407 (41%), Gaps = 79/407 (19%)

Query: 94  TPLSVHSYGGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCT-SRYRCVDCNFPNVDPS 150
           TP + +  G Y   +  GTP +   P+I   DTGSSL W  C+  R  C           
Sbjct: 127 TPGTSYGVGNYVTRMGLGTPAK---PYIMVVDTGSSLTWLQCSPCRVSC--------HRQ 175

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-----YLLQ 205
             P F PK SSS   + C  P+C+ +    +              P AC S     Y   
Sbjct: 176 SGPVFDPKTSSSYAAVSCSTPQCNDLSTATLN-------------PAACSSSDVCIYQAS 222

Query: 206 YG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
           YG   F+ G L  +T+ F S +VPNF  GC   ++    + AG+ G  R+  SL  QL  
Sbjct: 223 YGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAP 282

Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
                FSYCL S          +      PG         SYTP       SS+     Y
Sbjct: 283 TLGYSFSYCLPSSSSSGYLSIGSY----NPGQ-------YSYTPMV-----SSTLDDSLY 326

Query: 319 YVGLRQIIVGSKHVKI---PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
           ++ L  + V  K + +    YS L          I+DSG+  T +   +++A++K     
Sbjct: 327 FIKLSGMTVAGKPLAVSSSEYSSL--------PTIIDSGTVITRLPTTVYDALSKAVAGA 378

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
           M    RA   +  S L  CF +    S+ +P + + F GGA + L  +N    V +   C
Sbjct: 379 MKGTKRA---DAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTC 434

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L      A  PA  R  AII G+ Q Q F + +D+ ++R GFA   C
Sbjct: 435 L------AFAPA--RSAAII-GNTQQQTFSVVYDVKSNRIGFAAGGC 472


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 120/454 (26%), Positives = 174/454 (38%), Gaps = 51/454 (11%)

Query: 39  LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV 98
           LS  H +H S   PL+ + +LA    +R   L +K       + + S    S    P   
Sbjct: 27  LSVYHNVHPSSPSPLESIIALARDDDARLLFLSSKAA----TAGVSSAPVASGQAPP--- 79

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
                Y +    G+P Q       DT +   W  C+    C   +           F P 
Sbjct: 80  ----SYVVRAGLGSPSQ-QLLLALDTSADATWAHCSPCGTCPSSSL----------FAPA 124

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SSS   + C +  C    G    +   G         L   ++   +        L S+
Sbjct: 125 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASD 184

Query: 219 TLRFPSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGL---KKFSYCLL 269
           TLR     +PN+  GC + S   P       G+ G GR   +L SQ G      FSYCL 
Sbjct: 185 TLRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLP 243

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           S  +     S +L L    G+G  +   + YTP  +NP  SS      YYV +  + VG 
Sbjct: 244 S--YRSYYFSGSLRL----GAGGGQPRSVRYTPMLRNPHRSS-----LYYVNVTGLSVGR 292

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             VK+P       +    G +VDSG+  T    P++ A+ +EF RQ+   S         
Sbjct: 293 AWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPS---GYTSLG 349

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPAL 448
               CF+     +   P + +   GG  +ALP EN         L CL +    A  P  
Sbjct: 350 AFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAM----AEAPQN 405

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 ++ + Q QN  + FD+AN R GFAK+ C
Sbjct: 406 VNSVVNVIANLQQQNIRVVFDVANSRIGFAKESC 439


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 118/397 (29%), Positives = 162/397 (40%), Gaps = 59/397 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS--RYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y      G PPQ +   I DTGS LVW  C++  R  C     P  + S    F P    
Sbjct: 90  YVAEYLIGDPPQRAEALI-DTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPV--- 145

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLSETL 220
                            P     C         C LA   S +  YG G  AG L +E  
Sbjct: 146 -----------------PCAARICAANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGTEAF 188

Query: 221 RFPSKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
            F S T      GC   +          +G+ G GR   SL SQ G  KFSYCL +  F 
Sbjct: 189 AFQSGTA-ELAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCL-TPYFH 246

Query: 275 DAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
           +   + +L +       G GD  T     T F K P GS      FYY+ L  + VG   
Sbjct: 247 NNGATGHLFVGASASLGGHGDVMT-----TQFVKGPKGS-----PFYYLPLIGLTVGETR 296

Query: 332 VKIPYSY-----LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           + IP +      + PG   +GGVI+DSGS FT +    ++A+A E   ++     A   +
Sbjct: 297 LPIPATVFDLREVAPGLF-SGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPD 355

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
              G   C        V +P ++  F+GGA MA+P E+Y+A V             +AGP
Sbjct: 356 ADDGAL-CVARRDVGRV-VPAVVFHFRGGADMAVPAESYWAPVDKAA---ACMAIASAGP 410

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              +    ++G++Q QN  + +DLAN  F F    C+
Sbjct: 411 YRRQS---VIGNYQQQNMRVLYDLANGDFSFQPADCS 444


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 174/400 (43%), Gaps = 71/400 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +    G+PP      + DTGSSL+W  C+  + C         P   P F P +SS
Sbjct: 87  GEYLMRFYIGSPPVERLAMV-DTGSSLIWLQCSPCHNCF--------PQETPLFEPLKSS 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTC-PLACPSYLLQYG-LGFTAGLLLSET 219
           + +   C +  C+ +             P  + C  L    Y + YG   F+ G+L +ET
Sbjct: 138 TYKYATCDSQPCTLL------------QPSQRDCGKLGQCIYGIMYGDKSFSVGILGTET 185

Query: 220 LRFPS----KTV--PNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLK---KF 264
           L F S    +TV  PN + GC      +I +  +  GIAG G    SL SQLG +   KF
Sbjct: 186 LSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKF 245

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCLL     D+  +S L   +        T G+  TP    P     +   +Y++ L  
Sbjct: 246 SYCLLPY---DSTSTSKLKFGS---EAIITTNGVVSTPLIIKP-----SLPTYYFLNLEA 294

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           + +G K        +V     +G +++DSG+  T++E   +          +G      D
Sbjct: 295 VTIGQK--------VVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLG-VKLLQD 345

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
           +   S L+ CF    + ++ +P++  +F  GA +AL P+N    L  + +LCL      A
Sbjct: 346 L--PSPLKTCF--PNRANLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCL------A 394

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             P+ G G + + G     +F +E+DL   +  FA   CA
Sbjct: 395 VVPSSGIGIS-LFGSIAQYDFQVEYDLEGKKVSFAPTDCA 433


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 61/391 (15%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y +    GTP Q   P +   DT S + W PC+    CV C      PS   AF P +S+
Sbjct: 115 YIVKALIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 161

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S + + C  P+C  +  P   +R             AC S+ L YG    A  L  +T+R
Sbjct: 162 SFKNVSCSAPQCKQVPNPTCGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 207

Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
             +  +  F  GC        +I   +   G+     S  S    +    FSYCL S  F
Sbjct: 208 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--F 265

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S +L L  GP S   +   + YT   +NP  SS      YYV L  I VG K V 
Sbjct: 266 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 315

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +P + +        G I DSG+ +T +  P++EAV  EF +++   +  A V    G   
Sbjct: 316 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTT--AVVTSLGGFDT 373

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
           C+  SG+  V +P +   FK G  M +P +N           CL +    AA P      
Sbjct: 374 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----AAAPENVNSV 424

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             ++   Q QN  +  D+ N R G A+++C+
Sbjct: 425 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 455


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 174/410 (42%), Gaps = 63/410 (15%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
           ++ ++ G PPQ  T  + DTGS L W       RC     P+  P + PA F    SS+ 
Sbjct: 61  TVPVAVGAPPQNVT-MVLDTGSELSWL------RCNGSRVPSTPPPQAPAAFNGSASSTY 113

Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
               C +P+C W  G +  V   C G  P + +C ++     L Y    +A G+L ++T 
Sbjct: 114 AAAHCSSPECQW-RGRDLPVPPFCAG--PPSXSCRVS-----LSYADASSADGILAADTF 165

Query: 221 RFPSKTVPNFLAGC---------SILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLS 270
                     L GC         +  SD + A G+ G  R S S  +Q    +F+YC+  
Sbjct: 166 LLGGAPPVXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-- 223

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIVG 328
                AP     +L  G G G +  P L+YTP  +   P+         Y V L  I VG
Sbjct: 224 -----APGDGPGLLVLG-GDGAALAPQLNYTPLIQISRPLPYFDRVA--YSVQLEGIRVG 275

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR---AADV 385
           +  + IP S L P   G G  +VDSG+ FTF+    +  +  EF+ Q          +D 
Sbjct: 276 AALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDF 335

Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNE--------- 432
             +     CF  S  +    S  LPE+ L  + GA++A+  E     V  E         
Sbjct: 336 VFQGAFDACFRASEARVAAASXMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEA 394

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V CL     + AG +     A ++G    QN ++E+DL N R GFA  +C
Sbjct: 395 VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 439


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 115/440 (26%), Positives = 199/440 (45%), Gaps = 56/440 (12%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           ++   L S  L R R ++ + +      N+ ++ +   + + +++ +   Y +++  G+ 
Sbjct: 17  RLQKQLISDDL-RVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLN-YIVTMGLGS- 73

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
              +   I DTGS L W  C     C +         + P F P  SSS Q + C +  C
Sbjct: 74  --TNMTVIIDTGSDLTWVQCEPCMSCYN--------QQGPIFKPSTSSSYQSVSCNSSTC 123

Query: 174 -SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFL 231
            S  F       C G +P   TC     +Y++ YG G +T G L  E L F   +V +F+
Sbjct: 124 QSLQFATGNTGAC-GSNP--STC-----NYVVNYGDGSYTNGELGVEQLSFGGVSVSDFV 175

Query: 232 AGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
            GC   +       +G+ G GRS  SL SQ        FSYCL +    ++  S +LV+ 
Sbjct: 176 FGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---ESGASGSLVMG 232

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
                  + TP ++YT    NP         FY + L  I V    +++P       S G
Sbjct: 233 NESSVFKNVTP-ITYTRMLPNP-----QLSNFYILNLTGIDVDGVALQVP-------SFG 279

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
           NGGV++DSG+  T +   +++A+   F++Q   +  A      S L  CF+++G   V +
Sbjct: 280 NGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGF---SILDTCFNLTGYDEVSI 336

Query: 406 PELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           P + + F+G A++ +     F +V  +   +CL L + + A          I+G++Q +N
Sbjct: 337 PTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDA------YDTAIIGNYQQRN 390

Query: 464 FYLEFDLANDRFGFAKQKCA 483
             + +D    + GFA++ C+
Sbjct: 391 QRVIYDTKQSKVGFAEESCS 410


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 80/243 (32%), Positives = 115/243 (47%), Gaps = 16/243 (6%)

Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTP 302
           +G+ G    + SL SQL + +FSYCL    F +   S  L          + T  +  T 
Sbjct: 110 SGLMGLSPGTMSLISQLSVPRFSYCLTP--FAERKTSPMLFGAMADLRKYNTTGPIQTTA 167

Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
             +NP   +     +YYV L  + +G+K +++P + L    DG GG IVDSGST   + G
Sbjct: 168 ILRNPAMDTF----YYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTIVDSGSTMAHLAG 223

Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS---GKKSVYLPELILKFKGGAKMA 419
             F+AV K  +  +        VE       CF +       +V  P L+L F GGA MA
Sbjct: 224 KAFDAVKKAVLEAVKLPVFNGTVED---YELCFAVPSGVAMAAVKTPPLVLHFDGGAAMA 280

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           LP +NYF      ++CL +    A  P     P  I+G+ Q QN ++ FD+ N +F FA 
Sbjct: 281 LPRDNYFQEPRAGLMCLAV----ARSPEDLGAPISIIGNVQQQNMHVLFDVHNQKFSFAP 336

Query: 480 QKC 482
            KC
Sbjct: 337 TKC 339


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 162/391 (41%), Gaps = 55/391 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++L  GTP    T  I DTGS L W       +C  CN  +  P + P F P +SS+ 
Sbjct: 125 YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCNASDCYPQKDPLFDPSKSSTF 177

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             I C +  C  +    V+    GC+      P  C  Y ++YG G  T G+  +ETL  
Sbjct: 178 ATIPCASDACKQL---PVDGYDNGCTNNTSGMPPQC-GYAIEYGNGAITEGVYSTETLAL 233

Query: 223 -PSKTVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
             S  V +F  GC   SD+        G+ G G + ESL SQ        FSYCL     
Sbjct: 234 GSSAVVKSFRFGCG--SDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCL----- 286

Query: 274 DDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
              P++S     T   P S ++   G  +TP +      S     FY V L  I VG K 
Sbjct: 287 --PPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHA----FSPKIATFYVVTLTGISVGGKA 340

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP +    G+      IVDSG+  T +    ++A+   F   M  Y      +  S L
Sbjct: 341 LDIPPAVFAKGN------IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SAL 392

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C++ +G  +V +P++ L F GGA + L           +V   +L  D  A    G G
Sbjct: 393 DTCYNFTGHGTVTVPKVALTFVGGATVDL-----------DVPSGVLVEDCLAFADAGDG 441

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+   +   + +D      GF    C
Sbjct: 442 SFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 130/488 (26%), Positives = 202/488 (41%), Gaps = 80/488 (16%)

Query: 23  AGAGSSAATVTVPLTPLSTKHYLHHSDSDPL---KILHSLASSSLSRARHLKTKTKPKTK 79
           AGA  +A TV      L     +   D DP    + L  L ++  SRA   + + +    
Sbjct: 105 AGAARTATTVL----ELKRHSLVAIPDDDPAAHDRYLRRLLAADESRANSFQLRIR---N 157

Query: 80  DSNIGSNYSNSLIKTPLSV---HSYGGYSISLSFGT----PPQASTPFIFDTGSSLVWFP 132
           D    ++  +   + PL+         Y  +++ G      P A+   I DTGS L W  
Sbjct: 158 DRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQ 217

Query: 133 CTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRN 192
           C     C  C        R P F P  S++   + C    C+        +    C   N
Sbjct: 218 CKP---CSACY-----AQRDPLFDPAGSATYAAVRCNASACAASLKAATGTP-GSCGGGN 268

Query: 193 KTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR----QPAGIAG 247
           + C      Y L YG G F+ G+L ++T+     ++  F+ GC  LS+R      AG+ G
Sbjct: 269 ERC-----YYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCG-LSNRGLFGGTAGLMG 322

Query: 248 FGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
            GR+  SL SQ  L+    FSYCL +    DA  S +L L     S  + TP ++YT   
Sbjct: 323 LGRTELSLVSQTALRYGGVFSYCLPATTSGDA--SGSLSLGGDASSYRNTTP-VAYTRMI 379

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
            +P     A   FY++ +    VG        + L     G   V++DSG+  T +   +
Sbjct: 380 ADP-----AQPPFYFLNVTGAAVGG-------TALAAQGLGASNVLIDSGTVITRLAPSV 427

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALP 421
           +  V  EF RQ      AA      G   L  C+D++G   V +P L L+ +GGA++ + 
Sbjct: 428 YRGVRAEFTRQFA----AAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVD 483

Query: 422 PENYFALV---GNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
                 +V   G++V CL +    + D             I+G++Q +N  + +D    R
Sbjct: 484 AAGMLFVVRKDGSQV-CLAMASLSYEDQTP----------IIGNYQQKNKRVVYDTVGSR 532

Query: 475 FGFAKQKC 482
            GFA + C
Sbjct: 533 LGFADEDC 540


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 162/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP    T  +FDTGS   W  C     CV   +   +      F P RSS
Sbjct: 184 GNYVVTIGLGTPAGRYT-VVFDTGSDTTWVQCEP---CVVVCYEQQEK----LFDPARSS 235

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C  P CS ++        KGCS  +         Y +QYG G ++ G    +TL
Sbjct: 236 TDANISCAAPACSDLY-------TKGCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 281

Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
              S   +  F  GC   ++    + AG+ G GR   SLP Q   K    F++C  +R  
Sbjct: 282 TLSSYDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS- 340

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S    LD GPGS  + +  L+      N +        FYYVGL  I VG K + 
Sbjct: 341 -----SGTGYLDFGPGSSPAVSTKLTTPMLVDNGL-------TFYYVGLTGIRVGGKLLS 388

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
           IP S          G IVDSG+  T +    + ++   F   +    Y +A  +   S L
Sbjct: 389 IPPSVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPAL---SLL 440

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G   V +P + L F+GGA + +              CL  F  N     +G  
Sbjct: 441 DTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASVSQACL-GFAANEEDDDVG-- 497

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ QL+ F + +D+     GF+   C
Sbjct: 498 ---IVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 169/392 (43%), Gaps = 65/392 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G PP  +   + DTGS + W  C     C +C +   DP     F P  S+
Sbjct: 147 GEYFLRVGIGKPPSQAY-VVLDTGSDVSWIQCAP---CSEC-YQQSDP----IFDPVSSN 197

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   I C  P+C  +      S C     RN TC      Y + YG G +T G   +ET+
Sbjct: 198 SYSPIRCDAPQCKSL----DLSEC-----RNGTC-----LYEVSYGDGSYTVGEFATETV 243

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD--- 274
              +  V N   GC   ++      AG+ G G    S P+Q+    FSYCL++R  D   
Sbjct: 244 TLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVS 303

Query: 275 ----DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
               ++P+  N+V                  P  +NP         FYY+GL+ I VG +
Sbjct: 304 TLEFNSPLPRNVVT----------------APLRRNP-----ELDTFYYLGLKGISVGGE 342

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + IP S     + G GG+I+DSG+  T +   +++A+   F++      +A  V   S 
Sbjct: 343 ALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGV---SL 399

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
              C+D+S ++SV +P +   F  G ++ LP  NY   V +       F    +  +   
Sbjct: 400 FDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS--- 456

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q Q   + FD+AN   GF+   C
Sbjct: 457 ----IMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 61/391 (15%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y +    GTP Q   P +   DT S + W PC+    CV C      PS   AF P +S+
Sbjct: 99  YIVKALIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 145

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S + + C  P+C  +  P   +R             AC S+ L YG    A  L  +T+R
Sbjct: 146 SFKNVSCSAPQCKQVPNPTCGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 191

Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
             +  +  F  GC        +I   +   G+     S  S    +    FSYCL S  F
Sbjct: 192 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--F 249

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S +L L  GP S   +   + YT   +NP  SS      YYV L  I VG K V 
Sbjct: 250 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 299

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +P + +        G I DSG+ +T +  P++EAV  EF +++   +  A V    G   
Sbjct: 300 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTT--AVVTSLGGFDT 357

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
           C+  SG+  V +P +   FK G  M +P +N           CL +    AA P      
Sbjct: 358 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----AAAPENVNSV 408

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             ++   Q QN  +  D+ N R G A+++C+
Sbjct: 409 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 124/437 (28%), Positives = 175/437 (40%), Gaps = 79/437 (18%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
           F ++ LL        ++AATV + LT         H+D+        LA+  L +   L+
Sbjct: 6   FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51

Query: 72  TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           +K +   +             Y N +  T   VH        L+ GTPPQ       DTG
Sbjct: 52  SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
           S L+W  C     C D   P  DPS           S+   G     C +PK    F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
                       +TC      Y   YG    T G L  +   F     +VP    GC + 
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201

Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           ++        GIAGFGR   SLPSQL +  FS+C  +    +    S ++LD       S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
               +  TP  +NP   +     FYY+ L+ I VGS  + +P S      +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
           G+  T +   ++  V   F  Q+        V   +   P F +S   +   Y+P+L+L 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367

Query: 412 FKGGAKMALPPENYFAL 428
           F+ GA M LP ENY  L
Sbjct: 368 FE-GATMDLPRENYVWL 383


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 166/405 (40%), Gaps = 64/405 (15%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S GG          P+ +   + DTGS++ W   T+   C                   R
Sbjct: 51  SGGGCHYRFELTHRPKDNISAVVDTGSNIFW---TTEKEC------------------SR 89

Query: 160 SSSSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLL 215
           S +  ++ C +PKC          S  K  + +   C     +Y ++YG      TAG+L
Sbjct: 90  SKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKC-----TYAIKYGGNANDSTAGVL 144

Query: 216 LSETLRF---PSKTVP------NFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGL 261
             + L      SK VP          GCS        D    G+ G GRS+ SLP QL  
Sbjct: 145 YEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNF 204

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
            KFSYCL S +  D P  S L+L   P                   +  +S +   Y+V 
Sbjct: 205 SKFSYCLSSYQKPDLP--SYLLLTAAPDMATGAV--GGAAAVATTALQPNSDYKTRYFVD 260

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L+ I +G    ++P       +   G + VD+G++FT +EG +F  +  E  R M     
Sbjct: 261 LQGISIGG--TRLP----AVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKY 314

Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
             +   ++  + C+     +  +S  LP+++L F   A M LP ++Y     ++ LCL +
Sbjct: 315 VKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAI 373

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              N       +G   +LG+FQ+QN ++  D  N++  F +  C+
Sbjct: 374 DKSNI------KGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 412


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 127/460 (27%), Positives = 185/460 (40%), Gaps = 68/460 (14%)

Query: 35  PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSN--SLI 92
           P +PL+  H         L     + ++  +RA+ ++ +    T  S  G    N  SL 
Sbjct: 97  PCSPLADAH------DGKLPSHEEILAADQNRAKSIQRRVSTTTTVSR-GKPKRNRPSLP 149

Query: 93  KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
            +  S    G Y +++  GTP    T  +FDTGS   W  C     CV   +   +    
Sbjct: 150 ASSGSALGTGNYVVTIGLGTPAGRYT-VVFDTGSDTTWVQCEP---CVVVCYKQQEK--- 202

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
             F P RSS+   I C  P CS ++        KGCS  +         Y +QYG G ++
Sbjct: 203 -LFDPARSSTYANISCAAPACSDLY-------IKGCSGGHCL-------YGVQYGDGSYS 247

Query: 212 AGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---F 264
            G    +TL   S   +  F  GC   ++    + AG+ G GR   SLP Q   K    F
Sbjct: 248 IGFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVF 307

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           ++C  +R       S    LD GPGS  + +  L+      N          FYYVGL  
Sbjct: 308 AHCFPARS------SGTGYLDFGPGSLPAVSAKLTTPMLVDNGP-------TFYYVGLTG 354

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRA 382
           I VG K + IP S          G IVDSG+  T +    + ++   F   M    Y +A
Sbjct: 355 IRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKA 409

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
             +   S L  C+D +G   V +P + L F+GGA + +              CL  F  N
Sbjct: 410 PAL---SLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHASGIIYAASVSQACL-GFAGN 465

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                +G     I+G+ QL+ F + +D+     GF    C
Sbjct: 466 KEDDDVG-----IVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 158/391 (40%), Gaps = 44/391 (11%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +  S GTP Q     I DTGS L +  C     C + + P   PS    F P    
Sbjct: 32  GQYFVDFSLGTPEQKFH-LIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTP---- 86

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
               + C + +C  I  P V + C    P +   P    SY  +YG    T G+   ET 
Sbjct: 87  ----VPCDSAECLLIPAP-VGAPCSSSYPESP--PQGACSYEYRYGDNSSTVGVFAYETA 139

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
                 V +   GC   +        G+ G G+ + S  SQ G     KF+YCL S    
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSY-LS 198

Query: 275 DAPVSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
              V S+L+       GD   S    L +TP   NP+  S      YYV + +I  G + 
Sbjct: 199 PTSVFSSLIF------GDDMMSTIHDLQFTPLVSNPLNPS-----VYYVQIVRICFGGET 247

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP S     S GNGG I DSG+T T+     +  +   F + +  Y RA       GL
Sbjct: 248 LLIPDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSV-PYPRAP--PSPQGL 304

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C ++SG      P   ++F  GA       NYF  V   + CL +   ++ G      
Sbjct: 305 PLCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDG------ 358

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++G+   QN+ +++D    R GFA   C
Sbjct: 359 -FNVIGNIIQQNYLVQYDREEHRIGFAHANC 388


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 116/435 (26%), Positives = 180/435 (41%), Gaps = 61/435 (14%)

Query: 56  LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
           +H       S    L + +  K +  + GS+  + + +        G Y + +  G+PP+
Sbjct: 1   MHRDVKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGS------GEYFVRIGLGSPPR 54

Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
            S   + D+GS +VW  C    +C     P  DP+   +F+    SS+     +N  C+ 
Sbjct: 55  -SQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCN- 112

Query: 176 IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGC 234
                   RC+               Y + YG G +T G L  ETL F    V N   GC
Sbjct: 113 ------SGRCR---------------YEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGC 151

Query: 235 SILSDR----QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTG 287
              S+R      AG+ G G  S S   QL  +    FSYCL+SR       ++N  L+ G
Sbjct: 152 G-HSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRG-----TNTNGFLEFG 205

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
               ++   G ++ P  +NP   S     FYY+ L  + VG   V +          G+G
Sbjct: 206 S---EAMPVGAAWIPLVRNPRAPS-----FYYIRLLGLGVGDTRVPVSEDVFQLNELGSG 257

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           GV++D+G+  T      +EA    FI Q  N  RA+ V   S    C+++ G  SV +P 
Sbjct: 258 GVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGV---SIFDTCYNLFGFLSVRVPT 314

Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
           +   F GG  + +P  N+   V +       F  + +G +       ILG+ Q +   + 
Sbjct: 315 VSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLS-------ILGNIQQEGIQIS 367

Query: 468 FDLANDRFGFAKQKC 482
            D AN+  GF    C
Sbjct: 368 VDEANEFVGFGPNIC 382


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 159/372 (42%), Gaps = 52/372 (13%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I DT S L W  C     C D   P  DP+  P++         ++ C +  C  +    
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYA--------VLPCNSSSCDALQVAT 192

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
             +          +C     SY L Y  G ++ G+L  + L    + +  F+ GC   S+
Sbjct: 193 GSAAGACGGGEQPSC-----SYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SN 246

Query: 240 RQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
           + P    +G+ G GRS  SL SQ   +    FSYCL      ++  S +LVL        
Sbjct: 247 QGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVYR 303

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           + TP + YT    +PV      G FY+V L  I +G + V+             G VIVD
Sbjct: 304 NSTP-IVYTTMVSDPVQ-----GPFYFVNLTGITIGGQEVE----------SSAGKVIVD 347

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +   ++ AV  EF+ Q   Y +A      S L  CF+++G + V +P L   F
Sbjct: 348 SGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF---SILDTCFNLTGFREVQIPSLKFVF 404

Query: 413 KGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
           +G  ++ +      YF    +  +CL L +  +           I+G++Q +N  + FD 
Sbjct: 405 EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKS------EYETSIIGNYQQKNLRVIFDT 458

Query: 471 ANDRFGFAKQKC 482
              + GFA++ C
Sbjct: 459 LGSQIGFAQETC 470


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 115/414 (27%), Positives = 165/414 (39%), Gaps = 67/414 (16%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     +IS++ GTPPQ +   + DTGS L W  C +         P       P F
Sbjct: 58  LRFHHNVSLTISITVGTPPQ-NMSMVIDTGSELSWLHCNTN---TTATIP------YPFF 107

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSY-LLQYGLGF---- 210
            P  SSS   I C +P C+                R+   P +C S  L    L +    
Sbjct: 108 NPNISSSYTPISCSSPTCT-------------TRTRDFPIPASCDSNNLCHATLSYADAS 154

Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
            + G L S+T  F S   P  + GC       +  SD    G+ G    S SL SQL + 
Sbjct: 155 SSEGNLASDTFGFGSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP 214

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN----PVGSSSAFGEFY 318
           KFSYC+    F     S  L+L     S  S    L+YTP  +     P    SA    Y
Sbjct: 215 KFSYCISGSDF-----SGILLLGE---SNFSWGGSLNYTPLVQISTPLPYFDRSA----Y 262

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            V L  I +  K + I  +  VP   G G  + D G+ F+++ GP++ A+  EF+ Q   
Sbjct: 263 TVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNG 322

Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVG--- 430
             RA D      +  +  C+ +   +S    LP + L F+G        +  + + G   
Sbjct: 323 TLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVW 382

Query: 431 --NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             + V C      +  G       A I+G    Q+ ++EFDL   R G A  +C
Sbjct: 383 GNDSVYCFTFGNSDLLGVE-----AFIIGHHHQQSMWMEFDLVEHRVGLAHARC 431


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 123/473 (26%), Positives = 198/473 (41%), Gaps = 76/473 (16%)

Query: 28  SAATVTVPLT----PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNI 83
           S+AT++VPL     P +   Y   SD  P          S +R  ++K++       + +
Sbjct: 51  SSATLSVPLVHRYGPCAASQY---SDM-PTPSFSETLRHSRARTNYIKSRAS-----TGM 101

Query: 84  GSNYSNSLIKTPLSVHSYGG---YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYR 138
            S   ++ +  P  +  +     Y ++L FGTP  PQ     + DTGS + W       +
Sbjct: 102 ASTPDDAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQV---LLMDTGSDVSWV------Q 152

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C  CN     P + P F P +SS+   I C    C+ +     +    GC+     C   
Sbjct: 153 CAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLG----DHYRNGCTSGGTQC--- 205

Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPA----GIAGFGRSS 252
              Y ++YG G  T G+  +ET+ F P  TV +F  GC     R P+    G+ G G + 
Sbjct: 206 --GYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCG-HDQRGPSDKFDGLLGLGGAP 262

Query: 253 ESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
           ESL  Q        FSYCL +   +    +  L L   P S  + T    +TP +  P+ 
Sbjct: 263 ESLVVQTASVYGGAFSYCLPALNSE----AGFLALGVRP-SAATNTSAFVFTPMWHLPMD 317

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
           ++S     Y V +  I VG K + IP S         GG+++DSG+  T +    + A+ 
Sbjct: 318 ATS-----YMVNMTGISVGGKPLDIPRSAF------RGGMLIDSGTIVTELPETAYNALN 366

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
               +    Y   A  +  +    C++ +G  +V +P + L F GGA + L        V
Sbjct: 367 AALRKAFAAYPMVASEDFDT----CYNFTGYSNVTVPRVALTFSGGATIDLD-------V 415

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            N +L         +GP +G G   I+G+   +   + +D  + + GF    C
Sbjct: 416 PNGILVKDCLAFRESGPDVGLG---IIGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 153/388 (39%), Gaps = 71/388 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP      + D+GS ++W  C    +C    +   DP     F P  SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C       +     G       C      Y + YG G +T G L  ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                 V     GC   +       AG+ G G  + SL  QLG      FSYCL SR   
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR--- 285

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                         G+G                 G+ S    FYYVGL  I VG + + +
Sbjct: 286 --------------GAG-----------------GAGSLASSFYYVGLTGIGVGGERLPL 314

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
             S      DG GGV++D+G+  T +    + A+   F   MG   R+  V   S L  C
Sbjct: 315 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 371

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+SG  SV +P +   F  GA + LP  N    VG  V CL  F  +++G +       
Sbjct: 372 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 423

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q +   +  D AN   GF    C
Sbjct: 424 ILGNIQQEGIQITVDSANGYVGFGPNTC 451


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 121/473 (25%), Positives = 188/473 (39%), Gaps = 77/473 (16%)

Query: 35  PLTPLSTKH-------YLHHSDSDPLKILHSLASSSLSRARHLKT--------KTKPKTK 79
           P +PL+  H        +  +D + ++ +    S++  R +  K         K  P   
Sbjct: 79  PCSPLADAHGKPPAHDEILAADQNRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIH 138

Query: 80  DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
             +  S+ + SL  T     S G Y +++  GTP    T  +FDTGS   W  C  R   
Sbjct: 139 PGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYT-VVFDTGSDTTWVQC--RPCV 195

Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
           V C        + P F P +SS+   + C +  C+ +          GC+  +       
Sbjct: 196 VKCY-----KQKEPLFDPAKSSTYANVSCTDSACADL-------DTNGCTGGHCL----- 238

Query: 200 PSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESL 255
             Y +QYG G +T G    +TL      +  F  GC   ++    + AG+ G GR   SL
Sbjct: 239 --YAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSL 296

Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
             Q   K    F+YCL       A  +    LD GPGS  +       TP   +   +  
Sbjct: 297 TVQAYNKYGGAFAYCL------PALTTGTGYLDFGPGSAGNNA---RLTPMLTDKGQT-- 345

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FYYVG+  I VG + V +  S          G +VDSG+  T +    + A++  F
Sbjct: 346 ----FYYVGMTGIRVGGQQVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAF 396

Query: 373 IRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            + M     A   +K  G   L  C+D +G   V LP + L F+GGA + +        +
Sbjct: 397 DKVM----LARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI 452

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +CL  F  N    ++      I+G+ Q + + + +DL     GFA   C
Sbjct: 453 SEAQVCLA-FASNGDDESVA-----IVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 157/387 (40%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++  GTP   S   + DTGS L W       +C  CN     P + P F P RSS+ 
Sbjct: 120 YVVTVGLGTP-AVSQVLLIDTGSDLSWV------QCAPCNSTTCYPQKDPLFDPSRSSTY 172

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             I C    C  +      S C   S     C      Y + YG G  T G+  +ETL  
Sbjct: 173 APIPCNTDACRDLTRDGYGSDCTSGSGGGAQC-----GYAITYGDGSQTTGVYSNETLTM 227

Query: 223 -PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDD 275
            P  TV +F  GC    D    +  G+ G G + ESL  Q        FSYCL       
Sbjct: 228 APGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCL------- 280

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P +++       G+  +   G  +TP  +           FY V +  I VG + + +P
Sbjct: 281 -PAANDQAGFLALGAPVNDASGFVFTPMVREQQ-------TFYVVNMTGITVGGEPIDVP 332

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S        +GG+I+DSG+  T ++   + A+   F + M  Y    + E    L  C+
Sbjct: 333 PSAF------SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE----LDTCY 382

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           + +G  +V +P + L F GGA + L        V + +L         AGP    G   I
Sbjct: 383 NFTGHSNVTVPRVALTFSGGATVDLD-------VPDGILLDNCLAFQEAGPDNQPG---I 432

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+   +   + +D+ + R GF    C
Sbjct: 433 LGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 168/390 (43%), Gaps = 71/390 (18%)

Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
           DTGS ++W    PC+   R    N P      +  + P+ SS++ L+ C +P C  + G 
Sbjct: 47  DTGSDVLWVNCRPCSGCPRKSALNIP------LTMYDPRESSTTSLVSCSDPLC--VRGR 98

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-------SKTVPNFL 231
                   CS     C      Y+  YG G T+ G  + + +++        + T    L
Sbjct: 99  RFAE--AQCSQTTNNC-----EYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVL 151

Query: 232 AGCSI-----LSDRQPA--GIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVS 279
            GCSI     LS  Q A  GI GFG+   S+P+QL  ++     FS+CL   K     + 
Sbjct: 152 FGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILV 211

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
              + +          PG++YTP   + V         Y V LR I V S   ++P    
Sbjct: 212 IGGIAE----------PGMTYTPLVPDSV--------HYNVVLRGISVNSN--RLPIDAE 251

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
              S  + GVI+DSG+T  +     +      F++ +   + A  V  +     CF +SG
Sbjct: 252 DFSSTNDTGVIMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSG 307

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVG------NEVLCL-ILFTDNAAGPALGRGP 452
           + S   P + L F+GGA M L P+NY    G       +V C+    + ++AGP  G   
Sbjct: 308 RLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDG-SQ 365

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             ILGD  L++  + +DL N R G+    C
Sbjct: 366 LTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 167/389 (42%), Gaps = 69/389 (17%)

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           DTGS ++W  C     C  C  P      IP   + P+ SS++ L+ C +P C  + G  
Sbjct: 20  DTGSDVLWVNCRP---CSGC--PRKSALNIPLTMYDPRESSTTSLVSCSDPLC--VRGRR 72

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-------SKTVPNFLA 232
                  CS     C      Y+  YG G T+ G  + + +++        + T    L 
Sbjct: 73  FAE--AQCSQATNNC-----EYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLF 125

Query: 233 GCSI-----LSDRQPA--GIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVSS 280
           GCSI     LS  Q A  GI GFG+   S+P+QL  ++     FS+CL   K     +  
Sbjct: 126 GCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVI 185

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
             + +          PG++YTP   + V         Y V LR I V S   ++P     
Sbjct: 186 GGIAE----------PGMTYTPLVPDSV--------HYNVVLRGISVNSN--RLPIDAED 225

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
             S  + GVI+DSG+T  +     +      F++ +   + A  V  +     CF +SG+
Sbjct: 226 FSSTNDTGVIMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSGR 281

Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVG------NEVLCL-ILFTDNAAGPALGRGPA 453
            S   P + L F+GGA M L P+NY    G       +V C+    + ++AGP  G    
Sbjct: 282 LSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDG-SQL 339

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ILGD  L++  + +DL N R G+    C
Sbjct: 340 TILGDIVLKDKLVVYDLDNSRIGWMSYNC 368


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 126/459 (27%), Positives = 187/459 (40%), Gaps = 65/459 (14%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY--SNSLIKTP 95
           P ST   L H D+    +   LA+S     R    + + K      G ++   +SL   P
Sbjct: 65  PFST--VLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVP 122

Query: 96  LSVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
           LS  +    G Y   L  GTP   S   + DTGSSL W  C+    CV      V     
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTP-STSYAMVVDTGSSLTWLQCSP---CVVSCHRQVG---- 174

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFT 211
           P F P+ SS+   + C   +C  +    +      CS  N         Y   YG   F+
Sbjct: 175 PLFDPRASSTYTSVRCSASQCDELQAATLNP--SACSASNVCI------YQASYGDSSFS 226

Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFS 265
            G L ++T+ F S + P+F  GC   ++    + AG+ G  R+  SL  QL       FS
Sbjct: 227 VGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFS 286

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL +        +S   L  GP +        SYT     P+ SSS     Y++ L  +
Sbjct: 287 YCLPT-------AASTGYLSIGPYNTGHY---YSYT-----PMASSSLDASLYFITLSGM 331

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   + +      P    +   I+DSG+  T +   +  A++K   + M    RA   
Sbjct: 332 SVGGSPLAVS-----PSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRA--- 383

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAA 444
              S L  CF+    + + +P +++ F GGA M L   N    V +   CL    TD+ A
Sbjct: 384 PAFSILDTCFEGQASQ-LRVPTVVMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTDSTA 442

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                     I+G+ Q Q F + +D+A  R GF+   C+
Sbjct: 443 ----------IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 129/439 (29%), Positives = 195/439 (44%), Gaps = 72/439 (16%)

Query: 61  SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
           +SS+ R   L++K K      ++G+   +SLI  P +  S  G+ ++LS G+PP  +   
Sbjct: 68  TSSIERFDFLESKIKEL---KSVGNEARSSLI--PFNRGS--GFLVNLSIGSPP-VTQLV 119

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           + DTGSSL+W  C     C++C            F P +S S + +GC  P  ++I G  
Sbjct: 120 VVDTGSSLLWVQCLP---CINCF-----QQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK 171

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTV-----PNFLAGC 234
                  C+  N+        Y L+Y G   + G+L  E+L F +         N   GC
Sbjct: 172 -------CNRFNQ------AEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218

Query: 235 SILS-----DRQPAGIAGFGRSSE-SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
             ++     D    G+ G G     ++ +QLG  KFSYC+      + P+ ++  L  G 
Sbjct: 219 GHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCIGDI---NNPLYTHNHLVLGQ 274

Query: 289 GS---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
           GS   GDS       TP   +       FG  YYV L+ I VGSK +KI  +     SDG
Sbjct: 275 GSYIEGDS-------TPLQIH-------FGH-YYVTLQSISVGSKTLKIDPNAFKISSDG 319

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFD-ISGKKSV 403
           +GGV++DSG T+T +    FE +  E +  M G   R     K  GL  CF  +  +  V
Sbjct: 320 SGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL--CFKGVVSRDLV 377

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
             P +   F GGA + L   + F   G +  CL +   N+    L      ++G    QN
Sbjct: 378 GFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLS-----VIGILAQQN 432

Query: 464 FYLEFDLANDRFGFAKQKC 482
           + + FDL   +  F +  C
Sbjct: 433 YNVGFDLEQMKVFFRRIDC 451


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 126/408 (30%), Positives = 178/408 (43%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
           G Y   L  GTPP+     I DTGS ++W  C S   C  C  P      IP   F P  
Sbjct: 50  GLYYTRLQLGTPPRDFYVQI-DTGSDVLWVSCGS---CNGC--PVNSGLHIPLNFFDPGS 103

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           S ++ LI C + +CS      ++S    CS +N  C      Y  QYG G  T+G  +S+
Sbjct: 104 SPTASLISCSDQRCSL----GLQSSDSVCSAQNNLC-----GYNFQYGDGSGTSGYYVSD 154

Query: 219 TLRFPS---KTVPN-----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL-- 261
            L F +    +V N      + GCS L       SDR   GI GFG+   S+ SQL    
Sbjct: 155 LLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQG 214

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
              + FS+CL   K DD+     LVL      G+   P + YTP   +           Y
Sbjct: 215 ISPRAFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLVPSQ--------PHY 256

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            + ++ I V  + + I  S  V G+  + G I+DSG+T  +    L EA    FI  + +
Sbjct: 257 NLNMQSISVNGQTLAIDPS--VFGTSSSQGTIIDSGTTLAY----LAEAAYDPFISAITS 310

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
               +     S    C+ IS   +   P++ L F GGA M L P++Y    + +G   L 
Sbjct: 311 IVSPSVRPYLSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALW 370

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I F         G+G   ILGD  L++    +D+AN R G+A   C+
Sbjct: 371 CIGFQ-----KIQGQG-ITILGDLVLKDKIFVYDIANQRIGWANYDCS 412


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 109/396 (27%), Positives = 177/396 (44%), Gaps = 71/396 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           + +++ FGTP Q  T  IFDTGS + W       +C+ C+  +      P F P +S++ 
Sbjct: 135 FVVTVGFGTPAQTYT-VIFDTGSDVSWI------QCLPCS-GHCYKQHDPIFDPTKSATY 186

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
            ++ C +P+C+   G    S+C      N TC      Y ++YG G + AG+L  ETL  
Sbjct: 187 SVVPCGHPQCAAADG----SKCS-----NGTC-----LYKVEYGDGSSSAGVLSHETLSL 232

Query: 223 PS-KTVPNFLAGC--SILSD-RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
            S + +P F  GC  + L D     G+ G GR   SL SQ        FSYCL S   D+
Sbjct: 233 TSTRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DN 289

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
                  +  T P S D     + YT   +        +  FY+V L  I +G   + +P
Sbjct: 290 TTHGYLTIGPTTPASNDD----VQYTAMVQK-----QDYPSFYFVELVSIDIGGYILPVP 340

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            +        + G  +DSG+  T++    + A+   F   M  Y  A   +       C+
Sbjct: 341 PTLFT-----DDGTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCY 392

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
           D +G+ ++++P +  KF  G+   L   ++F +        ++F D+ A PA+G      
Sbjct: 393 DFTGQSAIFIPAVSFKFSDGSVFDL---SFFGI--------LIFPDDTA-PAIGCLGFVA 440

Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                P  I+G+ Q +N  + +D+A ++ GFA   C
Sbjct: 441 RPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 121/473 (25%), Positives = 188/473 (39%), Gaps = 77/473 (16%)

Query: 35  PLTPLSTKH-------YLHHSDSDPLKILHSLASSSLSRARHLKT--------KTKPKTK 79
           P +PL+  H        +  +D + ++ +    S++  R +  K         K  P   
Sbjct: 79  PCSPLADAHGKPPAHDEILAADQNRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIH 138

Query: 80  DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
             +  S+ + SL  T     S G Y +++  GTP    T  +FDTGS   W  C  R   
Sbjct: 139 PGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYT-VVFDTGSDTTWVQC--RPCV 195

Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
           V C        + P F P +SS+   + C +  C+ +          GC+  +       
Sbjct: 196 VKCY-----KQKGPLFDPAKSSTYANVSCTDSACADL-------DTNGCTGGHCL----- 238

Query: 200 PSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESL 255
             Y +QYG G +T G    +TL      +  F  GC   ++    + AG+ G GR   SL
Sbjct: 239 --YAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSL 296

Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
             Q   K    F+YCL       A  +    LD GPGS  +       TP   +   +  
Sbjct: 297 TVQAYNKYGGAFAYCL------PALTTGTGYLDFGPGSAGNNA---RLTPMLTDKGQT-- 345

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FYYVG+  I VG + V +  S          G +VDSG+  T +    + A++  F
Sbjct: 346 ----FYYVGMTGIRVGGQQVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAF 396

Query: 373 IRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            + M     A   +K  G   L  C+D +G   V LP + L F+GGA + +        +
Sbjct: 397 DKVM----LARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI 452

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +CL  F  N    ++      I+G+ Q + + + +DL     GFA   C
Sbjct: 453 SEAQVCLA-FASNGDDESVA-----IVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 159/372 (42%), Gaps = 52/372 (13%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I DT S L W  C     C D   P  DP+  P++         ++ C +  C  +    
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSY--------AVLPCNSSSCDALQVAT 191

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
             +          +C     SY L Y  G ++ G+L  + L    + +  F+ GC   S+
Sbjct: 192 GSAAGACGGGEQPSC-----SYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SN 245

Query: 240 RQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
           + P    +G+ G GRS  SL SQ   +    FSYCL      ++  S +LVL        
Sbjct: 246 QGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVYR 302

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
           + TP + YT    +PV      G FY+V L  I +G + V+             G VIVD
Sbjct: 303 NSTP-IVYTTMVSDPVQ-----GPFYFVNLTGITIGGQEVE----------SSAGKVIVD 346

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +   ++ AV  EF+ Q   Y +A      S L  CF+++G + V +P L   F
Sbjct: 347 SGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF---SILDTCFNLTGFREVQIPSLKFVF 403

Query: 413 KGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
           +G  ++ +      YF    +  +CL L +  +           I+G++Q +N  + FD 
Sbjct: 404 EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKS------EYETSIIGNYQQKNLRVIFDT 457

Query: 471 ANDRFGFAKQKC 482
              + GFA++ C
Sbjct: 458 LGSQIGFAQETC 469


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 122/465 (26%), Positives = 195/465 (41%), Gaps = 95/465 (20%)

Query: 52  PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
           P ++L +   S++  A   ++  +    D+     + + L  TP        Y ++++ G
Sbjct: 57  PARVLEAARRSTVRAAALSRSYVR---VDAPSADGFVSELTSTPFE------YLMAVNIG 107

Query: 112 TPPQASTPFIFDTGSSLVWFPCT--------SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           TPP      I DTGS L+W  C+        +  R  D   P V       F P +S++ 
Sbjct: 108 TPPTRMVA-IADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQ------FDPSKSTTF 160

Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +L+ C +  CS +   +   +S+C+               Y   YG G  T+G+L +ET 
Sbjct: 161 RLVDCDSVACSELPEASCGADSKCR---------------YSYSYGDGSHTSGVLSTETF 205

Query: 221 RFP----------SKTVPNFLAGCSI--LSDRQPAGIAGFGRSSESLPSQLGL-----KK 263
            F           +  V N   GCS   +      G+ G G    SL SQLG      ++
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRR 265

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL+        V ++  L+ GP +  +  PG   TP   + V +      +Y V LR
Sbjct: 266 FSYCLVPYS-----VKASSALNFGPRAAVTD-PGAVTTPLIPSQVKA------YYIVELR 313

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            + VG+K  + P             +IVDSG+T TF+     EA+    ++++    +  
Sbjct: 314 SVKVGNKTFEAP---------DRSPLIVDSGTTLTFLP----EALVDPLVKELTGRIKLP 360

Query: 384 DVEKKSGLRP-CFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
             +    L P CFD+SG +    +  +P++ +   GGA + L  EN F  V    LCL  
Sbjct: 361 PAQSPERLLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCL-- 418

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               A      + PA I+G+   QN ++ +DL      FA   CA
Sbjct: 419 ----AVSAMSEQFPASIIGNIAQQNMHVGYDLDKGTVTFAPAACA 459


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)

Query: 65  SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
           +R   + +K   K    ++  + S  L     S    G Y +++  GTP +     IFDT
Sbjct: 65  ARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 123

Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           GS L W  C    R C D         + P F P +S+S   + C +  C  +   +   
Sbjct: 124 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 173

Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
               CS  N         Y +QYG   F+ G L  E     +  V      GC    + Q
Sbjct: 174 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCG--ENNQ 224

Query: 242 -----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
                 AG+ G GR   S PSQ      K FSYCL S     A  + +L   +   +G S
Sbjct: 225 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 277

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
           ++  + +TP      G+S     FY + +  I VG + + IP + +  PG+      ++D
Sbjct: 278 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 324

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +    + A+   F  +M  Y   + V   S L  CFD+SG K+V +P++   F
Sbjct: 325 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 381

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
            GGA + L  +  F +     +CL  F  N+         A I G+ Q Q   + +D A 
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLA-FAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 435

Query: 473 DRFGFAKQKCA 483
            R GFA   C+
Sbjct: 436 GRVGFAPNGCS 446


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 164/391 (41%), Gaps = 64/391 (16%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P+ +   + DTGS++ W   T+   C                   RS +  ++ C +PKC
Sbjct: 42  PKDNISAVVDTGSNIFW---TTEKEC------------------SRSKTRSMLPCCSPKC 80

Query: 174 SWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSETLRF---PSKT 226
                     S  K  + +   C     +Y ++YG      TAG+L  + L      SK 
Sbjct: 81  EQRASCGCRRSELKAEAEKETKC-----TYAIKYGGNANDSTAGVLYEDKLTIVAVASKA 135

Query: 227 VP------NFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
           VP          GCS        D    G+ G GRS+ SLP QL   KFSYCL S +  D
Sbjct: 136 VPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD 195

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P  S L+L   P    +              +  +S +   Y+V L+ I +G    ++P
Sbjct: 196 LP--SYLLLTAAPDM--ATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG--TRLP 249

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
                 G    G + VD+G++FT +EG +F  +  E  R M       +   ++  + C+
Sbjct: 250 AVSTKSG----GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICY 305

Query: 396 ---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
                +  +S  LP+++L F   A M LP ++Y     ++ LCL +   N       +G 
Sbjct: 306 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAIDKSNI------KGG 358

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             +LG+FQ+QN ++  D  N++  F +  C+
Sbjct: 359 ISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 389


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 162/386 (41%), Gaps = 57/386 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPPQ +     DT +   W PCT+   C               F P++S++ 
Sbjct: 78  YIVRAKIGTPPQ-TLLLAMDTSNDAAWIPCTACDGCAST-----------LFAPEKSTTF 125

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C  P+C  +  P               C ++  ++ L YG    A  L+ +T+   
Sbjct: 126 KNVSCAAPECKQVPNPG--------------CGVSSCNFNLTYGSSSIAANLVQDTITLA 171

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  VP++  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S  F    
Sbjct: 172 TDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 229

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  KNP  SS      YYV L  I VG K V IP +
Sbjct: 230 FSGSLRL--GPVAQPKR---IKYTPLLKNPRRSS-----LYYVNLEAIRVGRKVVDIPPA 279

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF R++G       V    G   C+++
Sbjct: 280 ALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVG---PKLTVTSLGGFDTCYNV 336

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                + +P +   F  G  + LP +N           CL +    A  P        ++
Sbjct: 337 ----PIVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAM----AGAPDNVNSVLNVI 387

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  + +D+ N R G A++ C
Sbjct: 388 ANMQQQNHRVLYDVPNSRVGVARELC 413


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 119/453 (26%), Positives = 173/453 (38%), Gaps = 51/453 (11%)

Query: 40  STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH 99
           S  H +H S   PL+ + +LA    +R   L +K       + + S    S    P    
Sbjct: 26  SVYHNVHPSSPSPLESIIALARDDDARLLFLSSKAA----TAGVSSAPVASGQAPP---- 77

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
               Y +    G+P Q       DT +   W  C+    C   +           F P  
Sbjct: 78  ---SYVVRAGLGSPSQ-QLLLALDTSADATWAHCSPCGTCPSSSL----------FAPAN 123

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
           SSS   + C +  C    G    +   G         L   ++   +        L S+T
Sbjct: 124 SSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDT 183

Query: 220 LRFPSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGL---KKFSYCLLS 270
           LR     +PN+  GC + S   P       G+ G GR   +L SQ G      FSYCL S
Sbjct: 184 LRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPS 242

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
             +     S +L L    G+G  +   + YTP  +NP  SS      YYV +  + VG  
Sbjct: 243 --YRSYYFSGSLRL----GAGGGQPRSVRYTPMLRNPHRSS-----LYYVNVTGLSVGHA 291

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            VK+P       +    G +VDSG+  T    P++ A+ +EF RQ+   S          
Sbjct: 292 WVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPS---GYTSLGA 348

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALG 449
              CF+     +   P + +   GG  +ALP EN         L CL +    A  P   
Sbjct: 349 FDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAM----AEAPQNV 404

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                ++ + Q QN  + FD+AN R GFAK+ C
Sbjct: 405 NSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 437


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 164/391 (41%), Gaps = 56/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G + + ++ GTP   S   I DTGS L W  C     C DC      P   P + P +SS
Sbjct: 113 GEFLMKMAIGTP-SLSFSAILDTGSDLTWTQCKP---CTDCY-----PQPTPIYDPSQSS 163

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C +  C  +                 +C  A   YL  YG    T G+L  E+ 
Sbjct: 164 TYSKVPCSSSMCQAL--------------PMYSCSGANCEYLYSYGDQSSTQGILSYESF 209

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLP----SQLGL---KKFSYCLLSRKF 273
              S+++P+   GC   ++       G        P    SQLG     KFSYCL+S   
Sbjct: 210 TLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVS--I 267

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            D+P  ++ +      S ++KT  +S TP  +     S +   FYY+ L  I VG + + 
Sbjct: 268 TDSPSKTSPLFIGKTASLNAKT--VSSTPLVQ-----SRSRPTFYYLSLEGISVGGQLLD 320

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLR 392
           I         DG GGVI+DSG+T T++E   ++ V K  I  +        V+  + GL 
Sbjct: 321 IADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI----NLPQVDGSNIGLD 376

Query: 393 PCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
            CF+  SG  + + P +   F+ GA   LP ENY     + + CL +   N         
Sbjct: 377 LCFEPQSGSSTSHFPTITFHFE-GADFNLPKENYIYTDSSGIACLAMLPSNGMS------ 429

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I G+ Q QN+ + +D   +   FA   C
Sbjct: 430 ---IFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  108 bits (269), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)

Query: 65  SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
           +R   + +K   K    ++  + S  L     S    G Y +++  GTP +     IFDT
Sbjct: 93  ARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 151

Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           GS L W  C    R C D         + P F P +S+S   + C +  C  +   +   
Sbjct: 152 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 201

Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
               CS  N         Y +QYG   F+ G L  E     +  V      GC    + Q
Sbjct: 202 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCG--ENNQ 252

Query: 242 P-----AGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
                 AG+ G GR   S PSQ      K FSYCL S     A  + +L   +   +G S
Sbjct: 253 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 305

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
           ++  + +TP      G+S     FY + +  I VG + + IP + +  PG+      ++D
Sbjct: 306 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 352

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +    + A+   F  +M  Y   + V   S L  CFD+SG K+V +P++   F
Sbjct: 353 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 409

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
            GGA + L  +  F +     +CL  F  N+         A I G+ Q Q   + +D A 
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCL-AFAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 463

Query: 473 DRFGFAKQKCA 483
            R GFA   C+
Sbjct: 464 GRVGFAPNGCS 474


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 174/391 (44%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRS 160
           G Y++++  GTP +  T  IFDTGS L W  C    + C     P +DP++        S
Sbjct: 131 GDYAVTVGLGTPKKEFT-LIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTK--------S 181

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +S + I C +  C  +     ES    CS  + TC      Y +QYG G ++ G   +ET
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGES----CS--SPTCL-----YQVQYGDGSYSIGFFATET 230

Query: 220 LRFPSKTV-PNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
           L   S  V  NFL GC   +    R  AG+ G GR+  SLPSQ   K    FSYCL    
Sbjct: 231 LTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL---- 286

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
               P SS+       G   SKT  + +TP  ++   +      FY + + ++ VG   +
Sbjct: 287 ----PASSSSKGYLSFGGQVSKT--VKFTPLSEDFKST-----PFYGLDITELSVGGNKL 335

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            I  S          G ++DSG+  T +    + A++  F + M +Y      +  S   
Sbjct: 336 SIDASIF-----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDY---PSTDGYSIFD 387

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRG 451
            C+D S  +++ +P++ + FKGG +M +      + + G + +CL  F  N         
Sbjct: 388 TCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLA-FAGNGDDV----- 441

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            A I G+ Q + + + +D A  R GFA   C
Sbjct: 442 KAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 118/430 (27%), Positives = 187/430 (43%), Gaps = 57/430 (13%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH---SYGGYSISLSFGTPPQASTPFIF 122
           R R ++ + + K    N  S+  +S I+ PL+         Y +++  G     +   I 
Sbjct: 94  RVRSMQNRIRAKVSGHN--SSEQSSEIQIPLASGINLETLNYIVTIGLGNQ---NMTVII 148

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS L W  C     C+ C        + P F P  SSS   + C +  C  +      
Sbjct: 149 DTGSDLTWVQCDP---CMSCY-----SQQGPVFNPSNSSSYNSLLCNSSTCQNL--QFTT 198

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR- 240
              + C   N   P +C ++ + YG G FT G L  E L F   +V NF+ GC   +   
Sbjct: 199 GNTEACESNN---PSSC-NHTVSYGDGSFTDGELGVEHLSFGGISVSNFVFGCGRNNKGL 254

Query: 241 --QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
               +GI G GRS+ S+ SQ        FSYCL +    D+  S +LV+        + T
Sbjct: 255 FGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTT---DSGASGSLVIGNESSLFKNLT 311

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
           P ++YT    NP         FY + L  I VG   ++         S GNGG+++DSG+
Sbjct: 312 P-IAYTSMVSNP-----QLSNFYVLNLTGIDVGGVAIQ-------DTSFGNGGILIDSGT 358

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
             T +   L+ A+  EF++Q   Y  A  +   S L  CF+++G + V +P L + F+  
Sbjct: 359 VITRLAPSLYNALKAEFLKQFSGYPIAPAL---SILDTCFNLTGIEEVSIPTLSMHFENN 415

Query: 416 AKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
             + +      Y    G++V CL      A           I+G++Q +N  + +D    
Sbjct: 416 VDLNVDAVGILYMPKDGSQV-CL------ALASLSDENDMAIIGNYQQRNQRVIYDAKQS 468

Query: 474 RFGFAKQKCA 483
           + GFA++ C+
Sbjct: 469 KIGFAREDCS 478


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 125/461 (27%), Positives = 203/461 (44%), Gaps = 68/461 (14%)

Query: 36  LTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTP 95
           + P S++   ++     L+ + ++ + S+ RA +L          +++ S   N L K  
Sbjct: 32  IHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYL----------NHVFSLSHNDLPKPT 81

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           +  ++   Y +S S GTPP      + DTGS  +WF C     C++           P F
Sbjct: 82  IIPYAGSYYVMSYSIGTPP-FQLYGVVDTGSDGIWFQCKPCKPCLN--------QTSPIF 132

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
            P +SS+ + I C +P C        ++RC   S R + C     +YL + G   + G +
Sbjct: 133 NPSKSSTYKNIRCSSPICK----RGEKTRCS--SNRKRKCEYEI-TYLDRSG---SQGDI 182

Query: 216 LSETLRFPSK-----TVPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
             +TL   S      + P  + GC    S+ ++   +GI GFGR + S+ SQLG     K
Sbjct: 183 SKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGK 242

Query: 264 FSYCLLSRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           FSYCL S  F  A +SS L   D    SG     G+  TP  +      S +   Y+  L
Sbjct: 243 FSYCLASL-FSKANISSKLYFGDMAVVSGH----GVVSTPLIQ------SFYVGNYFTNL 291

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
               VG   +K+  S L+P ++GN   ++DSGST T +   ++  +    I  M    R 
Sbjct: 292 EAFSVGDHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVI-SMVKLKRV 348

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
            D  ++  L  C+  + KK   +P +   F+ GA + L   N F  + +EV+C   F  N
Sbjct: 349 KDPTQQLSL--CYKTTLKK-YEVPIITAHFR-GADVKLNAFNTFIQMNHEVMC---FAFN 401

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++       P ++ G+   QNF + +D   +   F    C 
Sbjct: 402 SSA-----FPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)

Query: 65  SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
           +R   + +K   K   +++  + S  L     S    G Y +++  GTP +     IFDT
Sbjct: 94  ARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 152

Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           GS L W  C    R C D         + P F P +S+S   + C +  C  +   +   
Sbjct: 153 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 202

Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
               CS  N         Y +QYG   F+ G L  +     S  V      GC    + Q
Sbjct: 203 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCG--ENNQ 253

Query: 242 -----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
                 AG+ G GR   S PSQ      K FSYCL S     A  + +L   +   +G S
Sbjct: 254 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 306

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
           ++  + +TP      G+S     FY + +  I VG + + IP + +  PG+      ++D
Sbjct: 307 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 353

Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
           SG+  T +    + A+   F  +M  Y   + V   S L  CFD+SG K+V +P++   F
Sbjct: 354 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 410

Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
            GGA + L  +  F       +CL  F  N+         A I G+ Q Q   + +D A 
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCL-AFAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 464

Query: 473 DRFGFAKQKCA 483
            R GFA   C+
Sbjct: 465 GRVGFAPNGCS 475


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/396 (27%), Positives = 160/396 (40%), Gaps = 70/396 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           + +++  GTP Q S   IFDTGS L W    PC S   C         P + P F P +S
Sbjct: 149 FVVAVGLGTPAQPSA-LIFDTGSDLSWVQCQPCGSSGHC--------HPQQDPLFDPSKS 199

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           S+   + C  P+C+   G         CS  N TC      YL+ YG G  T G+L  +T
Sbjct: 200 STYAAVHCGEPQCAAAGGL--------CSEDNTTC-----LYLVHYGDGSSTTGVLSRDT 246

Query: 220 LRFPS-KTVPNFLAGCSILSDRQPAGIAGFGR---------SSESLPSQLGLK---KFSY 266
           L   S + +  F  GC   +      +  FGR            SLPSQ        FSY
Sbjct: 247 LALTSSRALAGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSY 300

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL S        +  L +   P +    T    YT   + P      F  FY+V L  I 
Sbjct: 301 CLPSSN----STTGYLTIGATPAT---DTGAAQYTAMLRKP-----QFPSFYFVELVSID 348

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +G   + +P     P     GG ++DSG+  T++    +E +   F   M  Y+ A    
Sbjct: 349 IGGYILPVP-----PAVFTRGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPA---P 400

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
               L  C+D +G+  V +P +  +F  GA   L        +   V CL     +A G 
Sbjct: 401 PNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGL 460

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                P  I+G+ Q ++  + +D+A ++ GF    C
Sbjct: 461 -----PLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 121/404 (29%), Positives = 173/404 (42%), Gaps = 68/404 (16%)

Query: 92  IKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
           I  P  +  Y G   Y I++ FGTP +  T  IFDTGS++ W  C     CV   +P  +
Sbjct: 1   ISIPARIGLYIGTANYVITVGFGTPKKNQT-VIFDTGSNVNWIQCKP---CVVSCYPQQE 56

Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
           P     F P  SS+ + I C +  C+ +         +GCS    TC      Y + YG 
Sbjct: 57  P----LFDPTLSSTYRNISCTSAACTGL-------SSRGCS--GSTCV-----YGVTYGD 98

Query: 209 GF-TAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL 261
           G  T G L +ET    +  V  NF+ GC    + Q      AG+ G GRS  SL SQL  
Sbjct: 99  GSSTVGFLATETFTLAAGNVFNNFIFGCG--QNNQGLFTGAAGLIGLGRSPYSLNSQLAT 156

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
                FSYCL S        S+   L+ G      +TPG  YT    N     S     Y
Sbjct: 157 SLGNIFSYCLPSTS------SATGYLNIG---NPLRTPG--YTAMLTN-----SRAPTLY 200

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           ++ L  I VG   + +  +        + G I+DSG+  T +    + A+   F   M  
Sbjct: 201 FIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQ 255

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
           Y+RAA     S L  C+D S   +V  P + L +  G  + +P    F ++ +  +CL  
Sbjct: 256 YTRAA---AASILDTCYDFSRTTTVTFPTIKLHYT-GLDVTIPGAGVFYVISSSQVCL-A 310

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           F  N+    +G     I+G+ Q +   + +D A  R GFA   C
Sbjct: 311 FAGNSDSTQIG-----IIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 129/457 (28%), Positives = 191/457 (41%), Gaps = 78/457 (17%)

Query: 52  PLKILHSLASSSLSRAR--HLKTKTKPKTKDSNIGSNYSN-------SLIKTPLSVHSYG 102
           PL   H   S  +S+ +  H +T  + + + +NI +  S+        L ++ +++ +  
Sbjct: 62  PLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPRNSSAKELQQSGVTIPTSS 121

Query: 103 GYS-------ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           GYS       I++S GTP       I DTGS + W       +C  C   +    +   F
Sbjct: 122 GYSLGTPEYVITVSLGTPAVTQVMSI-DTGSDVSWV------QCAPCAAQSCSSQKDKLF 174

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGL 214
            P +S++     C + +C+ + G        GC   N  C      Y+++Y     T G 
Sbjct: 175 DPAKSATYSAFSCSSAQCAQLGGEG-----NGC--LNSHC-----QYIVKYVDHSNTTGT 222

Query: 215 LLSETLRFP-SKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
             S+TL    S  V NF  GCS  ++    Q  G+ G G  +ESL SQ      K FSYC
Sbjct: 223 YGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYC 282

Query: 268 LLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           L        P SS+    L  G  +G + +   S TP  +  V +      FY V L+ I
Sbjct: 283 L-------PPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPT------FYGVFLQAI 329

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            V    + +P S        +G  +VDSG+  T +    ++A+   F ++M  Y  AA V
Sbjct: 330 TVAGTKLNVPASVF------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPV 383

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
                L  CFD SG K+V +P + L F  GA M L     F        CL  FT  A  
Sbjct: 384 GI---LDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIF-----YAGCLA-FTATAQ- 433

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G   ILG+ Q + F + FD+     GF    C
Sbjct: 434 ----DGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 138/496 (27%), Positives = 203/496 (40%), Gaps = 95/496 (19%)

Query: 11  LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
           + +L  L+ T    A  S    ++ L P        HS   PL     +  + L ++  L
Sbjct: 5   VLTLFFLVSTMLVDASKSLMGFSIDLIP-------RHSPISPL-YNSQMTQTELVKSAAL 56

Query: 71  KTKTKPKTKDSNIGSNYSNSL--IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSL 128
           ++ T+  +K  N     S  L  I TP+  H  G Y +  S GTP       IFDTGS L
Sbjct: 57  RSITR--SKRVNFIGQISPPLSPIITPIPDH--GEYLMRFSLGTP-SVERLAIFDTGSDL 111

Query: 129 VWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
            W  CT    C         P   P F P +SS+   + C++  C+    P  +  C   
Sbjct: 112 SWLQCTPCKTCY--------PQEAPLFDPTQSSTYVDVPCESQPCTLF--PQNQRECGS- 160

Query: 189 SPRNKTCPLACPSYLLQYGL-GFTAGLLLSETLRFPSK-------TVPNFLAGCSILSD- 239
              +K C      YL QYG   FT G L  +T+ F S        T P  + GC+  S+ 
Sbjct: 161 ---SKQC-----IYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAFYSNF 212

Query: 240 -----RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
                 +  G  G G    SL SQLG +   KFSYC++       P SS     TG    
Sbjct: 213 TFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMV-------PFSST---STGKLKF 262

Query: 292 DSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
            S  P   +  TPF  NP     ++  +Y + L  I VG K V       + G  G G +
Sbjct: 263 GSMAPTNEVVSTPFMINP-----SYPSYYVLNLEGITVGQKKV-------LTGQIG-GNI 309

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYLPE 407
           I+DS    T +E    + +  +FI  +     A +VE        F+  +    ++  PE
Sbjct: 310 IIDSVPILTHLE----QGIYTDFISSV---KEAINVEVAEDAPTPFEYCVRNPTNLNFPE 362

Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
            +  F  GA + L P+N F  + N ++C+ +       P+ G     I G++   NF +E
Sbjct: 363 FVFHFT-GADVVLGPKNMFIALDNNLVCMTVV------PSKGIS---IFGNWAQVNFQVE 412

Query: 468 FDLANDRFGFAKQKCA 483
           +DL   +  FA   C+
Sbjct: 413 YDLGEKKVSFAPTNCS 428


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 90/287 (31%), Positives = 134/287 (46%), Gaps = 42/287 (14%)

Query: 211 TAGLLLSETLRFPS-KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
           T GLL  +   F +  +VP    GC + ++        GIAGFGR   SLPSQL +  FS
Sbjct: 226 TTGLLEVDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS 285

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           +C  +    +    S ++LD       +    +  TP  +N     SA    YY+ L+ I
Sbjct: 286 HCFTAV---NGLKQSTVLLDLLADLYKNGRGAVQSTPLIQN-----SANPTLYYLSLKGI 337

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM------GNY 379
            VGS  + +P S     ++G GG I+DSG++ T +   +++ V  EF  Q+      GN 
Sbjct: 338 TVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN- 395

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLC 435
                    +G   CF    +    +P+L+L F+ GA M LP ENY   V    GN ++C
Sbjct: 396 --------ATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSMIC 446

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L +         LG   A I G+FQ QN ++ +DL N+   F   +C
Sbjct: 447 LAI-------NELGDERATI-GNFQQQNMHVLYDLQNNMLSFVAAQC 485



 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 73/152 (48%), Gaps = 32/152 (21%)

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM------GN 378
           I VGS  + +P S     ++G GG I+DSG++ T +   +++ V  EF  Q+      GN
Sbjct: 42  ITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVL 434
                     +G   CF    +    +P+L+L F+ GA M LP ENY   V    GN ++
Sbjct: 101 ---------ATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSII 150

Query: 435 CLILFTDNAAGPALGRG-PAIILGDFQLQNFY 465
           CL          A+ +G    I+G+FQ QN +
Sbjct: 151 CL----------AINKGDETTIIGNFQQQNMH 172


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 134/471 (28%), Positives = 195/471 (41%), Gaps = 77/471 (16%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSD-PLKILHSLASS---SLSRARHLKTK-TKPKTKDS 81
           SS+   TVPL      H+ H   S  P K + SL         RA ++K K +    KD 
Sbjct: 52  SSSGATTVPL------HHRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRKFSGDVKKDG 105

Query: 82  NIGSNYSNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
                   S +  P ++ +      Y I++  G+P +  T  I D+GS + W  C     
Sbjct: 106 QGAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLI-DSGSDVSWVQCKP--- 161

Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
           C+ C+   VDP     F P  SS+     C +  C+ +          GCS  ++     
Sbjct: 162 CLQCH-SQVDP----LFDPSLSSTYSPFSCSSAACAQL-----GQDGNGCSSSSQC---- 207

Query: 199 CPSYLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSILS---DRQPAGIAGFGRSSES 254
              Y+++Y  G  T G   S+TL   S T+ NF  GCS +    +    G+ G G  + S
Sbjct: 208 --QYIVRYADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPS 265

Query: 255 LPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
           L SQ        FSYCL        P SS  +           T G   + F K P+  S
Sbjct: 266 LASQTAGTFGTAFSYCL-----PPTPSSSGFL-----------TLGAGTSGFVKTPMLRS 309

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
           S    FY V L  I VG   + IP S        + G+++DSG+  T +    + A++  
Sbjct: 310 SPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMVMDSGTIITRLPRTAYSALSSA 363

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
           F   M  Y  A     +S +  CFD SG+ SV LP + L F GGA + L       ++GN
Sbjct: 364 FKAGMKQYRPA---PPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGI--ILGN 418

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              CL  F  N+   + G     I+G+ Q + F + +D+     GF    C
Sbjct: 419 ---CLA-FAANSDDSSPG-----IVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 162/368 (44%), Gaps = 54/368 (14%)

Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
           DTGS + W    PC S   C    +   DP     F PK SSS   + C + +C  +   
Sbjct: 166 DTGSDVTWLQCQPCASENTC----YKQFDP----IFDPKSSSSYSPLSCNSQQCKLLDKA 217

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
           N  S          TC      Y + YG G FT G L +ETL F  S ++PN   GC   
Sbjct: 218 NCNS---------DTCI-----YQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHD 263

Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
           ++      AG+ G G  + SL SQL    FSYCL++    D+  SS L  ++   S DS 
Sbjct: 264 NEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSNMPS-DSL 319

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
           T     +P  KN       F  + YV +  I VG K + I  +       G GG+IVDSG
Sbjct: 320 T-----SPLVKN-----DRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSG 369

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +  + +   ++E++ + F++   + S A  +   S    C++ SG+ +V +P +      
Sbjct: 370 TIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQSNVEVPTIAFVLSE 426

Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
           G  + LP  NY  ++       + F    +  +       I+G FQ Q   + +DL N  
Sbjct: 427 GTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS-------IIGSFQQQGIRVSYDLTNSL 479

Query: 475 FGFAKQKC 482
            GF+  KC
Sbjct: 480 VGFSTNKC 487


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 158/387 (40%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQ ++  I D    LVW  C    RC +           P F P  S++ 
Sbjct: 51  YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCSRCFE--------QDTPLFDPTASNTY 101

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C  P C  I  P+    C G       C     +Y      G T G + ++T  F 
Sbjct: 102 RAEPCGTPLCESI--PSDSRNCSG-----NVC-----AYQASTNAGDTGGKVGTDT--FA 147

Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
             T    LA GC + SD      P+GI G GR+  SL +Q G+  FSYCL      DA  
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGR 204

Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +S L L +     G G + +     TPF  N  G+ +    +Y V L  +  G   + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S    GS     V++D+ S  +F+    ++AV K     +G    A  VE       CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEP---FDLCF 307

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
             SG      P+L+  F+GGA M +P  NY     N  +CL + +              +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG  Q +N +  FDL  +   F    C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 164/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP    T  +FDTGS   W  C     CV   +   +      F P RSS
Sbjct: 180 GNYVVTIGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYKQQEK----LFDPARSS 231

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   + C  P CS ++        +GCS  +         Y +QYG G ++ G    +TL
Sbjct: 232 TYANVSCAAPACSDLY-------TRGCSGGHCL-------YSVQYGDGSYSIGFFAMDTL 277

Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
              S   V  F  GC   ++    + AG+ G GR   SLP Q   K    F++CL +R  
Sbjct: 278 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 336

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S    LD GPGS  +       TP   +   +      FYYVG+  I VG + + 
Sbjct: 337 -----SGTGYLDFGPGS-PAAVGARQTTPMLTDNGPT------FYYVGMTGIRVGGQLLS 384

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
           IP S          G IVDSG+  T +    + ++   F   M    Y +A  +   S L
Sbjct: 385 IPQSVF-----STAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAL---SLL 436

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G   V +P++ L F+GGA + +             +CL  F  N     +G  
Sbjct: 437 DTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCL-GFAANEDDDDVG-- 493

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ QL+ F + +D+     GF+   C
Sbjct: 494 ---IVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 165/391 (42%), Gaps = 61/391 (15%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + +  GTP Q   P +   DT S + W PC+    CV C      PS   AF P +S+
Sbjct: 99  YIVKVLIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 145

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S + + C  P+C  +  P   +R             AC S+ L YG    A  L  +T+R
Sbjct: 146 SFKNVSCSAPQCKQVPNPACGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 191

Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
             +  +  F  GC        +I   +   G+     S  S    +    FSYCL S  F
Sbjct: 192 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPS--F 249

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S +L L  GP S   +   + YT   +NP  SS      YYV L  I VG K V 
Sbjct: 250 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 299

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +P + +        G I DSG+ +T +  P++EAV  EF +++   +  A V    G   
Sbjct: 300 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPT--AVVTSLGGFDT 357

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
           C+  SG+  V +P +   FK G  M +P +N           CL +    A+ P      
Sbjct: 358 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----ASAPENVNSV 408

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             ++   Q QN  +  D+ N R G A+++C+
Sbjct: 409 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 122/467 (26%), Positives = 193/467 (41%), Gaps = 68/467 (14%)

Query: 42  KHYLHHSDSDPL---KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV 98
           +H L     DP+   + L  L ++  SRA   + +       ++  S  +   + + + +
Sbjct: 80  RHSLTAIPEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL 139

Query: 99  HSYGGYSISLSFGTP---PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
            +   Y  ++S G     P A+   I DTGS L W  C     C  C        R P F
Sbjct: 140 QTLN-YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKP---CSACY-----AQRDPLF 190

Query: 156 IPKRSSSSQLIGCQNPKC--SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTA 212
            P  S++   + C    C  S          C      ++ C      Y L YG G F+ 
Sbjct: 191 DPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKC-----YYALAYGDGSFSR 245

Query: 213 GLLLSETLRFPSKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLK---KFS 265
           G+L ++T+     ++  F+ GC  LS+R      AG+ G GR+  SL SQ   +    FS
Sbjct: 246 GVLATDTVALGGASLGGFVFGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFS 304

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL +    DA  S +L       S    T  ++YT    +P     A   FY++ +   
Sbjct: 305 YCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADP-----AQPPFYFLNVTGA 359

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG        + L     G   V++DSG+  T +   ++ AV  EF+RQ G    AA  
Sbjct: 360 AVGG-------TALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFG----AAGY 408

Query: 386 EKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLIL- 438
               G   L  C+D++G   V +P L L+ +GGA + +       +V   G++V CL + 
Sbjct: 409 PAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQV-CLAMA 467

Query: 439 ---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              + D             I+G++Q +N  + +D    R GFA + C
Sbjct: 468 SLSYEDETP----------IIGNYQQKNKRVVYDTLGSRLGFADEDC 504


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 114/393 (29%), Positives = 166/393 (42%), Gaps = 60/393 (15%)

Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + L+ GTPP    PFI   DTGS L W  C     C   + P  D +   +F P    
Sbjct: 83  YLMELAIGTPP---VPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSP---- 135

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
               + C +  C  I+     SRC            + PS   +Y   +  G    E   
Sbjct: 136 ----LPCSSATCLPIW----SSRC------------STPSATCRYRYAYDDGAYSPECAG 175

Query: 222 FPSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS--RKFDDA 276
               +V     GC + +        G  G GR S SL +QLG+ KFSYCL         +
Sbjct: 176 I---SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSS 232

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
           PV    + +    S  +    +  TP  ++P   S      YYV L  I +G   + IP 
Sbjct: 233 PVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSR-----YYVSLEGISLGDARLPIPN 287

Query: 337 -SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            ++ +   DG+GG+IVDSG+ FT +    F  V       +G       V   S  RPCF
Sbjct: 288 GTFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLGQ----PVVNASSLDRPCF 343

Query: 396 --DISGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCL-ILFTDNAAGPALGR 450
               +G + +  +P+++L F GGA M L  +NY +    E   CL I+ T++A+G     
Sbjct: 344 PAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGS---- 399

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               +LG+FQ QN  + FD+   +  F    C+
Sbjct: 400 ----VLGNFQQQNIQMLFDITVGQLSFMPTDCS 428


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 124/407 (30%), Positives = 180/407 (44%), Gaps = 69/407 (16%)

Query: 89  NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW---FPCTSRYRCVDCNFP 145
           + L +TP++  + G Y I +S+G PPQ ST  I DTGS L W    PC S Y  +   F 
Sbjct: 76  DQLFETPVASGN-GEYLIDISYGNPPQKSTA-IVDTGSDLNWVQCLPCKSCYETLSAKF- 132

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
             DPS+        S+S + +GC +  C  +                ++C  +C  Y   
Sbjct: 133 --DPSK--------SASYKTLGCGSNFCQDL--------------PFQSCAASC-QYDYM 167

Query: 206 YGLGF-TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLG- 260
           YG G  T+G L ++ +   +  +PN   GC   ++ +     G+ G G+   SL SQLG 
Sbjct: 168 YGDGSSTSGALSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGG 227

Query: 261 --LKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPG-LSYTPFYKNPVGSSSAFGE 316
              KKFSYCL+       P+ S     T P   GDS   G ++YTP   N     + +  
Sbjct: 228 TATKKFSYCLV-------PLGST---KTSPLYIGDSTLAGGVAYTPMLTN-----NNYPT 272

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           FYY  L+ I V  K V  P +     + G GG+I+DSG+T T+++   F  +    ++  
Sbjct: 273 FYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMVAA-LKAA 331

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLC 435
             Y  A       GL  CF  +G  +   P ++  F  GA +AL P+N F AL      C
Sbjct: 332 LPYPEADG--SFYGLEYCFSTAGVANPTYPTVVFHFN-GADVALAPDNTFIALDFEGTTC 388

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L + +              I G+ Q  N  +  DL N R GF    C
Sbjct: 389 LAMASSTGFS---------IFGNIQQLNHVIVHDLVNKRIGFKSANC 426


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 118/408 (28%), Positives = 168/408 (41%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTPP+     + DTGS ++W  C S   C  C   +    ++  F P  S 
Sbjct: 79  GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  I C + +CSW     ++S   GCS +N  C     +Y  QYG G  T+G  +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185

Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F     S  VPN  A    GCS       + SDR   GI GFG+   S+ SQL      
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL             LVL      G+   P + +TP   +           Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  I V  + + I  S     S  NG G I+D+G+T  ++     EA    F+  + N 
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
              +     S    C+ I+       P + L F GGA M L P++Y      + G  V C
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +        G         ILGD  L++    +DL   R G+A   C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 130/446 (29%), Positives = 190/446 (42%), Gaps = 61/446 (13%)

Query: 49  DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
           D + ++ LHS       R  + ++ +   T D   G +  ++ +K+ LS+ S G Y + +
Sbjct: 60  DEERVRFLHS-------RLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGS-GNYYVKI 111

Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
             GTP +  +  I DTGSSL W  C     CV      VDP     F P  S + + + C
Sbjct: 112 GVGTPAKYFS-MIVDTGSSLSWLQCQP---CVIYCHVQVDP----IFTPSVSKTYKALSC 163

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV 227
            + +CS +    + +   GCS     C      Y   YG   F+ G L  + L       
Sbjct: 164 SSSQCSSLKSSTLNA--PGCSNATGAC-----VYKASYGDTSFSIGYLSQDVLTLTPSAA 216

Query: 228 PN--FLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
           P+  F+ GC    D Q      AGI G      S+  QL  K    FSYCL S  F   P
Sbjct: 217 PSSGFVYGCG--QDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPS-SFSAQP 273

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PY 336
            SS     +   S  S +P   +TP  KNP          Y++GL  I V  K + +   
Sbjct: 274 NSSVSGFLSIGASSLSSSP-YKFTPLVKNP-----KIPSLYFLGLTTITVAGKPLGVSAS 327

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           SY VP        I+DSG+  T +   ++ A+ K F+  M    + A     S L  CF 
Sbjct: 328 SYNVP-------TIIDSGTVITRLPVAIYNALKKSFVMIMSK--KYAQAPGFSILDTCFK 378

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
            S K+   +PE+ + F+GGA + L   N    +     CL +        A    P  I+
Sbjct: 379 GSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEKGTTCLAI--------AASSNPISII 430

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G++Q Q F + +D+AN + GFA   C
Sbjct: 431 GNYQQQTFTVAYDVANSKIGFAPGGC 456


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 122/402 (30%), Positives = 169/402 (42%), Gaps = 72/402 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIF----DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           G Y   ++ GTP +  + F      D GS + W  C   +RC         P   P +  
Sbjct: 123 GEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYH------QPG--PVYNR 174

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLL 216
            +SSS+  +GC  P C  +          GC      C      Y ++YG G + AG   
Sbjct: 175 LKSSSASDVGCYAPACRAL------GSSGGCVQFLNEC-----QYKVEYGDGSSSAGDFG 223

Query: 217 SETLRFPSKT-VPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLK---KFSY 266
            ETL FP    VP    GC   SD Q       AGI G GR S S PSQ+  +    FSY
Sbjct: 224 VETLTFPPGVRVPGVAIGCG--SDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSY 281

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL  +       SS L   +G  +  + T   S+TP   N     S    FYYVGL  I 
Sbjct: 282 CLAGQG--TGGRSSTLTFGSGASATTTTTTPPSFTPMLTN-----SRMYTFYYVGLVGIS 334

Query: 327 VGSKHVK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           VG   V+ +  S L +  S G+GGVIVDSG+  T + GP + A    F        R A 
Sbjct: 335 VGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF--------RVAA 386

Query: 385 VEKKSGLRP---------CF-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-- 432
           V++     P         C+  + G+    +P + + F GG ++ LPP+NY   V +   
Sbjct: 387 VKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKG 446

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
            +C         G +       I+G+ QLQ F + +D+   R
Sbjct: 447 TMCFAFAGSGDRGVS-------IIGNIQLQGFRVVYDVDGQR 481


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 128/431 (29%), Positives = 179/431 (41%), Gaps = 71/431 (16%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLI--KTPLSVHSYGGYSISLSFGTPPQASTPFIFD 123
           RA++++ K    +     G   S ++    T  S      Y I++S GTP       I D
Sbjct: 85  RAKYIQAKLSVNSGSGTDGVQQSAAITLPTTLGSALDTLAYVITVSIGTPAMTQAVMI-D 143

Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           TGS + W  C +R       F          F P +SS+     C +  C+      +E 
Sbjct: 144 TGSDVSWVHCHARAGAGSSLF----------FDPGKSSTYTPFSCSSAACT-----RLEG 188

Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS-KTVPNFLAGCSILSD-- 239
           R  GCS  N TC      Y ++YG G  T G   S+TL   S + V NF  GCS  SD  
Sbjct: 189 RDNGCS-LNSTC-----QYTVRYGDGSNTTGTYGSDTLALNSTEKVENFQFGCSETSDPG 242

Query: 240 -----RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
                 Q  G+ G G  + SL SQ        FSYCL       A   S+  L  G  +G
Sbjct: 243 EGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL------PATTRSSGFLTLGASTG 296

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
              T G   TP +++    +     FY+V L+ I VG   V I  +    GS      I+
Sbjct: 297 ---TSGFVTTPMFRSRRAPT-----FYFVILQGINVGGDPVAISPTVFAAGS------IM 342

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T +    + A++  F   M  Y RA      S L  CFD +G+ +V +P + L 
Sbjct: 343 DSGTIITRLPPRAYSALSAAFRAGMRRYPRA---RAFSILDTCFDFTGQDNVSIPAVELV 399

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F GGA + L  +    + G+   CL      A  PA G G   I+G+ Q + F +  D+ 
Sbjct: 400 FSGGAVVDLDADGI--MYGS---CL------AFAPATG-GIGSIIGNVQQRTFEVLHDVG 447

Query: 472 NDRFGFAKQKC 482
               GF    C
Sbjct: 448 QSVLGFRPGAC 458


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 126/437 (28%), Positives = 175/437 (40%), Gaps = 74/437 (16%)

Query: 47  HSDSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
           + D + +K ++S  S +L   S    L + T P    S IGS                G 
Sbjct: 102 NQDKERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGS----------------GN 145

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +  GTP +     IFDTGS L W  C     C    +   D      F P +S+S 
Sbjct: 146 YFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEP---CARSCYKQQDV----IFDPSKSTSY 197

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +  C+ +          GCS   K C      Y +QYG   F+ G    E L  
Sbjct: 198 SNITCTSALCTQL--STATGNDPGCSASTKACI-----YGIQYGDSSFSVGYFSRERLTV 250

Query: 223 -PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
             +  V NFL GC    + Q      AG+ G GR   S   Q   K    FSYCL S   
Sbjct: 251 TATDVVDNFLFGCG--QNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPSTS- 307

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                SS   L  GP +       L YTPF     GSS     FY + +  I VG   VK
Sbjct: 308 -----SSTGHLSFGPAATGRY---LKYTPFSTISRGSS-----FYGLDITAIAVGG--VK 352

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +P S     +   GG I+DSG+  T +    + A+   F + M  Y  A ++   S L  
Sbjct: 353 LPVS---SSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGEL---SILDT 406

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D+SG K   +P +   F GG  + LPP+    +   + +CL  F  N     +     
Sbjct: 407 CYDLSGYKVFSIPTIEFSFAGGVTVKLPPQGILFVASTKQVCLA-FAANGDDSDV----- 460

Query: 454 IILGDFQLQNFYLEFDL 470
            I G+ Q +   + +D+
Sbjct: 461 TIYGNVQQRTIEVVYDV 477


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 156/380 (41%), Gaps = 40/380 (10%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S S GTPPQ  T  + D  S  VW  C++   C  C       +  P F    SS
Sbjct: 95  GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPFYAFLSS 150

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
           + + + C N  C  +  P        CS  +  C      Y   YG G    TAGLL  +
Sbjct: 151 TIREVRCANRGCQRLV-PQT------CSADDSPC-----GYSYVYGGGAANTTAGLLAVD 198

Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
              F +      + GC++ ++    G+ G GR   S  SQL + +FSY L     DDA  
Sbjct: 199 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVD 255

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
             + +L       D   P  S       P+ +S A    YYV L  I V  + + IP   
Sbjct: 256 VGSFILFL-----DDAKPRTSRA--VSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGT 308

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
               +DG+GGV++      TF++   ++ V +    ++    RAAD   + GL  C+   
Sbjct: 309 FDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI--ELRAAD-GSELGLDLCYTSE 365

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILG 457
              +  +P + L F GGA M L   NYF +     L CL +    A       G   +LG
Sbjct: 366 SLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPA-------GDGSLLG 418

Query: 458 DFQLQNFYLEFDLANDRFGF 477
                  ++ +D++  R  F
Sbjct: 419 SLIQVGTHMIYDISGSRLVF 438


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 121/408 (29%), Positives = 171/408 (41%), Gaps = 57/408 (13%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPA 154
           L  H     ++SL+ GTPPQ  T  + DTGS L W  C T R      +          +
Sbjct: 53  LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCATGRAAAAAAD----------S 101

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-G 213
           F P+ S++   + C + +CS    P   S    C   ++ C ++     L Y  G  + G
Sbjct: 102 FRPRASATFAAVPCGSARCSSRDLPAPPS----CDAASRRCRVS-----LSYADGSASDG 152

Query: 214 LLLSETLRFPSKTVPNFLAGC-SILSDRQP-----AGIAGFGRSSESLPSQLGLKKFSYC 267
            L ++              GC S   D  P     AG+ G  R + S  +Q   ++FSYC
Sbjct: 153 ALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYC 212

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQII 326
           +  R  DDA V   L+L    G  D     L+YTP Y+ P      F    Y V L  I 
Sbjct: 213 ISDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PTPPLPYFDRVAYSVQLLGIR 262

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VG K + IP S L P   G G  +VDSG+ FTF+ G  + AV  EF++Q      A +  
Sbjct: 263 VGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDP 322

Query: 387 K---KSGLRPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE------VL 434
               +     CF +   +   S  LP + L F  GA+M++  +     V  E      V 
Sbjct: 323 SFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFN-GAQMSVAGDRLLYKVPGERRGADGVW 381

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           CL  F +    P      A ++G     N ++E+DL   R G A  KC
Sbjct: 382 CLT-FGNADMVPLT----AYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 163/390 (41%), Gaps = 56/390 (14%)

Query: 104 YSISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y  SL  GTP   +T  +   DTGS   W  C     C DC            F P +SS
Sbjct: 134 YFTSLRLGTP---ATDLLVELDTGSDQSWIQCKP---CPDCY-----EQHEALFDPSKSS 182

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-GFTAGLLLSETL 220
           +   I C + +C  + G + +  C      +K CP     Y + Y    +T G L  +TL
Sbjct: 183 TYSDITCSSRECQEL-GSSHKHNCS----SDKKCP-----YEITYADDSYTVGNLARDTL 232

Query: 221 RF-PSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
              P+  VP F+ GC   +  S  +  G+ G GR   SL SQ+  +    FSYCL S   
Sbjct: 233 TLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS--- 289

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
             +P ++  +  +G  +        +     ++P         FYY+ L  I V  + +K
Sbjct: 290 --SPSATGYLSFSGAAAAAPTNAQFTEMVAGQHP--------SFYYLNLTGITVAGRAIK 339

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +P S     +    G I+DSG+ F+ +    + A+       MG Y RA      +    
Sbjct: 340 VPPSVFATAA----GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRA---PSSTIFDT 392

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D++G ++V +P + L F  GA + L P        N     + F  N    +LG    
Sbjct: 393 CYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLG---- 448

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +LG+ Q +   + +D+ N + GF    CA
Sbjct: 449 -VLGNTQQRTLAVIYDVDNQKVGFGANGCA 477


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 118/408 (28%), Positives = 168/408 (41%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTPP+     + DTGS ++W  C S   C  C   +    ++  F P  S 
Sbjct: 79  GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  I C + +CSW     ++S   GCS +N  C     +Y  QYG G  T+G  +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185

Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F     S  VPN  A    GCS       + SDR   GI GFG+   S+ SQL      
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL             LVL      G+   P + +TP   +           Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  I V  + + I  S     S  NG G I+D+G+T  ++     EA    F+  + N 
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
              +     S    C+ I+       P + L F GGA M L P++Y      + G  V C
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +        G         ILGD  L++    +DL   R G+A   C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 158/391 (40%), Gaps = 49/391 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  S+  GTPP  +   + DTGS +VW  C     CV C          P + P+ SS
Sbjct: 97  GEYFASVGVGTPPTPAL-LVIDTGSDVVWLQCKP---CVHCYR-----QLSPLYDPRGSS 147

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +     C  P+C            + C      C      Y + YG    T+G L ++ L
Sbjct: 148 TYAQTPCSPPQCR---------NPQTCDGTTGGC-----GYRIVYGDASSTSGNLATDRL 193

Query: 221 RFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
            F + T V N   GC   ++      AG+ G  R + S  +Q+     + F+YCL  R  
Sbjct: 194 VFSNDTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTR 253

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
             +  S  +   T P     + P   +TP   NP   S      YYV +    VG + V 
Sbjct: 254 SGSSSSYLVFGRTAP-----EPPSSVFTPLRSNPRRPS-----LYYVDMVGFSVGGEPVT 303

Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
                S  +  + G GGV+VDSG++ T      + A+   F  +             S  
Sbjct: 304 GFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVF 363

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D+ G      P ++L F GGA +ALPPENY  LV  E      F   AA    G  
Sbjct: 364 DACYDLRGVAVADAPGVVLHFAGGADVALPPENY--LVPEESGRYHCFALEAA----GHD 417

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++G+   Q F + FD+ N+R GF    C
Sbjct: 418 GLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 119/407 (29%), Positives = 163/407 (40%), Gaps = 76/407 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I L+ GTPPQ  +  + DTGS L+W  C     C  C     DP     F P  SSS 
Sbjct: 103 YLIDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASC-LAQPDP----LFAPAASSSY 153

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
             + C    C+ I    +   C+    R  TC     +Y   YG G T  G+  +E   F
Sbjct: 154 VPMRCSGQLCNDI----LHHSCQ----RPDTC-----TYRYNYGDGTTTLGVYATERFTF 200

Query: 223 PSKTVPNFLA----GCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL----SR 271
            S +          GC  +   S    +GI GFGR   SL SQL +++FSYCL     +R
Sbjct: 201 ASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTR 260

Query: 272 KFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFY----KNPVGSSSAFGEFYYVGLRQ 324
           K       S L+   L  G   GD    G   T       +NP         FYYV    
Sbjct: 261 K-------STLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPT--------FYYVPFTG 305

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           + VG++ ++IP S      DG+GGVIVDSG+  T     +   V + F  Q+      + 
Sbjct: 306 VTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL-RLPFTSS 364

Query: 385 VEKKSGLRPCFDI---------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
                G+  CF           S    V +P +   F+ GA + LP  NY          
Sbjct: 365 SSPDDGV--CFATPMAAGGRRASAATVVSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSL 421

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            IL  D+    A        +G+F  Q+  + +DL  +   FA  +C
Sbjct: 422 CILLADSGDSGA-------TIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 86/276 (31%), Positives = 131/276 (47%), Gaps = 33/276 (11%)

Query: 211 TAGLLLSETLRFPS-KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
           T GL+  +   F +  +VP    GC + ++        GIAGFGR   SLPSQL +  FS
Sbjct: 74  TTGLIEVDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS 133

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           +C  +    +    S ++LD       +    +  TP  +N     SA   FYY+ L+ I
Sbjct: 134 HCFTAV---NGLKQSTVLLDLPADLYKNGRGAVQSTPLIQN-----SANPTFYYLSLKGI 185

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VGS  + +P S     ++G GG I+DSG++ T +   +++ V  EF  Q+         
Sbjct: 186 TVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVP 241

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTD 441
              +G   CF    +    +P+L+L F+ GA M LP ENY   V    GN ++CL     
Sbjct: 242 GNATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL----- 295

Query: 442 NAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFG 476
                A+ +G    I+G+FQ QN ++ +DL N   G
Sbjct: 296 -----AINKGDETTIIGNFQQQNMHVLYDLQNMHRG 326


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 161/368 (43%), Gaps = 54/368 (14%)

Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
           DTGS + W    PC S   C    +   DP     F PK SSS   + C + +C  +   
Sbjct: 166 DTGSDVTWLQCQPCASENTC----YKQFDP----IFDPKSSSSYSPLSCNSQQCKLLDKA 217

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
           N  S          TC      Y + YG G FT G L +ETL F  S ++PN   GC   
Sbjct: 218 NCNS---------DTCI-----YQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHD 263

Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
           ++      AG+ G G  + SL SQL    FSYCL++    D+  SS L  +       S 
Sbjct: 264 NEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFN-------SY 313

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
            P  S T    +P+  +  F  + YV +  I VG K + I  +       G GG+IVDSG
Sbjct: 314 MPSDSLT----SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSG 369

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +  + +   ++E++ + F++   + S A  +   S    C++ SG+ +V +P +      
Sbjct: 370 TIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQSNVEVPTIAFVLSE 426

Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
           G  + LP  NY  ++       + F    +  +       I+G FQ Q   + +DL N  
Sbjct: 427 GTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS-------IIGSFQQQGIRVSYDLTNSI 479

Query: 475 FGFAKQKC 482
            GF+  KC
Sbjct: 480 VGFSTNKC 487


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 130/473 (27%), Positives = 190/473 (40%), Gaps = 79/473 (16%)

Query: 35  PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSR-----ARHLKTKTKPKTKDS 81
           P +PL+  H         L    S    I H +++++  R     +RH + +       +
Sbjct: 97  PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTDRVNPKRSRHRQQQPPSAPAPA 156

Query: 82  NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
              S+ + SL  +P      G Y +++  GTP    T  +FDTGS   W  C     CV 
Sbjct: 157 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 212

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
             +      R   F P  SS+   + C  P CS +          GCS  +         
Sbjct: 213 ACYEQ----REKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCL------- 254

Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
           Y +QYG G ++ G    +TL   S   V  F  GC   +D    + AG+ G GR   SLP
Sbjct: 255 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 314

Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
            Q   K    F++CL +R       +    LD G GS     P  + TP      P    
Sbjct: 315 VQTYGKYGGVFAHCLPARS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 361

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FYYVG+  I VG + + I  S          G IVDSG+  T +    + ++   
Sbjct: 362 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 411

Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
           F   M    Y +AA V   S L  C+D +G   V +P + L F+GGA + +        V
Sbjct: 412 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 468

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +CL  F  N  G  +G     I+G+ QL+ F + +D+     GF+   C
Sbjct: 469 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 174/415 (41%), Gaps = 83/415 (20%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y + L  GTP    +  I DT S LVW    PC S YR +D           P F PK
Sbjct: 90  GEYLVKLGTGTPQHFFSAAI-DTASDLVWMQCQPCVSCYRQLD-----------PVFNPK 137

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLS 217
            SSS  ++ C +  C+ + G        G          AC  Y  +Y G G T G L  
Sbjct: 138 LSSSYAVVPCTSDTCAQLDGHRCHEDDDG----------AC-QYTYKYSGHGVTKGTLAI 186

Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPA----GIAGFGRSSESLPSQLGLKKFSYCL---LS 270
           + L          + GCS  S   PA    G+ G GR   SL SQL + +F YCL   +S
Sbjct: 187 DKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMS 246

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R       S  LVL  G  +  + +  ++ T      + SS+ +  +YY+ L  + VG +
Sbjct: 247 R------TSGKLVLGAGADAVRNMSDRVTVT------MSSSTRYPSYYYLNLDGLAVGDQ 294

Query: 331 HVKIPYSYLVPGSDGNG-------------------GVIVDSGSTFTFMEGPLFEAVAKE 371
                 +   P S G G                   G+IVD  ST +F+E  L++ +A +
Sbjct: 295 TPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADD 354

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFAL 428
              ++    RA     + GL  CF +    G   VY+P + L F  G  + L  +  F  
Sbjct: 355 LEEEI-RLPRATP-SLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDRLFVT 411

Query: 429 VGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G  ++CL+          +GR   + ILG+FQLQN  + F+L   +  FAK  C
Sbjct: 412 DG-RMMCLM----------IGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 126/459 (27%), Positives = 185/459 (40%), Gaps = 65/459 (14%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY--SNSLIKTP 95
           P ST   L H D+    +   LA+S     R    + + K      G ++   +SL   P
Sbjct: 65  PFST--VLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVP 122

Query: 96  LSVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
           LS  +    G Y   L  GTP   S   + DTGSSL W  C+    CV      V     
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTP-STSYAMVVDTGSSLTWLQCSP---CVVSCHRQVG---- 174

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFT 211
           P F P+ SS+   + C   +C  +    +      CS  N         Y   YG   F+
Sbjct: 175 PLFDPRASSTYASVRCSASQCDELQAATLNP--SACSASNVCI------YQASYGDSSFS 226

Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFS 265
            G L ++T+ F S   P+F  GC   ++    + AG+ G  R+  SL  QL       FS
Sbjct: 227 VGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFS 286

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL +        +S   L  GP +        SYT     P+ SSS     Y++ L  +
Sbjct: 287 YCLPT-------AASTGYLSIGPYNTGHY---YSYT-----PMASSSLDASLYFITLSGM 331

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   + +      P    +   I+DSG+  T +   +  A++K   + M    RA   
Sbjct: 332 SVGGSPLAVS-----PSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRA--- 383

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAA 444
              S L  CF+    + + +P + + F GGA M L   N    V +   CL    TD+ A
Sbjct: 384 PAFSILDTCFEGQASQ-LRVPTVAMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTDSTA 442

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                     I+G+ Q Q F + +D+A  R GF+   C+
Sbjct: 443 ----------IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 130/429 (30%), Positives = 185/429 (43%), Gaps = 55/429 (12%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPF 120
           SL+R   L  K   +      GS+ +NSL     S  S G   Y   +  G P Q S  F
Sbjct: 141 SLNRKLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQ-SYFF 199

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI-PAFIPKRSSSSQLIGCQNPKCSWIFGP 179
           + DTGS + W       +C  C+  N    +I P F PK SSS   + C + +C  +   
Sbjct: 200 VPDTGSDVSWL------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLL--- 250

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
             E+ C        +C      Y ++YG G FT G L +ET  F  S ++PN   GC   
Sbjct: 251 -DEAACDA-----NSC-----IYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHD 299

Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
           ++      AG+ G G  + SL SQL    FSYCL+     D+  SS L  +    S DS 
Sbjct: 300 NEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDL---DSESSSTLDFNADQPS-DSL 355

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
           T     +P  KN       F  F YV +  + VG K + I  S       G+GG+IVDSG
Sbjct: 356 T-----SPLVKN-----DRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSG 405

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +T T +   +++ +   F+    N   A  V   S    C+D+S + +V +P +     G
Sbjct: 406 TTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPG 462

Query: 415 GAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
              + LP +N  F +      CL               P  I+G+ Q Q   + +DLAN 
Sbjct: 463 ENSLQLPAKNCLFQVDSAGTFCLAFLPSTF--------PLSIIGNVQQQGIRVSYDLANS 514

Query: 474 RFGFAKQKC 482
             GF+  KC
Sbjct: 515 LVGFSTDKC 523


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 171/408 (41%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     + DTGS ++W  C S   C  C   +    ++  F P  S 
Sbjct: 79  GLYYTKIRLGSPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  + C + +CSW     ++S   GCS +N  C     +Y  QYG G  T+G  +S+ L
Sbjct: 135 TATPVSCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185

Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
           +F     S  VPN  A    GCS       + SDR   GI GFG+   S+ SQL   GL 
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLA 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL             LVL      G+   P + +TP   +           Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  I V  + + I  S     S  NG G I+D+G+T  ++     EA    F+  + N 
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
              +     S    C+ I+   +   P + L F GGA M L P++Y      + G  V C
Sbjct: 341 VSQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +        G         ILGD  L++    +DL   R G+A   C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 130/473 (27%), Positives = 189/473 (39%), Gaps = 79/473 (16%)

Query: 35  PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSRA-----RHLKTKTKPKTKDS 81
           P +PL+  H         L    S    I H +++++  R      RH + +       +
Sbjct: 101 PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRRRHRQQQPPSAPAPA 160

Query: 82  NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
              S+ + SL  +P      G Y +++  GTP    T  +FDTGS   W  C     CV 
Sbjct: 161 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 216

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
             +      R   F P  SS+   + C  P CS       +    GCS  +         
Sbjct: 217 ACYEQ----REKLFDPASSSTYANVSCAAPACS-------DLDVSGCSGGHCL------- 258

Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
           Y +QYG G ++ G    +TL   S   V  F  GC   +D    + AG+ G GR   SLP
Sbjct: 259 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 318

Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
            Q   K    F++CL +R       +    LD G GS     P  + TP      P    
Sbjct: 319 VQTYGKYGGVFAHCLPARS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 365

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FYYVG+  I VG + + I  S          G IVDSG+  T +    + ++   
Sbjct: 366 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 415

Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
           F   M    Y +AA V   S L  C+D +G   V +P + L F+GGA + +        V
Sbjct: 416 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 472

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +CL  F  N  G  +G     I+G+ QL+ F + +D+     GF+   C
Sbjct: 473 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 105/396 (26%), Positives = 155/396 (39%), Gaps = 51/396 (12%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +  S GTPPQ       DT +   W PC   + C         P+  P+F P  S++ 
Sbjct: 94  YLVRASLGTPPQ-RLLLAVDTSNDAAWVPCAGCHGC---------PTTAPSFNPASSATF 143

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C  P CS    P+    C   +    +C      + L YG       L  + L   
Sbjct: 144 RPVPCGAPPCSQAPNPS----CTSLAKSKNSC-----GFSLSYGDSSLDATLSQDNLAVT 194

Query: 224 SK--TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKK------FSYCLLSRKFDD 275
           +    +  +  GC   S+   A   G           +   K      FSYCL S     
Sbjct: 195 ANGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSA 254

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           A  S +L L      G      +  TP   +P   S      YYV +  + +G K V IP
Sbjct: 255 ANFSGSLTLGR---KGQPAPEKMKTTPLLASPHRPS-----LYYVAMTGVRIGKKSVPIP 306

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKK 388
            S L   +    G ++DSG+ F  +  P + AV  E  R++            +  V   
Sbjct: 307 PSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSL 366

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPA 447
            G   C+++S   +V  P + L F GG ++ LP EN           CL +    AA PA
Sbjct: 367 GGFDTCYNVS---TVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAM----AASPA 419

Query: 448 LGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G   A+ ++G  Q QN  + FD+ N R GFA+++C
Sbjct: 420 DGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 115/411 (27%), Positives = 180/411 (43%), Gaps = 80/411 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     I DTGS ++W  C+S   C +C   +    ++  F    SS
Sbjct: 79  GLYFTRVKLGTPPREFNVQI-DTGSDVLWVTCSS---CSNCPQTSGLGIQLNYFDTTSSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +++L+ C +P C+      +++    C P++  C     SY  QYG G  T+G  +S+T 
Sbjct: 135 TARLVPCSHPICT----SQIQTTATQCPPQSNQC-----SYAFQYGDGSGTSGYYVSDTF 185

Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
            F +    + +A        GCS         +D+   GI GFG+   S+ SQL   G+ 
Sbjct: 186 YFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGIT 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL   K +D+     LVL      G+   PG+ Y+P   +           Y +
Sbjct: 246 PRVFSHCL---KGEDSG-GGILVL------GEILEPGIVYSPLVPSQ--------PHYNL 287

Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GN 378
            L+ I V  + + I P ++    +  N G I+D+G+T  +    L E     F+  +   
Sbjct: 288 DLQSIAVSGQLLPIDPAAF---ATSSNRGTIIDTGTTLAY----LVEEAYDPFVSAITAA 340

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
            S+ A      G   C+ +S   S   P +   F GGA M L PE Y           ++
Sbjct: 341 VSQLATPTINKG-NQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEY-----------LM 388

Query: 439 FTDNAAGPALG-------RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +  N AG AL        +G   ILGD  L++    +DLA+ R G+A   C
Sbjct: 389 YLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 115/393 (29%), Positives = 160/393 (40%), Gaps = 50/393 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   ++ GTP         DT S L W  C    RC         P   P F P+ S+
Sbjct: 136 GEYIAKIAVGTP-GVEALLALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 186

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S + +      C  +       R  G   +  TC      Y + YG G  T G  + ETL
Sbjct: 187 SYREMSFNAADCQAL------GRSGGGDAKRGTC-----VYTVGYGDGSTTVGDFIEETL 235

Query: 221 RFPSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK-KFSYCLLSRKFD 274
            F     +P    GC      L     AGI G GR   S P+Q+     FSYCL+   F 
Sbjct: 236 TFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLV--DFL 293

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
             P S +  L  G G+ D+  P +S+TP   N          FYYV L  I VG   V++
Sbjct: 294 SGPGSLSSTLTFGAGAVDTSPP-VSFTPTVLN-----LNMPTFYYVRLTGISVGG--VRV 345

Query: 335 P----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           P        +    G GGVIVDSG+  T +  P + A    F R +        +   SG
Sbjct: 346 PGVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF-RAVAVDLGQVSIGGPSG 404

Query: 391 -LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
               C+ + G+    +P + + F G  ++ L P+NY  L+  + +  + F    A  A G
Sbjct: 405 FFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNY--LIPVDSMGTVCF----AFAATG 458

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                I+G+ Q Q F + +D+   R GFA   C
Sbjct: 459 DHSVSIIGNIQQQGFRIVYDIGG-RVGFAPNSC 490


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 56/390 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +  T  +FDTGS + W  C     C+   +P     +   F P +S+
Sbjct: 133 GNYVVTVGLGTPKEDFT-LVFDTGSGITWTQCQP---CLGSCYPQ----KEQKFDPTKST 184

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S   + C +  C+ +  P  E   +GCS  N TC      Y + YG   ++ G   +ETL
Sbjct: 185 SYNNVSCSSASCNLL--PTSE---RGCSASNSTCL-----YQIIYGDQSYSQGFFATETL 234

Query: 221 RFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
              S  V  NFL GC   ++    Q AG+ G   SS SLPSQ      K+FSYCL S   
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--- 291

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
              P S+  +     G   S+T G  +TP        S AF  FY + +  I V    + 
Sbjct: 292 --TPSSTGYL---NFGGKVSQTAG--FTPI-------SPAFSSFYGIDIVGISVAGSQLP 337

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           I      P      G I+DSG+  T +    ++A+ + F  +M NY +    E    L  
Sbjct: 338 ID-----PSIFTTSGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDEL---LDT 389

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D S   +V  P++ + FKGG ++ +       LV    +  + F  N      G    
Sbjct: 390 CYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDSEFG---- 445

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I G+ Q + + + +D A    GFA   C+
Sbjct: 446 -IFGNHQQKTYEVVYDGAKGMIGFAAGACS 474


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  105 bits (261), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 132/432 (30%), Positives = 188/432 (43%), Gaps = 61/432 (14%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPF 120
           SL+R   L  K   +      GS+ +NSL     S  S G   Y   +  G P Q S  F
Sbjct: 141 SLNRKLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQ-SYFF 199

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI-PAFIPKRSSSSQLIGCQNPKCSWIFGP 179
           + DTGS + W       +C  C+  N    +I P F PK SSS   + C + +C  +   
Sbjct: 200 VPDTGSDVSWL------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLL--- 250

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
             E+ C        +C      Y ++YG G FT G L +ET  F  S ++PN   GC   
Sbjct: 251 -DEAACDA-----NSC-----IYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHD 299

Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
           ++       G+ G G  + SL SQL    FSYCL+     D+  SS L  +    S DS 
Sbjct: 300 NEGLFVGADGLIGLGGGAISLSSQLEATSFSYCLVDL---DSESSSTLDFNADQPS-DSL 355

Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
           T     +P  KN       F  F YV +  + VG K + I  S       G+GG+IVDSG
Sbjct: 356 T-----SPLVKN-----DRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSG 405

Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
           +T T +   +++ +   F+    N   A  V   S    C+D+S + +V +P +     G
Sbjct: 406 TTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPG 462

Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDL 470
              + LP +N          CLI   D+A    L   P+     I+G+ Q Q   + +DL
Sbjct: 463 ENSLQLPAKN----------CLIQ-VDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 511

Query: 471 ANDRFGFAKQKC 482
           AN   GF+  KC
Sbjct: 512 ANSLVGFSTDKC 523


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 105/387 (27%), Positives = 164/387 (42%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    G+PPQ +     DT +   W PCT+   C               F P++S++ 
Sbjct: 98  YIVRAKIGSPPQ-TLLLAMDTSNDAAWIPCTACDGCTST-----------LFAPEKSTTF 145

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C +P+C+ +  P              +C  +  ++ L YG    A  ++ +T+   
Sbjct: 146 KNVSCGSPQCNQVPNP--------------SCGTSACTFNLTYGSSSIAANVVQDTVTLA 191

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  +P++  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S  F    
Sbjct: 192 TDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 249

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  KNP  SS      YYV L  I VG K V IP  
Sbjct: 250 FSGSLRL--GPVAQPIR---IKYTPLLKNPRRSS-----LYYVNLVAIRVGRKVVDIPPE 299

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKSGLRPCFD 396
            L   +    G + DSG+ FT +  P + AV  EF R++   ++A   V    G   C+ 
Sbjct: 300 ALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYT 359

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
           +     +  P +   F  G  + LP +N           CL +    A+ P        +
Sbjct: 360 V----PIVAPTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAM----ASAPDNVNSVLNV 410

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + + Q QN  + +D+ N R G A++ C
Sbjct: 411 IANMQQQNHRVLYDVPNSRLGVARELC 437


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 114/446 (25%), Positives = 181/446 (40%), Gaps = 70/446 (15%)

Query: 49  DSDPLKILHSLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           D++ +K + S  S +L R    + L + T P    S IGS                  Y 
Sbjct: 4   DNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGS----------------ANYV 47

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + +  GTP +     +FDTGS L W  C     C    +   D      F P +SSS   
Sbjct: 48  VVVGLGTPKR-DLSLVFDTGSDLTWTQCEP---CAGSCYKQQDA----IFDPSKSSSYTN 99

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-P 223
           I C +  C+ +    ++S C   S  + +C      Y  +YG   T+ G L  E L    
Sbjct: 100 ITCTSSLCTQLTSDGIKSECS--SSTDASCI-----YDAKYGDNSTSVGFLSQERLTITA 152

Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
           +  V +FL GC   ++      AG+ G GR   S+  Q      K FSYCL        P
Sbjct: 153 TDIVDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL--------P 204

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            +S+ +     G+  +    L YTP        S+  G+  + GL  + +     K+P  
Sbjct: 205 ATSSSLGHLTFGASAATNASLIYTPL-------STISGDNSFYGLDIVSISVGGTKLPA- 256

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFD 396
            +   +   GG I+DSG+  T +   ++ A+   F R M  Y     V  ++GL   C+D
Sbjct: 257 -VSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYP----VANEAGLLDTCYD 311

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
           +SG K + +P +  +F GG  + L       +   + +CL  F  N +   +      + 
Sbjct: 312 LSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLA-FAANGSDNDI-----TVF 365

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+ Q +   + +D+   R GF    C
Sbjct: 366 GNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 168/389 (43%), Gaps = 55/389 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+ S   + D+GS +VW  C     C +C +   DP     F P  S+
Sbjct: 135 GEYFVRIGVGSPPR-SQYVVIDSGSDIVWVQCQP---CSEC-YQQSDP----VFDPAGSA 185

Query: 162 SSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +   I C +  C  +      + RC+               Y + YG G +T G L  ET
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCR---------------YEVSYGDGSYTRGTLALET 230

Query: 220 LRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
           L F    + N   GC  ++       AG+ G G  + S   QLG +    FSYCL+SR  
Sbjct: 231 LTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRG- 289

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                 S   L+ G G+      G ++ P  +NP   S     FYYVGL  + VG   V 
Sbjct: 290 ----TESTGTLEFGRGA---MPVGAAWVPLIRNPRAPS-----FYYVGLSGLGVGGIRVP 337

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           IP         G GGV++D+G+  T +  P +EA    FI Q  N  R+   ++ S    
Sbjct: 338 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRS---DRVSIFDT 394

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C++++G  SV +P +   F GG  + LP  N+   V  E      F  +A+G +      
Sbjct: 395 CYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLS------ 448

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            I+G+ Q +   +  D +N   GF    C
Sbjct: 449 -IIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 158/387 (40%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQ ++  I D    LVW  C    RC +           P F P  S++ 
Sbjct: 51  YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCGRCFE--------QGTPLFDPTASNTY 101

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C  P C  I  P+    C G       C     +Y      G T G + ++T  F 
Sbjct: 102 RAEPCGTPLCESI--PSDVRNCSG-----NVC-----AYEASTNAGDTGGKVGTDT--FA 147

Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
             T    LA GC + SD      P+GI G GR+  SL +Q G+  FSYCL      DA  
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGK 204

Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +S L L +     G G + +     TPF  N  G+ +    +Y V L  +  G   + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S    GS     V++D+ S  +F+    ++AV K     +G    A  VE       CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP---FDLCF 307

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
             SG      P+L+  F+GGA M +P  NY     N  +CL + +              +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG  Q +N +  FDL  +   F    C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 133/284 (46%), Gaps = 32/284 (11%)

Query: 211 TAGLLLSETLRFPSKT--VPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFS 265
           + G+L +ET  F +      N   GC  L++   AG   I G      S+  QL + KFS
Sbjct: 3   STGVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFS 62

Query: 266 YCLLSRKFDD---APVSSNLVLDTGPGSGDSKTPGLSYT-PFYKNPVGSSSAFGEFYYVG 321
           YCL    F D   +PV    + D G      KT G   T P  KNPV        +YYV 
Sbjct: 63  YCL--TPFTDHKTSPVMFGAMADLG----KYKTTGKVQTIPLLKNPVEDI-----YYYVP 111

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           +  I +GSK + +P + L    DG GG ++DS +T  ++  P F+ + K  +  M   + 
Sbjct: 112 MVGISIGSKRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAA 171

Query: 382 AADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
              ++       CF++      + V +P L+L F G A+M+LP ++YF      ++CL  
Sbjct: 172 NRSIDDYP---VCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCL-- 226

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               A   A   G   ++G+ Q QN ++ +DL N +F +A  KC
Sbjct: 227 ----AVMQAPFEGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 130/473 (27%), Positives = 189/473 (39%), Gaps = 79/473 (16%)

Query: 35  PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSR-----ARHLKTKTKPKTKDS 81
           P +PL+  H         L    S    I H +++++  R     +RH + +       +
Sbjct: 98  PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRSRHRQQQPPSAPAPA 157

Query: 82  NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
              S+ + SL  +P      G Y +++  GTP    T  +FDTGS   W  C     CV 
Sbjct: 158 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 213

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
             +      R   F P  SS+   + C  P CS +          GCS  +         
Sbjct: 214 ACYEQ----REKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCL------- 255

Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
           Y +QYG G ++ G    +TL   S   V  F  GC   +D    + AG+ G GR   SLP
Sbjct: 256 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 315

Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
            Q   K    F++CL  R       +    LD G GS     P  + TP      P    
Sbjct: 316 VQTYGKYGGVFAHCLPPRS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 362

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FYYVG+  I VG + + I  S          G IVDSG+  T +    + ++   
Sbjct: 363 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 412

Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
           F   M    Y +AA V   S L  C+D +G   V +P + L F+GGA + +        V
Sbjct: 413 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 469

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +CL  F  N  G  +G     I+G+ QL+ F + +D+     GF+   C
Sbjct: 470 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 100/387 (25%), Positives = 159/387 (41%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPPQ +     DT +   W PCT+   C               F P++S++ 
Sbjct: 97  YIVRAKIGTPPQ-TLLLAIDTSNDAAWIPCTACDGCTST-----------LFAPEKSTTF 144

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C +P+C+ +  P              +C  +  ++ L YG    A  ++ +T+   
Sbjct: 145 KNVSCGSPECNKVPSP--------------SCGTSACTFNLTYGSSSIAANVVQDTVTLA 190

Query: 224 SKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           +  +P +  GC   +       +   G+     S  S    L    FSYCL S  F    
Sbjct: 191 TDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 248

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  KNP  SS      YYV L  I VG K V IP +
Sbjct: 249 FSGSLRL--GPVAQPIR---IKYTPLLKNPRRSS-----LYYVNLFAIRVGRKIVDIPPA 298

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKSGLRPCFD 396
            L   +    G + DSG+ FT +  P++ AV  EF R++   ++A   V    G   C+ 
Sbjct: 299 ALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYT 358

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
           +     +  P +   F  G  + LP +N           CL +    A+ P        +
Sbjct: 359 V----PIVAPTITFMF-SGMNVTLPQDNILIHSTAGSTSCLAM----ASAPDNVNSVLNV 409

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + + Q QN  + +D+ N R G A++ C
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 68/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G + +S++ GTPP      I DTGS L W  C    +C   N P         F  K+SS
Sbjct: 83  GEFFMSITIGTPP-IKVFAIADTGSDLTWVQCKPCQQCYKENGP--------IFDKKKSS 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + +   C +  C  +      S  +GC   N  C      Y   YG   F+ G + +ET+
Sbjct: 134 TYKSEPCDSRNCQAL-----SSTERGCDESNNICK-----YRYSYGDQSFSKGDVATETV 183

Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
              S +      P  + GC   +    D   +GI G G    SL SQLG    KKFSYCL
Sbjct: 184 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243

Query: 269 LSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQ 324
             +    A  +   V++ G  S     SK  G+  TP   K P+        +YY+ L  
Sbjct: 244 SHKS---ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-------TYYYLTLEA 293

Query: 325 IIVGSKHVKIPY--SYLVPGSDG-----NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           I VG K  KIPY  S   P  DG     +G +I+DSG+T T +E   F+  +      + 
Sbjct: 294 ISVGKK--KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 351

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              R +D   +  L  CF  SG   + LPE+ + F  GA + L P N F  +  +++CL 
Sbjct: 352 GAKRVSD--PQGLLSHCFK-SGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLS 407

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +                I G+F   +F + +DL      F    C+
Sbjct: 408 MVPTTEVA---------IYGNFAQMDFLVGYDLETRTVSFQHMDCS 444


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 164/389 (42%), Gaps = 54/389 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+P +  +  I DTGSSL W  C     CV       DP     F P  S 
Sbjct: 11  GNYYVKVGLGSPARYYS-MIVDTGSSLSWLQCKP---CVVYCHVQADP----LFDPSASK 62

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           + + + C + +CS +    + +    C   +  C          Y +G+ +  LL+    
Sbjct: 63  TYKSLSCTSSQCSSLVDATLNNPL--CETSSNVCVYTASYGDSSYSMGYLSQDLLTLA-- 118

Query: 222 FPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
            PS+T+P F+ GC   S+    + AGI G GR+  S+  Q+  K    FSYCL +R    
Sbjct: 119 -PSQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---- 173

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
                   L  G  S         +TP   +P   S      Y++ L  I VG + + + 
Sbjct: 174 ---GGGGFLSIGKAS--LAGSAYKFTPMTTDPGNPS-----LYFLRLTAITVGGRALGVA 223

Query: 336 YS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKSGLRP 393
            + Y VP        I+DSG+  T +   ++    + F++ M + Y+RA      S L  
Sbjct: 224 AAQYRVP-------TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGF---SILDT 273

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CF  + K    +PE+ L F+GGA + L P N    V   + CL    +N           
Sbjct: 274 CFKGNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQVDEGLTCLAFAGNNGVA-------- 325

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            I+G+ Q Q F +  D++  R GFA   C
Sbjct: 326 -IIGNHQQQTFKVAHDISTARIGFATGGC 353


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 51/406 (12%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     ++SL+ GTPPQ  T  + DTGS L W  C             +      +F
Sbjct: 58  LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCAPGGGGGGGGRSAL------SF 110

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
            P+ S +   + C + +C     P+  + C G S   K C ++     L Y  G ++ G 
Sbjct: 111 RPRASLTFASVPCDSAQCRSRDLPSPPA-CDGAS---KQCRVS-----LSYADGSSSDGA 161

Query: 215 LLSETLRFPSKTVPNFLAGCSILS-DRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
           L +E              GC   + D  P G+A     G  R + S  SQ   ++FSYC+
Sbjct: 162 LATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI 221

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
             R  DDA V   L+L    G  D     L+YTP Y+ P      F    Y V L  I V
Sbjct: 222 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRV 271

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G K + IP S L P   G G  +VDSG+ FTF+ G  + A+  EF RQ   +  A +   
Sbjct: 272 GGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPN 331

Query: 388 ---KSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALV------GNEVLCL 436
              +     CF +   ++    LP + L F  GA+M +  +     V      G+ V CL
Sbjct: 332 FAFQEAFDTCFRVPQGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCL 390

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             F +    P      A ++G     N ++E+DL   R G A  +C
Sbjct: 391 T-FGNADMVPIT----AYVIGHHHQMNVWVEYDLERGRVGLAPIRC 431


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 109/428 (25%), Positives = 183/428 (42%), Gaps = 55/428 (12%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R R L+++ K     +NI +  S   + + + + +   Y +++  G     +   I DTG
Sbjct: 30  RVRSLQSRIKSIFSGNNIDALDSQIPLSSGVRLQTLN-YIVTVEIGG---RNMTVIVDTG 85

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S L W  C     C +         + P F P  S S Q I C +  C  +         
Sbjct: 86  SDLTWVQCQPCRLCYN--------QQDPLFNPSGSPSYQTILCNSSTCQSL--QYATGNL 135

Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---Q 241
             C     TC     +Y++ YG G +T G L  E L   +  V NF+ GC   +      
Sbjct: 136 GVCGSNTPTC-----NYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLFGG 190

Query: 242 PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
            +G+ G G+S  SL SQ        FSYCL +   D    S +L+L        + TP +
Sbjct: 191 ASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD---ASGSLILGGNSSVYKNTTP-I 246

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
           SYT    NP         FY++ L  I +G   ++ P       +    G+++DSG+  T
Sbjct: 247 SYTRMIANP-----QLPTFYFLNLTGISIGGVALQAP-------NYRQSGILIDSGTVIT 294

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
            +  P++  +  EF++Q   +  A      S L  CF+++G   V +P + ++F+G A++
Sbjct: 295 RLPPPVYRDLKAEFLKQFSGFPSAPPF---SILDTCFNLNGYDEVDIPTIRMQFEGNAEL 351

Query: 419 ALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRF 475
            +     F  V  +   +CL L        +L     I I+G++Q +N  + ++    + 
Sbjct: 352 TVDVTGIFYFVKTDASQVCLAL-------ASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404

Query: 476 GFAKQKCA 483
           GFA + C+
Sbjct: 405 GFAAEACS 412


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 137/499 (27%), Positives = 209/499 (41%), Gaps = 85/499 (17%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
           SSA+T   P       H          ++L  LA+ S +RA  L + +   +     G+ 
Sbjct: 19  SSASTPAAPAVRADLTHVDSGRGFTSRELLRRLATRSRARASRLYSSSSSSSSARPAGAG 78

Query: 87  YSNSLIKTPLSVHSYGG------YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
             +  +  PL+  + G       Y I LS GTP         DTGS LVW  C       
Sbjct: 79  --SHAVTAPLARGTVGDADIDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA------ 130

Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
            C+     P   P F    S ++  + C +P C+    P       GC+  + TC     
Sbjct: 131 -CHVCFAQP--FPTFDALASQTTLAVPCSDPICTSGKYP-----LSGCTFNDNTC----- 177

Query: 201 SYLLQYG-LGFTAGLLLSETLRFPSK------------TVPNFLAGCSILSD----RQPA 243
            YL  Y     T+G ++ +T  F S              VPN   GC   +        +
Sbjct: 178 FYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES 237

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSY 300
           GIAGF R   SLPSQL + +FS+C  +    DA  +S + L   PG    G   T  +  
Sbjct: 238 GIAGFSRGPMSLPSQLKVARFSHCFTA--IADAR-TSPVFLGGAPGPDNLGAHATGPVQS 294

Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV----PGSDGNGGVIVDSGST 356
           TPF       +++ G  YY+ L+ I VG    ++P + L         G+GG I+DSG+ 
Sbjct: 295 TPF-------ANSNGSLYYLTLKGITVGK--TRLPLNALAFAGKGTGSGSGGTIIDSGTG 345

Query: 357 FTFMEGPLFEAVAKEFIRQ----MGNYSRAADVEKK---SGLRPCFDISGKKSVYLPELI 409
              + GP++ ++   F+ +    + N S AAD E        R         +  LP+++
Sbjct: 346 IRTLPGPMYRSLRAAFVARVKLPVANES-AADAESTLCFEAARSASLPPEAPAPALPKVV 404

Query: 410 LKFKGGAKMALPPENY-FALVGNEV-----LCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           L    GA   LP E+Y   L+ +E      LCL++   N+AG +       I+G+FQ QN
Sbjct: 405 LHV-AGADWDLPRESYVLDLLEDEDGSGSGLCLVM---NSAGDS----DLTIIGNFQQQN 456

Query: 464 FYLEFDLANDRFGFAKQKC 482
            ++ +DL  ++  F   +C
Sbjct: 457 MHVAYDLEKNKLVFVPARC 475


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 131/499 (26%), Positives = 202/499 (40%), Gaps = 86/499 (17%)

Query: 5   PFSLICL-FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSS 63
           PF   CL F  +  LF+T+A    S  TV            L H DS PL   ++ + + 
Sbjct: 3   PFVFFCLAFYSVSSLFSTEANESPSGFTVD-----------LIHRDS-PLSPFYNPSLTP 50

Query: 64  LSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFD 123
             R  +   ++  +    +   + +N L ++ L +H+ G Y +    GTPP        D
Sbjct: 51  SQRIINAALRSISRLNRVSNLLDQNNKLPQSVLILHN-GEYLMRFYIGTPPVERLA-TAD 108

Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           TGS L+W  C+    C  C      P   P F P +SS+     C++  C+ +       
Sbjct: 109 TGSDLIWVQCSP---CASCF-----PQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQ--- 157

Query: 184 RCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRFPSK------TVPNFLAGC- 234
             KGC    +        Y  +YG    F+ GLL +ETLRF S+        PN   GC 
Sbjct: 158 --KGCGKSGECI------YTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCG 209

Query: 235 -----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDT 286
                ++    +  GI G G    SL SQ+G +   KFSYCLL       P+ S      
Sbjct: 210 LYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLL-------PLGSTSTSKL 262

Query: 287 GPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
             G+    T  G+  TP    P         +Y++ L  + V  K         VP    
Sbjct: 263 KFGNESIITGEGVVSTPMIIKP-----WLPTYYFLNLEAVTVAQKT--------VPTGST 309

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
           +G VI+DSG+  T++    +   A      +       DV   S L  CF    + +   
Sbjct: 310 DGNVIIDSGTLLTYLGESFYYNFAASLQESLA-VELVQDV--LSPLPFCFPY--RDNFVF 364

Query: 406 PELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           PE+  +F  GA+++L P N F +  +   +CL++   + +G +       I G F   +F
Sbjct: 365 PEIAFQFT-GARVSLKPANLFVMTEDRNTVCLMIAPSSVSGIS-------IFGSFSQIDF 416

Query: 465 YLEFDLANDRFGFAKQKCA 483
            +E+DL   +  F    C+
Sbjct: 417 QVEYDLEGKKVSFQPTDCS 435


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 126/449 (28%), Positives = 181/449 (40%), Gaps = 64/449 (14%)

Query: 44  YLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
           + H S S    +LH +ASS   R  +L +    K+K +++     N L          G 
Sbjct: 54  HTHVSASVIDTVLH-MASSDSHRFTYLSSLVAGKSKPTSVPVASGNQL--------HIGN 104

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPPQ     + DT +  VW PC+    C  C+  N   S         S+  
Sbjct: 105 YVVRARLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST-- 156

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLR 221
             + C   +C+   G      C   +P+   C     S+   YG    F+A L+  +TL 
Sbjct: 157 --VSCSTTQCTQARGLT----CPSSTPQPSIC-----SFNQSYGGDSSFSANLV-QDTLT 204

Query: 222 FPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDD 275
                +PNF  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S +   
Sbjct: 205 LSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY 264

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
              S  L L   P S       + YTP  +NP   S      YYV L  + VGS  V + 
Sbjct: 265 FSGSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVPVD 312

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPC 394
             YL   S+   G I+DSG+  T    P++EA+  EF +Q+ G++S     +       C
Sbjct: 313 PVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGAFDT------C 366

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPA 453
           F  S       P++ L       + LP EN         L CL +        A+     
Sbjct: 367 F--SADNENVTPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN--- 420

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++ + Q QN  + FD+ N R G A + C
Sbjct: 421 -VIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 126/442 (28%), Positives = 180/442 (40%), Gaps = 65/442 (14%)

Query: 55  ILHSLASSSLSRARHLKTKTKPKTK-DSNIGSNYSNSLIKTPLSVHS---YGGYSISLSF 110
           + H  A  S   AR  KT +   T  D++  +  + SL   PLS  +    G Y   +  
Sbjct: 69  LTHDDARISSLAARLAKTPSARATSLDADADAGLAGSLASVPLSPGASVGVGNYVTRMGL 128

Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           GTP       + DTGSSL W  C+     V C+  +      P F PK SS+   +GC  
Sbjct: 129 GTPATQYV-MVVDTGSSLTWLQCSPCL--VSCHRQSG-----PVFNPKSSSTYASVGCSA 180

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPN 229
            +CS +  P+       CS  N         Y   YG   F+ G L  +T+ F S ++PN
Sbjct: 181 QQCSDL--PSATLNPSACSSSNVCI------YQASYGDSSFSVGYLSKDTVSFGSTSLPN 232

Query: 230 FLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
           F  GC   ++    + AG+ G  R+  SL  QL       F+YCL S            +
Sbjct: 233 FYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS----SGYLSL 288

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV--KIPYSYLVP 341
               PG         SYTP       SSS     Y++ L  + V    +         +P
Sbjct: 289 GSYNPGQ-------YSYTPMV-----SSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP 336

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
                   I+DSG+  T +   ++ A++K     M   SRA+     S L  CF     +
Sbjct: 337 -------TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRAS---AYSILDTCFKGQASR 386

Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
            V  P + + F GGA + L  +N    V +   CL      A  PA  R  AII G+ Q 
Sbjct: 387 -VSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCL------AFAPA--RSAAII-GNTQQ 436

Query: 462 QNFYLEFDLANDRFGFAKQKCA 483
           Q F + +D+ + R GFA   C+
Sbjct: 437 QTFSVVYDVKSSRIGFAAGGCS 458


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 157/387 (40%), Gaps = 55/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQ ++  I D    LVW  C    RC +           P F P  S++ 
Sbjct: 51  YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCSRCFE--------QDTPLFDPTASNTY 101

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C  P C  I  P+    C G       C     +Y      G T G + ++T  F 
Sbjct: 102 RAEPCGTPLCESI--PSDSRNCSG-----NVC-----AYQASTNAGDTGGKVGTDT--FA 147

Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
             T    LA GC + SD      P+GI G GR+  SL +Q G+  FSYCL      DA  
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGK 204

Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           +S L L +     G G + +     TPF  N  G+ +    +Y V L  +  G   + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S    GS     V++D+ S  +F+    ++AV K     +G    A  VE       CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP---FDLCF 307

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
             SG      P+L+  F+GGA M +   NY     N  +CL + +              +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG  Q +N +  FDL  +   F    C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 118/445 (26%), Positives = 193/445 (43%), Gaps = 80/445 (17%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           + + +L + S +R R +  +    +  S  G+    + +++PL     GGY + +S GTP
Sbjct: 10  EAIRALVAKSHARVRWMAARANSSSWSSMAGT----TDVESPLHPDG-GGYVMDISVGTP 64

Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
            +     I DTGS LVW    PCT       C+   +       F P++SS+ + + C +
Sbjct: 65  GKRFRA-IADTGSDLVWVQSEPCTG------CSGGTI-------FDPRQSSTFREMDCSS 110

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-----SK 225
             C+ + G         C P + TC     SY  +YG G T G    +T+        S+
Sbjct: 111 QLCAELPGS--------CEPGSSTC-----SYSYEYGSGETEGEFARDTISLGTTSDGSQ 157

Query: 226 TVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSS 280
             P+F  GC +++       G+ G G+   SL SQL      KFSYCL+    +    SS
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESS 215

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            L+   GP +    T G+  T         S  +  +Y + +  I V  + +  P     
Sbjct: 216 PLLF--GPSAALHGT-GIQSTKITP----PSDTYPTYYLLTVNGIAVAGQTMGSP----- 263

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDISG 399
                 G  I+DSG+T T++   ++  V    + +M +      V+  S GL  C+D S 
Sbjct: 264 ------GTTIIDSGTTLTYVPSGVYGRV----LSRMESMVTLPRVDGSSMGLDLCYDRSS 313

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGRGPAIILG 457
            ++   P L ++  G A M  P  NYF +V +  + +CL      A G A G  P  I+G
Sbjct: 314 NRNYKFPALTIRLAG-ATMTPPSSNYFLVVDDSGDTVCL------AMGSASGL-PVSIIG 365

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           +   Q +++ +D  +    F + KC
Sbjct: 366 NVMQQGYHILYDRGSSELSFVQAKC 390


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 162/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP    T  +FDTGS   W  C     CV   +      R   F P RSS
Sbjct: 178 GNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSS 229

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   + C  P CS +   N+     GCS  +         Y +QYG G ++ G    +TL
Sbjct: 230 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 275

Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
              S   V  F  GC   ++    + AG+ G GR   SLP Q   K    F++CL +R  
Sbjct: 276 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 334

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                +    LD G GS  +    L+     +N          FYYVG+  I VG + + 
Sbjct: 335 -----TGTGYLDFGAGSLAAARARLTTPMLTENGP-------TFYYVGMTGIRVGGQLLS 382

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMGNYSRAADVEKKSGL 391
           IP S          G IVDSG+  T +    + ++  A         Y +A  V   S L
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAV---SLL 434

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G   V +P + L F+GGA++ +             +CL  F  N  G  +G  
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA-FAANEDGGDVG-- 491

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ QL+ F + +D+     GF    C
Sbjct: 492 ---IVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 129/452 (28%), Positives = 195/452 (43%), Gaps = 85/452 (18%)

Query: 61  SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
           +SS+ R   L++K K      ++G+   +SLI  P +  S  G+ ++LS G+PP  +   
Sbjct: 68  TSSIERFDFLESKIKEL---KSVGNEARSSLI--PFNRGS--GFLVNLSIGSPP-VTQLV 119

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           + DTGSSL+W  C     C++C            F P +S S + +GC  P  ++I G  
Sbjct: 120 VVDTGSSLLWVQCLP---CINCF-----QQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK 171

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTV------------ 227
                  C+  N+        Y L+Y G   + G+L  E+L F +               
Sbjct: 172 -------CNRFNQ------AEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218

Query: 228 ------PNFLAGCSILS-----DRQPAGIAGFGRSSE-SLPSQLGLKKFSYCLLSRKFDD 275
                  N   GC  ++     D    G+ G G     ++ +QLG  KFSYC+      +
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCIGDI---N 274

Query: 276 APVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            P+ ++  L  G GS   GDS       TP   +       FG  YYV L+ I VGSK +
Sbjct: 275 NPLYTHNHLVLGQGSYIEGDS-------TPLQIH-------FGH-YYVTLQSISVGSKTL 319

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGL 391
           KI  +     SDG+GGV++DSG T+T +    FE +  E +  M G   R     K  GL
Sbjct: 320 KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL 379

Query: 392 RPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
             CF  +  +  V  P +   F GGA + L   + F   G +  CL +   N+    L  
Sbjct: 380 --CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLS- 436

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               ++G    QN+ + FDL   +  F +  C
Sbjct: 437 ----VIGILAQQNYNVGFDLEQMKVFFRRIDC 464


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 51/406 (12%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L  H     ++SL+ GTPPQ  T  + DTGS L W  C             +      +F
Sbjct: 57  LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCAPGGGGGGGGRSAL------SF 109

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
            P+ S +   + C + +C     P+  + C G S   K C ++     L Y  G ++ G 
Sbjct: 110 RPRASLTFASVPCGSAQCRSRDLPSPPA-CDGAS---KQCRVS-----LSYADGSSSDGA 160

Query: 215 LLSETLRFPSKTVPNFLAGCSILS-DRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
           L +E              GC   + D  P G+A     G  R + S  SQ   ++FSYC+
Sbjct: 161 LATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI 220

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
             R  DDA V   L+L    G  D     L+YTP Y+ P      F    Y V L  I V
Sbjct: 221 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRV 270

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G K + IP S L P   G G  +VDSG+ FTF+ G  + A+  EF RQ   +  A +   
Sbjct: 271 GGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPN 330

Query: 388 ---KSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALV------GNEVLCL 436
              +     CF +   ++    LP + L F  GA+M +  +     V      G+ V CL
Sbjct: 331 FAFQEAFDTCFRVPQGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCL 389

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             F +    P      A ++G     N ++E+DL   R G A  +C
Sbjct: 390 T-FGNADMVPIT----AYVIGHHHQMNVWVEYDLERGRVGLAPIRC 430


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 109/405 (26%), Positives = 157/405 (38%), Gaps = 60/405 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   ++ GTP   +     DT S L W  C    RC         P   P F P+ S+
Sbjct: 139 GDYIAKIAVGTPAVEAL-LALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 189

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-------FTAGL 214
           S   +    P C  +       R  G   +  TC      Y + YG G        + G 
Sbjct: 190 SYGEMNYDAPDCQAL------GRSGGGDAKRGTC-----IYTVLYGDGDGHGSTSTSVGD 238

Query: 215 LLSETLRFPSKTVPNFLA-GCSI----LSDRQPAGIAGFGRSSESLPSQLGL----KKFS 265
           L+ ETL F       +L+ GC      L     AGI G  R   S+P Q+        FS
Sbjct: 239 LVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFS 298

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YCL+   F   P S +  L  G G+ D+  P  S+TP   N          FYYV L  +
Sbjct: 299 YCLV--DFISGPGSPSSTLTFGAGAVDTSPPA-SFTPTVLN-----QNMPTFYYVRLIGV 350

Query: 326 IVGSKHVKIP----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
            VG   V++P        +    G+GGVI+DSG+T T +  P + A    F        +
Sbjct: 351 SVGG--VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQ 408

Query: 382 AADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
            +          C+ + G+      V +P + + F GG +++L P+NY   V +      
Sbjct: 409 VSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCF 468

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            F         G     ++G+   Q F + +D+   R GFA   C
Sbjct: 469 AFAGT------GDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 130/483 (26%), Positives = 188/483 (38%), Gaps = 82/483 (16%)

Query: 16  ILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTK 75
           ILL  +     ++A T T PL  ++    L H DS    IL S  S   +     +T+  
Sbjct: 19  ILLSVSVTSTTTTAMTDTKPLRLVTG---LIHQDS----ILSSYQSLDRNNVERRRTRRA 71

Query: 76  PKTKDSNIGSNYSNSLIKTPLSVHSYG-GYSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
               D           I+  +     G  + ++ S G PP      I DTGS L+W  C 
Sbjct: 72  AFITDE----------IQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCR 120

Query: 135 SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT 194
               C DC          P F P +SS+   +   +P C     PN        SP+ K 
Sbjct: 121 P---CADCF-----RQSTPIFDPSKSSTYVDLSYDSPIC-----PN--------SPQKKY 159

Query: 195 CPLACPSYLLQYGLGFTA-GLLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAG 244
             L    Y   Y  G T+ G L +E + F +      TV + + GC   +    D Q +G
Sbjct: 160 NHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSG 219

Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
           I G     +S+ S+LG  +FSYC+    FD     + LVL      GD      S TPF+
Sbjct: 220 ILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVL------GDGVKMEGSSTPFH 271

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
                    F  FYYV L  I VG   + I          G GGV++DSG+T TF+    
Sbjct: 272 --------TFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDG 323

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGL-----RPCFDISGKKSVYLPELILKFKGGAKMA 419
           F+ ++ E  R +  + +        G      R   D+ G      PEL   F  GA + 
Sbjct: 324 FDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRG-----FPELAFHFAEGADLV 378

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           L   + F     +V CL +   N       +    ++G    Q++ + +DL   R  F +
Sbjct: 379 LDANSLFVQKNQDVFCLAVLESNL------KNIGSVIGIMAQQHYNVAYDLIGKRVYFQR 432

Query: 480 QKC 482
             C
Sbjct: 433 TDC 435


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 122/450 (27%), Positives = 181/450 (40%), Gaps = 81/450 (18%)

Query: 49  DSDPLKILHSLAS---SSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           D   +  +HS  +    S+ R R  K    P    + IGS                G Y 
Sbjct: 89  DQSRVDFIHSKIAGELESVDRLRGSKATKIPAKSGATIGS----------------GNYI 132

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCT--SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           +S+  GTP +  +  IFDTGS L W  C   +RY C +         + P F+P +S++ 
Sbjct: 133 VSVGLGTPKKYLS-LIFDTGSDLTWTQCQPCARY-CYN--------QKDPVFVPSQSTTY 182

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +P CS +   +      GCS        AC  Y +QYG   F+ G    ETL  
Sbjct: 183 SNISCSSPDCSQL--ESGTGNQPGCSAAR-----ACI-YGIQYGDQSFSVGYFAKETLTL 234

Query: 223 PSKTV-PNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            S  V  NFL GC   ++R      AG+ G G+   S+  Q   K    FSYCL      
Sbjct: 235 TSTDVIENFLFGCG-QNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCL------ 287

Query: 275 DAPVSSNLVLDTG--PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
             P +S+    TG     G      L YTP  K     +     FY V +  + VG   +
Sbjct: 288 --PKTSS---STGYLTFGGGGGGGALKYTPITK-----AHGVANFYGVDIVGMKVGGTQI 337

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            I  S          G I+DSG+  T +    + A+   F + M  Y +A ++   S L 
Sbjct: 338 PISSSVF-----STSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPEL---SILD 389

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            C+D+S   ++ +P++   FKGG ++ L             +CL  F  N     +    
Sbjct: 390 TCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCL-AFAGNQDPSTVA--- 445

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             I+G+ Q +   + +D+   + GF    C
Sbjct: 446 --IIGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 111/405 (27%), Positives = 173/405 (42%), Gaps = 56/405 (13%)

Query: 86  NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           N +N  +   LS+ S G Y + L  G+PP+  T  I DTGSSL W  C     CV     
Sbjct: 103 NSANIPLNPGLSIGS-GNYYLKLGLGSPPKYYT-MILDTGSSLSWLQCKP---CVVYCHS 157

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLL 204
            VDP     F P  S++ + + C + +CS +    + +  C      +  C         
Sbjct: 158 QVDP----LFEPSASNTYRPLYCSSSECSLLKAATLNDPLCTA----SGVCVYTASYGDA 209

Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL 261
            Y +G+ +  LL+ T   PS+T+P+F  GC   ++    + AGI G  R   S+ +QL  
Sbjct: 210 SYSMGYLSRDLLTLT---PSQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSP 266

Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           K    FSYCL        P S      T  G G      +S + +   P+  +S     Y
Sbjct: 267 KYGYAFSYCL--------PTS------TSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLY 312

Query: 319 YVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           ++ L  I V  + V +  + Y VP        I+DSG+  T +   ++ A+ + F++ M 
Sbjct: 313 FLRLAAITVAGRPVGVAAAGYQVP-------TIIDSGTVVTRLPISIYAALREAFVKIMS 365

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              R       S L  CF  S K     PE+ + F+GGA ++L   N        + CL 
Sbjct: 366 R--RYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADKGIACLA 423

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             + N            I+G+ Q Q + + +D++  + GFA   C
Sbjct: 424 FASSNQIA---------IIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 154/386 (39%), Gaps = 58/386 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++S GTP  A T    DTGS L W  CT       C  P     + P F P +SSS 
Sbjct: 140 YVVTVSLGTPGVAQT-LEVDTGSDLSWVQCT------PCAAPACYSQKDPLFDPAQSSSY 192

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C  P C             G      +C  A   Y++ YG G  T G+  S+TL  
Sbjct: 193 AAVPCGGPVCG------------GLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTL 240

Query: 223 -PSKTVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
            P+  V  F  GC           G+ G GR   SL  Q        FSYCL +R     
Sbjct: 241 SPNDAVRGFFFGCGHAQSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTR----- 295

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
           P ++  +   GP    +  PG S T    +P  ++     +Y V L  I VG + + +P 
Sbjct: 296 PSTTGYLTLGGPSG--AAPPGFSTTQLLSSPNAAT-----YYVVMLTGISVGGQQLSVPS 348

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           S         GG +VD+G+  T +    + A+   F   M +Y   +       L  C++
Sbjct: 349 SVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPS-APATGILDTCYN 401

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
            SG  +V LP + L F GGA + L  +            ++ F   A  P+   G   IL
Sbjct: 402 FSGYGTVTLPNVALTFSGGATVTLGADG-----------ILSFGCLAFAPSGSDGGMAIL 450

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+ Q ++F +  D      GF    C
Sbjct: 451 GNVQQRSFEVRID--GTSVGFKPSSC 474


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 124/446 (27%), Positives = 175/446 (39%), Gaps = 61/446 (13%)

Query: 46  HHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           H S S    +LH +ASS   R  +L +    K K +++     N L          G Y 
Sbjct: 55  HVSASVIDTVLH-MASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL--------HIGNYV 105

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +    GTPPQ     + DT +  VW PC+    C  C+  N   S         S+    
Sbjct: 106 VRAKLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST---- 155

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRFP 223
           + C   +C+   G      C   SP+   C     S+   YG    F+A L+  +TL   
Sbjct: 156 VSCSTAQCTQARGLT----CPSSSPQPSVC-----SFNQSYGGDSSFSASLV-QDTLTLA 205

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
              +PNF  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S +     
Sbjct: 206 PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFS 265

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S  L L   P S       + YTP  +NP   S      YYV L  + VGS  V +   
Sbjct: 266 GSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVPVDPV 313

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
           YL   ++   G I+DSG+  T    P++EA+  EF +Q+      +          CF  
Sbjct: 314 YLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCF-- 367

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIIL 456
           S       P++ L       + LP EN         L CL +        A+      ++
Sbjct: 368 SADNENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN----VI 422

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  + FD+ N R G A + C
Sbjct: 423 ANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 117/402 (29%), Positives = 164/402 (40%), Gaps = 73/402 (18%)

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           PPQ +   + DTGS L W  C            + +P+ +  F P RSSS   I C +P 
Sbjct: 82  PPQ-NISMVIDTGSELSWLRCNR----------SSNPNPVNNFDPTRSSSYSPIPCSSPT 130

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGF-----TAGLLLSETLRFPSKT 226
           C                 R+   P +C S  L    L +     + G L +E   F + T
Sbjct: 131 CR-------------TRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNST 177

Query: 227 VP-NFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
              N + GC           D +  G+ G  R S S  SQ+G  KFSYC+     DD P 
Sbjct: 178 NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI--SGTDDFP- 234

Query: 279 SSNLVLDTGPGSGDSK----TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
              L+L      GDS     TP L+YTP  +            Y V L  I V  K + I
Sbjct: 235 -GFLLL------GDSNFTWLTP-LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPI 286

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG---NYSRAADVEKKSGL 391
           P S LVP   G G  +VDSG+ FTF+ GP++ A+   F+ +           D   +  +
Sbjct: 287 PKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTM 346

Query: 392 RPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYF-----ALVGNE-VLCLILFT 440
             C+ IS  +        LP + L F+ GA++A+  +          VGN+ V C     
Sbjct: 347 DLCYRISPVRIRSGILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGN 405

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +  G       A ++G    QN ++EFDL   R G A  +C
Sbjct: 406 SDLMGME-----AYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 111/392 (28%), Positives = 164/392 (41%), Gaps = 61/392 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y + +  G+PP+ +   + D+GS ++W    PCT  Y   D           P F P 
Sbjct: 134 GEYFVRIGVGSPPR-NQYVVMDSGSDIIWVQCEPCTQCYHQSD-----------PVFNPA 181

Query: 159 RSSSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
            SSS   + C +  CS +      E RC+               Y + YG G +T G L 
Sbjct: 182 DSSSFSGVSCASTVCSHVDNAACHEGRCR---------------YEVSYGDGSYTKGTLA 226

Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGLKK---FSYCLLS 270
            ET+ F    + N   GC   +     G AG         S   QLG +    FSYCL+S
Sbjct: 227 LETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVS 286

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
           R      + S+ +L+ G    ++   G ++ P   NP   S     FYY+GL  + VG  
Sbjct: 287 RG-----IESSGLLEFGR---EAMPVGAAWVPLIHNPRAQS-----FYYIGLSGLGVGGL 333

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            V I          G+GGV++D+G+  T +    +EA    FI Q  N  RA+ V   S 
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGV---SI 390

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
              C+D+ G  SV +P +   F GG  + LP  N+   V +       F  +++G +   
Sbjct: 391 FDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLS--- 447

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q +   +  D AN   GF    C
Sbjct: 448 ----IIGNIQQEGIQISVDGANGFVGFGPNVC 475


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 108/405 (26%), Positives = 171/405 (42%), Gaps = 60/405 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +  P+ + S G Y  + + GTPPQ  +  +  TG  LVW  CT    C + + P  DP++
Sbjct: 45  VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
                   SS+ + + C +  C  I  P     C      +  C    P+       G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143

Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
            G+  ++T     +K    F  GC +++D++      P+GI G GR+  SL +Q+ +  F
Sbjct: 144 GGMAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
           SYCL  +        S+  L  G  +        S TPF  K   GSS      YY    
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
             +V    +K   + L   S     V++D+ S  +++    ++A+ K     +G    A+
Sbjct: 250 --MVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307

Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
             +        +D+   K+V    PEL+  F GGA + +PP NY    GN  +CL + + 
Sbjct: 308 PPKP-------YDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSS 360

Query: 442 ---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              N  G   G   A ILG  Q +N ++ FDL  +   F    C+
Sbjct: 361 ASLNLTGELEG---ASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 114/399 (28%), Positives = 158/399 (39%), Gaps = 65/399 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+PP  S   + DTGS + W  C   ++   C  P VDP     F P  SS+ 
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ--QCR-PQVDP----LFDPSLSSTY 192

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF--TAGLLLSETLR 221
               C +  C+ +F    E    GCS   +        Y+  YG G   T G   S+TL 
Sbjct: 193 SPFSCSSAACAQLF---QEGNANGCSSSGQC------QYIAMYGDGSVGTTGTYSSDTLA 243

Query: 222 FPSKT----VPNFLAGCSILSDRQPAGIAGFGRSS-------ESLPSQ----LGLKKFSY 266
             S +    V  F  GCS        GI G            +SL SQ     G   FSY
Sbjct: 244 LGSNSNTVVVSKFRFGCS----HAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSY 299

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL        P SS   L  G         G S   F K P+  SS    FY V L  I 
Sbjct: 300 CL-----PPTPSSSGF-LTLGAA-------GTSSAGFVKTPMLRSSQVPAFYGVRLEAIR 346

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VG + + IP +        + G+I+DSG+  T +    + +++  F   M  Y  A    
Sbjct: 347 VGGRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSA 400

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFK--GGAKMALPPENY-FALVGNEVLCLILFTDNA 443
               L  CFD+SG+ SV +P + L F   GGA + L        +  + + CL     + 
Sbjct: 401 GGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATS- 459

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  G   I+G+ Q + F + +D+A    GF    C
Sbjct: 460 -----DDGSTGIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 120/453 (26%), Positives = 181/453 (39%), Gaps = 69/453 (15%)

Query: 48  SDSDPLKILHSL--ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG-GY 104
           +D+ PL+++  L    S LS  + L      + +     + +    I+  +     G  +
Sbjct: 2   TDTKPLRLVTGLIHQDSILSSYQSLDRNNVERRRTRR--AAFITDEIQANMVADDRGQAF 59

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
            ++ S G PP      I DTGS L+W  C     C DC   +      P F P +SS+  
Sbjct: 60  LVNFSVGRPPVPQLVGI-DTGSDLLWVQCRP---CADCFRQST-----PIFDPSKSSTYV 110

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
            +   +P C     PN        SP+ K   L    Y   Y  G T+ G L +E + F 
Sbjct: 111 DLSYDSPIC-----PN--------SPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFE 157

Query: 224 SK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
           +      TV + + GC   +    D Q +GI G     +S+ S+LG  +FSYC+    FD
Sbjct: 158 TSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFD 215

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                + LVL      GD      S TPF+         F  FYYV L  I VG   + I
Sbjct: 216 PHYTHNQLVL------GDGVKMEGSSTPFH--------TFNGFYYVTLEGISVGETRLDI 261

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL--- 391
                     G GGV++DSG+T TF+    F+ ++ E  R +  + +        G    
Sbjct: 262 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCY 321

Query: 392 --RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
             R   D+ G      PEL   F  GA + L   + F     +V CL +   N       
Sbjct: 322 KGRVNEDLRG-----FPELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNL------ 370

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +    ++G    Q++ + +DL   R  F +  C
Sbjct: 371 KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 119/402 (29%), Positives = 168/402 (41%), Gaps = 73/402 (18%)

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           PPQ +   + DTGS L W  C            + +P+ +  F P RSSS   I C +P 
Sbjct: 82  PPQ-NISMVIDTGSELSWLRCNR----------SSNPNPVNNFDPTRSSSYSPIPCSSPT 130

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGF-----TAGLLLSETLRFPSKT 226
           C                 R+   P +C S  L    L +     + G L +E   F + T
Sbjct: 131 CR-------------TRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNST 177

Query: 227 VP-NFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
              N + GC           D +  G+ G  R S S  SQ+G  KFSYC+     DD P 
Sbjct: 178 NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFP- 234

Query: 279 SSNLVLDTGPGSGDSK----TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
              L+L      GDS     TP L+YTP  +            Y V L  I V  K + I
Sbjct: 235 -GFLLL------GDSNFTWLTP-LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPI 286

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVE--KKSGL 391
           P S L+P   G G  +VDSG+ FTF+ GP++ A+  +F+ Q  G  +   D E   +  +
Sbjct: 287 PKSVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTM 346

Query: 392 RPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYFALV-----GNE-VLCLILFT 440
             C+ IS  +        LP + L F+ GA++A+  +     V     GN+ V C     
Sbjct: 347 DLCYRISPFRIRTGILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGN 405

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +  G       A ++G    QN ++EFDL   R G A  +C
Sbjct: 406 SDLMGME-----AYVIGHHHQQNMWIEFDLQRSRIGLAPVQC 442


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 129/454 (28%), Positives = 191/454 (42%), Gaps = 78/454 (17%)

Query: 57  HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
           H++  S L RAR    + +   + SN   ++S      P  V   G Y   +  GTPP  
Sbjct: 33  HTVELSQL-RARD-ALRHRRMLQSSNGVVDFSVQGTFDPFQV---GLYYTKVQLGTPPVE 87

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
               I DTGS ++W  C S   C  C   +    ++  F P  SS+S +I C + +C+  
Sbjct: 88  FNVQI-DTGSDVLWVSCNS---CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCN-- 141

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR----FPSKTVPNFL 231
               ++S    CS +N  C     SY  QYG G  T+G  +S+ +     F      N  
Sbjct: 142 --NGIQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 194

Query: 232 A----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL--KKFSYCLLSRKFDD 275
           A    GCS         SDR   GI GFG+   S+ SQL   G+  + FS+CL      D
Sbjct: 195 APVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKG----D 250

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
           +     LVL      G+   P + YT      P          Y + L+ I V  + ++I
Sbjct: 251 SSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYNLNLQSIAVNGQTLQI 295

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
             S  V  +  + G IVDSG+T  +    L E     F+  +      +     S    C
Sbjct: 296 DSS--VFATSNSRGTIVDSGTTLAY----LAEEAYDPFVSAITASIPQSVHTVVSRGNQC 349

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFTDNAAGPALGR 450
           + I+   +   P++ L F GGA M L P++Y     ++ G  V C+        G    +
Sbjct: 350 YLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCI--------GFQKIQ 401

Query: 451 GPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G  I ILGD  L++  + +DLA  R G+A   C+
Sbjct: 402 GQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P  + T  + DTGS + W  C     C  C+    DP     F P  SS+ 
Sbjct: 198 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 248

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+      +     GCS  ++        Y++ YG G  T G   S+TL  
Sbjct: 249 SPFSCGSADCA-----QLGQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 297

Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
            S  V +F  GCS +    + Q  G+ G G  ++SL SQ    LG + FSYCL       
Sbjct: 298 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 351

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P SS  +             G   + F K P+  SS    FY V L+ I VG + + IP
Sbjct: 352 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 404

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
            S        + G ++DSG+  T +    + A++  F   M  Y  A    + SG L  C
Sbjct: 405 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 454

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           FD SG+ SV +P + L F GGA ++L       ++ N   CL  F  N+   +LG     
Sbjct: 455 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 503

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q + F + +D+     GF    C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 125/446 (28%), Positives = 188/446 (42%), Gaps = 59/446 (13%)

Query: 49  DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
           D + ++ LHS  ++  S  R+  T  K +   S +    S + +K+ LS+ S G Y + +
Sbjct: 64  DEERVRFLHSRLTNKES-VRNSATTDKLRGGPSLV----STTPLKSGLSIGS-GNYYVKI 117

Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
             GTP +  +  I DTGSSL W  C     CV      VDP     F P  S + + + C
Sbjct: 118 GLGTPAKYFS-MIVDTGSSLSWLQCQP---CVIYCHVQVDP----IFTPSTSKTYKALPC 169

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV 227
            + +CS +    + +   GCS     C      Y   YG   F+ G L  + L       
Sbjct: 170 SSSQCSSLKSSTLNA--PGCSNATGAC-----VYKASYGDTSFSIGYLSQDVLTLTPSEA 222

Query: 228 PN--FLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
           P+  F+ GC    D Q      +GI G      S+  QL  K    FSYCL S       
Sbjct: 223 PSSGFVYGCG--QDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNS 280

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PY 336
            S +  L  G  S       L+ +P+   P+  +      Y++ L  I V  K + +   
Sbjct: 281 SSLSGFLSIGASS-------LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSAS 333

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           SY VP        I+DSG+  T +   ++ A+ K F+  M    + A     S L  CF 
Sbjct: 334 SYNVP-------TIIDSGTVITRLPVAVYNALKKSFVLIMSK--KYAQAPGFSILDTCFK 384

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
            S K+   +PE+ + F+GGA + L   N    +     CL +        A    P  I+
Sbjct: 385 GSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEKGTTCLAI--------AASSNPISII 436

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G++Q Q F + +D+AN + GFA   C
Sbjct: 437 GNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P  + T  + DTGS + W  C     C  C+    DP     F P  SS+ 
Sbjct: 128 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 178

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+ +          GCS  ++        Y++ YG G  T G   S+TL  
Sbjct: 179 SPFSCGSAACAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 227

Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
            S  V +F  GCS +    + Q  G+ G G  ++SL SQ    LG + FSYCL       
Sbjct: 228 GSSAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 281

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P SS  +             G   + F K P+  SS    FY V L+ I VG + + IP
Sbjct: 282 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 334

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
            S        + G ++DSG+  T +    + A++  F   M  Y  A    + SG L  C
Sbjct: 335 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 384

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           FD SG+ SV +P + L F GGA ++L       ++ N   CL  F  N+   +LG     
Sbjct: 385 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAANSDDSSLG----- 433

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q + F + +D+     GF    C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 120/453 (26%), Positives = 181/453 (39%), Gaps = 69/453 (15%)

Query: 48  SDSDPLKILHSL--ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG-GY 104
           +D+ PL+++  L    S LS  + L      + +     + +    I+  +     G  +
Sbjct: 2   TDTKPLRLVTGLIHQDSILSSYQSLDRNNVERRRTRR--AAFIXDEIQANMVADDRGQAF 59

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
            ++ S G PP      I DTGS L+W  C     C DC   +      P F P +SS+  
Sbjct: 60  LVNFSVGRPPVPQLVGI-DTGSDLLWVQCRP---CADCFRQST-----PIFDPSKSSTYV 110

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
            +   +P C     PN        SP+ K   L    Y   Y  G T+ G L +E + F 
Sbjct: 111 DLSYDSPIC-----PN--------SPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFE 157

Query: 224 SK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
           +      TV + + GC   +    D Q +GI G     +S+ S+LG  +FSYC+    FD
Sbjct: 158 TSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFD 215

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                + LVL      GD      S TPF+         F  FYYV L  I VG   + I
Sbjct: 216 PHYTHNQLVL------GDGVKMEGSSTPFH--------TFNGFYYVTLEGISVGETRLDI 261

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL--- 391
                     G GGV++DSG+T TF+    F+ ++ E  R +  + +        G    
Sbjct: 262 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCY 321

Query: 392 --RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
             R   D+ G      PEL   F  GA + L   + F     +V CL +   N       
Sbjct: 322 KGRVNEDLRG-----FPELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNL------ 370

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +    ++G    Q++ + +DL   R  F +  C
Sbjct: 371 KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 147/342 (42%), Gaps = 49/342 (14%)

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
           P+F P RSS+ + + C  P+CS    P+    C G      +C     ++ L Y      
Sbjct: 145 PSFDPTRSSTYRPVRCGAPQCSQAPAPS----CPGG--LGSSC-----AFNLSYAASTFQ 193

Query: 213 GLLLSETLRFPSKT--VPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKF 264
            LL  + L        V  +  GC  +       P G+ GFGR   S PSQ        F
Sbjct: 194 ALLGQDALALHDDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVF 253

Query: 265 SYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           SYCL S K      SSN    L  GP +G  K   +  TP   NP   S      YYV +
Sbjct: 254 SYCLPSYK------SSNFSGTLRLGP-AGQPKR--IKTTPLLSNPHRPS-----LYYVNM 299

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I VG + V +P S L        G IVD+G+ FT +  P++ AV   F  ++    RA
Sbjct: 300 VGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV----RA 355

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTD 441
                  G   C+++    ++ +P +   F G   + LP EN      +  + CL +   
Sbjct: 356 PVAGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAM--- 408

Query: 442 NAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            AAGP  G   A+ +L   Q QN  + FD+AN R GF+++ C
Sbjct: 409 -AAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 124/438 (28%), Positives = 178/438 (40%), Gaps = 77/438 (17%)

Query: 47  HSDSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
           + D + +K ++S  S +L   S    L + T P    S IGS                G 
Sbjct: 101 NQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGS----------------GN 144

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +  GTP +     IFDTGS L W  C     C    +   D      F P +S+S 
Sbjct: 145 YFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEP---CARSCYKQQDA----IFDPSKSTSY 196

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +  C+ +          GCS   K C      Y +QYG   F+ G    E L  
Sbjct: 197 SNITCTSTLCTQL--STATGNEPGCSASTKACI-----YGIQYGDSSFSVGYFSRERLSV 249

Query: 223 -PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
             +  V NFL GC    + Q      AG+ G GR   S   Q      K FSYCL     
Sbjct: 250 TATDIVDNFLFGCG--QNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL----- 302

Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              P +S+    TG  S G + T  + YTPF     GSS     FY + +  I VG    
Sbjct: 303 ---PATSS---STGRLSFGTTTTSYVKYTPFSTISRGSS-----FYGLDITGISVGG--A 349

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
           K+P S     +   GG I+DSG+  T +    + A+   F + M  Y  A ++   S L 
Sbjct: 350 KLPVS---SSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGEL---SILD 403

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            C+D+SG +   +P++   F GG  + LPP+    +   + +CL  F  N     +    
Sbjct: 404 TCYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGILYVASAKQVCL-AFAANGDDSDV---- 458

Query: 453 AIILGDFQLQNFYLEFDL 470
             I G+ Q +   + +D+
Sbjct: 459 -TIYGNVQQKTIEVVYDV 475


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 112/411 (27%), Positives = 171/411 (41%), Gaps = 71/411 (17%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           +S G Y   +  GTPP+     I DTGS ++W  C +   C +C   +     +  F   
Sbjct: 73  NSVGLYYTKVKMGTPPKEFNVQI-DTGSDILWVNCNT---CSNCPQSSQLGIELNFFDTV 128

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
            SS++ LI C +P C+      V+     CSPR   C     SY  QYG G  T+G  +S
Sbjct: 129 GSSTAALIPCSDPICT----SRVQGAAAECSPRVNQC-----SYTFQYGDGSGTSGYYVS 179

Query: 218 ETLRF--------PSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL--- 259
           + + F           +    + GCSI        +D+   GI GFG    S+ SQL   
Sbjct: 180 DAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSR 239

Query: 260 GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
           G+  K FS+CL         +    +L+          P + Y+P   +           
Sbjct: 240 GITPKVFSHCLKGDGDGGGVLVLGEILE----------PSIVYSPLVPSQ--------PH 281

Query: 318 YYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           Y + L+ I V  + + I P  + +  S+  GG IVD G+T  ++    ++ +       +
Sbjct: 282 YNLNLQSIAVNGQLLPINPAVFSI--SNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAV 339

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNE 432
              +R    +  S    C+ +S       P + L F+GGA M L PE Y      L G E
Sbjct: 340 SQSAR----QTNSKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAE 395

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           + C I F     G       A ILGD  L++  + +D+A  R G+A   C+
Sbjct: 396 MWC-IGFQKFQEG-------ASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 114/409 (27%), Positives = 172/409 (42%), Gaps = 73/409 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     I DTGS ++W  C +   C +C   +     +  F    SS
Sbjct: 81  GLYFTKVKLGSPAKDFYVQI-DTGSDILWINCIT---CSNCPHSSGLGIELDFFDTAGSS 136

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ L+ C +P CS+     V++   GCS +   C     SY  QYG G  T G  +S+T+
Sbjct: 137 TAALVSCADPICSY----AVQTATSGCSSQANQC-----SYTFQYGDGSGTTGYYVSDTM 187

Query: 221 RFPS-----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL--- 261
            F +       V N     + GCS         +D+   GI GFG  + S+ SQL     
Sbjct: 188 YFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGV 247

Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K FS+CL   +         LVL      G+   P + Y+P   +           Y 
Sbjct: 248 TPKVFSHCLKGGENGGGV----LVL------GEILEPSIVYSPLVPSL--------PHYN 289

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L+ I V  +   +P    V  +  N G IVDSG+T  ++    +          +  +
Sbjct: 290 LNLQSIAVNGQ--LLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQF 347

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           S+   + K +    C+ +S       P++ L F GGA M L PE+Y    G        F
Sbjct: 348 SKPI-ISKGN---QCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYG--------F 395

Query: 440 TDNAAGPALG-----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            D+AA   +G     RG   ILGD  L++    +DLAN R G+A   C+
Sbjct: 396 LDSAAMWCIGFQKVERGFT-ILGDLVLKDKIFVYDLANQRIGWADYNCS 443


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 173/410 (42%), Gaps = 63/410 (15%)

Query: 87  YSNSLIKTPLS-VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           + +SL  TP S V+  GG Y ++ S GTPP  +   + DTGS +VW  C    +C     
Sbjct: 68  FKDSLSNTPESTVYVNGGEYLMTYSVGTPP-FNVYGVVDTGSDIVWLQCKPCEQCY---- 122

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                   P F P +SSS + I C +  C  +       R   C+ +N     +C   + 
Sbjct: 123 ----KQTTPIFNPSKSSSYKNIPCSSNLCQSV-------RYTSCNKQN-----SCEYTIN 166

Query: 205 QYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESL 255
                ++ G L  ETL   S T      P  + GC      +   + +GI G G    SL
Sbjct: 167 FSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSL 226

Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
            +QL      KFSYCLL    D    S     D    SGD    G+  TPF K    +  
Sbjct: 227 TTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGD----GVVSTPFVKKDPQA-- 280

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FYY+ L    VG+K  +I +  L    +GN  +I+DSG+T T +   ++  +    
Sbjct: 281 ----FYYLTLEAFSVGNK--RIEFEVLDDSEEGN--IILDSGTTLTLLPSHVYTNLESA- 331

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
           + Q+    R  D  +   L  C+ I+  +  + P +   FK GA + L P + FA V + 
Sbjct: 332 VAQLVKLDRVDDPNQL--LNLCYSITSDQYDF-PIITAHFK-GADIKLNPISTFAHVADG 387

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V+CL  FT +  GP        I G+    N  + +DL  +   F    C
Sbjct: 388 VVCLA-FTSSQTGP--------IFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 107/396 (27%), Positives = 159/396 (40%), Gaps = 70/396 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           + +++  GTP Q S   IFDTGS L W    PC S   C         P + P F P +S
Sbjct: 144 FVVAVGLGTPAQPSA-LIFDTGSDLSWVQCQPCGSSGHC--------HPQQDPLFDPSKS 194

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S+   + C  P+C+        +    CS  N TC      YL++YG G  T G+L  +T
Sbjct: 195 STYAAVHCGEPQCA--------AAGDLCSEDNTTC-----LYLVRYGDGSSTTGVLSRDT 241

Query: 220 LRFP-SKTVPNFLAGCSILSDRQPAGIAGFGR---------SSESLPSQLGLK---KFSY 266
           L    S+ +  F  GC   +      +  FGR            SLPSQ        FSY
Sbjct: 242 LALTSSRALTGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSY 295

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL S        +  L +   P    + T    YT   + P      F  FY+V L  I 
Sbjct: 296 CLPSSN----STTGYLTIGATPA---TDTGAAQYTAMLRKP-----QFPSFYFVELVSID 343

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +G   + +P     P     GG ++DSG+  T++    +  +   F   M  Y+ A    
Sbjct: 344 IGGYVLPVP-----PAVFTRGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPA---P 395

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
               L  C+D +G+  V +P +  +F  GA   L        +   V CL     +  G 
Sbjct: 396 PNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDTGG- 454

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                P  I+G+ Q ++  + +D+A ++ GF    C
Sbjct: 455 ----LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P  + T  + DTGS + W  C     C  C+    DP     F P  SS+ 
Sbjct: 128 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 178

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+ +          GCS  ++        Y++ YG G  T G   S+TL  
Sbjct: 179 SPFSCGSADCAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 227

Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
            S  V +F  GCS +    + Q  G+ G G  ++SL SQ    LG + FSYCL       
Sbjct: 228 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 281

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P SS  +             G   + F K P+  SS    FY V L+ I VG + + IP
Sbjct: 282 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 334

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
            S        + G ++DSG+  T +    + A++  F   M  Y  A    + SG L  C
Sbjct: 335 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 384

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           FD SG+ SV +P + L F GGA ++L       ++ N   CL  F  N+   +LG     
Sbjct: 385 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 433

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q + F + +D+     GF    C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 128/498 (25%), Positives = 213/498 (42%), Gaps = 94/498 (18%)

Query: 7   SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
           SLI ++  LIL F           TV V    LS     +H +  P+ I   L++S++S 
Sbjct: 7   SLIVIYYPLILFFLD---------TVVV----LSATDIPNH-NHRPMIIPLHLSTSNISS 52

Query: 67  ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
            R   T    + +  N  S+  N+ ++    + S G Y+  L  GTPPQ     I DTGS
Sbjct: 53  HRKPFTSNYHRRQLHN--SDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFA-LIVDTGS 109

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
           ++ + PC++   C  C        + P F P+ SS+ + + C NP C+            
Sbjct: 110 TVTYVPCST---CEQCG-----KHQDPRFQPESSSTYKPMQC-NPSCN------------ 148

Query: 187 GCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK---TVPNFLAGCSILS---- 238
            C    K C     +Y  +Y  +  ++GLL  + L F ++   T    + GC  +     
Sbjct: 149 -CDDEGKQC-----TYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVETGEL 202

Query: 239 -DRQPAGIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
             ++  GI G GR   S+  QL +K+     FS C          V   +VL   P    
Sbjct: 203 FSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDV----VGGAMVLGNIP---- 254

Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIV 351
              P + +   + +P  S+     +Y + L+++ V  K +K+ P  +     DG  G ++
Sbjct: 255 -PPPDMVFA--HSDPYRSA-----YYNIELKELHVAGKRLKLNPRVF-----DGKHGTVL 301

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK----SVYLPE 407
           DSG+T+ ++    F A     I+++  + +       S    CF  +G+     S   PE
Sbjct: 302 DSGTTYAYLPEEAFVAFKDAIIKEI-KFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPE 360

Query: 408 LILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
           + + F  G K++L PENY       +   CL +F +       G+ P  +LG   ++N  
Sbjct: 361 VNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQN-------GKDPTTLLGGIVVRNTL 413

Query: 466 LEFDLANDRFGFAKQKCA 483
           + +D  ND+ GF K  C+
Sbjct: 414 VTYDRDNDKIGFWKTNCS 431


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 119/425 (28%), Positives = 172/425 (40%), Gaps = 58/425 (13%)

Query: 67  ARHLKTKTKPKTKDSNIG-SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
            R L        KDS    +N++  +I + +   S G Y + +  G+PP+ +   + D+G
Sbjct: 107 VRRLSHGAPAAVKDSRYKVANFATDVI-SGMEAGS-GEYFVRIGVGSPPR-NQYMVIDSG 163

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S +VW  C    RC    +   DP     F P  SSS   + C +  C  +         
Sbjct: 164 SDIVWVQCKPCSRC----YQQSDP----VFDPADSSSFAGVSCGSDVCDRL--------- 206

Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ--- 241
                 N  C      Y + YG G +T G L  ETL      + +   GC   +      
Sbjct: 207 -----ENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGHTNQGMFIG 261

Query: 242 PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
            AG+ G G  S S   QLG +    FSYCL+SR        S   L+ G G+      G 
Sbjct: 262 AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG-----TGSTGALEFGRGA---LPVGA 313

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
           ++    +NP   S     FYY+GL  I VG   V +P         G  GV++D+G+  T
Sbjct: 314 TWISLIRNPRAPS-----FYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVT 368

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
                 + A    F  Q  N  RA  V   S    C+D++G +SV +P +   F  G  +
Sbjct: 369 RFPTAAYVAFRDSFTAQTSNLPRAPGV---SIFDTCYDLNGFESVRVPTVSFYFSDGPVL 425

Query: 419 ALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
            LP  N+   V G    CL  F  + +G +       I+G+ Q +   + FD AN   GF
Sbjct: 426 TLPARNFLIPVDGGGTFCL-AFAPSPSGLS-------IIGNIQQEGIQISFDGANGFVGF 477

Query: 478 AKQKC 482
               C
Sbjct: 478 GPNIC 482


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 184/397 (46%), Gaps = 70/397 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y ++LS GTPP      + DTGS+L+W  C     C DC +  VD    P F PK SS
Sbjct: 92  GEYLMNLSLGTPPSPIMA-VADTGSNLIWTQCKP---CDDC-YTQVD----PLFDPKASS 142

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + + C + +C+ +     E++   CS  +KTC     SYL+ Y  G +T G    +TL
Sbjct: 143 TYKDVSCSSSQCTAL-----ENQA-SCSTEDKTC-----SYLVSYADGSYTMGKFAVDTL 191

Query: 221 RFPSK-----TVPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
              S       + N + GC    ++    + +G+ G G  + SL  QLG     KFSYCL
Sbjct: 192 TLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCL 251

Query: 269 LSRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           +      + ++  +N V+     SG    PG   TP          +   FYY+ L+ I 
Sbjct: 252 VPENDQTSKINFGTNAVV-----SG----PGTVSTPLV------VKSRDTFYYLTLKSIS 296

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VGSK+++       P S+  G +++DSG+T T +    +  + +  +  + N  ++ D  
Sbjct: 297 VGSKNMQ------TPDSNIKGNMVIDSGTTLTLLPVKYYIEI-ENAVASLINADKSKDER 349

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
             S L  C++ +    + +P + + F+ GA + L P N F  V  +++CL      A G 
Sbjct: 350 IGSSL--CYNATA--DLNIPVITMHFE-GADVKLYPYNSFFKVTEDLVCL------AFGM 398

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  R    I G+   +NF + +D A+    F    CA
Sbjct: 399 SFYRNG--IYGNVAQKNFLVGYDTASKTMSFKPTDCA 433


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 85/268 (31%), Positives = 122/268 (45%), Gaps = 28/268 (10%)

Query: 216 LSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           ++ET  F   +   P    GC++ S+      +G+ G GR   SL +QL ++ F Y L S
Sbjct: 1   MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS 60

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
                +P+S   + D   G+GDS       TP   NPV        FYYVGL  I VG K
Sbjct: 61  DLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQDL---PFYYVGLTGISVGGK 113

Query: 331 HVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
            V+IP  ++    S G GGVI DSG+T T +  P +  V  E + QMG + +        
Sbjct: 114 LVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQKPPPAANDD 172

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAG 445
            L  CF   G  +   P ++L F GGA M L  ENY   +    G    C  +   + A 
Sbjct: 173 DLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQA- 229

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLAND 473
                    I+G+    +F++ FDL+ +
Sbjct: 230 -------LTIIGNIMQMDFHVVFDLSGN 250


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 114/427 (26%), Positives = 169/427 (39%), Gaps = 99/427 (23%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           ++ ++ GTPPQ  T  + DTGS L W  C  SR+                 F    SSS 
Sbjct: 64  TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSRHDA--------------PFDASASSSY 108

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
             + C +P C+W+ G ++        P    C  +     L Y    +A GLL ++T   
Sbjct: 109 APVPCSSPACTWL-GRDL--------PVRPFCDSSACRVSLSYADASSADGLLAADTFLL 159

Query: 223 PSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
            S  +P    GC       +  S+  P G+ G  R   S  +Q   ++F+YC+ + +   
Sbjct: 160 GSSPMPALF-GCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ--- 215

Query: 276 APVSSNLVLDTGPG-----SGDSKTP-------GLSYTPFYKNPVGSSSAFGEFYYVGLR 323
                      GPG       D++TP        L+YTP  +            Y V L 
Sbjct: 216 -----------GPGILLLGGNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLE 264

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I VGS  + IP   L P   G G  +VDSG+ FTF+    + A+  EF  Q+   +R+ 
Sbjct: 265 GIRVGSALLAIPKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQL---TRSL 321

Query: 384 DVEKKSGLRP--------------CFDISGKKSV------YLPELILKFKGGAKMALPPE 423
           D     GL P              CF  +  +         LPE+ L  +G   +    E
Sbjct: 322 D----GGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAE 377

Query: 424 NYFALV-------GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
                V       G  V CL   + + AG +     A ++G    Q+ ++E+DL N R G
Sbjct: 378 KLLYRVPGERRGEGEGVWCLTFGSSDMAGVS-----AYVIGHHHQQDVWVEYDLRNARLG 432

Query: 477 FAKQKCA 483
           FA  +CA
Sbjct: 433 FAAARCA 439


>gi|343161843|dbj|BAK57511.1| extracellular dermal glycoprotein [Nicotiana benthamiana]
          Length = 440

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 174/413 (42%), Gaps = 78/413 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +  + T  Q  TP +      D G   +W         VDC+   V  S  PA    R  
Sbjct: 46  TFQYLTQIQQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCR 92

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S+Q    +   C   F P       GC+  N TC L   + + Q     T+G L S+T++
Sbjct: 93  SAQCSLARAGGCGQCFSPPKP----GCN--NDTCGLIPDNTVTQTA---TSGELASDTVQ 143

Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
             S    N           F+ G + L  R  +G+   AG GR+  SLPSQ        +
Sbjct: 144 VQSSNGKNPGRNVVDKDFLFVCGSTFLLKRLASGVKGMAGLGRTRISLPSQFSAEFSFPR 203

Query: 263 KFSYCLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF- 314
           KF+ CL S         F D P S    L     + D      SYTP + NPV ++SAF 
Sbjct: 204 KFAVCLSSSTKSKGVVLFGDGPYS---FLPNREFANDD----FSYTPLFINPVSTASAFS 256

Query: 315 -GE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
            GE    Y++G++ I +  K V I  + L   + G GG  + + + +T +E  ++ AV  
Sbjct: 257 SGEPSSEYFIGVKSIKINQKVVSINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTN 316

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFD----ISGKKSVYLPELILKFKG-GAKMALPPENY 425
            F++++ N +R A V        CFD    +S +    +P + L  +       +   N 
Sbjct: 317 FFVKELVNITRVASVAP---FGACFDSRNIVSTRVGPTVPPIDLVLQNENVFWTIFGANS 373

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
              V   VLCL  F D    P      +I++G + +++  L+FDLA+ R GF 
Sbjct: 374 MVQVSENVLCL-GFVDGGVNPR----TSIVIGGYTIEDNLLQFDLASSRLGFT 421


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 109/378 (28%), Positives = 157/378 (41%), Gaps = 57/378 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPPQ +     DT +   W PCT+   C               F P++S++ 
Sbjct: 93  YIVRAKIGTPPQ-TLLLAMDTSNDAAWIPCTACDGCAST-----------LFAPEKSTTF 140

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C  P+C  +  P       G S RN         + L YG    A  L+ +T+   
Sbjct: 141 KNVSCAAPECKQVPNPGC-----GVSSRN---------FNLTYGSSSIAANLVQDTITLA 186

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  VP++  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S  F    
Sbjct: 187 TDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 244

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP +   +   + YTP  KNP  SS      YYV L  I VG K V IP +
Sbjct: 245 FSGSLRL--GPVAQPKR---IKYTPLLKNPRRSS-----LYYVNLEAIRVGRKVVDIPPA 294

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF R++G       V    G   C+++
Sbjct: 295 ALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVG---PKLTVTSLGGFDTCYNV 351

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                + +P +   F  G  + LP +N           CL +    A  P        ++
Sbjct: 352 ----PIVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAM----AGAPDNVNSVLNVI 402

Query: 457 GDFQLQNFYLEFDLANDR 474
            + Q QN  + +D+ N R
Sbjct: 403 ANMQQQNHRVLYDVPNSR 420


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 111/399 (27%), Positives = 158/399 (39%), Gaps = 64/399 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  ++  GTP +  +  I DTGS L W  C+    C   N           FIP  S+
Sbjct: 1   GEYLATVRLGTPERVFS-VIVDTGSDLTWVQCSPCGTCYSQN--------DSLFIPNTST 51

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG------FTAGLL 215
           S   + C    C+ +  P               C      Y   YG G      F    +
Sbjct: 52  SFTKLACGTELCNGLPYP--------------MCNQTTCVYWYSYGDGSLSTGDFVYDTI 97

Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGL---KKFSYCLL 269
             + +    + VPNF  GC   ++   AG   I G G+   S PSQL      KFSYCL+
Sbjct: 98  TMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLV 157

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
              +   P  ++ +L      GD+  P   G+ Y     NP         +YYV L  I 
Sbjct: 158 --DWLAPPTQTSPLL-----FGDAAVPTFPGVKYISLLTNP-----KVPTYYYVKLNGIS 205

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           VG K + I  +     S G  G I DSG+T T + G + + V         +Y R +D  
Sbjct: 206 VGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSD-- 263

Query: 387 KKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAA 444
             SGL  C     +  +  +P +   F+GG  M LPP NYF  L  ++  C  +     +
Sbjct: 264 DSSGLDLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIFLESSQSYCFSM----VS 318

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            P +      I+G  Q QNF + +D    + GF  + C 
Sbjct: 319 SPDV-----TIIGSIQQQNFQVYYDTVGRKIGFVPKSCV 352


>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
          Length = 191

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 58/177 (32%), Positives = 96/177 (54%), Gaps = 13/177 (7%)

Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
           G  +    FYYV ++ +IVG + + IP       ++G GG I+DSG+T ++   P +E +
Sbjct: 24  GKENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEII 83

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-A 427
            + F+ ++  Y    D      L+PC+++SG + + LP   + F  GA    P ENYF  
Sbjct: 84  KQAFVNKVKRYPILDDFPI---LKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIK 140

Query: 428 LVGNEVLCL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           L   +++CL IL T ++A          I+G++Q QNF++ +D    R GFA ++CA
Sbjct: 141 LEPEDIVCLAILGTPHSAMS--------IIGNYQQQNFHILYDTKRSRLGFAPRRCA 189


>gi|2245012|emb|CAB10432.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268406|emb|CAB78698.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1046

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 118/411 (28%), Positives = 169/411 (41%), Gaps = 86/411 (20%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS----------------SSQL 165
            DTGS LVWFPC   + C+ C    + PS   +     ++                SS L
Sbjct: 130 LDTGSDLVWFPCRP-FTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSDL 188

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
               N    +I           C+    T    CP +   YG G     L S++L  PS 
Sbjct: 189 CAISNCPLDFI-------ETGDCN----TSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSV 237

Query: 226 TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS 279
           +V NF  GC+  +  +P G+AGFGR   SLP+QL +        FSYCL+S  FD   V 
Sbjct: 238 SVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVR 297

Query: 280 --SNLVLDTGPGSGDSKT----------------PGLSYTPFYKNPVGSSSAFGEFYYVG 321
             S L+L       + +                     +T   +NP         FY V 
Sbjct: 298 RPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP-----KHPYFYSVS 352

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YS 380
           L+ I +G +++  P        +G GGV+VDSG+TFT +    + +V +EF  ++G  + 
Sbjct: 353 LQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHE 412

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFALVGN-------- 431
           RA  VE  S                  L+L F G  + + LP  NYF    +        
Sbjct: 413 RADRVEPSSA-----------------LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 455

Query: 432 -EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
            ++ CL+L          G G   ILG++Q Q F + +DL N R GFAK+ 
Sbjct: 456 RKIGCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRN 505


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 112/408 (27%), Positives = 169/408 (41%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     I DTGS ++W  C +   C +C   +     +  F    SS
Sbjct: 81  GLYFTKVKLGSPAKEFYVQI-DTGSDILWINCIT---CSNCPHSSGLGIELDFFDTAGSS 136

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ L+ C +P CS+     V++    CS +   C     SY  QYG G  T G  +S+T+
Sbjct: 137 TAALVSCGDPICSYA----VQTATSECSSQANQC-----SYTFQYGDGSGTTGYYVSDTM 187

Query: 221 RFPS-----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL--- 261
            F +       V N     + GCS         +D+   GI GFG  + S+ SQL     
Sbjct: 188 YFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGV 247

Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K FS+CL   +         LVL      G+   P + Y+P   +           Y 
Sbjct: 248 TPKVFSHCLKGGENGGGV----LVL------GEILEPSIVYSPLVPSQ--------PHYN 289

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L+ I V  +   +P    V  +  N G IVDSG+T  ++    +    K     +  +
Sbjct: 290 LNLQSIAVNGQ--LLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQF 347

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           S+   + K +    C+ +S       P++ L F GGA M L PE+Y    G        F
Sbjct: 348 SKPI-ISKGN---QCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYG--------F 395

Query: 440 TDNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            D AA   +G         ILGD  L++    +DLAN R G+A   C+
Sbjct: 396 LDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYDCS 443


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 117/431 (27%), Positives = 180/431 (41%), Gaps = 70/431 (16%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV-HSYGG--YSISLSFGTPPQASTPFIF 122
           RA +++ K    ++ +N+      S +  P S  +S G   Y I+++ GTP       I 
Sbjct: 90  RAAYIQAKVS--SRYNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSI- 146

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN-- 180
           DTGS + W       +C  C   +    +   F P  S++     C + +C+ +      
Sbjct: 147 DTGSDVSWV------QCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNG 200

Query: 181 -VESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS-KTVPNFLAGCSIL 237
            ++S+C+               Y+++YG G  TAG   S+TL   S   V +F  GCS  
Sbjct: 201 CLKSQCQ---------------YIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQFGCSHR 245

Query: 238 SDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           +     +  G+ G G  +ESL SQ      K FSYCL        P SS     T   +G
Sbjct: 246 AAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCL------PPPSSSGGGFLTLGAAG 299

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
            + +   S+TP  +  V +      FY V L+ I V    + +P S        +G  +V
Sbjct: 300 GASSSRYSHTPMVRFSVPT------FYGVFLQGITVAGTMLNVPASVF------SGASVV 347

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T +    ++A+   F ++M  Y  AA V     L  CFD SG  ++ +P + L 
Sbjct: 348 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGS---LDTCFDFSGFNTITVPTVTLT 404

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F  GA M L       + G      + FT  A       G   ILG+ Q + F + FD+ 
Sbjct: 405 FSRGAAMDLD------ISGILYAGCLAFTATAH-----DGDTGILGNVQQRTFEMLFDVG 453

Query: 472 NDRFGFAKQKC 482
               GF    C
Sbjct: 454 GRTIGFRSGAC 464


>gi|449432731|ref|XP_004134152.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
 gi|449527081|ref|XP_004170541.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 429

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 122/409 (29%), Positives = 177/409 (43%), Gaps = 58/409 (14%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           P++ H    Y I +   TP         D G  L+W         VDC+   V  S  PA
Sbjct: 35  PVTKHPSLQYIIQIHQRTP-LVPVNLTVDLGGWLMW---------VDCDRGFVSSSYKPA 84

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTA 212
               R  S+Q    ++  C   + P       GC+  N TC L+  + ++Q   G   T+
Sbjct: 85  ----RCRSAQCSLAKSISCGKCYLP----PHPGCN--NYTCSLSARNTIIQLSSGGEVTS 134

Query: 213 GLL-LSETLRFPSK---TVPNFLAGCS---ILSDRQ--PAGIAGFGRSSESLPSQLGLKK 263
            L+ +S T  F S    +VPNFL  CS   +L        G+AGFGR+  SLPSQ     
Sbjct: 135 DLVSVSSTNGFNSTRALSVPNFLFICSSTFLLEGLAGGVTGMAGFGRTRISLPSQFA-AA 193

Query: 264 FSYCLLSRKFDDAPVSSNL---VLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFG 315
           FS+   SRKF      S     V+ +G G          T  L+YTP   NPVG +    
Sbjct: 194 FSF---SRKFTMCLSGSTGFPGVIFSGYGPYHFLPNIDLTNSLTYTPLLINPVGFAGEKS 250

Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
             Y++G++ I   SK V +  + L   S+GNGG  + + + +T +E  ++ A+ K F  +
Sbjct: 251 SEYFIGVKSIEFNSKTVPLNTTLLKIDSNGNGGTKISTVNPYTVLETSIYRALVKTFTSE 310

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYL------PELILKFKGGAKMALPPENYFALV 429
           +GN  R A V        C+      S  L       +LIL+ K      +   N   +V
Sbjct: 311 LGNIPRVAAVAP---FEVCYSSKSFGSTELGPSVPSIDLILQNK-KVIWRMFGANSMVVV 366

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
             EVLCL                A+++G  Q+++  LEFDLA  R GF+
Sbjct: 367 TEEVLCLGFVEGGVEAET-----AMVIGGHQIEDNLLEFDLATSRLGFS 410


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 150/350 (42%), Gaps = 52/350 (14%)

Query: 93  KTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           K P++    GG Y +  S G PP      + DTGS L+W  C+    C  CN P   PS 
Sbjct: 75  KAPVTKSQKGGKYIMQFSIGEPPLLIWAEV-DTGSDLMWVKCSP---CNGCNPP---PS- 126

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-- 209
            P + P RS SS  + C +  C  +       R +  S +    P  C  Y   YG    
Sbjct: 127 -PLYDPARSRSSGKLPCSSQLCQAL------GRGRIISDQCSDDPPLC-GYHYAYGHSGD 178

Query: 210 -FTAGLLLSETLRFPSKTVPNFLA-GCSILSDRQP----AGIAGFGRSSESLPSQLGLKK 263
             T G+L +ET  F    V N ++ G S   D       AG+ G GR   SL SQLG  +
Sbjct: 179 HSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGR 238

Query: 264 FSYCLLSRKFDDAPVSSNLV------LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
           F+YCL +    D  V S ++      LDT  G        +S TP   NP          
Sbjct: 239 FAYCLAA----DPNVYSTILFGSLAALDTSAGD-------VSSTPLVTNPKPDRDTH--- 284

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           YYV L+ I VG   + I        SDG+GGV  DSG+  T ++   ++ V +    ++ 
Sbjct: 285 YYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQ 344

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYF 426
                A      G   CF  + +++V  +P L+L F  GA M+L   NY 
Sbjct: 345 RLGYDA------GDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYL 388


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P  + T  I DTGS + W  C     C  C+    DP     F P  SS+ 
Sbjct: 52  YLITVGLGSPATSQTMLI-DTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 102

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+ +          GCS  ++        Y++ YG G  T G   S+TL  
Sbjct: 103 SPFSCGSADCAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 151

Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
            S  V +F  GCS +    + Q  G+ G G  ++SL SQ    LG + FSYCL       
Sbjct: 152 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 205

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P SS  +             G   + F K P+  SS    FY V L+ I VG + + IP
Sbjct: 206 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 258

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
            S        + G ++DSG+  T +    + A++  F   M  Y  A    + SG L  C
Sbjct: 259 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 308

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           FD SG+ SV +P + L F GGA ++L       ++ N   CL  F  N+   +LG     
Sbjct: 309 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 357

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q + F + +D+     GF    C
Sbjct: 358 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 117/445 (26%), Positives = 191/445 (42%), Gaps = 80/445 (17%)

Query: 54  KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           + +  L + S +R R +  +    +  S  G+    + +++PL     GGY + +S GTP
Sbjct: 10  EAIRGLVAKSHARVRWMAARANSSSWSSMAGT----TDVESPLHPDG-GGYVMDISVGTP 64

Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
            +     I DTGS LVW    PCT       C+   +       F P++SS+ + + C +
Sbjct: 65  GKRFRA-IADTGSDLVWVQSEPCTG------CSGGTI-------FDPRQSSTFREMDCSS 110

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-----SK 225
             C+ + G         C P +  C     SY  +YG G T G    +T+        S+
Sbjct: 111 QLCTELPGS--------CEPGSSAC-----SYSYEYGSGETEGEFARDTISLGTTSGGSQ 157

Query: 226 TVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSS 280
             P+F  GC +++       G+ G G+   SL SQL      KFSYCL+    +    SS
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESS 215

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            L+   GP +    T G+  T         S  +  +Y + +  I V  + +  P     
Sbjct: 216 PLLF--GPSAALHGT-GIQSTKITP----PSDTYPTYYLLTVNGIAVAGQTMGSP----- 263

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDISG 399
                 G  I+DSG+T T++   ++  V    + +M +      V+  S GL  C+D S 
Sbjct: 264 ------GTTIIDSGTTLTYVPSGVYGRV----LSRMESMVTLPRVDGSSMGLDLCYDRSS 313

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGRGPAIILG 457
            ++   P L ++  G A M  P  NYF +V +  + +CL      A G A G  P  I+G
Sbjct: 314 NRNYKFPALTIRLAG-ATMTPPSSNYFLVVDDSGDTVCL------AMGSAGGL-PVSIIG 365

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           +   Q +++ +D  +    F + KC
Sbjct: 366 NVMQQGYHILYDRGSSELSFVQAKC 390


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 155/387 (40%), Gaps = 58/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +  + GTP Q       DT +   W PC     CV C+           F    S++ 
Sbjct: 90  YIVKANVGTPAQTFL-MALDTSNDAAWIPCNG---CVGCSST--------VFNSVTSTTF 137

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GC  P+C  +  P     C G      TC     ++   YG       L  +T+   
Sbjct: 138 KTLGCDAPQCKQVPNPT----CGG-----STC-----TWNTTYGGSTILSNLTRDTIALS 183

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  VP +  GC   +  S   P G+ G GR   S  SQ   L    FSYCL S  F    
Sbjct: 184 TDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLN 241

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S  L L  GP     +   +  TP  KNP  SS      YYV L  I VG K V IP S
Sbjct: 242 FSGTLRL--GPAGQPLR---IKTTPLLKNPRRSS-----LYYVNLIGIRVGRKIVDIPAS 291

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF +++GN    A V    G   C+  
Sbjct: 292 ALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGN----AIVSSLGGFDTCY-- 345

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N           CL +    AA P        ++
Sbjct: 346 --TGPIVAPTMTFMFS-GMNVTLPPDNLLIRSTAGSTSCLAM----AAAPDNVNSVLNVI 398

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + Q QN  + FD+ N R G A++ C+
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 166/404 (41%), Gaps = 73/404 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y   +  GTPPQ     + DTGS + W    PCT+  R  +   P      I  F P+
Sbjct: 46  GLYYTRIYLGTPPQQFYVHV-DTGSDVAWVNCVPCTNCKRASNVALP------ISIFDPE 98

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
           +S+S   I C + +C       + S  K CS  + +CP     Y   YG G  TAG L++
Sbjct: 99  KSTSKTSISCTDEECY------LASNSK-CSFNSMSCP-----YSTLYGDGSSTAGYLIN 146

Query: 218 ETLRFPSKTVPNFLA---------GC------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
           + L F      N  A         GC      + L+D    G+ GFG++  SLPSQL  +
Sbjct: 147 DVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWLTD----GLVGFGQAEVSLPSQLSKQ 202

Query: 263 KFSYCLLSRKFD-DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
             S  + +     D   S  LV+      G  + PGL YTP               Y V 
Sbjct: 203 NVSVNIFAHCLQGDNKGSGTLVI------GHIREPGLVYTPIVPKQ--------SHYNVE 248

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L  I V   +V  P ++ +  S   GGVI+DSG+T T++  P ++    +    M     
Sbjct: 249 LLNIGVSGTNVTTPTAFDLSNS---GGVIMDSGTTLTYLVQPAYDQFQAKVRDCM----- 300

Query: 382 AADVEKKSGLRP-CFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILF 439
                 +SG+ P  F        Y P + L F GGA M L P +Y +  +    L    F
Sbjct: 301 ------RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCF 354

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +   +    G     I GD  L++  + +D  N+R G+    C 
Sbjct: 355 SWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 121/445 (27%), Positives = 174/445 (39%), Gaps = 77/445 (17%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           +  +  +AS   +R R+L + T  KT  + I S            V + G Y + +  GT
Sbjct: 53  MNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQ---------VLNVGNYVVRVQLGT 103

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           P Q +   + DT +   W PC+    C+ C       S    F  + SS+   + C  P+
Sbjct: 104 PGQ-TMYMVLDTSNDAAWAPCSG---CIGC-------SSTTTFSAQNSSTFATLDCSKPE 152

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-------YLLQYGLGFT-AGLLLSETLRFPS 224
           C+   G                  L+CP+       +   YG   T +  L+ ++L    
Sbjct: 153 CTQARG------------------LSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGP 194

Query: 225 KTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPV 278
             +PNF  GC   +  S   P G+ G GR   SL SQ G      FSYCL S  F     
Sbjct: 195 NVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPS--FKSYYF 252

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
           S +L L  GP  G  K   +  TP   NP   S      YYV L  I VG   V I    
Sbjct: 253 SGSLKL--GP-VGQPK--AIRTTPLLHNPHRPS-----LYYVNLTGISVGRVLVPISPEL 302

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
           L    +   G I+DSG+  T     ++ AV  EF +Q+G                CF  +
Sbjct: 303 LAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVG-----GSFSPLGAFDTCFATN 357

Query: 399 GKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
            + S   P + L    G  + LP EN         + CL +    AA P        ++ 
Sbjct: 358 NEVSA--PAITLHLS-GLDLKLPMENSLIHSSAGSLACLAM----AAAPNNVNSVVNVIA 410

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q QN  + FD+ N + G A++ C
Sbjct: 411 NLQQQNHRILFDINNSKLGIARELC 435


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 119/404 (29%), Positives = 182/404 (45%), Gaps = 79/404 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++S GTPP      I DTGS L+W  C     C DC +  VD    P F PK SS
Sbjct: 92  GEYLMNISLGTPPFPIMA-IADTGSDLLWTQCKP---CDDC-YTQVD----PLFDPKASS 142

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + + + C + +C+ +     E++   CS  + TC     SY   YG   +T G +  +TL
Sbjct: 143 TYKDVSCSSSQCTAL-----ENQA-SCSTEDNTC-----SYSTSYGDRSYTKGNIAVDTL 191

Query: 221 RFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
              S       + N + GC   +    +++ +GI G G  + SL +QLG     KFSYCL
Sbjct: 192 TLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCL 251

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           +    ++   S    ++ G  +  S T G+  TP        + +   FYY+ L+ I VG
Sbjct: 252 VPLTSENDRTSK---INFGTNAVVSGT-GVVSTPLI------AKSQETFYYLTLKSISVG 301

Query: 329 SKHVKIPYSYLVPGSD---GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
           SK V+       PGSD   G G +I+DSG+T T +          EF  ++ +  + + D
Sbjct: 302 SKEVQ------YPGSDSGSGEGNIIIDSGTTLTLL--------PTEFYSELEDAVASSID 347

Query: 385 VEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
            EKK    +GL  C+  +G   V  P + + F  GA + L P N F  +  +++C     
Sbjct: 348 AEKKQDPQTGLSLCYSATGDLKV--PAITMHFD-GADVNLKPSNCFVQISEDLVCF---- 400

Query: 441 DNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 A    P+  I G+    NF + +D  +    F    CA
Sbjct: 401 ------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 438


>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
           sativus]
          Length = 432

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 121/419 (28%), Positives = 178/419 (42%), Gaps = 73/419 (17%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           P++ H  G Y   +   TP         D G   +W         VDC+   V  S    
Sbjct: 33  PVTKHPSGQYITQIRQRTP-LVPVKLTVDLGGQFMW---------VDCDRGYVSSS---- 78

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           + P R  S+Q    ++  C   F P       GC+  N TC     + ++Q     T+G 
Sbjct: 79  YKPVRCRSAQCSLSKSTSCGDCFSPPXP----GCN--NNTCGHFPGNTIIQLS---TSGE 129

Query: 215 LLSETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLG 260
           + S+ L   S          ++PNFL  C  + L +    G+   AGFGR+  SLPSQ  
Sbjct: 130 VTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFS 189

Query: 261 L-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG-----SGDSKTPGLSYTPFYKNPVGS 310
                 +KF+ CL           S  V+ +G G          T  L+YTP + NPV +
Sbjct: 190 AAFSFNRKFAVCL------SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVST 243

Query: 311 S--SAFGE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
           +  S  GE    Y++G++ I+  SK V I  + L   S+GNGG  + +   +T +E  ++
Sbjct: 244 AGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIY 303

Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMA 419
            A+ K   R++ N  R A V        C+      S  L       +LIL+ K      
Sbjct: 304 NALVKTITRELRNIPRVAAVAP---FGVCYKSKSFGSTRLGPGMPSIDLILQNK-KVIWR 359

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           +   N    V  EVLCL  F D   G    R  AI++G +Q+++  LEFDLA  R GF+
Sbjct: 360 IFGANSMVQVNEEVLCL-GFVD---GGVEAR-TAIVIGAYQMEDNLLEFDLATSRLGFS 413


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 152/393 (38%), Gaps = 70/393 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y IS+  GTP    T  I DTGS + W       +C  C  P         F P +SS+ 
Sbjct: 127 YVISVGLGTPAVTQTVTI-DTGSDVSWV------QCNPCPNPPCHAQTGALFDPAKSSTY 179

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE---TL 220
           + + C   +C+ +     E +  GC   N  C      Y +QYG G T     S    TL
Sbjct: 180 RAVSCAAAECAQL-----EQQGNGCGATNYEC-----QYGVQYGDGSTTNGTYSRDTLTL 229

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
              S  V  F  GCS L      Q  G+ G G  ++SL SQ        FSYCL      
Sbjct: 230 SGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLP----- 284

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS----SSAFGEFYYVGLRQIIVGSK 330
                        P SG S    L         V +    S     FY   L+ I VG K
Sbjct: 285 -------------PTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGK 331

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + +  S    GS      +VDSG+  T +    + A++  F   M  Y  A     +S 
Sbjct: 332 QLGLSPSVFAAGS------VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSA---PARSI 382

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG- 449
           L  CFD +G+  + +P + L F GGA + L P              I++ +  A  A G 
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNG------------IMYGNCLAFAATGD 430

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G   I+G+ Q + F + +D+ +   GF    C
Sbjct: 431 DGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 108/396 (27%), Positives = 173/396 (43%), Gaps = 70/396 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           + +++ FGTP Q  T  +FDTGS + W       +C+ C+  +      P F P +S++ 
Sbjct: 120 FVVTVGFGTPAQTYT-LMFDTGSDVSWI------QCLPCSG-HCYKQHDPIFDPTKSATY 171

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C +P+C+   G     +C      N TC      Y +QYG G  TAG+L  ETL  
Sbjct: 172 SAVPCGHPQCAAAGG-----KCS----SNGTC-----LYKVQYGDGSSTAGVLSHETLSL 217

Query: 223 PS-KTVPNFLAGC--SILSD-RQPAGIAGFGRSSESLPSQLGLKKFS---YCLLSRKFDD 275
            S + +P F  GC  + L D     G+ G GR   SL SQ      +   YCL S     
Sbjct: 218 TSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN--- 274

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
              +S+  L  G  +  S + G+ YT   +        +  FY+V L  I+VG   + +P
Sbjct: 275 ---TSHGYLTIGTTTPASGSDGVRYTAMIQK-----QDYPSFYFVDLVSIVVGGFVLPVP 326

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
                P      G ++DSG+  T++    + A+   F   M  Y  A   +       C+
Sbjct: 327 -----PILFTRDGTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCY 378

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
           D +G+ ++++P +  KF  G+   L P             +++F D+ A PA G      
Sbjct: 379 DFAGQNAIFMPLVSFKFSDGSSFDLSP-----------FGVLIFPDDTA-PATGCLAFVP 426

Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                P  I+G+ Q +N  + +D+A ++ GF    C
Sbjct: 427 RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 154/376 (40%), Gaps = 59/376 (15%)

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
           S   + DT S + W       +C+ C  P     + P + P +SS+   I C +P C  +
Sbjct: 168 SQTVVVDTSSDIPWV------QCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKEL 221

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGC 234
                 S   GCSP    C      Y++ YG G  T G  +++TL   P+  V +F  GC
Sbjct: 222 G----SSYGNGCSPTTDEC-----KYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGC 272

Query: 235 SILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTG 287
           S         Q AGI   G    SL  Q        FSYC+        P S+  +   G
Sbjct: 273 SHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCI------PKPSSAGFLSLGG 326

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
           P     K    SYTP  KN    +     FY V L  IIV  K + +P +    G+    
Sbjct: 327 PVEASLK---FSYTPLIKNKHAPT-----FYIVHLEAIIVAGKQLAVPPTAFATGA---- 374

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAADVEKKSGLRPCFDISGKKSVYLP 406
             ++DSG+  T +   ++ A+   F   M  Y   AA V     L  C+D +    V +P
Sbjct: 375 --VMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRN---LDTCYDFTRFPDVKVP 429

Query: 407 ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
           ++ L F GGA + L P +   L G    CL      AA P  G      +G+ Q Q + +
Sbjct: 430 KVSLVFAGGATLDLEPASII-LDG----CLAF----AATP--GEESVGFIGNVQQQTYEV 478

Query: 467 EFDLANDRFGFAKQKC 482
            +D+   + GF +  C
Sbjct: 479 LYDVGGGKVGFRRGAC 494


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 129/439 (29%), Positives = 180/439 (41%), Gaps = 63/439 (14%)

Query: 57  HSLASSSLSRARHLKTKTKPKTKDSNIGSN--YSNSLIKTPLSVHSYGGYSISLSFGTPP 114
           +SL SSSL  A   K     KT   N  S+  YS +LI             +SL  GTPP
Sbjct: 44  NSLFSSSL--ASQFKQNPNTKTTSYNYRSSFKYSMALI-------------VSLPIGTPP 88

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP-AFIPKRSSSSQLIGCQNPKC 173
           Q     + DTGS L W  C             V P   P AF P  SSS  ++ C +  C
Sbjct: 89  QTQQ-MVLDTGSQLSWIQC------------KVPPKTPPTAFDPLLSSSFSVLPCNHSLC 135

Query: 174 SWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS-KTVPNFL 231
                P V       S  +N+ C     SY    G  +  G L+ E   F S +T P  +
Sbjct: 136 K----PRVPDYTLPTSCDQNRLCHY---SYFYADGT-YAEGNLVREKFTFSSSQTTPPLI 187

Query: 232 AGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD--APVSSNLVLDTGP 288
            GC+   SD Q  GI G      S  S   + KFSYC+  R+     +P  S   L   P
Sbjct: 188 LGCATDSSDTQ--GILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGS-FYLGPNP 244

Query: 289 GSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
            S   K   L +Y    + P     A    Y + +  I +  K + I  S       G G
Sbjct: 245 SSAGFKYVNLMTYRQSQRMPNLDPLA----YTLPMLGIRINGKKLNISTSAFRADPSGAG 300

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV---Y 404
             ++DSG+ FTF+    +  V +E ++  G   +   V   S L  CFD  G   V    
Sbjct: 301 QTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGS-LDMCFD--GDAMVIGRM 357

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +  +  +F+ G ++ +  E   A VG  V CL +   +  G A     + I+G+F  Q+ 
Sbjct: 358 IGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVA-----SNIIGNFHQQDL 412

Query: 465 YLEFDLANDRFGFAKQKCA 483
           ++EFDL   R GF +  C+
Sbjct: 413 WVEFDLVGRRVGFGRTDCS 431


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 108/405 (26%), Positives = 170/405 (41%), Gaps = 60/405 (14%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +  P+ + S G Y  + + GTPPQ  +  +  TG  LVW  CT    C + + P  DP++
Sbjct: 45  VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
                   SS+ + + C +  C  I  P     C      +  C    P+       G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143

Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
            G   ++T     +K    F  GC +++D++      P+GI G GR+  SL +Q+ +  F
Sbjct: 144 GGKAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
           SYCL  +        S+  L  G  +        S TPF  K   GSS      YY    
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
             +V    +K   + L   S     V++D+ S  +++    ++A+ K     +G    A+
Sbjct: 250 --MVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307

Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
             +        +D+   K+V    PEL+  F GGA + +PP NY    GN  +CL + + 
Sbjct: 308 PPKP-------YDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSS 360

Query: 442 ---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              N  G   G   A ILG  Q +N ++ FDL  +   F    C+
Sbjct: 361 ASLNLTGELEG---ASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS ++W  C+    C   +  N+   ++ +F P  SS++  I C + +C+  F    
Sbjct: 22  IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 77

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
           E+ C+  + ++  C      Y   YG G  T+G  +S+T+ F +         +  + + 
Sbjct: 78  EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 132

Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
           GCS         +DR   GI GFG+   S+ SQL       K FS+CL  +  D+     
Sbjct: 133 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 188

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            LVL      G+   PGL YTP   +           Y + L  I V  + + I  S   
Sbjct: 189 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 234

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
             +    G IVDSG+T  ++    ++         +    R+  V K S    CF  S  
Sbjct: 235 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 288

Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
                P + L F GG  M++ PENY    A V N VL  I +  N       +G  I IL
Sbjct: 289 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 341

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           GD  L++    +DLAN R G+A   C+
Sbjct: 342 GDLVLKDKIFVYDLANMRMGWADYDCS 368


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 153/384 (39%), Gaps = 44/384 (11%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            G PPQ +   I DTGS+L+W  C +      C         +P +   RSS+   + C 
Sbjct: 90  IGDPPQRAAALI-DTGSNLIWTQCGT-----TCGLKACAKQDLPYYNLSRSSTFAAVPCA 143

Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN 229
           +   + +   N    C      + +C  A       YG G   G L +E   F S     
Sbjct: 144 DS--AKLCAANGVHLCG----LDGSCTFAA-----SYGAGSVFGSLGTEAFTFQSGAA-K 191

Query: 230 FLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
              GC  L+          +G+ G GR   SL SQ G  KFSYCL     +    S   V
Sbjct: 192 LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFV 251

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL---- 339
             +   SG      ++  PF K+P      +  FYY+ L  I VG   + IP +      
Sbjct: 252 GASASLSGGGGA--VTSIPFVKSP--EDYPYSTFYYLPLVGISVGETKLPIPSAAFELRR 307

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
           V     +GGVI+D+GS  T +    + A++ E  RQ+            +GL  C     
Sbjct: 308 VAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNR--SLVQPPADTGLDLCVARQD 365

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
              V +P L+  F GGA MA+   +Y+  V     C+++            G   ++G+F
Sbjct: 366 VDKV-VPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEG---------GYETVIGNF 415

Query: 460 QLQNFYLEFDLANDRFGFAKQKCA 483
           Q Q+ +L +D+      F    C+
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCS 439


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 105/388 (27%), Positives = 153/388 (39%), Gaps = 72/388 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+PP+ S   + D+GS +VW  C    +C   + P  DP+   +F     S
Sbjct: 199 GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 257

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           SS     +N  C          RC+               Y + YG G +T G L  ETL
Sbjct: 258 SSVCDRLENAGCH-------AGRCR---------------YEVSYGDGSYTKGTLALETL 295

Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
            F    V +   GC   +       AG+ G G  S S   QLG +    FSYCL+S    
Sbjct: 296 TFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS---- 351

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                                   ++ P  +NP   S     FYY+GL  + VG   V I
Sbjct: 352 -----------------------AAWVPLVRNPRAPS-----FYYIGLAGLGVGGIRVPI 383

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
                     G+GGV++D+G+  T +    ++A    F+ Q  N  RA  V        C
Sbjct: 384 SEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDTC 440

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D+ G  SV +P +   F GG  + LP  N+   + +       F  + +G +       
Sbjct: 441 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS------- 493

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q +   + FD AN   GF    C
Sbjct: 494 ILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 432

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 121/419 (28%), Positives = 178/419 (42%), Gaps = 73/419 (17%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           P++ H  G Y   +   TP         D G   +W         VDC+   V  S    
Sbjct: 33  PVTKHPSGQYITQIRQRTP-LVPVKLTVDLGGQFMW---------VDCDRGYVSSS---- 78

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           + P R  S+Q    ++  C   F P       GC+  N TC     + ++Q     T+G 
Sbjct: 79  YKPVRCRSAQCSLSKSTSCGDCFSPPRP----GCN--NNTCGHFPGNTIIQLS---TSGE 129

Query: 215 LLSETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLG 260
           + S+ L   S          ++PNFL  C  + L +    G+   AGFGR+  SLPSQ  
Sbjct: 130 VTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFS 189

Query: 261 L-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG-----SGDSKTPGLSYTPFYKNPVGS 310
                 +KF+ CL           S  V+ +G G          T  L+YTP + NPV +
Sbjct: 190 AAFSFNRKFAVCL------SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVST 243

Query: 311 S--SAFGE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
           +  S  GE    Y++G++ I+  SK V I  + L   S+GNGG  + +   +T +E  ++
Sbjct: 244 AGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIY 303

Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMA 419
            A+ K   R++ N  R A V        C+      S  L       +LIL+ K      
Sbjct: 304 NALVKTITRELRNIPRVAAVAP---FGVCYKSKSFGSTRLGPGMPSIDLILQNK-KVIWR 359

Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           +   N    V  EVLCL  F D   G    R  AI++G +Q+++  LEFDLA  R GF+
Sbjct: 360 IFGANSMVQVNEEVLCL-GFVD---GGVEAR-TAIVIGAYQMEDNLLEFDLATSRLGFS 413


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 116/399 (29%), Positives = 159/399 (39%), Gaps = 60/399 (15%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L++TP        Y +    GTPPQ       DT +   W PC     C  C       S
Sbjct: 104 LLQTPT-------YVVRARLGTPPQQLL-LAVDTSNDAAWIPCAG---CAGCPT-----S 147

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
             P F P  S+S + + C +P C+    PN       C P  K C      + L Y    
Sbjct: 148 SAPPFDPAASTSYRSVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSS 195

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
               L  ++L      V  +  GC   +  +   P G+ G GR   S  SQ   +    F
Sbjct: 196 LQAALSQDSLAVAGDAVKTYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTF 255

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           SYCL S  F     S  L L       + + P +  TP   NP  SS      YYV +  
Sbjct: 256 SYCLPS--FKSLNFSGTLRLGR-----NGQPPRIKTTPLLANPHRSS-----LYYVNMTG 303

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I VG K V IP   L        G ++DSG+ FT +  P + AV  E  R++G     A 
Sbjct: 304 IRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----AP 358

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNA 443
           V    G   CF+ +   +V  P + L F  G ++ LP EN         + CL +    A
Sbjct: 359 VSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHSTYGTISCLAM----A 410

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A P        ++   Q QN  + FD+ N R GFA+++C
Sbjct: 411 AAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 137/316 (43%), Gaps = 52/316 (16%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           L   S G Y + L+ GTPP   T  I DTGS L+W  C     C D           P F
Sbjct: 81  LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIWTQCAPCLLCAD--------QPTPYF 131

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
             K+S++ + + C++ +C+ +  P         S   K C      Y   YG    TAG+
Sbjct: 132 DVKKSATYRALPCRSSRCASLSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177

Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
           L +ET  F +         N   GC  L+    A   G+ GFGR   SL SQLG  +FSY
Sbjct: 178 LANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSY 237

Query: 267 CLLSRKFDDAP------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           CL S      P      V +NL   T   SG      +  TPF  NP     A    Y++
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLS-STNTSSGSP----VQSTPFVINP-----ALPNMYFL 286

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I +G+K + I         DG GGVI+DSG++ T+++   +EAV +  +  +    
Sbjct: 287 SLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---P 343

Query: 381 RAADVEKKSGLRPCFD 396
             A  +   GL  CF 
Sbjct: 344 LTAMNDTDIGLDTCFQ 359


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 119/447 (26%), Positives = 179/447 (40%), Gaps = 73/447 (16%)

Query: 49  DSDPLKILHSLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           D++ +K + S  S +L R    + L + T P    S IGS                  Y 
Sbjct: 94  DNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGS----------------ANYF 137

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + +  GTP +     +FDTGS L W  C     C    +   D      F P +SSS   
Sbjct: 138 VVVGLGTPKR-DLSLVFDTGSDLTWTQCEP---CAGSCYKQQDA----IFDPSKSSSYIN 189

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-P 223
           I C +  C+ +    ++SRC   +        AC  Y +QYG   T+ G L  E L    
Sbjct: 190 ITCTSSLCTQLTSAGIKSRCSSSTT-------ACI-YGIQYGDKSTSVGFLSQERLTITA 241

Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
           +  V +FL GC   ++      AG+ G GR   S   Q      K FSYCL        P
Sbjct: 242 TDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL--------P 293

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            +S+ +     G+  +    L YTP        S+  G+  + GL  + +     K+P  
Sbjct: 294 STSSSLGHLTFGASAATNANLKYTPL-------STISGDNTFYGLDIVGISVGGTKLPA- 345

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFD 396
            +   +   GG I+DSG+  T +    + A+   F + M  Y     V  + GL   C+D
Sbjct: 346 -VSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYP----VANEDGLFDTCYD 400

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-I 455
            SG K + +P++  +F GG  + LP          + +CL       A  A G    I I
Sbjct: 401 FSGYKEISVPKIDFEFAGGVTVELPLVGILIGRSAQQVCL-------AFAANGNDNDITI 453

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G+ Q +   + +D+   R GF    C
Sbjct: 454 FGNVQQKTLEVVYDVEGGRIGFGAAGC 480


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 171/409 (41%), Gaps = 73/409 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP      I DTGS ++W  C S   C  C   +    ++  F P  SS
Sbjct: 76  GLYYTKVQLGTPPVEFNVQI-DTGSDVLWVSCNS---CNGCPQTSGLQIQLNFFDPGSSS 131

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S +I C + +C+       +S    CS +N  C     SY  QYG G  T+G  +S+ +
Sbjct: 132 TSSMIACSDQRCN----NGKQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMM 182

Query: 221 R----FPSKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
                F      N  A    GCS         SDR   GI GFG+   S+ SQL   G+ 
Sbjct: 183 HLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 242

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYY 319
            + FS+CL      D+     LVL      G+   P + YT      P          Y 
Sbjct: 243 PRIFSHCLKG----DSSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYN 283

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L+ I V  + ++I  S  V  +  + G IVDSG+T  ++    ++         +   
Sbjct: 284 LNLQSISVNGQTLQIDSS--VFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQS 341

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
            R       S    C+ I+   +   P++ L F GGA M L P++Y     ++ G  V C
Sbjct: 342 VRTV----VSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC 397

Query: 436 LILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +        G    +G  I ILGD  L++  + +DLA  R G+A   C+
Sbjct: 398 I--------GFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 126/487 (25%), Positives = 190/487 (39%), Gaps = 75/487 (15%)

Query: 12  FSLLILLFTTDAGA---GSSAATVTVPLTPLSTK--HYLHHSDSDPLKILHSLASSSLSR 66
           F L  LLF+T        + + T  + + P+ +K   ++       +  + ++AS    R
Sbjct: 10  FFLFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPER 69

Query: 67  ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
            ++L T    KT    I              V     Y + +  GTP Q     + DT +
Sbjct: 70  LKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGTPGQQMF-MVLDTSN 119

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
              W PC+    C  C+           F+P  S++   + C   +CS +          
Sbjct: 120 DAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSGAQCSQV---------- 158

Query: 187 GCSPRNKTCPLACPSY-LLQYGLGFTAGL---LLSETLRFPSKTVPNFLAGC-SILSDRQ 241
               R  +CP    S  L     G  + L   L+ + +   +  +P F  GC + +S   
Sbjct: 159 ----RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGS 214

Query: 242 --PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
             P G+ G GR   SL SQ G      FSYCL S  F     S +L L  GP  G  K+ 
Sbjct: 215 IPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKL--GP-VGQPKS- 268

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            +  TP  +NP   S      YYV L  + VG   V IP   LV   +   G I+DSG+ 
Sbjct: 269 -IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTV 322

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
            T    P++ A+  EF +Q+        +        CF  + +     P + L F+ G 
Sbjct: 323 ITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAEA--PAITLHFE-GL 374

Query: 417 KMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
            + LP EN      +  L CL +    AA P        ++ + Q QN  + FD  N R 
Sbjct: 375 NLVLPMENSLIHSSSGSLACLSM----AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRL 430

Query: 476 GFAKQKC 482
           G A++ C
Sbjct: 431 GIARELC 437


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 118/400 (29%), Positives = 163/400 (40%), Gaps = 62/400 (15%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L++TP        Y +    GTP Q       DT +   W PC+    C  C  P   P 
Sbjct: 101 LLQTPT-------YVVRARLGTPAQ-QLLLAVDTSNDAAWIPCSG---CAGC--PTSSP- 146

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
               F P  S+S + + C +P+C  +  PN       CSP  K+C      + L Y    
Sbjct: 147 ----FNPAASASYRPVPCGSPQC--VLAPN-----PSCSPNAKSC-----GFSLSYADSS 190

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
               L  +TL      V  +  GC   +  +   P G+ G GR   S  SQ   +    F
Sbjct: 191 LQAALSQDTLAVAGDVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATF 250

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
           SYCL S  F     S  L L      G +  P  +  TP   NP  SS      YYV + 
Sbjct: 251 SYCLPS--FKSLNFSGTLRL------GRNGQPRRIKTTPLLANPHRSS-----LYYVNMT 297

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I VG K V IP S L        G ++DSG+ FT +  P++ A+  E  R++G  + AA
Sbjct: 298 GIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVG--AGAA 355

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDN 442
            V    G   C++     +V  P + L F  G ++ LP EN           CL +    
Sbjct: 356 AVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIHTTYGTTSCLAM---- 406

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           AA P        ++   Q QN  + FD+ N R GFA++ C
Sbjct: 407 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS ++W  C+    C   +  N+   ++ +F P  SS++  I C + +C+  F    
Sbjct: 108 IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 163

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
           E+ C+  + ++  C      Y   YG G  T+G  +S+T+ F +         +  + + 
Sbjct: 164 EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 218

Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
           GCS         +DR   GI GFG+   S+ SQL       K FS+CL  +  D+     
Sbjct: 219 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 274

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            LVL      G+   PGL YTP   +           Y + L  I V  + + I  S   
Sbjct: 275 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 320

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
             +    G IVDSG+T  ++    ++         +    R+  V K S    CF  S  
Sbjct: 321 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 374

Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
                P + L F GG  M++ PENY    A V N VL  I +  N       +G  I IL
Sbjct: 375 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 427

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           GD  L++    +DLAN R G+A   C+
Sbjct: 428 GDLVLKDKIFVYDLANMRMGWADYDCS 454


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS ++W  C+    C   +  N+   ++ +F P  SS++  I C + +C+  F    
Sbjct: 106 IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 161

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
           E+ C+  + ++  C      Y   YG G  T+G  +S+T+ F +         +  + + 
Sbjct: 162 EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 216

Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
           GCS         +DR   GI GFG+   S+ SQL       K FS+CL  +  D+     
Sbjct: 217 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 272

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            LVL      G+   PGL YTP   +           Y + L  I V  + + I  S   
Sbjct: 273 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 318

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
             +    G IVDSG+T  ++    ++         +    R+  V K S    CF  S  
Sbjct: 319 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 372

Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
                P + L F GG  M++ PENY    A V N VL  I +  N       +G  I IL
Sbjct: 373 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 425

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           GD  L++    +DLAN R G+A   C+
Sbjct: 426 GDLVLKDKIFVYDLANMRMGWADYDCS 452


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 133/484 (27%), Positives = 189/484 (39%), Gaps = 83/484 (17%)

Query: 31  TVTVPLTPLSTKHYLHHSDSD-PLKILHSLASSSLSRARH---LKTKTKPKTK------- 79
           TVT  L   +  H+   S S   L++LH     S++   H   L  + +  T        
Sbjct: 38  TVTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILR 97

Query: 80  ----------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
                     DS    N   S I + +   S G Y + +  G+PP+     + D+GS +V
Sbjct: 98  RISGKVIPSSDSRYEVNDFGSDIVSGMDQGS-GEYFVRIGVGSPPRDQY-MVIDSGSDMV 155

Query: 130 WF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
           W    PC   Y+  D           P F P +S S   + C +  C  I          
Sbjct: 156 WVQCQPCKLCYKQSD-----------PVFDPAKSGSYTGVSCGSSVCDRI---------- 194

Query: 187 GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQP 242
                N  C      Y + YG G +T G L  ETL F    V N   GC   +       
Sbjct: 195 ----ENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGA 250

Query: 243 AGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GL 298
           AG+ G G  S S   QL  +    F YCL+SR  D    + +LV       G    P G 
Sbjct: 251 AGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS---TGSLVF------GREALPVGA 301

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
           S+ P  +NP   S     FYYVGL+ + VG   + +P         G+GGV++D+G+  T
Sbjct: 302 SWVPLVRNPRAPS-----FYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVT 356

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
            +    + A    F  Q  N  RA+ V   S    C+D+SG  SV +P +   F  G  +
Sbjct: 357 RLPTAAYVAFRDGFKSQTANLPRASGV---SIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413

Query: 419 ALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            LP  N+   V +       F  +  G +       I+G+ Q +   + FD AN   GF 
Sbjct: 414 TLPARNFLMPVDDSGTYCFAFAASPTGLS-------IIGNIQQEGIQVSFDGANGFVGFG 466

Query: 479 KQKC 482
              C
Sbjct: 467 PNVC 470


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 76/224 (33%), Positives = 104/224 (46%), Gaps = 31/224 (13%)

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           KFSYCL S   DD+  S  L+     GS    T     TP   NP   S     FYY+ L
Sbjct: 5   KFSYCLTS--MDDSKASVLLL-----GSLAKATKDAISTPLLTNPSQPS-----FYYLSL 52

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I VG   + I  S      DG+GGVI+DSG+T T++E  +F+ + KEFI Q    +  
Sbjct: 53  EGIPVGGTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ---SNLQ 109

Query: 383 ADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLIL 438
            D    +GL  CF + S    V +P+L+  FKGG  + LP E+Y  ++ +    V CL +
Sbjct: 110 LDKSSSTGLDVCFSLPSETTQVEVPKLVFHFKGG-DLELPAESY--MIADSKLGVACLAM 166

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              N            I G+ Q QN  +  DL  +   F   +C
Sbjct: 167 GASNGMS---------IFGNVQQQNILVNHDLEKETISFVPTQC 201


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 126/470 (26%), Positives = 185/470 (39%), Gaps = 74/470 (15%)

Query: 31  TVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNS 90
           TV+VPL          H    P ++     SS   R R  + ++K      + G    ++
Sbjct: 55  TVSVPLVH-------RHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDA 107

Query: 91  LIKTPL----SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
            +  P     SV S   Y +++  GTP   S   + DTGS L W       +C  CN   
Sbjct: 108 DVSIPTHLGGSVDSLE-YVVTVGLGTP-SVSQVLLIDTGSDLSWV------QCQPCNSTT 159

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
             P + P F P +SS+   I C    C  +     +    GC+  +     A   + + Y
Sbjct: 160 CYPQKDPLFDPSKSSTYAPIPCNTDACRDL---TDDGYGGGCASGDGA---AQCGFAITY 213

Query: 207 GLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
           G G  T G+  +ETL   P   V +F  GC    D    +  G+ G G + ESL  Q   
Sbjct: 214 GDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTAS 273

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS------KTPGLSYTPFYKNPVGSSS 312
                FSYCL        P  +N V     G G +       T G  +TP  +       
Sbjct: 274 VYGGAFSYCL--------PALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE---- 321

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FY V +  I VG + + +P S        +GG+I+DSG+  T ++   + A+   F
Sbjct: 322 ---TFYVVNMTGITVGGEPIDVPPSAF------SGGMIIDSGTVVTELQHTAYNALQAAF 372

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
            + M  Y    + E    L  C+D SG  +V LP++ L F GGA + L        V N 
Sbjct: 373 RKAMAAYPLVRNGE----LDTCYDFSGYSNVTLPKVALTFSGGATIDLD-------VPNG 421

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +L         +GP    G   ILG+   +   + +D    R GF    C
Sbjct: 422 ILLDDCLAFQESGPDDQPG---ILGNVNQRTLEVLYDAGRGRVGFRAAVC 468


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 127/460 (27%), Positives = 179/460 (38%), Gaps = 73/460 (15%)

Query: 38  PLSTKHYLHHSDSDPLKILHSLASSSLSRA----RHLKTKTKPKTKDSNIGSNYSNSLIK 93
           P  T    HH        LH+       R     R +  K    + DS    N   S + 
Sbjct: 70  PSVTYRNHHHR-------LHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDVV 122

Query: 94  TPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPS 150
           + +   S G Y + +  G+PP+     + D+GS +VW    PC   Y+  D         
Sbjct: 123 SGMDQGS-GEYFVRIGVGSPPRDQY-MVIDSGSDMVWVQCQPCKLCYKQSD--------- 171

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG- 209
             P F P +S S   + C +  C  I               N  C      Y + YG G 
Sbjct: 172 --PVFDPAKSGSYTGVSCGSSVCDRI--------------ENSGCHSGGCRYEVMYGDGS 215

Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK--- 263
           +T G L  ETL F    V N   GC   +       AG+ G G  S S   QL  +    
Sbjct: 216 YTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGA 275

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGL 322
           F YCL+SR  D    + +LV       G    P G S+ P  +NP   S     FYYVGL
Sbjct: 276 FGYCLVSRGTDS---TGSLVF------GREALPVGASWVPLVRNPRAPS-----FYYVGL 321

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + + VG   + +P         G+GGV++D+G+  T +    + A    F  Q  N  RA
Sbjct: 322 KGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRA 381

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           + V   S    C+D+SG  SV +P +   F  G  + LP  N+   V +       F  +
Sbjct: 382 SGV---SIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 438

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             G +       I+G+ Q +   + FD AN   GF    C
Sbjct: 439 PTGLS-------IIGNIQQEGIQVSFDGANGFVGFGPNVC 471


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 118/400 (29%), Positives = 163/400 (40%), Gaps = 62/400 (15%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L++TP        Y +    GTP Q       DT +   W PC+    C  C  P   P 
Sbjct: 48  LLQTPT-------YVVRARLGTPAQ-QLLLAVDTSNDAAWIPCSG---CAGC--PTSSP- 93

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
               F P  S+S + + C +P+C  +  PN       CSP  K+C      + L Y    
Sbjct: 94  ----FNPAASASYRPVPCGSPQC--VLAPN-----PSCSPNAKSC-----GFSLSYADSS 137

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
               L  +TL      V  +  GC   +  +   P G+ G GR   S  SQ   +    F
Sbjct: 138 LQAALSQDTLAVAGDVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATF 197

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
           SYCL S  F     S  L L      G +  P  +  TP   NP  SS      YYV + 
Sbjct: 198 SYCLPS--FKSLNFSGTLRL------GRNGQPRRIKTTPLLANPHRSS-----LYYVNMT 244

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I VG K V IP S L        G ++DSG+ FT +  P++ A+  E  R++G  + AA
Sbjct: 245 GIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVG--AGAA 302

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDN 442
            V    G   C++     +V  P + L F  G ++ LP EN           CL +    
Sbjct: 303 AVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIHTTYGTTSCLAM---- 353

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           AA P        ++   Q QN  + FD+ N R GFA++ C
Sbjct: 354 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 169/413 (40%), Gaps = 64/413 (15%)

Query: 85  SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           ++ S + I++P+ +   G + +S+  GTPP  +   I DTGS L W  C     C +   
Sbjct: 72  TSVSTACIRSPI-IPDSGEFLMSIFIGTPP-VNVIAIADTGSDLTWTQCLPCRECFN--- 126

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                   P F P+RSSS + + C +  C      ++ES    C P  ++C     SY  
Sbjct: 127 -----QSQPIFNPRRSSSYRKVSCASDTCR-----SLES--YHCGPDLQSC-----SYGY 169

Query: 205 QYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSS--------ESL 255
            YG   FT G L S+ +   S  +P  + GC   +     G+                 +
Sbjct: 170 SYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM 229

Query: 256 PSQLGLK-KFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSS 311
            +  G+K +FSYCL +  F +A ++  +         G     TP +  +P         
Sbjct: 230 RTIAGVKPRFSYCLPTF-FSNANITGTISFGRKAVVSGRQVVSTPLVPRSP--------- 279

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FY++ L  I VG K  K      +     +G +I+DSG+T T +   L+  V   
Sbjct: 280 ---DTFYFLTLEAISVGKKRFKAANG--ISAMTNHGNIIIDSGTTLTLLPRSLYYGVFST 334

Query: 372 FIRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
             R +    +A  V+  SG L  C+       + +P +   F GGA + L P N FA V 
Sbjct: 335 LARVI----KAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVA 390

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           + V CL         PA       I G+    NF + +DL N R  F  + CA
Sbjct: 391 DNVTCLTF------APAT---QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 161/387 (41%), Gaps = 52/387 (13%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           + + GTPPQ ++  I D    LVW  C+   RC            +P F+P  SS+ +  
Sbjct: 70  NFTIGTPPQPASAII-DVAGELVWTQCSMCSRCFK--------QDLPLFVPNASSTFRPE 120

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
            C    C  I   N       CS    T      S L     G T G++ ++T    + T
Sbjct: 121 PCGTDACKSIPTSN-------CSSNMCTYEGTINSKLG----GHTLGIVATDTFAIGTAT 169

Query: 227 VPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
             +   GC + S       P+G+ G GR+  SL SQ+ + KFSYCL      D+  +S L
Sbjct: 170 A-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRL 225

Query: 283 VLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           +L +     G G+S T     TPF K   G      ++Y + L  I  G   + +P S  
Sbjct: 226 LLGSSAKLAGGGNSTT-----TPFVKTSPGDD--MSQYYPIQLDGIKAGDAAIALPPS-- 276

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
                GN  V+V + +  +F+    ++A+ KE  + +G    A  ++       CF  +G
Sbjct: 277 -----GN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQP---FDLCFPKAG 327

Query: 400 KKSVYLPELILKF-KGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIIL 456
             +   P+L+  F +G A + +PP  Y   VG E   +C+ + + +            IL
Sbjct: 328 LSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNIL 387

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G  Q +N +   DL      F    C+
Sbjct: 388 GSLQQENTHFLLDLEKKTLSFEPADCS 414


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 169/409 (41%), Gaps = 80/409 (19%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
           V S G Y ++L  GTPP      I DTGS L W    PCT  Y+ V           +P 
Sbjct: 86  VPSAGEYLMNLYIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPL 133

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           F PK SS+ +   C    C  +       + + CS + K C     ++   Y  G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLAL------GKDRSCS-KEKKC-----TFRYSYADGSFTGG 181

Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQL----- 259
            L SETL   S      + P F  GC   S    D+  +GI G G    SL SQL     
Sbjct: 182 NLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTIN 241

Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
           GL  FSYCLL     D+ +SS +         G G   TP +  +P              
Sbjct: 242 GL--FSYCLLPVS-TDSSISSRINFGASGRVSGYGTVSTPLVQKSP------------DT 286

Query: 317 FYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
           FYY+ L  I VG K  ++PY  Y        G +IVDSG+T+TF+    +  + K     
Sbjct: 287 FYYLTLEGISVGKK--RLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK----S 340

Query: 376 MGNYSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
           + N  +   V   +G+   C++ + +  +  P +   FK  A + L P N F  +  +++
Sbjct: 341 VANSIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFK-DANVELQPLNTFMRMQEDLV 397

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           C  +   +  G         +LG+    NF + FDL   R  F    C 
Sbjct: 398 CFTVAPTSDIG---------VLGNLAQVNFLVGFDLRKKRVSFKAADCT 437


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 119/397 (29%), Positives = 169/397 (42%), Gaps = 82/397 (20%)

Query: 97  SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           SVH S   Y + ++ GTPP   T  + DTGS L+W  C +   C  C FP   P+  P +
Sbjct: 84  SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
            P RS++   + C++P C  +  P   SR   CSP +  C     +Y   YG G  T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSR---CSPPDTGC-----AYYFSYGDGTSTDGV 185

Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
           L +ET    S T    +A GC   ++ S    +G+ G GR   SL SQLG+ +       
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTR------- 238

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
                 P  S                     P   +P              L  I VG  
Sbjct: 239 ------PRRSCRA---------RAAARGGGAPTTTSP--------------LEGITVGDT 269

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  +       G+GGVI+DSG+TFT +E   F A+A+    ++     A+      G
Sbjct: 270 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAHL--G 326

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
           L  CF  +  ++V +P L+L F  GA M L  E+Y            +  D +AG A LG
Sbjct: 327 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 373

Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               RG + +LG  Q QN ++ +DL      F   KC
Sbjct: 374 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 120/446 (26%), Positives = 175/446 (39%), Gaps = 74/446 (16%)

Query: 49  DSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
           D++ +K + S  S +L   +R + L + T P      IGS                  Y 
Sbjct: 98  DNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGS----------------ADYY 141

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + +  GTP +     IFDTGS L W  C     C    +   DP     F P +SSS   
Sbjct: 142 VVVGLGTPKR-DLSLIFDTGSYLTWTQCEP---CAGSCYKQQDP----IFDPSKSSSYTN 193

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-P 223
           I C +  C+         R  GCS       +    Y ++YG    + G L  E L    
Sbjct: 194 IKCTSSLCTQF-------RSAGCSSSTDASCI----YDVKYGDNSISRGFLSQERLTITA 242

Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  V +FL GC   ++   R  AG+ G  R   S   Q   +  K FSYCL        P
Sbjct: 243 TDIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL--------P 294

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            + + +     G+  +    L YTPF       S+  GE  + GL  + +     K+P  
Sbjct: 295 STPSSLGHLTFGASAATNANLKYTPF-------STISGENSFYGLDIVGISVGGTKLPA- 346

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            +   +   GG I+DSG+  T +    + A+   F + M  Y  A        L  C+D 
Sbjct: 347 -VSSSTFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRL---LDTCYDF 402

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
           SG K + +P +  +F GG K+ LP          + LCL       A  A G G  I I 
Sbjct: 403 SGYKEISVPRIDFEFAGGVKVELPLVGILYGESAQQLCL-------AFAANGNGNDITIF 455

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
           G+ Q +   + +D+   R GF    C
Sbjct: 456 GNVQQKTLEVVYDVEGGRIGFGAAGC 481


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 168/409 (41%), Gaps = 62/409 (15%)

Query: 82  NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
            + +   ++ + T + V ++  Y +++S GTP  + T  + DTGS + W       +C  
Sbjct: 122 QLATGSRSATVPTTMGVGTFQ-YVVTVSLGTPGVSQTVEV-DTGSDVSWV------QCKP 173

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
           C+ P  +  R   F P +SS+   + C    CS +     E+ C G       C      
Sbjct: 174 CSAPACNSQRDQLFDPAKSSTYSAVPCGADACSEL--RIYEAGCSG-----SQC-----G 221

Query: 202 YLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGF---GRSSESLP 256
           Y++ YG G  T G+  S+TL   P  TV  FL GC        AGI G    GR S SL 
Sbjct: 222 YVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLK 281

Query: 257 SQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
           SQ        FSYCL S++      +  L L  GP          S + F    + ++ A
Sbjct: 282 SQAAGAYGGVFSYCLPSKQ----SAAGYLTLG-GP---------TSASGFATTGLLTAWA 327

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
              FY V L  I VG + V +P S         GG +VD+G+  T +    + A+   F 
Sbjct: 328 APTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFR 381

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
             +  Y   +       L  C+D S    V LP + L F GGA +AL             
Sbjct: 382 GAIAPYGYPS-APANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL-----SS 435

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            CL      A  P  G G A ILG+ Q ++F + FD      GF    C
Sbjct: 436 GCL------AFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 109/406 (26%), Positives = 161/406 (39%), Gaps = 75/406 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++  GTP +  T  +FDTGS L W  C     C D  +   +P     F P +SS+ 
Sbjct: 126 YVVTIGIGTPARNFT-VLFDTGSDLTWVQCKP---CTDSCYQQQEP----LFDPSKSSTY 177

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C  P+C    G   +  C G      TC      Y ++YG    T G L  E    
Sbjct: 178 VDVPCGTPQCK--IGGGQDLTCGG-----TTC-----EYSVKYGDQSVTRGNLAQEAFTL 225

Query: 223 PSKTVP--NFLAGCS---------ILSDRQPAGIAGFGRSSESLPSQLGLKK----FSYC 267
                P    + GCS            +   AG+ G GR   S+ SQ         FSYC
Sbjct: 226 SPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYC 285

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
           L  R       SS   L  G  +       LS+TP     V  +S     Y V L  I V
Sbjct: 286 LPPRG------SSAGYLTIGAAA--PPQSNLSFTPL----VTDNSQLSSVYVVNLVGISV 333

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
               + I  S    G+      ++DSG+  T M    +  +  EF R MG Y+   +   
Sbjct: 334 SGAALPIDASAFYIGT------VIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHV 387

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
           +S L  C+D++G   V  P + L+F GGA++ +               L++F  +A+G +
Sbjct: 388 ES-LDTCYDVTGHDVVTAPPVALEFGGGARIDVDASG----------ILLVFAVDASGQS 436

Query: 448 LGRG---------PA-IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           L            P  +I+G+ Q + + + FD+   R GF    C+
Sbjct: 437 LTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGCS 482


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 117/414 (28%), Positives = 164/414 (39%), Gaps = 69/414 (16%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           ++ ++ GTPPQ  T  + DTGS L W  C   Y           P   PAF    SSS  
Sbjct: 56  TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSYA----------PPLTPAFNASGSSSYG 104

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--------------LACPSYLLQYGLGF 210
            + C +  C W  G ++       +P +  C               LA  ++LL  G   
Sbjct: 105 AVPCPSTACEW-RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163

Query: 211 TA-GLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
            A G        + S T  N     + +S+    G+ G  R + S  +Q G ++F+YC+ 
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGTRRFAYCI- 221

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQI 325
                 AP     VL  G   G    P L+YTP  +     S     F    Y V L  I
Sbjct: 222 ------APGEGPGVLLLGDDGG--VAPPLNYTPLIE----ISQPLPYFDRVAYSVQLEGI 269

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   + IP S L P   G G  +VDSG+ FTF+    + A+  EF  Q      A   
Sbjct: 270 RVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG 328

Query: 386 EKKSGLRPCFDIS--------GKKSVYLPELILKFKGGAKMALPPENYFALVGNE----- 432
           E     +  FD             S  LPE+ L  + GA++A+  E    +V  E     
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387

Query: 433 ----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               V CL     + AG +     A ++G    QN ++E+DL N R GFA  +C
Sbjct: 388 GAEAVWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 122/390 (31%), Positives = 171/390 (43%), Gaps = 71/390 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P  A T  I DTGS + W  C     C  C+    D      F P  SS+ 
Sbjct: 127 YLITVGMGSPAVAQTMLI-DTGSDVSWVQCKP---CSQCH-SQADS----LFDPSSSSTY 177

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
               C +  C+ +       R +GCS  +  C      Y ++YG G T +G   S+TL  
Sbjct: 178 SAFSCTSAACAQL-------RQRGCS--SSQC-----QYTVKYGDGSTGSGTYSSDTLAL 223

Query: 223 PSKTVPNFLAGCS------ILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
            S TV NF  GCS      +L D Q AG+ G G  +ESL +Q      K FSYCL     
Sbjct: 224 GSSTVENFQFGCSQSESGNLLQD-QTAGLMGLGGGAESLATQTAGTFGKAFSYCL----- 277

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
              P          PGS    T G S + F  K P+  S+    +Y V L+ I VG + +
Sbjct: 278 --PPT---------PGSSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQL 326

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP S    GS      I+DSG+  T +    + A++  F   M  Y  A   +      
Sbjct: 327 NIPASAFSAGS------IMDSGTIITRLPRTAYSALSSAFKAGMKQYPPA---QPMGIFD 377

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            CFD SG+ SV +P + L F GGA + L  +    ++G+   CL  F  N+   +LG   
Sbjct: 378 TCFDFSGQSSVSIPTVALVFSGGAVVDLASDGI--ILGS---CLA-FAANSDDTSLG--- 428

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             I+G+ Q + F + +D+     GF    C
Sbjct: 429 --IIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 170/388 (43%), Gaps = 51/388 (13%)

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           ++ G   Q ST  I DTGS L W  C     C +         + P F P  SSS   + 
Sbjct: 68  VTVGIGGQNST-LIVDTGSDLTWVQCLPCRLCYN--------QQEPLFNPSNSSSFLSLP 118

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
           C +P C  +  P   S    CS +N T   +C  Y + YG G ++ G L  E L      
Sbjct: 119 CNSPTCVAL-QPTAGSSGL-CSNKNST---SC-DYQIDYGDGSYSRGELGFEKLTLGKTE 172

Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSS 280
           + NF+ GC   +       +G+ G  RS  SL SQ   L    FSYCL +        S 
Sbjct: 173 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS---SG 229

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
           +L L     S       +SYT   +NP  S+     FY++ L  I +G  ++       V
Sbjct: 230 SLTLGGADFSNFKNISPISYTRMIQNPQMSN-----FYFLNLTGISIGGVNLN------V 278

Query: 341 PGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
           P    N GV  ++DSG+  T +   +++A   EF +Q   Y         S L  CF+++
Sbjct: 279 PRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGF---SILNTCFNLT 335

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG-RGPAII 455
           G + V +P +   F+G A+M +  E  F  V ++   +CL       A  +LG     +I
Sbjct: 336 GYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICL-------AFASLGYEDQTMI 388

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +G++Q +N  + ++    + GFA + C+
Sbjct: 389 IGNYQQKNQRVIYNSKESKVGFAGEPCS 416


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 170/388 (43%), Gaps = 51/388 (13%)

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           ++ G   Q ST  I DTGS L W  C     C +         + P F P  SSS   + 
Sbjct: 147 VTVGIGGQNST-LIVDTGSDLTWVQCLPCRLCYN--------QQEPLFNPSNSSSFLSLP 197

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
           C +P C  +  P   S    CS +N T   +C  Y + YG G ++ G L  E L      
Sbjct: 198 CNSPTCVAL-QPTAGSSGL-CSNKNST---SC-DYQIDYGDGSYSRGELGFEKLTLGKTE 251

Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSS 280
           + NF+ GC   +       +G+ G  RS  SL SQ   L    FSYCL +        S 
Sbjct: 252 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS---SG 308

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
           +L L     S       +SYT   +NP  S+     FY++ L  I +G  ++       V
Sbjct: 309 SLTLGGADFSNFKNISPISYTRMIQNPQMSN-----FYFLNLTGISIGGVNLN------V 357

Query: 341 PGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
           P    N GV  ++DSG+  T +   +++A   EF +Q   Y         S L  CF+++
Sbjct: 358 PRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGF---SILNTCFNLT 414

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG-RGPAII 455
           G + V +P +   F+G A+M +  E  F  V ++   +CL       A  +LG     +I
Sbjct: 415 GYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICL-------AFASLGYEDQTMI 467

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +G++Q +N  + ++    + GFA + C+
Sbjct: 468 IGNYQQKNQRVIYNSKESKVGFAGEPCS 495


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 115/403 (28%), Positives = 167/403 (41%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  GTP +  T  +FDTGS L W       +C  C+       + P F P  SS
Sbjct: 83  GNYVVSVGLGTPARDLT-VVFDTGSDLSWV------QCGPCSSGGCYHQQDPLFAPSSSS 135

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C  P+C     P     C   SP +  CP     Y + YG    T G L ++TL
Sbjct: 136 TFSAVRCGEPEC-----PRARQSCSS-SPGDDRCP-----YEVVYGDKSRTVGHLGNDTL 184

Query: 221 RF-----------PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
                         S  +P F+ GC   +     +  G+ G GR   SL SQ   K    
Sbjct: 185 TLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEG 244

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL S        SSN     G  S  +  P  ++  F   P+ + S    FYYV L 
Sbjct: 245 FSYCLPSS-------SSNA---HGYLSLGTPAPAPAHARF--TPMLNRSNTPSFYYVKLV 292

Query: 324 QIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I V  + +K+     L P      G+IVDSG+  T +    + A+   F+  MG Y   
Sbjct: 293 GIRVAGRAIKVSSRPALWPA-----GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGY- 346

Query: 383 ADVEKKSGLRPCFDIS--GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
               + S L  C+D +     +V +P + L F GGA +++       +      CL  F 
Sbjct: 347 KRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLA-FA 405

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            N  G + G     ILG+ Q +   + +D+   + GFA + C+
Sbjct: 406 PNGNGRSAG-----ILGNTQQRTVAVVYDVGRQKIGFAAKGCS 443


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 161/395 (40%), Gaps = 61/395 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  GTPP+     I DTGSSL W  C     C        DP     + P  S 
Sbjct: 123 GNYYVKLGLGTPPKYYA-MILDTGSSLSWLQCQP---CAVYCHAQADP----LYDPSVSK 174

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + + + C + +CS +    +        P  +T   AC  Y   YG   F+ G L  + L
Sbjct: 175 TYKKLSCASVECSRLKAATLND------PLCETDSNAC-LYTASYGDTSFSIGYLSQDLL 227

Query: 221 RFPS-KTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
              S +T+P F  GC    D Q      AGI G  R   S+ +QL  K    FSYCL + 
Sbjct: 228 TLTSSQTLPQFTYGCG--QDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTA 285

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGLRQIIVG 328
               +      +    P S         +TP     KNP          Y++ L  I V 
Sbjct: 286 NSGSSGGGFLSIGSISPTS-------YKFTPMLTDSKNP--------SLYFLRLTAITVS 330

Query: 329 SKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
            + + +  + Y VP        ++DSG+  T +   ++ A+ + F++ M   ++ A    
Sbjct: 331 GRPLDLAAAMYRVP-------TLIDSGTVITRLPMSMYAALRQAFVKIMS--TKYAKAPA 381

Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
            S L  CF  S K    +PE+ + F+GGA + L   +        + CL       AG +
Sbjct: 382 YSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAF-----AGSS 436

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G     I+G+ Q Q + + +D++  R GFA   C
Sbjct: 437 -GTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 113/393 (28%), Positives = 166/393 (42%), Gaps = 59/393 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTP   +   + D+GSSL W       +C  C   +  P   P + P+ SS
Sbjct: 106 GNYITRLGLGTP-TTTYVMVVDSGSSLTWL------QCAPCAV-SCHPQAGPLYDPRASS 157

Query: 162 SSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +   + C  P+C+ +    +  S C G    +  C      Y   YG G F+ G L  +T
Sbjct: 158 TYAAVPCSAPQCAELQAATLNPSSCSG----SGVC-----QYQASYGDGSFSFGYLSKDT 208

Query: 220 LRFPSK-TVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
           +   S  + P F  GC   ++    + AG+ G  R+  SL SQL       F+YCL +  
Sbjct: 209 VSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSA 268

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                 +S   L  G  S D+K PG  SYT        SSS     Y+V L  + V    
Sbjct: 269 -----AASAGYLSFGSNS-DNKNPGKYSYTSMV-----SSSLDASLYFVSLAGMSVAGSP 317

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + +P S       G+   I+DSG+  T +  P++ A++K     +G    A      S L
Sbjct: 318 LAVPSSEY-----GSLPTIIDSGTVITRLPTPVYTALSKA----VGAALAAPSAPAYSIL 368

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAAGPALGR 450
           + CF     K + +P + + F GGA + L P N    V     CL    TD+ A      
Sbjct: 369 QTCFKGQVAK-LPVPAVNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTA------ 421

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I+G+ Q Q F + +D+   R GFA   C+
Sbjct: 422 ----IIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 113/441 (25%), Positives = 177/441 (40%), Gaps = 85/441 (19%)

Query: 51  DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
           D  +I+  L S     A  L T+ KPK K      N +N  +  P++    + S   Y  
Sbjct: 57  DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 104

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
               GTP Q +     D  +   W PC++   C  C       +  P+F P +SS+ + +
Sbjct: 105 RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 154

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
            C +P+C+ +  P              +CP    S   + L Y       +L  ++L   
Sbjct: 155 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 200

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
           +  V ++  GC  + +      AG  R          L+  +  L             LV
Sbjct: 201 NNVVVSYTFGCLRVVNGNSRAAAGAHR----------LRPRAALL-------------LV 237

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
            D G      +   +  TP   NP   S      YYV +  I VGSK V++P S L    
Sbjct: 238 ADQGHLGPIGQPKRIKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVPQSALAFNP 292

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
               G I+D+G+ FT +  P++ AV   F  ++    R        G   C+++    +V
Sbjct: 293 VTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCYNV----TV 344

Query: 404 YLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI-ILGDFQL 461
            +P +   F G   + LP EN      +  V CL +    AAGP+ G   A+ +L   Q 
Sbjct: 345 SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAALNVLASMQQ 400

Query: 462 QNFYLEFDLANDRFGFAKQKC 482
           QN  + FD+AN R GF+++ C
Sbjct: 401 QNQRVLFDVANGRVGFSRELC 421


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 116/407 (28%), Positives = 170/407 (41%), Gaps = 75/407 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKRSS 161
           Y   L  G+PP+     I DTGS ++W  C+S   C  C  P      IP   F P  S 
Sbjct: 90  YYTRLQLGSPPRDFYVQI-DTGSDVLWVSCSS---CNGC--PVSSGLHIPLNFFDPGSSP 143

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ LI C + +CS      ++S    C+ +N  C      Y  QYG G  T+G  +S+ L
Sbjct: 144 TASLISCSDQRCSL----GLQSSDSVCAAQNNQC-----GYTFQYGDGSGTSGYYVSDLL 194

Query: 221 RFPS--------KTVPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGL---- 261
            F +         +    + GCS L        DR   GI GFG+   S+ SQL      
Sbjct: 195 HFDTILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGIT 254

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL   K DD+     LVL      G+   P + YTP   +           Y +
Sbjct: 255 PRVFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLVPSQ--------PHYNL 296

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  + + I  S     S  N G I+DSG+T  ++     EA    FI  + +  
Sbjct: 297 NLQSIYVNGQTLAIDPSVFATSS--NQGTIIDSGTTLAYLT----EAAYDPFISAITSTV 350

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCL 436
             +     S    C+  S   +   P++ L F GG  M L P++Y     ++ G  + C+
Sbjct: 351 SPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCV 410

Query: 437 ILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G    +G  I ILGD  L++    +D+A  R G+A   C
Sbjct: 411 --------GFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDC 449


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 106/401 (26%), Positives = 165/401 (41%), Gaps = 75/401 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTPP      I DTGS L+W  C    +CV        P   P F P++SS+ 
Sbjct: 92  YLMRFYIGTPPVERFA-IADTGSDLIWVQCAPCEKCV--------PQNAPLFDPRKSSTF 142

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSETL 220
           + + C +  C+ +             P  + C         QY  G     +G+L  E++
Sbjct: 143 KTVPCDSQPCTLL------------PPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESI 190

Query: 221 RFPSKT----VPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYC 267
            F SK      P    GC      ++   ++  G+ G G    SL SQLG    +KFSYC
Sbjct: 191 NFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYC 250

Query: 268 LLSRKFDDAPVSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
                    P+SSN    +  G  +   +  G+  TP     +G S     +YY+ L  +
Sbjct: 251 F-------PPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPS-----YYYLNLEGV 298

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL---FEAVAKEFIRQMGNYSRA 382
            +G+K VK   S     +DGN  +++DSG++FT ++      F A+ KE       Y   
Sbjct: 299 SIGNKKVKTSES----QTDGN--ILIDSGTSFTILKQSFYNKFVALVKEV------YGVE 346

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           A          CF+  GK+  + P+++  F  GAK+ +   N F    N +LC++    +
Sbjct: 347 AVKIPPLVYNFCFENKGKRKRF-PDVVFLFT-GAKVRVDASNLFEAEDNNLLCMVALPTS 404

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               +       I G+     + +E+DL      FA   CA
Sbjct: 405 DEDDS-------IFGNHAQIGYQVEYDLQGGMVSFAPADCA 438


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 112/400 (28%), Positives = 156/400 (39%), Gaps = 87/400 (21%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  +++ G+PP+  +  + DTGS L W       RC  C+                  
Sbjct: 1   GVYYSTITLGSPPKDFS-LVMDTGSDLTWV------RCDPCS------------------ 35

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
                    P CS  F      R    + +  TC      Y   YG G FT G L  +TL
Sbjct: 36  ---------PDCSSTF-----DRLASNTYKALTC---ADDYSYGYGDGSFTQGDLSVDTL 78

Query: 221 RFPS------KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
           +         +  P F+ GC  L         GI      S S PSQ+G K   KFSYCL
Sbjct: 79  KMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCL 138

Query: 269 LSRKFDDAPVSSNLVLDTG------PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           L +   ++   S +V          PGSG  K   L YTP     +G SS +   Y V L
Sbjct: 139 LRQTAQNSLKKSPMVFGEAAVELKEPGSG--KLQELQYTP-----IGESSIY---YTVRL 188

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I VG++ + +  S  + G D     I DSG+T T +       V     + + +    
Sbjct: 189 DGISVGNQRLDLSPSAFLNGQDKP--TIFDSGTTLTMLP----PGVCDSIKQSLASMVSG 242

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           A+     GL  CF +       LP++   F GGA     P NY   +G+ + CLI    N
Sbjct: 243 AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVPTN 301

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                       I G+ Q Q+F++  D+ N R GF +  C
Sbjct: 302 EVS---------IFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 115/407 (28%), Positives = 171/407 (42%), Gaps = 68/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     I DTGS ++W  C+    C   +  N+   ++  F P  SS
Sbjct: 89  GLYFTRVKLGSPPKEYFVQI-DTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSS 144

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S  I C + +C+     + E+ C+  +  N  C      Y   YG G  T+G  +S+T+
Sbjct: 145 TSSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTM 196

Query: 221 RFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
            F S         +  + + GCS         +DR   GI GFG+   S+ SQL      
Sbjct: 197 YFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 256

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL  +  D+      LVL      G+   PGL YTP   +           Y +
Sbjct: 257 PKVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNL 298

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  I+V  + + I  S     +    G IVDSG+T  ++    ++         +    
Sbjct: 299 NLESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSV 356

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLI 437
           R+  V K +    CF  S       P + L F GG  M + PENY    A + N VL  I
Sbjct: 357 RSL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCI 412

Query: 438 LFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  N       +G  I ILGD  L++    +DLAN R G+    C+
Sbjct: 413 GWQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 116/415 (27%), Positives = 172/415 (41%), Gaps = 80/415 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
           G Y   +  G+PP+     I DTGS ++W  C+S   C  C  P     +IP   F P  
Sbjct: 82  GLYFTRVQLGSPPKDFYVQI-DTGSDVLWVSCSS---CNGC--PVTSGLQIPLTFFDPGS 135

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG------FTAG 213
           S+++ L+ C + +C+      ++S    CS R   C      Y  QYG G      + A 
Sbjct: 136 STTAALVSCSDQRCT----AGIQSSDSLCSSRTNQC-----GYTFQYGDGSGTSGYYVAD 186

Query: 214 LLLSETLRFPSKTVPNFLAG--------CSIL-------SDRQPAGIAGFGRSSESLPSQ 258
           L+  +TL   S  +              CS L       SDR   GI GFG+   S+ SQ
Sbjct: 187 LMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQ 246

Query: 259 LGL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
           L       + FS+CL   K DD+     LVL      G+   P + YTP   +       
Sbjct: 247 LASQGITPRVFSHCL---KGDDSG-GGVLVL------GEIVEPNIVYTPLVPSQ------ 290

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
               Y + L+ I V  + + I  S  V G+  N G IVDSG+T  +    L E     F+
Sbjct: 291 --PHYNLYLQSISVAGQTLAIDPS--VFGASSNQGTIVDSGTTLAY----LAEGAYDPFV 342

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALV 429
             + +          S    C+ ++   +   P++ L F GGA + L P++Y     ++ 
Sbjct: 343 SAITSVVSLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVG 402

Query: 430 GNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G  V C+        G     G  I ILGD  L++    +D+AN R G+    C+
Sbjct: 403 GAAVWCV--------GFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDCS 449


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 154/387 (39%), Gaps = 58/387 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +  + GTP Q       DT +   W PC     CV C+           F    S++ 
Sbjct: 90  YIVKANVGTPAQTFL-MALDTSNDAAWIPCNG---CVGCSST--------VFNSVTSTTF 137

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GC  P+C  +  P     C G      TC     ++   YG       L  +T+   
Sbjct: 138 KTLGCDAPQCKQVPNPT----CGG-----STC-----TWNTTYGGSTILSNLTRDTIALS 183

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
           +  VP +  GC   +  S   P G+ G GR   S  SQ   L    FSYCL S  F    
Sbjct: 184 TDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLN 241

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S  L L  GP     +   +  TP  KNP  SS      YYV L  I VG K V IP S
Sbjct: 242 FSGTLRL--GPAGQPLR---IKTTPLLKNPRRSS-----LYYVNLIGIRVGRKIVDIPAS 291

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
            L        G I DSG+ FT +  P++ AV  EF +++GN    A V    G   C+  
Sbjct: 292 ALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGN----AIVSSLGGFDTCY-- 345

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LP +N           CL +    AA P        ++
Sbjct: 346 --TGPIVAPTMTFMFS-GMNVTLPTDNLLIRSTAGSTSCLAM----AAAPDNVNSVLNVI 398

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + Q QN  + FD+ N R G A++ C+
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 117/464 (25%), Positives = 205/464 (44%), Gaps = 74/464 (15%)

Query: 35  PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKT 94
           PL+P     + +H+ +D  +I  +    S SR  +L    K         +   N +  +
Sbjct: 18  PLSP-----FYNHTMTDTARI-EATVHRSRSRLNYLYYINKLSE------NALDNDVSLS 65

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR--- 151
           P  V+  G Y +S + G P      F+ DT + L+W  C+      +CN    +P +   
Sbjct: 66  PTLVNEGGEYLMSFNIGNPSSQVMGFL-DTSNGLIWVQCS------NCN-SQCEPEKRGL 117

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-F 210
              F+  +S + ++  C +  C+ + G       + C+  +K C      Y L YG    
Sbjct: 118 TTKFLSSKSFTYEMEPCGSNFCNSLTG------FQTCNSSDKWC-----KYRLVYGDNKA 166

Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCS---ILSDRQP-AGIAGFGRSSESLPSQLGL 261
           T+G+L S++  F +       V     GCS   +  D Q   G  G  ++  SL SQLG+
Sbjct: 167 TSGILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGI 226

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           KKFSYCL+   F++   +S +   + P +   +TP L     Y N         + YYV 
Sbjct: 227 KKFSYCLV--PFNNLGSTSKMYFGSLPVTSGGQTPLL-----YPN--------SDAYYVK 271

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           +  I +G+        + V   +   G I+D+G T++ +E   F+++  +F+  + ++ +
Sbjct: 272 VLGISIGNDEPHFDGVFDV--YEVRDGWIIDTGITYSSLETDAFDSLLAKFLT-LKDFPQ 328

Query: 382 AADVEKKSGLRPCFDISGKKSVY-LPELILKFKGGAKMALPPENYFALVGNE-VLCLILF 439
             D + K     CF++     +   P++ + F G A + L  E+ F  + ++ + CL L 
Sbjct: 329 RKD-DPKERFELCFELQNANDLESFPDVTVHFDG-ADLILNVESTFVKIEDDGIFCLALL 386

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              +        P  ILG+FQLQN+++ +DL      FA   CA
Sbjct: 387 RSGS--------PVSILGNFQLQNYHVGYDLEAQVISFAPVDCA 422


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 130/467 (27%), Positives = 179/467 (38%), Gaps = 93/467 (19%)

Query: 40  STKHYLHHSDSDPLKILHSLASSSLSRARHLK---TKTKPKTKDSNIGSNYSNSLIKTPL 96
           S K  L++S    L+  +     S+SR  H +       PK  +S I +N          
Sbjct: 40  SPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANG--------- 90

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
                G Y +SLS GTPP      I DTGS L+W  CT   +C             P F 
Sbjct: 91  -----GEYLMSLSLGTPP-FEILAIADTGSDLIWTQCTPCDKCY--------KQIAPLFD 136

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
           PK S + + + C   +C  +           CS   + C      Y   YG   FT G L
Sbjct: 137 PKSSKTYRDLSCDTRQCQNL------GESSSCSSE-QLC-----QYSYYYGDRSFTNGNL 184

Query: 216 LSETLRFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---K 263
             +T+  PS        P  + GC   +    D++ +GI G G    SL SQ+G     K
Sbjct: 185 AVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGK 244

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
           FSYCL+    + A  SS L         GSG   TP +S     KNP         FYY+
Sbjct: 245 FSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLIS-----KNP-------DTFYYL 292

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + VG K ++   S           +I+DSG++ T      F   A      + N  
Sbjct: 293 TLEAMSVGDKKIEFGGSSFGGSEG---NIIIDSGTSLTLFPVNFFTEFATAVENAVINGE 349

Query: 381 RAADVEKKSGL-----RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
           R  D    SGL     RP  D+       +P +   F  GA + L   N F L+ ++VLC
Sbjct: 350 RTQDA---SGLLSHCYRPTPDLK------VPVITAHFN-GADVVLQTLNTFILISDDVLC 399

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L  F    +G         I G+    NF + +D+      F    C
Sbjct: 400 LA-FNSTQSGA--------IFGNVAQMNFLIGYDIQGKSVSFKPTDC 437


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 160/392 (40%), Gaps = 62/392 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTP +  T  +FDTGS   W  C     CV   +   +P     F P +S+
Sbjct: 159 GNYVVPVRLGTPAERFT-VVFDTGSDTTWVQCQP---CVAYCYRQKEP----LFDPTKSA 210

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C +  CS ++         GCS  +         Y +QYG G +T G    +TL
Sbjct: 211 TYANISCSSSYCSDLY-------VSGCSGGHCL-------YGIQYGDGSYTIGFYAQDTL 256

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                T+ NF  GC   +     + AG+ G GR   SLP Q   K    F+YCL      
Sbjct: 257 TLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL------ 310

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            A  +    LD GPG+  +       TP    + P         FYYVG+  I VG   +
Sbjct: 311 PATSAGTGFLDLGPGAPAANA---RLTPMLVDRGPT--------FYYVGMTGIKVGGHVL 359

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP S          G +VDSG+  T +    +  +   F + M     +A     S L 
Sbjct: 360 PIPGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSA-APAFSILD 413

Query: 393 PCFDISGKK--SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
            C+D++G K  S+ LP + L F+GGA + +       +      CL  F  NA    +  
Sbjct: 414 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLA-FAPNADDTDVA- 471

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q +   + +D+     GFA   C
Sbjct: 472 ----IVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 117/406 (28%), Positives = 168/406 (41%), Gaps = 68/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G + +S++ GTPP      I DTGS L W  C    +C   N P         F  K+SS
Sbjct: 83  GEFFMSITIGTPPMKVFA-IADTGSDLTWVQCKPCQQCYKENGP--------IFDKKKSS 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + +   C +  C  +      S  +GC      C      Y   YG   F+ G + +ET+
Sbjct: 134 TYKSEPCDSRNCHAL-----SSSERGCDESKNVC-----KYRYSYGDQSFSKGDVATETI 183

Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
              S +      P  + GC   +    D   +GI G G    SL SQLG    KKFSYCL
Sbjct: 184 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243

Query: 269 LSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQ 324
             +    A  +   V++ G  S     SK  G+  TP   K P         +YY+ L  
Sbjct: 244 SHKS---ATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR-------TYYYLTLEA 293

Query: 325 IIVGSKHVKIPY--SYLVPG-----SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           I VG K  KIPY  S   P      S+ +G +I+DSG+T T ++   F+         + 
Sbjct: 294 ISVGKK--KIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVT 351

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              R +D   +  L  CF  SG   + LPE+ + F  GA + L P N F  V  +++CL 
Sbjct: 352 GAKRVSD--PQGLLSHCFK-SGSAEIGLPEITVHFT-GADVRLSPINAFVKVSEDMVCLS 407

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +                I G+F   +F + +DL      F +  C+
Sbjct: 408 MVPTTEVA---------IYGNFAQMDFLVGYDLETRTVSFQRMDCS 444


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 120/425 (28%), Positives = 170/425 (40%), Gaps = 75/425 (17%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           ++ ++ G PPQ  T  + DTGS L W  C      V    P   P    AF    SS+  
Sbjct: 60  TVPVAVGAPPQNVT-MVLDTGSELSWLLCNGSR--VPSTPPQ--PQAPAAFNGSASSTYA 114

Query: 165 LIGCQN-PKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
              C + P+C W  G +  V   C G  P + +C ++     L Y    +A G+L ++T 
Sbjct: 115 AAHCSSSPECQW-RGRDLPVPPFCAG--PPSNSCRVS-----LSYADASSADGVLAADTF 166

Query: 221 RFPSKTVPNFLAGC--------------------SILSDRQPAGIAGFGRSSESLPSQLG 260
                     L GC                    +  S     G+ G  R S S  +Q G
Sbjct: 167 LLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTG 226

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLD-TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF-- 317
             +F+YC+      D P    LVL   G G+  S  P L+YTP  +     S     F  
Sbjct: 227 TLRFAYCIAP---GDGP--GLLVLGGDGDGAALSAAPQLNYTPLIE----MSQPLPYFDR 277

Query: 318 --YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
             Y V L  I VG+  + IP S L P   G G  +VDSG+ FTF+    +  +  EF+ Q
Sbjct: 278 VAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ 337

Query: 376 MGNYSR---AADVEKKSGLRPCFDISGKK------SVYLPELILKFKGGAKMALPPENYF 426
                      D   +     CF  S  +      S  LPE+ L  + GA++A+  E   
Sbjct: 338 TSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLR-GAEVAVGGEKLL 396

Query: 427 ALVGNE---------VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
            +V  E         V CL     + AG +     A ++G    QN ++E+DL N R GF
Sbjct: 397 YMVPGERRGEGGSEAVWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNSRVGF 451

Query: 478 AKQKC 482
           A  +C
Sbjct: 452 APARC 456


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 127/460 (27%), Positives = 195/460 (42%), Gaps = 83/460 (18%)

Query: 44  YLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
           Y+   D + ++  HS LA +S + A   K           +G   +   +K+ LS+ S G
Sbjct: 54  YMFAKDEERIRYFHSRLAKNSDANASSKK-----------VGPKLAGIPLKSGLSMGS-G 101

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
            Y + +  G+P +  T  I DTGSS  W    PCT     + C+         P F P  
Sbjct: 102 NYYVKMGLGSPTKYYT-MIVDTGSSFSWLQCQPCT-----IYCHI-----QEDPVFNPSA 150

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S + + + C + +CS +    +      CS ++  C      Y   YG   F+ G L  +
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPT--CSKQSNAC-----VYKASYGDSSFSLGYLSQD 203

Query: 219 TLRF-PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCL- 268
            L   PS+T+ +F+ GC    D Q       GI G   +  S+ SQL  K    FSYCL 
Sbjct: 204 VLTLTPSQTLSSFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLP 261

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQII 326
            S    ++P    L + T      S TP  SY  TP  KNP   S      Y++ L  I 
Sbjct: 262 TSFSTPNSPKEGFLSIGT-----SSLTPSSSYKFTPLLKNPNNPS-----LYFIDLESIT 311

Query: 327 VGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
           V  + + +  S Y VP        I+DSG+  T +  P++  +   ++  +   Y +A  
Sbjct: 312 VAGRPLGVAASSYKVP-------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPG 364

Query: 385 VEKKSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           +   S L  CF   ++G   V  P++ + FKGGA + L   N    +   + CL +    
Sbjct: 365 I---SLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELETGITCLAM---- 416

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 G     I+G++Q Q   + +D+ N R GFA   C
Sbjct: 417 -----AGSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 154/390 (39%), Gaps = 52/390 (13%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +    GTPPQ     + DT +  VW PC+    C  C+  N   S         S+
Sbjct: 28  GNYVVRAKLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST 81

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSET 219
               + C   +C+   G      C   SP+   C     S+   YG    F+A L+  +T
Sbjct: 82  ----VSCSTAQCTQARGLT----CPSSSPQPSVC-----SFNQSYGGDSSFSASLV-QDT 127

Query: 220 LRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKF 273
           L      +PNF  GC   +  +   P G+ G GR   SL SQ   L    FSYCL S + 
Sbjct: 128 LTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRS 187

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S  L L   P S       + YTP  +NP   S      YYV L  + VGS  V 
Sbjct: 188 FYFSGSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVP 235

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +   YL   ++   G I+DSG+  T    P++EA+  EF +Q+      +          
Sbjct: 236 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDT 291

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
           CF  S       P++ L       + LP EN         L CL +        A+    
Sbjct: 292 CF--SADNENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN-- 346

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             ++ + Q QN  + FD+ N R G A + C
Sbjct: 347 --VIANLQQQNLRILFDVPNSRIGIAPEPC 374


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 160/392 (40%), Gaps = 62/392 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTP +  T  +FDTGS   W  C     CV   +   +P     F P +S+
Sbjct: 94  GNYVVPVRLGTPAERFT-VVFDTGSDTTWVQCQP---CVAYCYRQKEP----LFDPTKSA 145

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C +  CS ++         GCS  +         Y +QYG G +T G    +TL
Sbjct: 146 TYANISCSSSYCSDLY-------VSGCSGGHCL-------YGIQYGDGSYTIGFYAQDTL 191

Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
                T+ NF  GC   +     + AG+ G GR   SLP Q   K    F+YCL      
Sbjct: 192 TLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL------ 245

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            A  +    LD GPG+  +       TP    + P         FYYVG+  I VG   +
Sbjct: 246 PATSAGTGFLDLGPGAPAANA---RLTPMLVDRGPT--------FYYVGMTGIKVGGHVL 294

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP S          G +VDSG+  T +    +  +   F + M     +A     S L 
Sbjct: 295 PIPGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSA-APAFSILD 348

Query: 393 PCFDISGKK--SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
            C+D++G K  S+ LP + L F+GGA + +       +      CL  F  NA    +  
Sbjct: 349 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLA-FAPNADDTDVA- 406

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               I+G+ Q +   + +D+     GFA   C
Sbjct: 407 ----IVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 172/408 (42%), Gaps = 71/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     I DTGS ++W  C S   C DC   +     +  F P  SS
Sbjct: 84  GLYFTKVKLGSPPREFNVQI-DTGSDILWVTCNS---CNDCPRTSGLGIELSFFDPSSSS 139

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ L+ C +P C+ +    V++    CSP++  C     SY   YG G  T G  +S+ L
Sbjct: 140 TTSLVSCSHPICTSL----VQTTAAECSPQSNQC-----SYSFHYGDGSGTTGYYVSDML 190

Query: 221 RFPSKTVPNFLA--------GCSILS-------DRQPAGIAGFGRSSESLPSQL---GL- 261
            F +    + +A        GCS          D+   GI GFG+   S+ SQL   G+ 
Sbjct: 191 YFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGIT 250

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL      +      LVL      G+   P + Y+P   +           Y +
Sbjct: 251 PKVFSHCLKG----EGDGGGKLVL------GEILEPNIIYSPLVPSQ--------SHYNL 292

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  +   +P    V  +  N G IVDSG+T T+    L E     F+  +    
Sbjct: 293 NLQSISVNGQ--LLPIDPAVFATSNNQGTIVDSGTTLTY----LVETAYDPFVSAITATV 346

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
            ++     S    C+ +S       P + L F GGA M L P  Y   +G        F+
Sbjct: 347 SSSTTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLG--------FS 398

Query: 441 DNAAGPALG----RGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           D AA   +G      P I ILGD  L++    +DLA+ R G+A   C+
Sbjct: 399 DGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDCS 446


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 152/389 (39%), Gaps = 62/389 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y IS+  GTP    T  I DTGS + W  C        C  P         F P +SS+ 
Sbjct: 127 YVISVGLGTPAVTQTVTI-DTGSDVSWVQCNP------CPNPPCYAQTGALFDPAKSSTY 179

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE---TL 220
           + + C   +C+ +     E +  GC   N  C      Y +QYG G T     S    TL
Sbjct: 180 RAVSCAAAECAQL-----EQQGNGCGATNYEC-----QYGVQYGDGSTTNGTYSRDTLTL 229

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
              S  V  F  GCS +      Q  G+ G G  ++SL SQ        FSYCL      
Sbjct: 230 SGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL------ 283

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
             P S       G     +   G   + F    +  S     FY   L+ I VG K + +
Sbjct: 284 -PPTS-------GSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGL 335

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
             S    GS      +VDSG+  T +    + A++  F   M  Y  A     +S L  C
Sbjct: 336 SPSVFAAGS------VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSA---PARSILDTC 386

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG-RGPA 453
           FD +G+  + +P + L F GGA + L P              I++ +  A  A G  G  
Sbjct: 387 FDFAGQTQISIPTVALVFSGGAAIDLDPNG------------IMYGNCLAFAATGDDGTT 434

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            I+G+ Q + F + +D+ +   GF    C
Sbjct: 435 GIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 122/407 (29%), Positives = 173/407 (42%), Gaps = 74/407 (18%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           ++ PL+  S G Y +S+S GTPP      + DTGS L+W  C    +C          SR
Sbjct: 81  LQAPLTPGS-GEYLMSVSIGTPP-VDYIGMADTGSDLMWAQCLPCLKCYK-------QSR 131

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
            P F P +S+S   + C +  C  I     +S C         C      Y   YG   +
Sbjct: 132 -PIFDPLKSTSFSHVPCNSQNCKAI----DDSHCGA----QGVC-----DYSYTYGDQTY 177

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL-----K 262
           T G L  E +   S +V + + GC   S       +G+ G G    SL SQ+       +
Sbjct: 178 TKGDLGFEKITIGSSSVKSVI-GCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236

Query: 263 KFSYCL---LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFY 318
           +FSYCL   LS          N V+     SG    PG+  TP   KNPV        +Y
Sbjct: 237 RFSYCLPTLLSHANGKINFGQNAVV-----SG----PGVVSTPLISKNPV-------TYY 280

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           YV L  I +G++       ++     GN  VI+DSG+T +F+   L++ V    ++ +  
Sbjct: 281 YVTLEAISIGNER------HMASAKQGN--VIIDSGTTLSFLPKELYDGVVSSLLKVV-- 330

Query: 379 YSRAADVEKKSGLRP-CFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
             +A  V+        CFD  I+   S  +P +  +F GGA + L P N F  V N V C
Sbjct: 331 --KAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNC 388

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           L L     A P    G   I+G+  L NF + +DL   R  F    C
Sbjct: 389 LTL---TPASPTDEFG---IIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 66/389 (16%)

Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + +SFGTP  PQ     + DTGS + W       +C  C+     P + P + P  SS
Sbjct: 79  YVVRVSFGTPAVPQV---VVIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSS 129

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +   + C +  C  +      S C       K C  A     + Y  G  T G    + L
Sbjct: 130 TYSAVPCASDVCKKLAADAYGSGCT----SGKQCGFA-----ISYADGTSTVGAYSQDKL 180

Query: 221 RF-PSKTVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
              P   V NF  GC            G+ G GR  ESL ++ G   FSYCL S      
Sbjct: 181 TLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPS------ 233

Query: 277 PVSSN---LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            VSS    L L  G      K P    + F   P+G+      F  V L  I VG K + 
Sbjct: 234 -VSSKPGFLALGAG------KNP----SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD 282

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S        +GG+IVDSG+  T ++   + A+   F + M  Y    + +    L  
Sbjct: 283 LRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDT 332

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C++++G K+V +P++ L F GGA + L   N   + G    CL       +GP    G A
Sbjct: 333 CYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAF---AESGP---DGSA 382

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +LG+   + F + FD +  +FGF  + C
Sbjct: 383 GVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 66/389 (16%)

Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y + +SFGTP  PQ     + DTGS + W       +C  C+     P + P + P  SS
Sbjct: 113 YVVRVSFGTPAVPQV---VVIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSS 163

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +   + C +  C  +      S C       K C  A     + Y  G  T G    + L
Sbjct: 164 TYSAVPCASDVCKKLAADAYGSGCT----SGKQCGFA-----ISYADGTSTVGAYSQDKL 214

Query: 221 RF-PSKTVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
              P   V NF  GC            G+ G GR  ESL ++ G   FSYCL S      
Sbjct: 215 TLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPS------ 267

Query: 277 PVSSN---LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            VSS    L L  G      K P    + F   P+G+      F  V L  I VG K + 
Sbjct: 268 -VSSKPGFLALGAG------KNP----SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD 316

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           +  S        +GG+IVDSG+  T ++   + A+   F + M  Y    + +    L  
Sbjct: 317 LRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDT 366

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C++++G K+V +P++ L F GGA + L   N   + G    CL       +GP    G A
Sbjct: 367 CYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAF---AESGP---DGSA 416

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +LG+   + F + FD +  +FGF  + C
Sbjct: 417 GVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 127/460 (27%), Positives = 195/460 (42%), Gaps = 83/460 (18%)

Query: 44  YLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
           Y+   D + ++  HS LA +S + A   K           +G   +   +K+ LS+ S G
Sbjct: 54  YMFAKDEERIRYFHSRLAKNSDANASFKK-----------VGPKLAGIPLKSGLSMGS-G 101

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
            Y + +  G+P +  T  I DTGSS  W    PCT     + C+         P F P  
Sbjct: 102 NYYVKMGLGSPTKYYT-MIVDTGSSFSWLQCQPCT-----IYCHI-----QEDPVFNPSA 150

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S + + + C + +CS +    +      CS ++  C      Y   YG   F+ G L  +
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPT--CSKQSNAC-----VYKASYGDSSFSLGYLSQD 203

Query: 219 TLRF-PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCL- 268
            L   PS+T+ +F+ GC    D Q       GI G   +  S+ SQL  K    FSYCL 
Sbjct: 204 VLTLTPSQTLSSFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLP 261

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQII 326
            S    ++P    L + T      S TP  SY  TP  KNP   S      Y++ L  I 
Sbjct: 262 TSFSTPNSPKEGFLSIGT-----SSLTPSSSYKFTPLLKNPNNPS-----LYFIDLESIT 311

Query: 327 VGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
           V  + + +  S Y VP        I+DSG+  T +  P++  +   ++  +   Y +A  
Sbjct: 312 VAGRPLGVAASSYKVP-------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPG 364

Query: 385 VEKKSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           +   S L  CF   ++G   V  P++ + FKGGA + L   N    +   + CL +    
Sbjct: 365 I---SLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELETGITCLAM---- 416

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 G     I+G++Q Q   + +D+ N R GFA   C
Sbjct: 417 -----AGSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 170/412 (41%), Gaps = 68/412 (16%)

Query: 82  NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
            + +   ++ + T + V ++  Y +++S GTP  + T  + DTGS + W       +C  
Sbjct: 122 QLATGSRSATVPTTMGVGTFQ-YVVTVSLGTPGVSQTVEV-DTGSDVSWV------QCKP 173

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
           C+ P  +  R   F P +SS+   + C    CS +     E+ C G       C      
Sbjct: 174 CSAPACNSQRDQLFDPAKSSTYSAVPCGADACSEL--RIYEAGCSG-----SQC-----G 221

Query: 202 YLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGF---GRSSESLP 256
           Y++ YG G  T G+  S+TL   P  TV  FL GC        AGI G    GR S SL 
Sbjct: 222 YVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLK 281

Query: 257 SQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
           SQ        FSYCL S++      S+   L  G   G S   G + T        ++ A
Sbjct: 282 SQAAGAYGGVFSYCLPSKQ------SAAGYLTLG---GPSSASGFATTGLL-----TAWA 327

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF- 372
              FY V L  I VG + V +P S         GG +VD+G+  T +    + A+   F 
Sbjct: 328 APTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFR 381

Query: 373 --IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
             I   G  S  A+      L  C+D S    V LP + L F GGA +AL          
Sbjct: 382 GAIAPCGYPSAPAN----GILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL---- 433

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               CL      A  P  G G A ILG+ Q ++F + FD      GF    C
Sbjct: 434 -SSGCL------AFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 114/407 (28%), Positives = 171/407 (42%), Gaps = 68/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     I DTGS ++W  C+    C   +  N+   ++  F P  SS
Sbjct: 89  GLYFTRVKLGSPPKEYFVQI-DTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSS 144

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S  I C + +C+     + E+ C+  +  N  C      Y   YG G  T+G  +S+T+
Sbjct: 145 TSSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTM 196

Query: 221 RFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
            F +         +  + + GCS         +DR   GI GFG+   S+ SQL      
Sbjct: 197 YFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 256

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL  +  D+      LVL      G+   PGL YTP   +           Y +
Sbjct: 257 PKVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNL 298

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  I+V  + + I  S     +    G IVDSG+T  ++    ++         +    
Sbjct: 299 NLESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSV 356

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLI 437
           R+  V K +    CF  S       P + L F GG  M + PENY    A + N VL  I
Sbjct: 357 RSL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCI 412

Query: 438 LFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  N       +G  I ILGD  L++    +DLAN R G+    C+
Sbjct: 413 GWQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 165/426 (38%), Gaps = 93/426 (21%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           ++ ++ GTPPQ  T  + DTGS L W  C   Y           P   PAF    SSS  
Sbjct: 56  TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSYA----------PPLTPAFNASGSSSYG 104

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--------------LACPSYLLQYGLGF 210
            + C +  C W  G ++       +P +  C               LA  ++LL  G   
Sbjct: 105 AVPCPSTACEW-RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163

Query: 211 TA-GLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
            A G        + S T  N     + +S+    G+ G  R + S  +Q G ++F+YC+ 
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGTRRFAYCI- 221

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQI 325
                 AP     VL  G   G +  P L+YTP  +     S     F    Y V L  I
Sbjct: 222 ------APGEGPGVLLLGDDGGVA--PPLNYTPLIE----ISQPLPYFDRVAYSVQLEGI 269

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ---------- 375
            VG   + IP S L P   G G  +VDSG+ FTF+    + A+  EF  Q          
Sbjct: 270 RVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGE 329

Query: 376 -----MGNYSR-----AADVEKKSGLRPCFD---------ISGKKSVYLPELILKFKGGA 416
                 G +        A V   SGL P            +SG+K +Y+     + +GGA
Sbjct: 330 PGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGA 389

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
           +               V CL     + AG +     A ++G    QN ++E+DL N R G
Sbjct: 390 EA--------------VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVG 430

Query: 477 FAKQKC 482
           FA  +C
Sbjct: 431 FAPARC 436


>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like, partial [Brachypodium distachyon]
          Length = 364

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/262 (32%), Positives = 119/262 (45%), Gaps = 40/262 (15%)

Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LS 299
           AG+ G  R + S  SQ G ++FSYC+  R  DDA V   L+L      G S  P    L+
Sbjct: 110 AGLLGMNRGALSFVSQAGTRRFSYCISDR--DDAGV---LLL------GHSDLPNFLPLN 158

Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
           YTP Y+  +         Y V L  I+VGSK + IP S L P   G G  +VDSG+ FTF
Sbjct: 159 YTPLYQPSLPLPYFDRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTF 218

Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--------ISGKKSVYLPELILK 411
           + G  + A+  EF RQ   + RA D E     +  FD        +S      LP + L+
Sbjct: 219 LLGDAYAALKAEFYRQSTPFLRALD-EPSFAFQGAFDTCFRVPRGMSPPPGRLLPSVTLR 277

Query: 412 FKGGAKMALPPENYFALVGNE-----------VLCLILFTDNAAGPALGRGPAIILGDFQ 460
           F  GA+M +  +     V  E           V CL  F +    P +    A ++G   
Sbjct: 278 FN-GAEMVVGGDRLLYKVPGERRGGAGADDDAVWCLT-FGNADMVPIM----AYVIGHHH 331

Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
             N ++E+DL   R G A+ +C
Sbjct: 332 QMNLWVEYDLERGRVGLAQVRC 353


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 113/400 (28%), Positives = 162/400 (40%), Gaps = 77/400 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+S GTPP      I DTGS L W  C    +C             P F P +S+
Sbjct: 90  GEYLMSVSIGTPP-VDYLGIADTGSDLTWAQCLPCLKCYQ--------QLRPIFNPLKST 140

Query: 162 SSQLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S   + C    C  +   +  V+  C                Y   YG   ++ G L  E
Sbjct: 141 SFSHVPCNTQTCHAVDDGHCGVQGVCD---------------YSYTYGDRTYSKGDLGFE 185

Query: 219 TLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCL-- 268
            +   S +V + + GC   S       +G+ G G    SL SQ+       ++FSYCL  
Sbjct: 186 KITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPT 244

Query: 269 -LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQII 326
            LS          N V+           PG+  TP   KN V        +YY+ L  I 
Sbjct: 245 LLSHANGKINFGENAVV---------SGPGVVSTPLISKNTV-------TYYYITLEAIS 288

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +G++       ++     GN  VI+DSG+T T +   L++ V    ++ +    +A  V+
Sbjct: 289 IGNER------HMAFAKQGN--VIIDSGTTLTILPKELYDGVVSSLLKVV----KAKRVK 336

Query: 387 KKSG-LRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
              G L  CFD  I+   S+ +P +   F GGA + L P N F  V + V CL L    A
Sbjct: 337 DPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTL---KA 393

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A P    G   I+G+    NF + +DL   R  F    CA
Sbjct: 394 ASPTTEFG---IIGNLAQANFLIGYDLEAKRLSFKPTVCA 430


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 162/391 (41%), Gaps = 55/391 (14%)

Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y ++L FGTP  PQ     + DTGS L W       +C  CN     P + P F P  SS
Sbjct: 122 YVVTLGFGTPAVPQV---LLIDTGSDLSWV------QCQPCNSSTCYPQKDPVFDPSASS 172

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   + C +  C  +   + +S   GC+  N +   +   Y +QYG G  T G+  +ETL
Sbjct: 173 TYAPVPCGSEACRDL---DPDSYANGCT--NSSSGASLCQYGIQYGNGDTTVGVYSTETL 227

Query: 221 RF---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
                 +  V NF  GC ++         G+ G G + ESL SQ        FSYCL + 
Sbjct: 228 TLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAG 287

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                  ++  +    P +G + T G  +TP              FY V L  I VG K 
Sbjct: 288 N-----STAGFLALGAPATGGNNTAGFQFTPLQVVET-------TFYLVKLTGISVGGKQ 335

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + I  +         GG+I+DSG+  T +    + A+   F   M  Y      + +  L
Sbjct: 336 LDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDED-L 388

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G  +V +P + L F+GG  + L   +   L G    CL       AG +   G
Sbjct: 389 DTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG----CLAFV----AGAS--DG 438

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+   + F + +D A    GF    C
Sbjct: 439 DTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 117/404 (28%), Positives = 162/404 (40%), Gaps = 74/404 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  ++  GTP +  +  I DTGS L W  C+   +C   N           F+P  S+
Sbjct: 11  GEYLATVRLGTPERVFS-VIVDTGSDLTWVQCSPCGKCYSQN--------DALFLPNTST 61

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           S   + C +  C+ +  P               C      Y   YG G  T G  + +T+
Sbjct: 62  SFTKLACGSALCNGLPFP--------------MCNQTTCVYWYSYGDGSLTTGDFVYDTI 107

Query: 221 RF-----PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGL---KKFSYCLL 269
                    + VPNF  GC   ++   AG   I G G+   S  SQL      KFSYCL+
Sbjct: 108 TMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLV 167

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGL---SYTPFYKNPVGSSSAFGEFYYVGLRQII 326
              +   P  ++ +L      GD+  P L    Y P   NP         +YYV L  I 
Sbjct: 168 --DWLAPPTQTSPLL-----FGDAAVPILPDVKYLPILANP-----KVPTYYYVKLNGIS 215

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRA 382
           VG   + I  +     S G  G I DSG+T T     L EA  KE +  M      YSR 
Sbjct: 216 VGDNLLNISSTVFDIDSVGGAGTIFDSGTTVT----QLAEAAYKEVLAAMNASTMAYSRK 271

Query: 383 ADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFT 440
             ++  S L  C     K  +  +P +   F+GG  M LPP NYF  L  ++  C     
Sbjct: 272 --IDDISRLDLCLSGFPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCF---- 324

Query: 441 DNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 A+   P + I+G  Q QNF + +D A  + GF  + C 
Sbjct: 325 ------AMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFVPKDCV 362


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 117/439 (26%), Positives = 178/439 (40%), Gaps = 78/439 (17%)

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
           A  S++RA H    +   T +S +              +   GGY ++ S GTPP     
Sbjct: 57  ARRSINRANHFFKDSDTSTPESTV--------------IPDRGGYLMTYSVGTPP-TKIY 101

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS +VW  C    +C +           P F P +SSS + I C +  C  +   
Sbjct: 102 GIADTGSDIVWLQCEPCEQCYN--------QTTPIFNPSKSSSYKNIPCSSKLCHSV--- 150

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
               R   CS +N     +C  Y + YG    + G L  +TL   S      + P  + G
Sbjct: 151 ----RDTSCSDQN-----SC-QYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIG 200

Query: 234 CSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL-D 285
           C   +        +GI G G    SL +QLG     KFSYCL+     ++  SS L   D
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGD 260

Query: 286 TGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
               SGD    G+  TP   K+PV        FY++ L+   VG+K V+   S    G D
Sbjct: 261 AAVVSGD----GVVSTPLIKKDPV--------FYFLTLQAFSVGNKRVEFGGS--SEGGD 306

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
             G +I+DSG+T T +   ++  +    +  +    R  D  ++  L  C+ +   +  +
Sbjct: 307 DEGNIIIDSGTTLTLIPSDVYTNLESAVV-DLVKLDRVDDPNQQFSL--CYSLKSNEYDF 363

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
            P + + FK GA + L   + F  + + ++C          P LG     I G+   QN 
Sbjct: 364 -PIITVHFK-GADVELHSISTFVPITDGIVCFAF----QPSPQLGS----IFGNLAQQNL 413

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + +DL      F    C 
Sbjct: 414 LVGYDLQQKTVSFKPTDCT 432


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 113/405 (27%), Positives = 164/405 (40%), Gaps = 67/405 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   ++ GTP   +     DTGS + W  C    RC         P   P F P+ S+
Sbjct: 132 GEYMAKIAVGTPAVEAL-LAMDTGSDITWLQCQPCRRCY--------PQSGPVFDPRHST 182

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL--GFTAGLLLSET 219
           S + +G   P C  +       R  G   +  TC      Y + YG     T G  + ET
Sbjct: 183 SYREMGYDAPDCQAL------GRSGGGDAKRMTC-----VYAVGYGDDGSTTVGDFIEET 231

Query: 220 LRFPSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLL 269
           L F     VP+   GC      L     AGI G GR   S PSQ+      +  FSYCL 
Sbjct: 232 LTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCL- 290

Query: 270 SRKFDDAP---VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           +  F  +P   VSS L +  G  +G   +P  S+TP  +N          FYYV L  + 
Sbjct: 291 ADFFLSSPGRSVSSTLTIGDGAAAG---SPPPSFTPTVQN-----LNMATFYYVRLVGVS 342

Query: 327 VG--------SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           VG           +K+ PY+       G GGVI+DSG+  T +    + A    F     
Sbjct: 343 VGGVRVPGVTEDDLKLDPYT-------GRGGVILDSGTAVTRLARRAYIAFRDAFRAAAV 395

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
           +  + +          C+ + G+ ++ +P + + F GG ++ LPP+NY   V +      
Sbjct: 396 DLGQVSIGGPSGFFDTCYTMGGR-AMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCF 454

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            F         G     I+G+ Q Q F + +++   R GFA   C
Sbjct: 455 AFA------GTGDRSVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 162/401 (40%), Gaps = 67/401 (16%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + LS GTPPQ    F     S   W  C+S    ++C   ++       F P  S+S   
Sbjct: 1   MDLSLGTPPQP-LNFTLAVDSGFSWVACSSSC-AINCTTASL-------FQPGLSTSHTK 51

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRFPS 224
           + C +P CS      V + C          P +  SY   YG  F+ AG L+S+     S
Sbjct: 52  LPCGSPSCSAFSA--VSTSCG---------PSSSCSYNTSYGTNFSSAGDLVSDIATMDS 100

Query: 225 ----KTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYCLLSR 271
               K   N   GC      +L     +G  GF + + S   QL       KF YCL S 
Sbjct: 101 VRNRKVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSD 160

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            F    V  N  L        S +  ++YTP   NP  +     E Y++ L  I +    
Sbjct: 161 TFRGKLVIGNYKLRNA-----SISSSMAYTPMITNPQAA-----ELYFINLSTISIDKNK 210

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR-----AADVE 386
            ++P    +  S+G GG ++D+ +  +++    +     + ++ + NY+      ++ V 
Sbjct: 211 FQVPIQGFL--SNGTGGTVIDTTTFLSYLTSDFY----TQLVQAIKNYTTNLVEVSSSVA 264

Query: 387 KKSGLRPCFDISGKKSVYLP-ELILKFKGGAKMALPPENYFALVG----NEVLCLILFTD 441
              G+  C++IS       P  L   F GGA + +    +F L      N  +C+ +   
Sbjct: 265 DALGVELCYNISANSDFPPPATLTYHFLGGAGVEV--STWFLLDDSDSVNNTICMAIGRS 322

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            + GP L      ++G +Q  +  +E+DL   R+GF  Q C
Sbjct: 323 ESVGPNLN-----VIGTYQQLDLTVEYDLEQMRYGFGAQGC 358


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 171/406 (42%), Gaps = 70/406 (17%)

Query: 104 YSISLSFGTPPQASTPFI-FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y   +  G+PP+    F+  DTGS ++W  C+    C   +  N+   ++  F P  SS+
Sbjct: 117 YFTRVKLGSPPKEY--FVQIDTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSST 171

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
           S  I C + +C+     + E+ C+  +  N  C      Y   YG G  T+G  +S+T+ 
Sbjct: 172 SSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTMY 223

Query: 222 FPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL----- 261
           F +         +  + + GCS         +DR   GI GFG+   S+ SQL       
Sbjct: 224 FDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSP 283

Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           K FS+CL  +  D+      LVL      G+   PGL YTP   +           Y + 
Sbjct: 284 KVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNLN 325

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L  I+V  + + I  S     +    G IVDSG+T  ++    ++         +    R
Sbjct: 326 LESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR 383

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLIL 438
           +  V K +    CF  S       P + L F GG  M + PENY    A + N VL  I 
Sbjct: 384 SL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG 439

Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  N       +G  I ILGD  L++    +DLAN R G+    C+
Sbjct: 440 WQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 478


>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 112/252 (44%), Gaps = 19/252 (7%)

Query: 239 DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
           D +  G+ G  R S S  SQ+   KFSYC+    F         VL  G  +     P L
Sbjct: 128 DSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSDFSG-------VLLLGDANFSWLMP-L 179

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
           +YTP  +            Y V L  I V SK + +P S  VP   G G  +VDSG+ FT
Sbjct: 180 NYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFT 239

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDI--SGKKSVYLPELILKFK 413
           F+ GP++ A+  EF+ Q     R  +      + G+  C+ +  S     +LP + L F+
Sbjct: 240 FLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFR 299

Query: 414 GGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
            GA+M +  +     V  EV     +  FT       L    A ++G    QN ++EFDL
Sbjct: 300 -GAEMKVSGDRLLYRVPGEVRGSDSVYCFT--FGNSDLLAVEAYVIGHHHQQNVWMEFDL 356

Query: 471 ANDRFGFAKQKC 482
              R GFA+ +C
Sbjct: 357 EKSRIGFAQVQC 368


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 126/487 (25%), Positives = 191/487 (39%), Gaps = 75/487 (15%)

Query: 12  FSLLILLFTTDAGA---GSSAATVTVPLTPLSTK--HYLHHSDSDPLKILHSLASSSLSR 66
           F L+ LLF+T        + + T  + + P+ +K   ++       +  + ++AS    R
Sbjct: 10  FFLVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPER 69

Query: 67  ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
            ++L T    KT    I              V     Y + +  GTP Q     + DT +
Sbjct: 70  LKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGTPGQQMF-MVLDTSN 119

Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
              W PC+    C    F +        F+P  S++   + C   +CS +          
Sbjct: 120 DAAWVPCSG---CT--GFSST------TFLPNASTTLGSLDCSGAQCSQV---------- 158

Query: 187 GCSPRNKTCPLACPSY-LLQYGLGFTAGL---LLSETLRFPSKTVPNFLAGC-SILSDRQ 241
               R  +CP    S  L     G  + L   L+ + +   +  +P F  GC + +S   
Sbjct: 159 ----RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGS 214

Query: 242 --PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
             P G+ G GR   SL SQ G      FSYCL S  F     S +L L  GP  G  K+ 
Sbjct: 215 IPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKL--GP-VGQPKS- 268

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            +  TP  +NP   S      YYV L  + VG   V IP   LV   +   G I+DSG+ 
Sbjct: 269 -IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTV 322

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
            T    P++ A+  EF +Q+        +        CF  + +     P + L F+ G 
Sbjct: 323 ITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAEA--PAITLHFE-GL 374

Query: 417 KMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
            + LP EN      +  L CL +    AA P        ++ + Q QN  + FD  N R 
Sbjct: 375 NLVLPMENSLIHSSSGSLACLSM----AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRL 430

Query: 476 GFAKQKC 482
           G A++ C
Sbjct: 431 GIARELC 437


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 105/405 (25%), Positives = 171/405 (42%), Gaps = 69/405 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           + + ++G Y +    GTPP      I DT S L+W  C+    C         P   P F
Sbjct: 82  VRIPNHGEYLMRFYIGTPPVERLA-IADTASDLIWVQCSPCETCF--------PQDTPLF 132

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA---CPSYLLQYGLG-FT 211
            P +SS+   + C +  C+              S     CPL    C  Y   YG G  T
Sbjct: 133 EPHKSSTFANLSCDSQPCT--------------SSNIYYCPLVGNLC-LYTNTYGDGSST 177

Query: 212 AGLLLSETLRFPSKTV--PNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLK- 262
            G+L +E++ F S+TV  P  + GC   +D       +  GI G G    SL SQLG + 
Sbjct: 178 KGVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQI 237

Query: 263 --KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYY 319
             KFSYCLL       P +S   +    G+  + T  G+  TP   +P      +  +Y+
Sbjct: 238 GHKFSYCLL-------PFTSTSTIKLKFGNDTTITGNGVVSTPLIIDP-----HYPSYYF 285

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L  I +G K +++  +        NG +I+D G+  T++E   +          +G  
Sbjct: 286 LHLVGITIGQKMLQVRTT-----DHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGIS 340

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLIL 438
               D+        CF    + ++  P+++ +F  GAK+ L P+N +F      ++CL +
Sbjct: 341 ETKDDIPYPFDF--CF--PNQANITFPKIVFQFT-GAKVFLSPKNLFFRFDDLNMICLAV 395

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             D  A     +G + + G+    +F +E+D    +  FA   C+
Sbjct: 396 LPDFYA-----KGFS-VFGNLAQVDFQVEYDRKGKKVSFAPADCS 434


>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
           +FD+    S +VL      GD   P    L+YTPF   Y+ P   SS +G +YY+GLR +
Sbjct: 1   RFDEENQKSLMVL------GDKAFPNGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            +G K +K+P   L   + GNGG I+DSG+TFT     +F+ +A  F  Q+  Y RA DV
Sbjct: 53  SIGGKRMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111

Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
           E  +G+  C+++SG +++ LPE 
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 77/264 (29%), Positives = 118/264 (44%), Gaps = 33/264 (12%)

Query: 234 CSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
           C   +  +  G+ G  R S S  +Q+GL+KFSYC+  +       SS ++L     S  S
Sbjct: 431 CRTRTHSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQD------SSGILLFGE--SSFS 482

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
               L YTP     V  S+    F    Y V L  I V +  +++P S   P   G G  
Sbjct: 483 WLKALKYTPL----VQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 538

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKSVY-- 404
           +VDSG+ FTF+ GP++ A+  EF+RQ     +  +      +  +  C+ +   +     
Sbjct: 539 MVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPP 598

Query: 405 LPELILKFKGGAKMALPPENYFALV------GNEVLCLILFTDNAAGPALGRGPAIILGD 458
           LP + L F+ GA+M++  E     V       + V C         G       + I+G 
Sbjct: 599 LPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVE-----SYIIGH 652

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
              QN ++EFDLA  R GFA+ +C
Sbjct: 653 HHQQNVWMEFDLAKSRVGFAEVRC 676



 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 35/78 (44%), Gaps = 13/78 (16%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           LS H     ++SL+ G+PPQ  T  + DTGS L W  C               P+    F
Sbjct: 367 LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKK------------APNLHSVF 413

Query: 156 IPKRSSSSQLIGCQNPKC 173
            P RSSS   I C +P C
Sbjct: 414 DPLRSSSYSPIPCTSPTC 431


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 107/405 (26%), Positives = 160/405 (39%), Gaps = 87/405 (21%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP    T  +FDTGS   W  C     CV   +   +      F P RSS
Sbjct: 176 GNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYEQQEK----LFDPVRSS 227

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   + C  P CS +   N+     GCS  +         Y +QYG G ++ G    +TL
Sbjct: 228 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 273

Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
              S   V  F  GC   ++    + AG+ G GR   SLP Q   K    F++CL +R  
Sbjct: 274 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST 333

Query: 274 DDA--------------PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
                             +++ ++ D GP                            FYY
Sbjct: 334 GTGYLDFGAGSPAAASARLTTPMLTDNGP---------------------------TFYY 366

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMG 377
           +G+  I VG + + IP S          G IVDSG+  T +  P + ++  A        
Sbjct: 367 IGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAAR 421

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
            Y +A  V   S L  C+D +G   V +P + L F+GGA++ +             +CL 
Sbjct: 422 GYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA 478

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            F  N  G  +G     I+G+ QL+ F + +D+     GF    C
Sbjct: 479 -FAANEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|224090425|ref|XP_002308984.1| predicted protein [Populus trichocarpa]
 gi|222854960|gb|EEE92507.1| predicted protein [Populus trichocarpa]
          Length = 416

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 151/347 (43%), Gaps = 55/347 (15%)

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK--- 225
           +NP C+         R K C+   K C L+  +   + G   T+  L  + +   S    
Sbjct: 71  KNPSCNTAQCSLAVYRLKTCTVDKKFCVLSPDNTATRTG---TSDYLTQDVVSIQSTDGS 127

Query: 226 ------TVPNFLAGCS---ILSD--RQPAGIAGFGRSSESLPSQLGL-----KKFSYCLL 269
                 +VPNFL  C+   IL    +   G+AG GR+  SLPSQ        KKF+ CL 
Sbjct: 128 NPGRVVSVPNFLFSCAPTFILQGLAKGVKGMAGLGRTKISLPSQFSAAFSFPKKFAICLT 187

Query: 270 SRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYV 320
           S           ++   GP       D  +  L YTP   NPV ++S + E      Y++
Sbjct: 188 SSNAKGV-----VIFGDGPYVLLPHADDLSQSLIYTPLILNPVSTASGYFEGEPSTDYFI 242

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           G++ I +    V +  S L    +G GG  + + + +T ME  ++ AV   F+R++    
Sbjct: 243 GVKSIKINENVVPLNASLLSINREGYGGTKISTVNAYTVMETTIYNAVTDSFVRELAK-- 300

Query: 381 RAADVEKKSGLRP---CFDISGKKSVYL------PELILKFKGGAKMALPPENYFALVGN 431
             A+V + + + P   CF+     S  +       +L+L+ K      +   N    V +
Sbjct: 301 --ANVPRVASVAPFGACFNSKNIGSTRVGPAVPQIDLVLQSK-NVYWRIFGANSMVQVKD 357

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           +VLCL  F D    P      +I++G  QL++  L+FDLA  R GF+
Sbjct: 358 DVLCL-GFVDGGVNPR----TSIVIGGHQLEDNLLQFDLAASRLGFS 399


>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
           +FD+    S +VL      GD   P    L+YTPF   Y+ P   SS +G +YY+GLR +
Sbjct: 1   RFDEENQKSLMVL------GDKAFPNGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            +G K +K+P   L   + GNGG I+DSG+TFT     +F+ +A  F  Q+  Y RA DV
Sbjct: 53  SIGGKRMKLPSKLLRFDAKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111

Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
           E  +G+  C+++SG +++ LPE 
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 165/391 (42%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP    T  +FDTGS   W  C     CV   +      R   F P RSS
Sbjct: 178 GNYVVTVGLGTPVSRYT-VVFDTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSS 229

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   + C  P CS +   N+     GCS  +         Y +QYG G ++ G    +TL
Sbjct: 230 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 275

Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
              S   V  F  GC   ++    + AG+ G GR   SLP Q   K    F++CL +R  
Sbjct: 276 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 334

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                +    LD G GS  + +  L+ TP   +   +      FYYVG+  I VG + + 
Sbjct: 335 -----TGTGYLDFGAGSLAAASARLT-TPMLTDNGPT------FYYVGMTGIRVGGQLLS 382

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMGNYSRAADVEKKSGL 391
           IP S          G IVDSG+  T +    + ++  A         Y +A  V   S L
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAV---SLL 434

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D +G   V +P + L F+GGA++ +             +CL  F  N  G  +G  
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA-FAANEDGGDVG-- 491

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ QL+ F + +D+     GF    C
Sbjct: 492 ---IVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 133/473 (28%), Positives = 193/473 (40%), Gaps = 79/473 (16%)

Query: 27  SSAATVTVPLTPLSTKHYLHH------SDSDPLKILHSLASSSLSRARHLKTKTKPKTKD 80
           S++  +TVPL      H+ H       S+  P  +   L    L RA ++K K     K 
Sbjct: 56  STSGGITVPL------HHRHGPCSPVPSNKMPASLEERLQRDQL-RAAYIKRKFS-GAKG 107

Query: 81  SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
            ++  + + ++  T  +  S   Y I++  G+P    T    DTGS + W  C     C 
Sbjct: 108 GDVEQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQT-MSMDTGSDVSWVQCKP---CS 163

Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
            C+   VD      F P  SS+     C +  C  +   +   +  GCS  +  C     
Sbjct: 164 QCH-SEVDS----LFDPSASSTYSPFSCSSAACVQL---SQSQQGNGCS--SSQC----- 208

Query: 201 SYLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSES 254
            Y++ Y  G  T G   S+TL   S  +  F  GCS       SD Q  G+ G G  ++S
Sbjct: 209 QYIVSYVDGSSTTGTYSSDTLTLGSNAIKGFQFGCSQSESGGFSD-QTDGLMGLGGDAQS 267

Query: 255 LPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL-SYTPFYKNPVGS 310
           L SQ      K FSYCL        P          PGS    T G  S + F K P+  
Sbjct: 268 LVSQTAGTFGKAFSYCL-------PPT---------PGSSGFLTLGAASRSGFVKTPMLR 311

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
           S+    +Y V L  I VG + + IP S    GS      ++DSG+  T +    + A++ 
Sbjct: 312 STQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGS------VMDSGTVITRLPPTAYSALSS 365

Query: 371 EFIRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
            F   M  Y  A    + SG L  CFD SG+ SV +P + L F GGA + L        +
Sbjct: 366 AFKAGMKKYPPA----QPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIMLEL 421

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            N  L    F  N+   +LG      +G+ Q + F + +D+     GF    C
Sbjct: 422 DNWCLA---FAANSDDSSLG-----FIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 119/437 (27%), Positives = 171/437 (39%), Gaps = 82/437 (18%)

Query: 55  ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
           + H LA  + +RA  +    +  T+    G  +S  ++         G Y  S+  GTPP
Sbjct: 99  LAHRLARDA-ARAEAISVSARNVTR---AGGGFSAPVVSG--LAQGSGEYFASVGVGTPP 152

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
             +   + DTGS +VW       +C  C        R+  F P+RS S   + C  P C 
Sbjct: 153 TPAL-LVLDTGSDVVWL------QCAPCRQCYAQSGRV--FDPRRSRSYAAVRCGAPPCR 203

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP-SKTVPNFLA 232
            +           C  R  TC      Y + YG G  TAG L +ETL F     VP    
Sbjct: 204 GLDAGGGGG----CDRRRGTC-----LYQVAYGDGSVTAGDLATETLWFARGARVPRVAV 254

Query: 233 GCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDT 286
           GC   ++      AG+ G GR   SLP+Q      ++FSYC      D   +   +    
Sbjct: 255 GCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVHQHV 314

Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
           G                                 G R   VG + ++     L P S G 
Sbjct: 315 G---------------------------------GARVRGVGERSLR-----LDP-STGR 335

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
           GGVI+DSG++ T +  P++ AV + F    G    A      S    C+D+ G++ V +P
Sbjct: 336 GGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPG--GFSLFDTCYDLRGRRVVKVP 393

Query: 407 ELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
            + +   GGA++ALPPENY   V      CL L   +        G   I+G+ Q Q F 
Sbjct: 394 TVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTD--------GGVSIVGNIQQQGFR 445

Query: 466 LEFDLANDRFGFAKQKC 482
           + FD    R     + C
Sbjct: 446 VVFDGDRQRVALVPKSC 462


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 114/410 (27%), Positives = 171/410 (41%), Gaps = 75/410 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           G Y + L  GTP    +  I DT S LVW    PC S YR +D           P F P+
Sbjct: 86  GEYLVKLGIGTPQHYFSAAI-DTASDLVWLQCQPCVSCYRQLD-----------PIFNPR 133

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
            SSS  ++ C +  CS + G     RC      ++    AC       G   T G L  +
Sbjct: 134 LSSSYAVVPCSSDTCSQLDG----HRC------DEDDDQACRYNYKYSGNAVTNGTLAID 183

Query: 219 TLRFPSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCL---LSR 271
            L          + GCS  S      Q +G+ G  R   SL SQL +++F YCL   +SR
Sbjct: 184 KLAVGGNVFHAVVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSR 243

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK- 330
                     LVL  G G+   +      T      + SS+ +  +YY+    + VG + 
Sbjct: 244 ------TPGKLVLGAGAGADAVRNVSDRVTV----TMSSSTRYPSYYYLNFDGLAVGDQT 293

Query: 331 --HVKIPYS-----------YLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
              ++ P S               GS  N  G+IVD  ST +F+E  L++ +A +   ++
Sbjct: 294 PGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEI 353

Query: 377 GNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
               RA     + GL  CF +    G   VY+P + + F  G  + L  +  F L    +
Sbjct: 354 -RLPRATP-STRLGLDLCFILPEGVGIDRVYVPTVSMSFD-GRWLELERDRLF-LEDGRM 409

Query: 434 LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +CL+          +GR   + ILG++Q QN ++ ++L   +  FAK  C
Sbjct: 410 MCLM----------IGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 163/376 (43%), Gaps = 54/376 (14%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS L W  C     C +   P  DPS         SSS + + C +  C  +   
Sbjct: 151 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 202

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
              S    C   N      C  Y++ YG G +T G L SE++      + N + GC   +
Sbjct: 203 TGNS--GPCGGFNGVVKTTCE-YVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNN 259

Query: 239 DR---QPAGIAGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTGPGSG 291
                  +G+ G GRSS SL SQ  LK F    SYCL S + D A  + +   D    S 
Sbjct: 260 KGLFGGASGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGTLSFGNDF---SV 314

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
              +  + YTP  +NP         FY + L    +G   +K     L  G     G+++
Sbjct: 315 YKNSTSVFYTPLVQNP-----QLRSFYILNLTGASIGGVELKT----LSFGR----GILI 361

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T +   +++AV  EF++Q   +  A      S L  CF+++  + + +P + + 
Sbjct: 362 DSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGY---SILDTCFNLTSYEDISIPTIKMI 418

Query: 412 FKGGAKMALPPENYFALVGNE--VLCLILFT---DNAAGPALGRGPAIILGDFQLQNFYL 466
           F+G A++ +     F  V  +  ++CL L +   +N  G         I+G++Q +N  +
Sbjct: 419 FEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQKNQRV 469

Query: 467 EFDLANDRFGFAKQKC 482
            +D   +R G A + C
Sbjct: 470 IYDTTQERLGIAGENC 485


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 157/390 (40%), Gaps = 64/390 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +S+  GTP +     +FDTGS L W       +C  CN  N      P F P +S++ 
Sbjct: 188 YIVSVGLGTP-RRDLLVVFDTGSDLSWV------QCKPCN--NCYKQHDPLFDPSQSTTY 238

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C   +C                  + TC      Y + YG +  T G L  +TL  
Sbjct: 239 SAVPCGAQECL----------------DSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL 282

Query: 223 --PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
              S  +  F+ GC         +  G+ G GR   SL SQ   +    FSYCL S    
Sbjct: 283 GPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRA 342

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
           +  +S             +  P   +T        + S    FYY+ L  I V  + V++
Sbjct: 343 EGYLSLGSA---------AAPPHAQFTAMV-----TRSDTPSFYYLDLVGIKVAGRTVRV 388

Query: 335 -PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
            P  +  PG+      ++DSG+  T +    + A+   F   M  Y RA  +   S L  
Sbjct: 389 APAVFKAPGT------VIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPAL---SILDT 439

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D +G+  V +P + L F GGA + L       +      CL  F  N    ++G    
Sbjct: 440 CYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANRSQACLA-FASNGDDTSVG---- 494

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            ILG+ Q + F + +DLAN + GF  + C+
Sbjct: 495 -ILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 435

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 107/398 (26%), Positives = 168/398 (42%), Gaps = 84/398 (21%)

Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           I D G   +W  C ++Y          R   C+  N D        PK        GC N
Sbjct: 63  IVDLGGQFLWVDCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKP-------GCNN 115

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
             C             G +P N     A    L      +Q   GF  G  +  + RF  
Sbjct: 116 NTC-------------GVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVS-RFLF 161

Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FD 274
              P FL  G +       +G+AG GR+  +LPSQL       +KF+ CL S K    F 
Sbjct: 162 SCAPTFLLKGLAT----GASGMAGLGRTKIALPSQLASAFSFARKFAICLSSSKGVVLFG 217

Query: 275 DAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIV 327
           D P     N+V D+     DS    L+YTP   NPV ++SAF +      Y++G++ I +
Sbjct: 218 DGPYGFLPNVVFDS-----DS----LTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKI 268

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
             K V +  S L   ++G GG  + +   +T +E  +++AV   F++     S A ++++
Sbjct: 269 DEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKA----SAARNIKR 324

Query: 388 KSGLRP---CF-DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
              + P   C+ +++G +   +V   EL L+        +   N    + +EVLCL    
Sbjct: 325 VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQ-NENVVWRIFGANSMVSINDEVLCLGFVN 383

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
                       +I++G +QL+N  L+FDLA  + GF+
Sbjct: 384 GGK-----NTRTSIVIGGYQLENNLLQFDLAASKLGFS 416


>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
 gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
          Length = 134

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
           +FD+    S +VL      GD   P    L+YTPF   Y+ P   SS +G +YY+GLR +
Sbjct: 1   RFDEENQKSLMVL------GDKAFPTGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            +G K +K+P   L   + GNGG I+DSG+TFT     +F+ +A  F  Q+  Y RA DV
Sbjct: 53  SIGGKRMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111

Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
           E  +G+  C+++SG +++ LPE 
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/184 (35%), Positives = 87/184 (47%), Gaps = 20/184 (10%)

Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFM 360
           TP   NP+  S     FYY+ L  I VG   + I  S      DG+GGVI+DSG+T T++
Sbjct: 25  TPLITNPLQPS-----FYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYI 79

Query: 361 EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMA 419
           E   F+++ KEF  Q        D    +GL  CF + SGK  V +P+L+  FKGG  + 
Sbjct: 80  EENAFDSLKKEFTSQT---KLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGG-DLE 135

Query: 420 LPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           LP ENY  A     V CL +   N            I G+ Q QN  +  DL  +   F 
Sbjct: 136 LPGENYMIADSSLGVACLAMGASNGMS---------IFGNIQQQNILVNHDLQKETITFI 186

Query: 479 KQKC 482
             +C
Sbjct: 187 PTQC 190


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 156/379 (41%), Gaps = 51/379 (13%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC--SWIFG 178
           I DTGS L W  C     C  C        R P F P  S+S   + C    C  S    
Sbjct: 180 IVDTGSDLTWVQCKP---CSVCY-----AQRDPLFDPSGSASYAAVPCNASACEASLKAA 231

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
             V   C                Y L YG G F+ G+L ++T+     +V  F+ GC  L
Sbjct: 232 TGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-L 290

Query: 238 SDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           S+R      AG+ G GR+  SL SQ   +    FSYCL +    DA  S +L  DT   S
Sbjct: 291 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTS--S 348

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
             + TP +SYT    +P     A   FY++ +    VG   V                V+
Sbjct: 349 YRNATP-VSYTRMIADP-----AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 395

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           +DSG+  T +   ++ AV  EF RQ G   R       S L  C++++G   V +P L L
Sbjct: 396 LDSGTVITRLAPSVYRAVRAEFARQFGA-ERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 454

Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNF 464
           + +GGA M +      + A      +CL +    F D             I+G++Q +N 
Sbjct: 455 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP----------IIGNYQQKNK 504

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + +D    R GFA + C+
Sbjct: 505 RVVYDTVGSRLGFADEDCS 523


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 105/406 (25%), Positives = 167/406 (41%), Gaps = 69/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS ++W  C +  +C   +   +D   +  + PK SS
Sbjct: 84  GLYYTEIKLGTPPKHYYVQV-DTGSDILWVNCITCEQCPHKSGLGLD---LTLYDPKASS 139

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +  ++ C    C+  FG  +  +C    P        C  Y + YG G  T G  +++ L
Sbjct: 140 TGSMVMCDQAFCAATFGGKLP-KCGANVP--------C-EYSVTYGDGSSTIGSFVTDAL 189

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F       +T P   + + GC          S++   GI GFG ++ S+ SQL      
Sbjct: 190 QFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKV 249

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            K F++CL + K              G G    GD   P +  TP   +           
Sbjct: 250 KKIFAHCLDTIK--------------GGGIFSIGDVVQPKVKTTPLVADK--------PH 287

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V L+ I VG   +++P     PG     G I+DSG+T T++     E V KE +  + 
Sbjct: 288 YNVNLKTIDVGGTTLQLPAHIFEPGE--KKGTIIDSGTTLTYLP----ELVFKEVMLAVF 341

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
           N  +        G   CF   G      P +   F+    + + P  YF   GN+V C +
Sbjct: 342 NKHQDITFHDVQGFL-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYC-V 399

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            F + A+    G+   +++GD  L N  + +DL N   G+    C+
Sbjct: 400 GFQNGASQSKDGK-DIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 156/379 (41%), Gaps = 51/379 (13%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC--SWIFG 178
           I DTGS L W  C     C  C        R P F P  S+S   + C    C  S    
Sbjct: 179 IVDTGSDLTWVQCKP---CSVCY-----AQRDPLFDPSGSASYAAVPCNASACEASLKAA 230

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
             V   C                Y L YG G F+ G+L ++T+     +V  F+ GC  L
Sbjct: 231 TGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-L 289

Query: 238 SDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           S+R      AG+ G GR+  SL SQ   +    FSYCL +    DA  S +L  DT   S
Sbjct: 290 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTS--S 347

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
             + TP +SYT    +P     A   FY++ +    VG   V                V+
Sbjct: 348 YRNATP-VSYTRMIADP-----AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 394

Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
           +DSG+  T +   ++ AV  EF RQ G   R       S L  C++++G   V +P L L
Sbjct: 395 LDSGTVITRLAPSVYRAVRAEFARQFGA-ERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 453

Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNF 464
           + +GGA M +      + A      +CL +    F D             I+G++Q +N 
Sbjct: 454 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP----------IIGNYQQKNK 503

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + +D    R GFA + C+
Sbjct: 504 RVVYDTVGSRLGFADEDCS 522


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 99/389 (25%), Positives = 149/389 (38%), Gaps = 55/389 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR-YRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y ++L+ GTPPQ  +  I D G  LVW  C     RC   + P  D +    F P+    
Sbjct: 51  YVVNLTIGTPPQPVSAII-DIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPE---- 105

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGLLLSETLR 221
                           P   + C+    R+          Y      G T G + ++ + 
Sbjct: 106 ----------------PCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVA 149

Query: 222 FPSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
             +        GC++ S+       +G  G GR++ SL +Q+    FSYCL      D  
Sbjct: 150 IGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAP---PDTG 206

Query: 278 VSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            SS L L       G+G     G   TPF K      S     Y + L  I  G+  + +
Sbjct: 207 KSSALFLGASAKLAGAGK----GAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAM 262

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S       GN  ++V + +  T +   ++  + K     +G       V+      P 
Sbjct: 263 PQS-------GNT-IMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPK 314

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
              SG      P+L+L F+GGA+M +P  +Y    GN+  C+ +       PALG     
Sbjct: 315 ASASGGA----PDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAIL----GSPALGG--VS 364

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ILG  Q  N +L FDL  +   F    C+
Sbjct: 365 ILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 127/503 (25%), Positives = 213/503 (42%), Gaps = 87/503 (17%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKIL---H 57
           MA+    LI + SLL+ L      +G S+  + +P +P       HH  S P  IL   H
Sbjct: 1   MASLWTQLISMASLLLSLARWVPVSGDSSNVLLLP-SP-------HHEGSRPAMILPLHH 52

Query: 58  SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
           S+  SS S   H   + + K  DS    ++ N+ ++    +   G Y+  L  GTPPQ  
Sbjct: 53  SVPDSSFS---HFNPRRQLKESDSE---HHPNARMRLYDDLLRNGYYTARLWIGTPPQRF 106

Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
              I DTGS++ + PC++   C  C        + P F P+ S + Q +     KC+W  
Sbjct: 107 A-LIVDTGSTVTYVPCST---CRHCG-----SHQDPKFRPEDSETYQPV-----KCTW-- 150

Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV---PNFLAG 233
                 +C  C    K C     +Y  +Y  +  ++G L  + + F ++T       + G
Sbjct: 151 ------QC-NCDNDRKQC-----TYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFG 198

Query: 234 CS-----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
           C       + +++  GI G GR   S+  QL  KK    ++S  F        +      
Sbjct: 199 CENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKK----VISDSFSLCYGGMGVGGGAMV 254

Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS-DGNG 347
             G S    + +T    +PV S      +Y + L++I V  K +     +L P   DG  
Sbjct: 255 LGGISPPADMVFT--RSDPVRSP-----YYNIDLKEIHVAGKRL-----HLNPKVFDGKH 302

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-----DISGKKS 402
           G ++DSG+T+ ++    F A     +++  +  R +  + +     CF     D+S + S
Sbjct: 303 GTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYN-DICFSGAEIDVS-QIS 360

Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQ 460
              P + + F  G K++L PENY           CL +F++       G  P  +LG   
Sbjct: 361 KSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSN-------GNDPTTLLGGIV 413

Query: 461 LQNFYLEFDLANDRFGFAKQKCA 483
           ++N  + +D  + + GF K  C+
Sbjct: 414 VRNTLVMYDREHTKIGFWKTNCS 436


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 113/393 (28%), Positives = 164/393 (41%), Gaps = 64/393 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRS 160
           G Y   +  GTP + S   + DTGSSL W  C+    C V C+  +      P F P+ S
Sbjct: 119 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSP---CLVSCHRQSG-----PVFNPRSS 169

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET 219
           SS   + C  P+C  +    +      CS  N         Y   YG   F+ G L  +T
Sbjct: 170 SSYASVSCSAPQCDALTTATLNPST--CSTSNVCI------YQASYGDSSFSVGYLSKDT 221

Query: 220 LRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
           + F S +VPNF  GC   ++    Q AG+ G  R+  SL  QL       FSYCL +   
Sbjct: 222 VSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSS 281

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
               +S        PG         SYTP  K+ +  S      Y++ +  I V  K + 
Sbjct: 282 SSGYLSIGSY---NPGQ-------YSYTPMAKSSLDDS-----LYFIKMTGITVAGKPLS 326

Query: 334 I---PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           +    YS L          I+DSG+  T +   ++ A++K     M    RA+     S 
Sbjct: 327 VSASAYSSL--------PTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAF---SI 375

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           L  CF     + + +P++ + F GGA + L   N    V +   CL      A  PA   
Sbjct: 376 LDTCFQGQASR-LRVPQVSMAFAGGAALKLKATNLLVDVDSATTCL------AFAPARS- 427

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             A I+G+ Q Q F + +D+ N + GFA   C+
Sbjct: 428 --AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 119/476 (25%), Positives = 190/476 (39%), Gaps = 87/476 (18%)

Query: 48  SDSDPLKIL-HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YS 105
           SD++ L +  H L   ++ R+R       P+     + ++  N ++     V S GG Y 
Sbjct: 34  SDTESLNLTDHELLRRAIQRSRDRLASIAPRL----LPTSSRNKVVVAEAPVLSAGGEYL 89

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + L  GTP    T  I DT S L+W  C     CV C +  +DP     F P  S+S  +
Sbjct: 90  VKLGLGTPQHCFTAAI-DTASDLIWTQCQP---CVKC-YKQLDP----VFNPVASTSYAV 140

Query: 166 IGCQNPKCSWIFGPNVESRC--KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
           + C +  C  +       RC   G S     C      Y   YG    T G+L  + L  
Sbjct: 141 VPCNSDTCDELD----THRCARDGDSDDEDAC-----QYTYSYGGNATTRGILAVDRLAI 191

Query: 223 PSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDD 275
                   + GCS  S   P    +G+ G GR + SL SQL +++F YCL   +SR    
Sbjct: 192 GDDVFRGVVFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRS--- 248

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV--- 332
              +  LVL       D+     + +     P+ + S +  +YY+ L  I +G + +   
Sbjct: 249 ---AGRLVL-----GADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFR 300

Query: 333 -KIPYSYLVPGSDGNG---------------------GVIVDSGSTFTFMEGPLFEAVAK 370
            +   +   PG+                         G+I+D  ST TF+E  L+E +  
Sbjct: 301 SRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVD 360

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFA 427
           +   ++    R +  +   GL  CF +        VY P + L F+ G  + L  E  F 
Sbjct: 361 DLEEEI-RLPRGSGSDL--GLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFV 416

Query: 428 L-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               + ++CL++   +            ILG++Q QN  + ++L   R  F K  C
Sbjct: 417 EDRASGMMCLMVGKTDGVS---------ILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 161/400 (40%), Gaps = 66/400 (16%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L++TP        Y +  S GTPPQ       DT +   W PC     C  C   +  P 
Sbjct: 106 LLQTPT-------YVVRASLGTPPQQLL-LAVDTSNDASWIPCAG---CAGCPTSSAAP- 153

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
               F P  S+S + + C +P C+    PN       C P  K C      + L Y    
Sbjct: 154 ----FDPASSASYRTVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSS 197

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
               L  ++L      V  +  GC   +  +   P G+ G GR   S  SQ   +    F
Sbjct: 198 LQAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATF 257

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
           SYCL S  F     S  L L      G +  P  +  TP   NP  SS      YYV + 
Sbjct: 258 SYCLPS--FKSLNFSGTLRL------GRNGQPQRIKTTPLLANPHRSS-----LYYVNMT 304

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I VG K V IP      G+    G ++DSG+ FT +  P + AV  E  R++G     A
Sbjct: 305 GIRVGRKVVPIPAFDPATGA----GTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----A 355

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDN 442
            V    G   CF+ +   +V  P + L F G  ++ LP EN         + CL +    
Sbjct: 356 PVSSLGGFDTCFNTT---AVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAM---- 407

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           AA P        ++   Q QN  + FD+ N R GFA+++C
Sbjct: 408 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 62/380 (16%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS L W  C     C +   P  DPS         SSS + + C +  C  +   
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 199

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
              S    C   N      C  Y++ YG G +T G L SE++      + NF+ GC    
Sbjct: 200 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 253

Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
            R   G+        G GRSS SL SQ  LK F    SYCL S + D A  S +   D+ 
Sbjct: 254 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 309

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
             S  + +  +SYTP  +NP         FY + L    +G   +K         S    
Sbjct: 310 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 354

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G+++DSG+  T +   +++AV  EF++Q   +  A      S L  CF+++  + + +P 
Sbjct: 355 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 411

Query: 408 LILKFKGGAKMALPPENYFALV--GNEVLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
           + + F+G A++ +     F  V     ++CL L +   +N  G         I+G++Q +
Sbjct: 412 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 462

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N  + +D   +R G   + C
Sbjct: 463 NQRVIYDTTQERLGIVGENC 482


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 119/407 (29%), Positives = 180/407 (44%), Gaps = 68/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     I DTGS ++W  C S   C  C   +    ++  F P  SS
Sbjct: 75  GLYYTKVKLGTPPRELYVQI-DTGSDVLWVSCGS---CNGCPQTSGLQIQLNYFDPGSSS 130

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S LI C + +C       V++    CS RN  C     +Y  QYG G  T+G  +S+ +
Sbjct: 131 TSSLISCLDRRCR----SGVQTSDASCSGRNNQC-----TYTFQYGDGSGTSGYYVSDLM 181

Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
            F S         +  + + GCSIL       S+R   GI GFG+   S+ SQL   G+ 
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL   K D++     LVL      G+   P + Y+P   +           Y +
Sbjct: 242 PRVFSHCL---KGDNSG-GGVLVL------GEIVEPNIVYSPLVPSQ--------PHYNL 283

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  + V+I  S  V  +  N G IVDSG+T  +    L E     F+  +    
Sbjct: 284 NLQSISVNGQIVRIAPS--VFATSNNRGTIVDSGTTLAY----LAEEAYNPFVIAIAAVI 337

Query: 381 RAADVEKKSGLRPCFDISGKKSVYL-PELILKFKGGAKMALPPENYFA---LVGNEVLCL 436
             +     S    C+ I+   +V + P++ L F GGA + L P++Y      +G   +  
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWC 397

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I F    +G ++      ILGD  L++    +DLA  R G+A   C+
Sbjct: 398 IGF-QKISGQSI-----TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 155/385 (40%), Gaps = 55/385 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +  S GTPPQ       DT +   W PC     C  C   +  P     F P  S+S 
Sbjct: 112 YVVRASLGTPPQ-QLLLAVDTSNDASWIPCAG---CAGCPTSSAAP-----FDPAASASY 162

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + + C +P C+    PN       C P  K C      + L Y        L  ++L   
Sbjct: 163 RTVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSSLQAALSQDSLAVA 210

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
              V  +  GC   +  +   P G+ G GR   S  SQ   +    FSYCL S  F    
Sbjct: 211 GNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPS--FKSLN 268

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S  L L       + +   +  TP   NP  SS      YYV +  + VG K V IP  
Sbjct: 269 FSGTLRLGR-----NGQPQRIKTTPLLANPHRSS-----LYYVNMTGVRVGRKVVPIPAF 318

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
               G+    G ++DSG+ FT +  P + AV  E  R++G     A V    G   CF+ 
Sbjct: 319 DPATGA----GTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----APVSSLGGFDTCFNT 369

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
           +   +V  P + L F G  ++ LP EN   +V +     I     AA P        ++ 
Sbjct: 370 T---AVAWPPMTLLFDG-MQVTLPEEN---VVIHSTYGTISCLAMAAAPDGVNTVLNVIA 422

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
             Q QN  + FD+ N R GFA+++C
Sbjct: 423 SMQQQNHRVLFDVPNGRVGFARERC 447


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 177/409 (43%), Gaps = 69/409 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR-----------CVDCNFPNVDPSRI 152
           Y  +++ GTPP      + DTGS LVW  C +                + + P   P  +
Sbjct: 82  YLAAVNVGTPPVRFLA-VADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAV 140

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
             F P  SSS   +GC  P C         + C G S        AC  +   Y  G +A
Sbjct: 141 VYFNPFDSSSYSRVGCDGPSC---LALATNASCNGDSH-------AC-DFRYSYRDGASA 189

Query: 213 -GLLLSETLRF------PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLK 262
            GLL ++T  F       + +  +   GC+  +   + Q  G+ G G    SL SQLG +
Sbjct: 190 TGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLG-R 248

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           KFS+CL +   DDA    + +L+ G  +  S  PG + TP     + SSS    +Y + +
Sbjct: 249 KFSFCLTAYDIDDA----SSILNFGARAVVSD-PGAATTPL----IASSSNAAAYYAISI 299

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME-----GPLFEAVAKEFIRQMG 377
             + V  + V        PG+     VIVD+G+  TF++      PL E++A+  +    
Sbjct: 300 DSLKVAGQPV--------PGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLAR--VMDGA 349

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV--YLPE--LILKFKGGAKMALPPENYFALVGNEV 433
              RA   ++   L  C+D+S  K V   +P+  L+L   GG ++ L  E  F LV   V
Sbjct: 350 GLPRAPPPDET--LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGV 407

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LCL + T     P L   P  +LG+  LQ+ ++  DL      FA   C
Sbjct: 408 LCLAVVT---TSPEL--QPLSVLGNVALQDLHVGIDLDARTATFATANC 451


>gi|62362434|gb|AAX81588.1| nectarin IV [Nicotiana langsdorffii x Nicotiana sanderae]
          Length = 437

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 166/390 (42%), Gaps = 67/390 (17%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            D G   +W         VDC+   V  S  PA    RS+   L G     C   F P  
Sbjct: 63  LDLGGQFLW---------VDCDQGYVSSSYKPARC--RSAQCSLAGAGG--CGQCFSPPK 109

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN-----------F 230
                GC+  N TC L   + + +     T+G L S+ ++  S    N           F
Sbjct: 110 ----PGCN--NNTCSLLPDNTITRTA---TSGELASDIVQVQSSNGKNPGRNVTDKDFLF 160

Query: 231 LAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSSNL 282
           + G + L +   +G+   AG GR+  SLPSQ        +KF+ CL S       V    
Sbjct: 161 VCGSTFLLEGLASGVKGMAGLGRTRISLPSQFSAEFSFPRKFAVCLSSSTNSKGVV---- 216

Query: 283 VLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIVGSKHVK 333
           +   GP S     +      SYTP + NPV ++SAF  GE    Y++G++ I +  K V 
Sbjct: 217 LFGDGPYSFLPNREFSNNDFSYTPLFINPVSTASAFSSGEPSSEYFIGVKSIKINQKVVP 276

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           I  + L   + G GG  + + + +T +E  ++ AV   F++++ N +R A V        
Sbjct: 277 INTTLLSIDNQGVGGTKISTVNPYTILETSMYNAVTNFFVKELVNITRVASVAP---FGA 333

Query: 394 CFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           CFD    +S +    +P++ L  +       +   N    V   VLCL  F D    P  
Sbjct: 334 CFDSRTIVSTRVGPAVPQIDLVLQNENVFWTIFGANSMVQVSENVLCL-GFVDGGINPR- 391

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
               +I++G + +++  L+FDLA+ R GF 
Sbjct: 392 ---TSIVIGGYTIEDNLLQFDLASSRLGFT 418


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 62/380 (16%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS L W  C     C +   P  DPS         SSS + + C +  C  +   
Sbjct: 100 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 151

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
              S    C   N      C  Y++ YG G +T G L SE++      + NF+ GC    
Sbjct: 152 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 205

Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
            R   G+        G GRSS SL SQ  LK F    SYCL S + D A  S +   D+ 
Sbjct: 206 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 261

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
             S  + +  +SYTP  +NP         FY + L    +G   +K         S    
Sbjct: 262 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 306

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G+++DSG+  T +   +++AV  EF++Q   +  A      S L  CF+++  + + +P 
Sbjct: 307 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 363

Query: 408 LILKFKGGAKMALPPENYFALVGNE--VLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
           + + F+G A++ +     F  V  +  ++CL L +   +N  G         I+G++Q +
Sbjct: 364 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 414

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N  + +D   +R G   + C
Sbjct: 415 NQRVIYDTTQERLGIVGENC 434


>gi|356518052|ref|XP_003527698.1| PREDICTED: basic 7S globulin 2-like [Glycine max]
          Length = 447

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/399 (24%), Positives = 167/399 (41%), Gaps = 64/399 (16%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           ++  GTP Q ST  + D G   +W  C++R                       SSS + I
Sbjct: 59  TIGIGTP-QHSTNLVIDLGGENLWHDCSNRRY--------------------NSSSKRKI 97

Query: 167 GCQNPKCSWIFGPNVESRC-----KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
            C++ KC       V + C      GC+  + T  ++ P  L Q+   +T   ++ +T+ 
Sbjct: 98  VCKSKKCPE-GAACVSTGCIGPYKPGCAISDCTITVSNP--LAQFSSSYT---MVEDTIF 151

Query: 222 FPSKTVPNFLAGCSILSD-----------RQPAGIAGFGRSSESLPSQLGLK-----KFS 265
                +P FLAGC  L D           R   GI GF  S  +LPSQL L      KFS
Sbjct: 152 LSHTYIPGFLAGCVDLDDGLSGNALQGLPRTSKGIIGFSHSELALPSQLVLSNKLIPKFS 211

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV--GSSSAFGE---FYYV 320
            C  S   ++     N+ +  G G    ++  L  TP   NPV  G+ S +G     Y++
Sbjct: 212 LCFPSS--NNLKGFGNIFIGAGGGHPQVESKFLQTTPLVVNPVATGAVSIYGAPSIEYFI 269

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            ++ I +    + +  S L     GNGG  + + + +T +   L++   +EFI +     
Sbjct: 270 DVKAIKIDGHVLNLNSSLLSIDKKGNGGTKISTMTPWTELHSSLYKPFVQEFINK-AEGR 328

Query: 381 RAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
           R   V        CFD S  ++    + +P + L   GGA+  +   N   ++ ++ +  
Sbjct: 329 RMKRVAPVPPFDACFDTSTIRNSITGLAVPSIDLVLPGGAQWTIYGANSMTVMTSKNVAC 388

Query: 437 ILFTDNAAGP----ALGRGPAIILGDFQLQNFYLEFDLA 471
           + F D    P    ++    ++++G  QL++  L  D+A
Sbjct: 389 LAFVDGGMKPKEMHSIQLEASVVIGGHQLEDNLLVIDMA 427


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/389 (25%), Positives = 149/389 (38%), Gaps = 55/389 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR-YRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y ++L+ GTPPQ  +  I D G  LVW  C     RC   + P  D +    F P+    
Sbjct: 51  YVVNLTIGTPPQPVSAII-DIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPE---- 105

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGLLLSETLR 221
                           P   + C+    R+          Y      G T G + ++ + 
Sbjct: 106 ----------------PCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVA 149

Query: 222 FPSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
             +        GC++ S+       +G  G GR++ SL +Q+    FSYCL      D  
Sbjct: 150 IGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAP---PDTG 206

Query: 278 VSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            SS L L       G+G     G   TPF K     +S     Y + L  I  G+  + +
Sbjct: 207 KSSALFLGASAKLAGAGK----GAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAM 262

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S       GN  + V + +  T +   ++  + K     +G       V+      P 
Sbjct: 263 PQS-------GNT-ITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPK 314

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
              SG      P+L+L F+GGA+M +P  +Y    GN+  C+ +       PALG     
Sbjct: 315 ASASGGA----PDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAIL----GSPALGG--VS 364

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ILG  Q  N +L FDL  +   F    C+
Sbjct: 365 ILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 150/360 (41%), Gaps = 62/360 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  GTPP+     + DTGS ++W  C S   C  C   +    ++  F P  S 
Sbjct: 79  GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  I C + +CSW     ++S   GCS +N  C     +Y  QYG G  T+G  +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185

Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F     S  VPN  A    GCS       + SDR   GI GFG+   S+ SQL      
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL             LVL      G+   P + +TP   +           Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  I V  + + I  S     S  NG G I+D+G+T  ++     EA    F+  + N 
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLI 437
              +     S    C+ I+       P + L F GGA M L P++Y     N    LC +
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVASALCFL 400


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 109/392 (27%), Positives = 166/392 (42%), Gaps = 46/392 (11%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL  GTPPQ  T  + DTGS L W  C  +   +    P +   +  +F P  SSS  L
Sbjct: 68  VSLPIGTPPQP-TDLVLDTGSQLSWIQCHDKK--IKKRLPPLPKPKTTSFDPSLSSSFSL 124

Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
           + C +P C   I    + + C     +N+ C     SY    G     G L+ E   F  
Sbjct: 125 LPCNHPICKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKFTFSK 176

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
           S + P  + GC+  S     GI G  R   S  SQ  + KFSYC+ SR   +   +    
Sbjct: 177 SLSTPPVILGCAQASTEN-RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNP--TGLFY 233

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVP 341
           L   P S   K     Y      P   SS   +   Y + ++ I +  K + +P +   P
Sbjct: 234 LGDNPNSSKFK-----YVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKP 288

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSGLRPC 394
            + G+G  ++DSGS  T++    +E V +E +R +G        Y+  AD+        C
Sbjct: 289 DAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADM--------C 340

Query: 395 FD--ISGKKSVYLPELILKFKGGAKMAL-PPENYFALVGNEVLCLILFTDNAAGPALGRG 451
           FD  ++ +    +  +  +F  G ++ +   E     V   V C+ +         LG G
Sbjct: 341 FDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGI----GRSERLGIG 396

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             II G    QN ++E+DLAN R GF   +C+
Sbjct: 397 SNII-GTVHQQNMWVEYDLANKRVGFGGAECS 427


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 116/414 (28%), Positives = 169/414 (40%), Gaps = 75/414 (18%)

Query: 100 SYGGYSISLSF-----GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           +Y  Y + L F     G+PP+     I DTGS ++W  C S   C  C  P      IP 
Sbjct: 74  TYDPYRVGLYFTRVLLGSPPKEFYVQI-DTGSDVLWVSCGS---CNGC--PQSSGLHIPL 127

Query: 155 --FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-T 211
             F P  SS++ LI C + +CS      V+S   GCS +   C      Y  QYG G  T
Sbjct: 128 NFFDPGSSSTASLISCSDQRCSL----GVQSSDAGCSSQGNQCI-----YTFQYGDGSGT 178

Query: 212 AGLLLSETLRFPS---KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPS 257
           +G  +S+ L F +    +V N     + GCSI        SDR   GI GFG+   S+ S
Sbjct: 179 SGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVIS 238

Query: 258 QL---GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
           Q+   G+  K FS+CL         +    +++            + Y+P   +      
Sbjct: 239 QMSSQGITPKVFSHCLKGDGGGGGILVLGEIVE----------EDIVYSPLVPSQ----- 283

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
                Y + L+ I V  K + I     V  +  N G IVDSG+T  ++    ++      
Sbjct: 284 ---PHYNLNLQSISVNGKSLAIDPE--VFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAI 338

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL---V 429
              +    R       S    C+ I+       P + L F GG  M L PE+Y      +
Sbjct: 339 TEAVSQSVRPL----LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSI 394

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G+  +  I F         G+G   ILGD  L++    +DLA  R G+A   C+
Sbjct: 395 GDAAVWCIGFQK-----IQGQG-ITILGDLVLKDKIFVYDLAGQRIGWANYDCS 442


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 62/380 (16%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS L W  C     C +   P  DPS         SSS + + C +  C  +   
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 199

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
              S    C   N      C  Y++ YG G +T G L SE++      + NF+ GC    
Sbjct: 200 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 253

Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
            R   G+        G GRSS SL SQ  LK F    SYCL S + D A  S +   D+ 
Sbjct: 254 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 309

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
             S  + +  +SYTP  +NP         FY + L    +G   +K         S    
Sbjct: 310 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 354

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
           G+++DSG+  T +   +++AV  EF++Q   +  A      S L  CF+++  + + +P 
Sbjct: 355 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 411

Query: 408 LILKFKGGAKMALPPENYFALV--GNEVLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
           + + F+G A++ +     F  V     ++CL L +   +N  G         I+G++Q +
Sbjct: 412 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 462

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N  + +D   +R G   + C
Sbjct: 463 NQRVIYDSTQERLGIVGENC 482


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 157/385 (40%), Gaps = 61/385 (15%)

Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
           +  GTP       + DTGSSL W  C+     V C+  +      P F PK SS+   +G
Sbjct: 1   MGLGTPATQYV-MVVDTGSSLTWLQCSPCL--VSCHRQSG-----PVFNPKSSSTYASVG 52

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT 226
           C   +CS +  P+       CS  N         Y   YG   F+ G L  +T+ F S +
Sbjct: 53  CSAQQCSDL--PSATLNPSACSSSNVCI------YQASYGDSSFSVGYLSKDTVSFGSTS 104

Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSS 280
           +PNF  GC   ++    + AG+ G  R+  SL  QL       F+YCL S          
Sbjct: 105 LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS----SGY 160

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK--HVKIPYSY 338
             +    PG         SYTP       SSS     Y++ L  + V      V      
Sbjct: 161 LSLGSYNPGQ-------YSYTPMV-----SSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 208

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
            +P        I+DSG+  T +   ++ A++K     M   SRA+     S L  CF   
Sbjct: 209 SLP-------TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRAS---AYSILDTCFKGQ 258

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
             + V  P + + F GGA + L  +N    V +   CL      A  PA  R  AII G+
Sbjct: 259 ASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCL------AFAPA--RSAAII-GN 308

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
            Q Q F + +D+ + R GFA   C+
Sbjct: 309 TQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 113/400 (28%), Positives = 154/400 (38%), Gaps = 82/400 (20%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS +VW P  +    +            PA  P+ + 
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN- 177

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
                 C  P C  +          GC  R  +C      Y + YG G  TAG   SETL
Sbjct: 178 ------CVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 219

Query: 221 RFP-SKTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
            F     V     GC    D +   IA       GR   S PSQ+     + FSYCL+ R
Sbjct: 220 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 277

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                   S                            G +     FYYV L    VG   
Sbjct: 278 TSSRRARPSRRW-------------------------GGTPRMATFYYVHLLGFSVGGAR 312

Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
           VK +  S L +  + G GGVI+DSG++ T +  P++EAV   F        RAA V  + 
Sbjct: 313 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 364

Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
                S    C+++SG++ V +P + +   GGA +ALPPENY   V      C  +   +
Sbjct: 365 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 424

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                   G   I+G+ Q Q F + FD    R GF  + C
Sbjct: 425 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 155/397 (39%), Gaps = 54/397 (13%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +S+  GTP       + DTGS L W  C   Y C  C  PN  P R+  F    SSS 
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNC--EYWCKSCPKPNPHPGRV--FRANDSSSF 174

Query: 164 QLIGCQNPKC--------SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
           + I C +  C        S    PN  + C     R    P A   +  +     T GL 
Sbjct: 175 RTIPCSSDDCKIELQDYFSLTECPNPNAPCL-FDYRYLNGPRAIGVFANET---VTVGLN 230

Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
             + +R     + + L GC+   +     P G+ G G    SL  +L      KFSYCL+
Sbjct: 231 DHKKIR-----LFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLV 285

Query: 270 SRKFDDAPVSSN----LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
                D   SSN    L     P   + K P + +T      + +      FY V +  I
Sbjct: 286 -----DHLSSSNHKNFLSFGDIP---EMKLPKMQHTELLLGYINA------FYPVNVSGI 331

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   + I  S  +    G GG+IVDSG++ T + G  ++ V          + +   +
Sbjct: 332 SVGGSMLSI--SSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPI 389

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
           E       CF+  G     +P L++ F  GA    P ++Y   V   + CL +   +  G
Sbjct: 390 ELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPG 449

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +       ILG+   QN   E+DL   + GF    C
Sbjct: 450 SS-------ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 116/414 (28%), Positives = 169/414 (40%), Gaps = 75/414 (18%)

Query: 100 SYGGYSISLSF-----GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
           +Y  Y + L F     G+PP+     I DTGS ++W  C S   C  C  P      IP 
Sbjct: 59  TYDPYRVGLYFTRVLLGSPPKEFYVQI-DTGSDVLWVSCGS---CNGC--PQSSGLHIPL 112

Query: 155 --FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-T 211
             F P  SS++ LI C + +CS      V+S   GCS +   C      Y  QYG G  T
Sbjct: 113 NFFDPGSSSTASLISCSDQRCSL----GVQSSDAGCSSQGNQCI-----YTFQYGDGSGT 163

Query: 212 AGLLLSETLRFPS---KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPS 257
           +G  +S+ L F +    +V N     + GCSI        SDR   GI GFG+   S+ S
Sbjct: 164 SGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVIS 223

Query: 258 QL---GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
           Q+   G+  K FS+CL         +    +++            + Y+P   +      
Sbjct: 224 QMSSQGITPKVFSHCLKGDGGGGGILVLGEIVE----------EDIVYSPLVPSQ----- 268

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
                Y + L+ I V  K + I     V  +  N G IVDSG+T  ++    ++      
Sbjct: 269 ---PHYNLNLQSISVNGKSLAIDPE--VFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAI 323

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL---V 429
              +    R       S    C+ I+       P + L F GG  M L PE+Y      +
Sbjct: 324 TEAVSQSVRPL----LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSI 379

Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G+  +  I F         G+G   ILGD  L++    +DLA  R G+A   C+
Sbjct: 380 GDAAVWCIGFQ-----KIQGQG-ITILGDLVLKDKIFVYDLAGQRIGWANYDCS 427


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 166/407 (40%), Gaps = 68/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFI-FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           G Y   +  G P  A   F+  DTGS ++W  C+    C   +  N+   ++  F P  S
Sbjct: 87  GLYFTRVKLGNP--AKEYFVQIDTGSDILWVACSPCTGCPTSSGLNI---QLEFFNPDSS 141

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           S+S  I C + +C+       E+ C+     +  C      Y   YG G  T+G  +S+T
Sbjct: 142 STSSRIPCSDDRCTAALQTG-EAVCQSSDSPSSPC-----GYTFTYGDGSGTSGFYVSDT 195

Query: 220 LRFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL 261
           + F +         +  + + GCS       + +DR   GI GFG+   S+ SQL   G+
Sbjct: 196 MYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGV 255

Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K FS+CL  +  D+      LVL      G+   PGL +TP   +           Y 
Sbjct: 256 SPKTFSHCL--KGSDNG--GGILVL------GEIVEPGLVFTPLVPSQ--------PHYN 297

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L  I V  +  K+P    +  +    G IVDSG+T  +    L +     FI  +   
Sbjct: 298 LNLESIAVSGQ--KLPIDSSLFATSNTQGTIVDSGTTLVY----LVDGAYDPFINAIAAA 351

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCL 436
              +     S    CF  +       P   L FKGG  M + PENY      V N VL  
Sbjct: 352 VSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWC 411

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I +  +            ILGD  L++    +DLAN R G+A   C+
Sbjct: 412 IGWQRSQG--------ITILGDLVLKDKIFVYDLANMRMGWADYDCS 450


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 115/407 (28%), Positives = 168/407 (41%), Gaps = 72/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     I DTGS ++W  CTS   C  C   +    ++  F P  SS
Sbjct: 82  GLYYTKVKLGTPPREFNVQI-DTGSDVLWVSCTS---CNGCPKTSELQIQLSFFDPGVSS 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S+ L+ C + +C   F    ES   GCSP N  C     SY  +YG G  T+G  +S+ +
Sbjct: 138 SASLVSCSDRRCYSNF--QTES---GCSP-NNLC-----SYSFKYGDGSGTSGFYISDFM 186

Query: 221 RFPSKTVPN--------FLAGCSILSD-------RQPAGIAGFGRSSESLPSQLGL---- 261
            F +             F+ GCS L         R   GI G G+ S S+ SQL +    
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL      D      +VL      G  K P   YTP   +           Y V
Sbjct: 247 PRVFSHCLKG----DKSGGGIMVL------GQIKRPDTVYTPLVPSQ--------PHYNV 288

Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L+ I V  + + I P  + +   DG    I+D+G+T  ++    +    +     +  Y
Sbjct: 289 NLQSIAVNGQILPIDPSVFTIATGDGT---IIDTGTTLAYLPDEAYSPFIQAIANAVSQY 345

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
            R    E       CF+I+       PE+ L F GGA M L P  Y   F+  G+ + C+
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCI 401

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                      +      ILGD  L++  + +DL   R G+A+  C+
Sbjct: 402 -------GFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 117/439 (26%), Positives = 177/439 (40%), Gaps = 78/439 (17%)

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
           A  S++RA H    +   T +S +              +   GGY ++ S GTPP     
Sbjct: 57  ARRSINRANHFFKDSDTSTPESTV--------------IPDRGGYLMTYSVGTPP-TKIY 101

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS +VW  C    +C +           P F P +SSS + I C +  C  +   
Sbjct: 102 GIADTGSDIVWLQCEPCEQCYN--------QTTPIFNPSKSSSYKNIPCLSKLCHSV--- 150

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
               R   CS +N     +C  Y + YG    + G L  +TL   S      + P  + G
Sbjct: 151 ----RDTSCSDQN-----SC-QYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIG 200

Query: 234 CSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL-D 285
           C   +        +GI G G    SL +QLG     KFSYCL+     ++  SS L   D
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGD 260

Query: 286 TGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
               SGD    G+  TP   K+PV        FY++ L+   VG+K V+   S    G D
Sbjct: 261 AAVVSGD----GVVSTPLIKKDPV--------FYFLTLQAFSVGNKRVEFGGS--SEGGD 306

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
             G +I+DSG+T T +   ++  +    +  +    R  D  ++  L  C+ +   +  +
Sbjct: 307 DEGNIIIDSGTTLTLIPSDVYTNLESAVV-DLVKLDRVDDPNQQFSL--CYSLKSNEYDF 363

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
            P +   FK GA + L   + F  + + ++C          P LG     I G+   QN 
Sbjct: 364 -PIITAHFK-GADIELHSISTFVPITDGIVCFAF----QPSPQLGS----IFGNLAQQNL 413

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + +DL      F    C 
Sbjct: 414 LVGYDLQQKTVSFKPTDCT 432


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 170/395 (43%), Gaps = 63/395 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y + +S GTPP  +   I DTGS+L W  C + + +C D         +I  F P  SS+
Sbjct: 25  YFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSST 78

Query: 163 SQLIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
              +GC    C+   G +++   + GC   + TC      Y L+YG G ++ G L  + L
Sbjct: 79  YSKVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRL 130

Query: 221 RFPS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKF 273
              S +++ NF+ GC    L +   AGI GFG  S S  +Q+        FSYC      
Sbjct: 131 TLASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHE 190

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
           ++        L  GP + D          +   P          Y +    ++V    ++
Sbjct: 191 NEGS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLE 235

Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSG 390
           I PY Y+   +      IVDSG+  T++  P+F+A+ K   ++M    Y+R  D      
Sbjct: 236 IDPYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER---- 285

Query: 391 LRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
            R CF IS   S      P + +K    + + LP EN F    N V+C     D+A    
Sbjct: 286 -RICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA---- 338

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G     +LG+  +++F L FD+    FGF  + C
Sbjct: 339 -GVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/444 (24%), Positives = 169/444 (38%), Gaps = 81/444 (18%)

Query: 69  HLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH---------SYGGYSISLSFGTPPQASTP 119
           H ++   P      I  +YS+ ++K   S            Y  + ++ S G PP     
Sbjct: 49  HHESSLSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLA 108

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGSSL W  C   + C  C+        +P F P +SS+   + C           
Sbjct: 109 -VMDTGSSLTWVMC---HPCSSCS-----QQSVPIFDPSKSSTYSNLSC----------- 148

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKT-----VPNFLAG 233
              S C  C   N  CP     Y ++Y G G + G+   E L   +       VP+ + G
Sbjct: 149 ---SECNKCDVVNGECP-----YSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFG 200

Query: 234 C----SILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
           C    SI S+  P     G+ G G    SL    G KKFSYC+ + +  +   +  ++ D
Sbjct: 201 CGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG-KKFSYCIGNLRNTNYKFNRLVLGD 259

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSD 344
                GDS T                +     YYV L  I +G + + I P  +    +D
Sbjct: 260 KANMQGDSTTL---------------NVINGLYYVNLEAISIGGRKLDIDPTLFERSITD 304

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF------DIS 398
            N GVI+DSG+  T++    FE ++ E    +      A  +K +    C+      D+S
Sbjct: 305 NNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLS 364

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
           G      P +   F  GA + L   + F        C+ +   N  G       +I  G 
Sbjct: 365 G-----FPLVTFHFAEGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSI--GM 417

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
              QN+ + +DL   R  F +  C
Sbjct: 418 LAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/410 (24%), Positives = 172/410 (41%), Gaps = 75/410 (18%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
           S G Y   +  G+PP+     + DTGS ++W  C    +C    D   P      +  + 
Sbjct: 74  SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 126

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
            K SS+S+ +GC++  CS+I              +++TC    P SY + YG G T+ G 
Sbjct: 127 SKTSSTSKNVGCEDDFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 173

Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
            + + +         +T P     + GC          +D    GI GFG+S+ S+ SQL
Sbjct: 174 FIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQL 233

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
                  + FS+CL            N+        G+ ++P +  TP   N V      
Sbjct: 234 AAGGSTKRIFSHCL-----------DNMNGGGIFAVGEVESPVVKTTPIVPNQV------ 276

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y V L+ + V    + +P S  +  ++G+GG I+DSG+T  ++   L+ ++ ++   
Sbjct: 277 --HYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 332

Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
           +     +   +        CF  +       P + L F+   K+++ P +Y   +  ++ 
Sbjct: 333 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 387

Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           C   F   + G     G   I+LGD  L N  + +DL N+  G+A   C+
Sbjct: 388 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 434


>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 435

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 176/411 (42%), Gaps = 75/411 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  +  TP +      D G   +W  C + Y                 + P R  
Sbjct: 42  TLQYITQIKQRTPLVPENLVLDIGGQFLWVDCDNNYVS-------------STYRPARCG 88

Query: 162 SSQLIGCQNPKCSWIFG---PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
           S+Q    ++  C   F    P   +   G +P N     A    L Q  +       L  
Sbjct: 89  SAQCSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVS------LQS 142

Query: 219 TLRF---PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG-----LKKFS 265
           T  F    + TV  FL  C+     Q      +G+AG GR+  +LPSQL       +KF+
Sbjct: 143 TNGFNPIQNATVSRFLFSCAPTFLLQGLATGVSGMAGLGRTRIALPSQLASAFSFRRKFA 202

Query: 266 YCLLSRK----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE----- 316
            CL S      F D P    ++L   P    S+   L++TP   NPV ++SAF +     
Sbjct: 203 VCLSSSNGVAFFGDGPY---VLL---PNVDASQL--LTFTPLLINPVSTASAFSQGEPSA 254

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
            Y++G++ I +  K V +  + L   S G GG  + S + +T +E  +F+AV + F++  
Sbjct: 255 EYFIGVKSIKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFVKA- 313

Query: 377 GNYSRAADVEKKSGLRP---CFD----ISGKKSVYLP--ELILKFKGGAKMALPPENYFA 427
              S A ++ + + + P   CF     ++ +    +P  EL+L+ +          +  +
Sbjct: 314 ---SSARNITRVASVAPFEVCFSRENVLATRLGAAVPTIELVLQNQKTVWRIFGANSMVS 370

Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           +  ++VLCL  F +    P      +I++G +QL++  L+FDLA  R GF+
Sbjct: 371 VSDDKVLCL-GFVNGGENPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 416


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/392 (28%), Positives = 165/392 (42%), Gaps = 46/392 (11%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           +SL  GTPPQ  T  + DTGS L W  C  +   V    P +   +  +F P  SSS  L
Sbjct: 68  VSLPIGTPPQP-TDLVLDTGSQLSWIQCHDKK--VKKRLPPLPKPKTASFDPSLSSSFSL 124

Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
           + C +P C   I    + + C     +N+ C     SY    G     G L+ E   F  
Sbjct: 125 LPCNHPICKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKFTFSK 176

Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
           S + P  + GC+  S     GI G      S  SQ  + KFSYC+ SR   +   +    
Sbjct: 177 SLSTPPVILGCAQASTEN-RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNP--TGLFY 233

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVP 341
           L   P S   K     Y      P   SS   +   Y + ++ I +  K + IP +   P
Sbjct: 234 LGDNPNSSKFK-----YVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKP 288

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSGLRPC 394
            + G+G  ++DSGS  T++    +E V +E +R +G        Y+  AD+        C
Sbjct: 289 DAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADM--------C 340

Query: 395 FD--ISGKKSVYLPELILKFKGGAKMAL-PPENYFALVGNEVLCLILFTDNAAGPALGRG 451
           FD  ++ +    +  +  +F  G ++ +   E     V   V C+ +         LG G
Sbjct: 341 FDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGI----GRSERLGIG 396

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             II G    QN ++E+DLAN R GF   +C+
Sbjct: 397 SNII-GTVHQQNMWVEYDLANKRVGFGGAECS 427


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/410 (24%), Positives = 172/410 (41%), Gaps = 75/410 (18%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
           S G Y   +  G+PP+     + DTGS ++W  C    +C    D   P      +  + 
Sbjct: 70  SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 122

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
            K SS+S+ +GC++  CS+I              +++TC    P SY + YG G T+ G 
Sbjct: 123 SKTSSTSKNVGCEDDFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 169

Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
            + + +         +T P     + GC          +D    GI GFG+S+ S+ SQL
Sbjct: 170 FIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQL 229

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
                  + FS+CL            N+        G+ ++P +  TP   N V      
Sbjct: 230 AAGGSTKRIFSHCL-----------DNMNGGGIFAVGEVESPVVKTTPIVPNQV------ 272

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y V L+ + V    + +P S  +  ++G+GG I+DSG+T  ++   L+ ++ ++   
Sbjct: 273 --HYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 328

Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
           +     +   +        CF  +       P + L F+   K+++ P +Y   +  ++ 
Sbjct: 329 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 383

Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           C   F   + G     G   I+LGD  L N  + +DL N+  G+A   C+
Sbjct: 384 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 430


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 126/454 (27%), Positives = 180/454 (39%), Gaps = 90/454 (19%)

Query: 52  PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
           P + + S    S+SR  H        TK+S+I ++ + S +     + + G Y +  S G
Sbjct: 50  PTQRIVSAVRRSMSRVHHFS-----PTKNSDIFTDTAQSEM-----ISNQGEYLMKFSLG 99

Query: 112 TPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP 171
           TP       I DTGS L+W  C    +C +           P F PK SS+ + I C   
Sbjct: 100 TP-AFDILAIADTGSDLIWTQCKPCDQCYE--------QDAPLFDPKSSSTYRDISCSTK 150

Query: 172 KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT---- 226
           +C  +      + C G    NKTC      Y   YG   FT+G + ++T+   S +    
Sbjct: 151 QCDLL---KEGASCSG--EGNKTC-----HYSYSYGDRSFTSGNVAADTITLGSTSGRPV 200

Query: 227 -VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV 278
            +P  + GC   +      + +GI G G    SL SQLG     KFSYCL+       P+
Sbjct: 201 LLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLV-------PL 253

Query: 279 SSNLV----LDTG-----PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           SSN      L+ G      G G   TP +S  P              FY++ L  + VGS
Sbjct: 254 SSNATNSSKLNFGSNGIVSGGGVQSTPLISKDP------------DTFYFLTLEAVSVGS 301

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           + +K P S         G +I+DSG+T T      F     E    + +      VE  S
Sbjct: 302 ERIKFPGSSF---GTSEGNIIIDSGTTLTLFPEDFF----SELSSAVQDAVAGTPVEDPS 354

Query: 390 G-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
           G L  C+ I     +  P +   F  GA + L P N F  V + VLC      N+     
Sbjct: 355 GILSLCYSIDAD--LKFPSITAHFD-GADVKLNPLNTFVQVSDTVLCFAFNPINSGA--- 408

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 I G+    NF + +DL      F    C
Sbjct: 409 ------IFGNLAQMNFLVGYDLEGKTVSFKPTDC 436


>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
 gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 435

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 179/409 (43%), Gaps = 72/409 (17%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           SL + T     TP +      D G   +W         VDC+   V  S  PA    R  
Sbjct: 41  SLQYITEIHQRTPLVPVKLTVDLGGQFMW---------VDCDRGYVSSSYKPA----RCR 87

Query: 162 SSQL-IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY---GLGFTAGLLLS 217
           S+Q  +  ++  C   F P       GC+  N TC L   + +++    G   +  + +S
Sbjct: 88  SAQCSLASKSSACGQCFSPPRP----GCN--NNTCSLFPGNTIIRLSTSGEVASDVVSVS 141

Query: 218 ETLRF-PSKTV--PNFLAGCS---ILSDRQPA--GIAGFGRSSESLPSQLGL-----KKF 264
            T  F P++ V  PNFL  C    +L    P   G+AGFGR+  SLPSQ        +KF
Sbjct: 142 STNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKF 201

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSS--SAFGE- 316
           + CL          SS  V+ +G G          T   +YTP + NPV ++  S+ GE 
Sbjct: 202 AVCL------SGSTSSPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEK 255

Query: 317 --FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y++G+  I+V SK V +  + L   S+GNGG  + + + FT +E  +++A+ K F  
Sbjct: 256 STEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTT 315

Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFAL 428
           ++    R   V        C+      S  L       +L+L+ K     ++   N    
Sbjct: 316 EVSKVPRVGAVAP---FEVCYSSKSFPSTRLGAGVPTIDLVLQNK-KVIWSMFGANSMVQ 371

Query: 429 VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           V +EVLCL  F D      +    AI++G  Q+++  LEFDLA  R GF
Sbjct: 372 VNDEVLCL-GFVDG----GVDVRTAIVIGAHQIEDKLLEFDLATSRLGF 415


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/404 (27%), Positives = 162/404 (40%), Gaps = 66/404 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+S GTPP      I DTGS L W  C    +C   N         P F  K+SS
Sbjct: 83  GEYFMSISIGTPPSKFLA-IADTGSDLTWVQCKPCQQCYKQN--------TPLFDKKKSS 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + +   C +  C+ +         +GC      C      Y   YG   FT G + +ET+
Sbjct: 134 TYKTESCDSITCNAL-----SEHEEGCDESRNACK-----YRYSYGDESFTKGEVATETI 183

Query: 221 RFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
              S      + P    GC   +    +   +GI G G    SL SQLG    KKFSYCL
Sbjct: 184 SIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCL 243

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGD---SKTPGLSYTPF-YKNPVGSSSAFGEFYYVGLRQ 324
                  A  +   V++ G  S     SK   +  TP   K+P         +Y++ L  
Sbjct: 244 ---SHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDP-------ETYYFLTLEA 293

Query: 325 IIVGSKHVKIPYS-----YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           I VG    K+PY+      L   S   G +I+DSG+T T ++   ++         +   
Sbjct: 294 ITVG--KTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGA 351

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            R +D   +  L  CF  SG K + LP + + F  GA + L P N F  +  +++CL + 
Sbjct: 352 KRVSD--PQGILTHCFK-SGDKEIGLPTITMHFT-GADVKLSPINSFVKLSEDIVCLSMI 407

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                          I G+    +F + +DL      F +  C+
Sbjct: 408 PTTEVA---------IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 170/395 (43%), Gaps = 63/395 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           Y + +S GTPP  +   I DTGS+L W  C + + +C D         +I  F P  SS+
Sbjct: 6   YFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSST 59

Query: 163 SQLIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
              +GC    C+   G +++   + GC   + TC      Y L+YG G ++ G L  + L
Sbjct: 60  YSKVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRL 111

Query: 221 RFPS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKF 273
              S +++ NF+ GC    L +   AGI GFG  S S  +Q+        FSYC      
Sbjct: 112 TLASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHE 171

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
           ++        L  GP + D          +   P          Y +    ++V    ++
Sbjct: 172 NEGS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLE 216

Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSG 390
           I PY Y+   +      IVDSG+  T++  P+F+A+ K   ++M    Y+R  D      
Sbjct: 217 IDPYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER---- 266

Query: 391 LRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
            R CF IS   S      P + +K    + + LP EN F    N V+C     D+A    
Sbjct: 267 -RICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA---- 319

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G     +LG+  +++F L FD+    FGF  + C
Sbjct: 320 -GVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 158/371 (42%), Gaps = 57/371 (15%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DT   + W       +C  C  P   P R P F P  SS++  + C++P C  + GP  
Sbjct: 152 IDTTVDVPWI------QCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSL-GPY- 203

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCS---- 235
                GCS  N++    C  YL++Y     TAG  +++TL     T V NF  GCS    
Sbjct: 204 ---GNGCS--NRSANAEC-RYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVR 257

Query: 236 -ILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
              SD   AG    G  ++SL +Q        FSYC+          +S  +   GP + 
Sbjct: 258 GRFSDLT-AGTMSLGGGAQSLLAQTARSLGNAFSYCVPQAS------ASGFLSIGGPATT 310

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
           +S T   + TP  ++ +  S      Y V L+ I+V  + + IP          + G ++
Sbjct: 311 NSTTV-FATTPLVRSAINPS-----LYLVRLQGIVVAGRRLGIPPVAF------SAGAVM 358

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DS +  T +    + A+ + F   M  Y R+        L  C+D  G  +V +P + L 
Sbjct: 359 DSSAVITQLPPTAYRALRRAFRNAMRAYPRSG---ATGTLDTCYDFLGLTNVRVPAVSLV 415

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F GGA + L P     ++G    CL  FT  ++  ALG      +G+ Q Q   + +D+A
Sbjct: 416 FGGGAVVVLDPPA--VMIGG---CLA-FTATSSDLALG-----FIGNVQQQTHEVLYDVA 464

Query: 472 NDRFGFAKQKC 482
               GF +  C
Sbjct: 465 AGGVGFRRGAC 475


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 158/388 (40%), Gaps = 60/388 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P    T    DTGS + W  C     C  C+   VD      F P  SS+ 
Sbjct: 122 YVITVGIGSPAVTQT-MSMDTGSDVSWVQCKP---CSQCH-SEVDS----LFDPSSSSTY 172

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET-LRF 222
               C +  C+ +   +      GC   +  C      Y++ YG   +     S   L  
Sbjct: 173 SPFSCSSAPCAQL---SQSQEGNGC--MSSQC-----QYIVNYGDSSSTTGTYSSDTLTL 222

Query: 223 PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
            S  + +F  GCS       + Q  G+ G G  ++SL SQ        FSYCL       
Sbjct: 223 GSSAMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCL------- 275

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P S         GS    T G   + F K P+  S+    +Y V L  I VGS+ + +P
Sbjct: 276 PPTS---------GSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLP 326

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
            S    GS      ++DSG+  T +    + A++  F   M  Y  A      SG L  C
Sbjct: 327 TSVFSAGS------LMDSGTIITRLPPTAYSALSSAFKAGMQQYPPAT----PSGILDTC 376

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           FD SG+ S+ +P + L F GGA + L  +     + + + CL  FT N    +LG     
Sbjct: 377 FDFSGQSSISIPTVTLVFSGGAAVDLAFDGIMLEISSSIRCLA-FTPNGDDSSLG----- 430

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q + F + +D+     GF    C
Sbjct: 431 IIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 149/356 (41%), Gaps = 35/356 (9%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           I+++ GTP   +   + D  S  VW  C               P    AF P  S++   
Sbjct: 90  INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAA-----GCLPPPATAFRPNGSATFSP 144

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA-CPSYLLQYG--LGFTAGLLLSETLRF 222
           + C +  C     P +   C            A C SY L YG     T+G L ++T  F
Sbjct: 145 LPCSSDMCL----PVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF 200

Query: 223 PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFSYCLLS-RKFDDAPV 278
            +  VP  + GCS  S    AG   + G GR + SL SQL   KFSY LL+    DD   
Sbjct: 201 GATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSA 260

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKHVKIPYS 337
            S +        GD   P          P+ SS+ + +FYYV L  + V G++   IP  
Sbjct: 261 DSVIRF------GDDAVPKTKRG--QSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAG 312

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRAADVEKKSGLRP 393
                ++G GGVI+ S +  T++E   ++ V      ++G    N S A +++       
Sbjct: 313 TFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDL------ 366

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
           C++ S    V +P+L L F GGA M L   NYF +  +  L  +    +  G  LG
Sbjct: 367 CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLG 422


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 116/415 (27%), Positives = 173/415 (41%), Gaps = 74/415 (17%)

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
           +V  YG +  +L  GTP +     I DTGS++ + PC S  R  +C   + D     AF 
Sbjct: 55  AVKDYGYFYATLHLGTPAR-QFAVIVDTGSTITYVPCASCGR--NCGPHHKDA----AFD 107

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLL 216
           P  SSSS +IGC + KC     P     C GCS + +     C           +AGLL+
Sbjct: 108 PASSSSSAVIGCDSDKCICGRPP-----C-GCSEKRE-----CTYQRTYAEQSSSAGLLV 156

Query: 217 SETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLKK-----FSY 266
           S+ L+     V   + GC       + +++  GI G G S  SL +QL         F+ 
Sbjct: 157 SDQLQLRDGAV-EVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFAL 215

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           C  S + D A       L  G          L YT        SS A   +Y V L  + 
Sbjct: 216 CFGSVEGDGA-------LMLGDVDAAEYDVALQYTALL-----SSLAHPHYYSVQLEALW 263

Query: 327 VGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF----EAVAKEFIRQMGNYSR 381
           VG + + + P  Y     +   G ++DSG+TFT++    F    EAV+   +    N  +
Sbjct: 264 VGGQQLPVKPERY-----EEGYGTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVK 318

Query: 382 AADVEKKSGLR---PCF---------DISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
             D ++KS  +    CF         D S  + V+ P   L+F  G ++   P NY  + 
Sbjct: 319 GPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVF-PVFELQFADGVRLRTGPLNYLFMH 377

Query: 430 GNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             E+   CL +F + A+G         +LG    +N  +++D  N R GF    C
Sbjct: 378 TGEMGAYCLGVFDNGASG--------TLLGGISFRNILVQYDRRNRRVGFGAASC 424


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 114/417 (27%), Positives = 175/417 (41%), Gaps = 71/417 (17%)

Query: 84  GSNYSNSLIKTPLSVHSYGGYSI--SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
           GS  SN+  K  +S  S  G +I  ++S G PP      + DTGS ++W  CT    C +
Sbjct: 80  GSLVSNNEYKARVSP-SLTGRTIMANISIGQPPIPQL-VVMDTGSDILWVMCTP---CTN 134

Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC--KGCSPRNKTCPLAC 199
           C+                   + L    +P  S  F P  ++ C  KGCS R    P   
Sbjct: 135 CD-------------------NHLGLLFDPSMSSTFSPLCKTPCDFKGCS-RCDPIP--- 171

Query: 200 PSYLLQYGLGFTA-GLLLSETLRFPSKT-----VPNFLAGC--SILSDRQPA--GIAGFG 249
             + + Y    TA G+   +T+ F +       +P+ L GC  +I  D  P   GI G  
Sbjct: 172 --FTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNIGQDTDPGHNGILGLN 229

Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
              +SL +++G +KFSYC+     D       L+L  G     +   G S TPF      
Sbjct: 230 NGPDSLATKIG-QKFSYCI-GDLADPYYNYHQLILGEG-----ADLEGYS-TPF------ 275

Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
                  FYYV +  I VG K + I         +  GGVI+D+GST TF+   +   ++
Sbjct: 276 --EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLLS 333

Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
           KE    +G   R   +EK   ++  +    +  V  P +   F  GA +AL   ++F  +
Sbjct: 334 KEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDSGSFFNQL 393

Query: 430 GNEVLCLILFTDNAAGPA----LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            + V C+ +      GP     L   P++I G    Q++ + +DL N    F +  C
Sbjct: 394 NDNVFCMTV------GPVSSLNLKSKPSLI-GLLAQQSYSVGYDLVNQFVYFQRIDC 443


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 158/389 (40%), Gaps = 58/389 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + +  GTPP   T  +FDTGS   W  C     CV   +   D  R+  F P +SS+ 
Sbjct: 163 YVVPIGLGTPPSRFT-VVFDTGSDTTWVQCRP---CVVSCYKQKD--RL--FDPAKSSTY 214

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             + C +P C+ +          GC+  +         Y +QYG G +T G    +TL  
Sbjct: 215 ANVSCADPACADL-------DASGCNAGHCL-------YGIQYGDGSYTVGFFAKDTLAV 260

Query: 223 PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
               +  F  GC   +     Q AG+ G GR   S+  Q   K    FSYCL +      
Sbjct: 261 AQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATG 320

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIP 335
            +    +  +  GS    TP L+     K P         FYYVGL  I VG K +  IP
Sbjct: 321 YLEFGPLSPSSSGSNAKTTPMLTD----KGPT--------FYYVGLTGIRVGGKQLGAIP 368

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFM--EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
            S        N G +VDSG+  T +        + A         Y +AA     S L  
Sbjct: 369 ESVF-----SNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAA---AYSILDT 420

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           C+D +G   V LP + L F+GGA + L        +    +CL  F  N    ++G    
Sbjct: 421 CYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCL-GFASNGDDESVG---- 475

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            I+G+ Q + + + +D++    GFA   C
Sbjct: 476 -IVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 149/356 (41%), Gaps = 35/356 (9%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           I+++ GTP   +   + D  S  VW  C               P    AF P  S++   
Sbjct: 90  INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAA-----GCLPPPATAFRPNGSATFSP 144

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA-CPSYLLQYG--LGFTAGLLLSETLRF 222
           + C +  C     P +   C            A C SY L YG     T+G L ++T  F
Sbjct: 145 LPCSSDMCL----PVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF 200

Query: 223 PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFSYCLLS-RKFDDAPV 278
            +  VP  + GCS  S    AG   + G GR + SL SQL   KFSY LL+    DD   
Sbjct: 201 GATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSA 260

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKHVKIPYS 337
            S +        GD   P          P+ SS+ + +FYYV L  + V G++   IP  
Sbjct: 261 DSVIRF------GDDAVPKTKRG--RSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAG 312

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRAADVEKKSGLRP 393
                ++G GGVI+ S +  T++E   ++ V      ++G    N S A +++       
Sbjct: 313 TFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDL------ 366

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
           C++ S    V +P+L L F GGA M L   NYF +  +  L  +    +  G  LG
Sbjct: 367 CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLG 422


>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
          Length = 435

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 167/398 (41%), Gaps = 84/398 (21%)

Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           I D G   +W  C ++Y          R   C+  N D        PK        GC N
Sbjct: 63  IVDLGGQFLWVDCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKP-------GCNN 115

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
             C             G +P N     A    L      +Q   GF  G  +  + RF  
Sbjct: 116 NTC-------------GVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVS-RFLF 161

Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FD 274
              P FL  G +       +G+AG GR+  +LPSQL       +KF+ CL S K    F 
Sbjct: 162 SCAPTFLLKGLAT----GASGMAGLGRTKIALPSQLASAFSFARKFAICLSSSKGVVLFG 217

Query: 275 DAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIV 327
           D P     N+V D+     DS    L+YTP   NPV ++SAF +      Y++G++ I +
Sbjct: 218 DGPYGFLPNVVFDS-----DS----LTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKI 268

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
             K V +  S L   ++G GG  + +   +T +E  +++AV   F++       A ++++
Sbjct: 269 DEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKAPA----ARNIKR 324

Query: 388 KSGLRP---CF-DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
              + P   C+ +++G +   +V   EL L+        +   N    + +EVLCL    
Sbjct: 325 VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQ-NENVVWRIFGANSMVSINDEVLCLGFVN 383

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
                       +I++G +QL+N  L+FDLA  + GF+
Sbjct: 384 GGK-----NTRTSIVIGGYQLENNLLQFDLAASKLGFS 416


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 109/392 (27%), Positives = 163/392 (41%), Gaps = 96/392 (24%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA----FIPKR 159
           Y ++++ G+PP+ S   I DTGS LVW         V C   N D S   A    F P R
Sbjct: 101 YLMTVNLGSPPR-SMLAIADTGSDLVW---------VKCKKGNNDTSSAAAPTTQFDPSR 150

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           SS+   + CQ   C        E+  +        C     +YL  YG G  T G+L +E
Sbjct: 151 SSTYGRVSCQTDAC--------EALGRATCDDGSNC-----AYLYAYGDGSNTTGVLSTE 197

Query: 219 TLRF--------PSKT-VPNFLAGCSILSDRQ--PAGIAGFGRSSESLPSQLGL-----K 262
           T  F        P +  +     GCS  +       G+ G G  + SL +QLG      +
Sbjct: 198 TFTFDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGR 257

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           +FSYCL+        V+++  L+ G    D   PG + TP   N   +S+A         
Sbjct: 258 RFSYCLVPHS-----VNASSALNFG-ALADVTEPGAASTPLVGNKTVASAASSR------ 305

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
                                     +IVDSG+T TF++  L   +  E  R++      
Sbjct: 306 --------------------------IIVDSGTTLTFLDPSLLGPIVDELSRRI----TL 335

Query: 383 ADVEKKSGL-RPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
             V+   GL + C++++G++      +P+L L+F GGA +AL PEN F  V    LCL +
Sbjct: 336 PPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI 395

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
                      + P  ILG+   QN ++ +DL
Sbjct: 396 VATTE------QQPVSILGNLAQQNIHVGYDL 421



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 70/139 (50%), Gaps = 14/139 (10%)

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFDISGKK---SVY 404
           +IVDSG+T TF++  L   +  E  R++        V+   GL + C++++G++      
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRI----TLPPVQSPDGLLQLCYNVAGREVEAGES 494

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P+L L+F GGA +AL PEN F  V    LCL +           + P  ILG+   QN 
Sbjct: 495 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTE------QQPVSILGNLAQQNI 548

Query: 465 YLEFDLANDRFGFAKQKCA 483
           ++ +DL      FA   CA
Sbjct: 549 HVGYDLDAGTVTFAVADCA 567


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 157/390 (40%), Gaps = 62/390 (15%)

Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y ++L FGTP  PQ     + DTGS + W  CT       CN     P + P F P +SS
Sbjct: 131 YVVTLGFGTPSVPQV---LLMDTGSDVSWVQCTP------CNSTKCYPQKDPLFDPSKSS 181

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +   I C    C  +     +    GC+     C      Y ++Y  G  + G+  +ETL
Sbjct: 182 TYAPIACNTDACRKLG----DHYHNGCTSGGTQC-----GYSVEYADGSHSRGVYSNETL 232

Query: 221 RF-PSKTVPNFLAGCSILSDRQPA----GIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
              P  TV +F  GC     R P+    G+ G G +  SL  Q        FSYCL +  
Sbjct: 233 TLAPGITVEDFHFGCG-RDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN 291

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            +    +  LVL + P    S      +TP    P      +  FY V +  I VG K +
Sbjct: 292 SE----AGFLVLGSPPSGNKSA---FVFTPMRHLP-----GYATFYMVTMTGISVGGKPL 339

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            IP S         GG+I+DSG+  T +    + A+     + +    +A  +       
Sbjct: 340 HIPQSAF------RGGMIIDSGTVDTELPETAYNALEAALRKAL----KAYPLVPSDDFD 389

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            C++ +G  ++ +P +   F GGA + L   N   ++ N+ L         +GP  G G 
Sbjct: 390 TCYNFTGYSNITVPRVAFTFSGGATIDLDVPN--GILVNDCLAF-----QESGPDDGLG- 441

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             I+G+   +   + +D      GF    C
Sbjct: 442 --IIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 122/466 (26%), Positives = 187/466 (40%), Gaps = 88/466 (18%)

Query: 42  KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
           KH  HH+ +D   +     S  L+ + +  TKT P               I   +S + Y
Sbjct: 26  KHNKHHNVNDSFSL-----SFPLTLSINSTTKTNP---------------IVPSISPYKY 65

Query: 102 G-GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
                ++L  GTPPQ     + DTGS + W  C ++            P    +F P  S
Sbjct: 66  SMALVVTLPIGTPPQLQQ-MVLDTGSQVSWIHCDNKK-----GPQKKQPPTTSSFDPSLS 119

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGL----- 214
           SS   + C +P C     P V          + + P  C +  L  Y   +T G      
Sbjct: 120 SSFFALPCNHPLCK----PQVP---------DISLPTDCDANRLCHYSFSYTDGTVVEGN 166

Query: 215 LLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
           L+ E +   PS T P  + GC+  SD    GI G      S P+Q  + KFSY +  ++ 
Sbjct: 167 LVRENIALSPSLTTPPIILGCANQSD-DARGILGMNLGRLSFPNQAKITKFSYFVPVKQ- 224

Query: 274 DDAPVSSNLVLDTGPGSG---DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
              P S +L L   P S      K    S +   + P     AF     + ++ I +G K
Sbjct: 225 -TQPGSGSLYLGNNPNSSCFRYVKLLTFSKSQSQRMPNLDPLAF----TLPMQGISIGGK 279

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAA 383
            + IP S   P + G G  I+DSGS F++M    +  +  E ++++G+       Y   A
Sbjct: 280 KLNIPPSVFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVA 339

Query: 384 DVEKKSGLRPCFDISGKKSVYLP-ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
           D+        CFD    +   L  +++ +F+ G ++ +P E     V   V C       
Sbjct: 340 DI--------CFDGDATEIGRLVGDMVFEFEKGVEIVIPKERVLIEVDGGVHCF------ 385

Query: 443 AAGPALGRGP-----AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                +GR         I+G+F  QN ++EFDLA  R GF    C+
Sbjct: 386 ----GIGRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRVGFRGANCS 427


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 107/407 (26%), Positives = 164/407 (40%), Gaps = 70/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     I DTGS ++W  C S   C +C   +    ++  F    SS
Sbjct: 64  GLYFTKVKLGSPPREFNVQI-DTGSDVLWVCCNS---CNNCPRTSGLGIQLNFFDSSSSS 119

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ L+ C +P C+      V++    CSP+   C     SY  QY  G  T+G  +S+TL
Sbjct: 120 TAGLVHCSDPICT----SAVQTTVTQCSPQTNQC-----SYTFQYEDGSGTSGYYVSDTL 170

Query: 221 RFPS----KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGL---- 261
            F +      V N     + GCS        ++D+   GI GFG+   S+ SQL      
Sbjct: 171 YFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGIT 230

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL         +    +L+          PG+ Y+P   +           Y +
Sbjct: 231 PRVFSHCLKGEGIGGGILVLGEILE----------PGMVYSPLVPSQ--------PHYNL 272

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  K + I  S  V  +  + G IVDSG+T  ++    ++     F+  +    
Sbjct: 273 NLQSIAVNGKLLPIDPS--VFATSNSQGTIVDSGTTLAYLVAEAYDP----FVSAVNVIV 326

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLCL 436
             +     S    C+ +S   S   P     F GGA M L PE+Y        G  V+  
Sbjct: 327 SPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWC 386

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I F               ILGD  L++    +DL   R G+A   C+
Sbjct: 387 IGFQKVQG--------VTILGDLVLKDKIFVYDLVRQRIGWANYDCS 425


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 154/397 (38%), Gaps = 64/397 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++L  GTP    T  I DTGS L W       +C  C        + P F P  SSS 
Sbjct: 91  YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSY 143

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C +  C  +        C G S        A   Y ++YG    T G+  +ETL  
Sbjct: 144 ASVPCDSDACRKLAAGAYGHGCTGVSGGAA----ALCEYGIEYGNRATTTGVYSTETLTL 199

Query: 223 -PSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
            P   V +F  GC    D Q        G+ G G + ESL SQ   +    FSYCL    
Sbjct: 200 KPGVVVADFGFGCG---DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL---- 252

Query: 273 FDDAPVSSN---LVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               P S     L L   P S  S    GLS+TP  + P     +   FY V L  I VG
Sbjct: 253 ---PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLP-----SVPTFYIVTLTGISVG 304

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
              + IP S        + G+++DSG+  T +    + A+   F   M  Y R       
Sbjct: 305 GAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNG 357

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF---TDNAAG 445
             L  C+D +G  +V +P + L F GGA + L       + G    CL      TDNA G
Sbjct: 358 GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIG 413

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                    I+G+   + F + +D      GF    C
Sbjct: 414 ---------IIGNVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 121/423 (28%), Positives = 184/423 (43%), Gaps = 82/423 (19%)

Query: 89  NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
           NS +    +V  YG +  +L  GTP +     I DTGS++ + PC+S   C     PN  
Sbjct: 63  NSTMPLHGAVKDYGYFYATLYLGTPAKKFA-VIVDTGSTMTYVPCSS---CGSGCGPN-- 116

Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
             +  AF P+ SS++  I C +PKCS         RC GCS +  T      SY  Q   
Sbjct: 117 -HQDAAFDPEASSTASRISCTSPKCSC-----GSPRC-GCSTQQCT---YTRSYAEQSS- 165

Query: 209 GFTAGLLLSETLRFPSKTVPN--FLAGCSILSD----RQPA-GIAGFGRSSESLPSQL-- 259
             ++G+LL + L      +P    + GC         RQ A G+ G G S  S+ +QL  
Sbjct: 166 --SSGILLEDVLAL-HDGLPGAPIIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVK 222

Query: 260 -GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSA 313
            G+    FS C    + D A     L+L      GD++ PG   L YTP       +S+ 
Sbjct: 223 AGVIDDVFSLCFGMVEGDGA-----LLL------GDAEVPGSISLQYTPLL-----TSTT 266

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
              +Y V +  + V  + + +  S      D   G ++DSG+TFT+M  P+F+A    F 
Sbjct: 267 HPFYYNVKMLSLAVEGQLLPVSQSLF----DQGYGTVLDSGTTFTYMPSPVFKA----FA 318

Query: 374 RQMGNYSRAADVEKKSGLRP-----CF-------DISGKKSVYLPELILKFKGGAKMALP 421
             +  Y+ +  +++  G  P     CF       D+    SV+ P + ++F  G  + L 
Sbjct: 319 GAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSVF-PSMEVQFDQGTSLVLG 377

Query: 422 PENYFAL--VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           P NY  +    +   CL +F +  AG         +LG    +N  + +D AN R GF  
Sbjct: 378 PLNYLFVHTFNSGKYCLGVFDNGRAG--------TLLGGITFRNVLVRYDRANQRVGFGP 429

Query: 480 QKC 482
             C
Sbjct: 430 ALC 432


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 106/388 (27%), Positives = 161/388 (41%), Gaps = 70/388 (18%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS ++W  C +   C +C   +     +  F    SS++ LI C +  C+      V
Sbjct: 85  IDTGSDILWVNCNT---CSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICT----SGV 137

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF--------PSKTVPNFLA 232
           +     CSPR   C     SY  QYG G  T+G  +S+ + F           +    + 
Sbjct: 138 QGAAAECSPRVNQC-----SYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVF 192

Query: 233 GCSI-------LSDRQPAGIAGFGRSSESLPSQL---GL--KKFSYCLLSRKFDDAPVSS 280
           GCSI        +D+   GI GFG    S+ SQL   G+  K FS+CL      D     
Sbjct: 193 GCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKG----DGNGGG 248

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYL 339
            LVL      G+   P + Y+P   +           Y + L+ I V  + + I P  + 
Sbjct: 249 ILVL------GEILEPSIVYSPLVPSQ--------PHYNLNLQSIAVNGQPLPINPAVFS 294

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
           +  S+  GG IVD G+T  ++    ++ +       +   +R    +  S    C+ +S 
Sbjct: 295 I--SNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSAR----QTNSKGNQCYLVST 348

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLCLILFTDNAAGPALGRGPAII 455
                 P + L F+GGA M L PE Y      L G E+ C + F     G       A I
Sbjct: 349 SIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWC-VGFQKLQEG-------ASI 400

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LGD  L++  + +D+A  R G+A   C+
Sbjct: 401 LGDLVLKDKIVVYDIAQQRIGWANYDCS 428


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 168/400 (42%), Gaps = 61/400 (15%)

Query: 93  KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
           K+ LS+++ G Y + +  GTP  A    +FDTGS   W  C     CV   +      + 
Sbjct: 155 KSGLSLNT-GNYVVPIRLGTP-AARFTVVFDTGSDTTWVQCQP---CVAYCYQQ----KE 205

Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
           P F P +S++   I C +  CS     ++++R  GCS  +         Y +QYG G +T
Sbjct: 206 PLFTPTKSATYANISCTSSYCS-----DLDTR--GCSGGHCL-------YAVQYGDGSYT 251

Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FS 265
            G    +TL     TV +F  GC   +     + AG+ G GR   S+P Q   K    F+
Sbjct: 252 VGFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFA 311

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           YC+       A  S    LD GPG+  +    L+       P         FYYVG+  I
Sbjct: 312 YCI------PATSSGTGFLDFGPGAPAAANARLTPMLVDNGPT--------FYYVGMTGI 357

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   + IP +        + G +VDSG+  T +    +E +   F + M         
Sbjct: 358 KVGGHLLSIPATVF-----SDAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKT-A 411

Query: 386 EKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPEN--YFALVGNEVLCLILFTDN 442
              S L  C+D++G + S+ LP + L F+GGA + +      Y A V    L      D+
Sbjct: 412 PAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDD 471

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                       I+G+ Q + + + +DL     GFA   C
Sbjct: 472 T--------DMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 100/388 (25%), Positives = 152/388 (39%), Gaps = 47/388 (12%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQ ++  I D    LVW  C++  RC   +        +P F+P  SS+ 
Sbjct: 62  YVANFTIGTPPQPASA-IVDVAGELVWTQCSACRRCFKQD--------LPVFVPNASSTF 112

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C    C  I  P      + CS     C    P   L+   G T+G   ++T    
Sbjct: 113 KPEPCGTAVCESI--PT-----RSCS--GDVCSYKGPPTQLR---GNTSGFAATDTFAIG 160

Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
           + TV     GC + SD      P+G  G GR+  SL +Q+ L +FSYCL  R       S
Sbjct: 161 TATV-RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK---S 216

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L L  G  +  +     S  PF K      S    +Y + L  I  G+  +    S  
Sbjct: 217 SRLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS-- 270

Query: 340 VPGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DI 397
                  GG++V  + S F+ +    + A  K     +G  +             CF   
Sbjct: 271 -------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKA 323

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
           +G      P+L+  F+G A + +PP  Y   VG E    C  + +  A     G     +
Sbjct: 324 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSV 382

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG  Q ++ +  +DL  +   F    C+
Sbjct: 383 LGSLQQEDVHFLYDLKKETLSFEPADCS 410


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 159/382 (41%), Gaps = 79/382 (20%)

Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
           PC S YR +D           P F PK SSS  ++ C +  C+ + G        G    
Sbjct: 5   PCVSCYRQLD-----------PVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDG---- 49

Query: 192 NKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA----GIA 246
                 AC  Y  +Y G G T G L  + L          + GCS  S   PA    G+ 
Sbjct: 50  ------AC-QYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLV 102

Query: 247 GFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           G GR   SL SQL + +F YCL   +SR       S  LVL  G  +  + +  ++ T  
Sbjct: 103 GLGRGPLSLVSQLSVHRFMYCLPPPMSR------TSGKLVLGAGADAVRNMSDRVTVT-- 154

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG---------------- 347
               + SS+ +  +YY+ L  + VG +      +   P S G G                
Sbjct: 155 ----MSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGA 210

Query: 348 ---GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI---SGKK 401
              G+IVD  ST +F+E  L++ +A +   ++    RA     + GL  CF +    G  
Sbjct: 211 NAYGMIVDVASTISFLETSLYDELADDLEEEI-RLPRATP-SLRLGLDLCFILPEGVGMD 268

Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQ 460
            VY+P + L F  G  + L  +  F   G  ++CL+          +GR   + ILG+FQ
Sbjct: 269 RVYVPTVSLSFD-GRWLELDRDRLFVTDG-RMMCLM----------IGRTSGVSILGNFQ 316

Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
           LQN  + F+L   +  FAK  C
Sbjct: 317 LQNMRVLFNLRRGKITFAKASC 338


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 154/388 (39%), Gaps = 47/388 (12%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQ ++  I D    LVW  C++  RC   +        +P F+P  SS+ 
Sbjct: 45  YVANFTIGTPPQPASA-IVDVAGELVWTQCSACRRCFKQD--------LPVFVPNASSTF 95

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C    C  I  P      + CS     C    P   L+   G T+G   ++T    
Sbjct: 96  KPEPCGTAVCESI--PT-----RSCS--GDVCSYKGPPTQLR---GNTSGFAATDTFAIG 143

Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
           + TV     GC + SD      P+G  G GR+  SL +Q+ L +FSYCL  R       S
Sbjct: 144 TATV-RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK---S 199

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L L  G  +  + +   S  PF K       +   +Y + L  I  G+  +    S  
Sbjct: 200 SRLFL--GSSAKLAGSESTSTAPFIKTSPDDDGS--NYYLLSLDAIRAGNTTIATAQS-- 253

Query: 340 VPGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DI 397
                  GG++V  + S F+ +    ++A  K     +G  +             CF   
Sbjct: 254 -------GGILVMHTVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKA 306

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
           +G      P+L+  F+G A + +PP  Y   VG E    C  + +  A     G     +
Sbjct: 307 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSV 365

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG  Q ++ +  +DL  +   F    C+
Sbjct: 366 LGSLQQEDVHFLYDLKKETLSFEPADCS 393


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 169/393 (43%), Gaps = 63/393 (16%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           + +S GTPP  +   I DTGS+L W  C + + +C D         +I  F P  SS+  
Sbjct: 1   MGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSSTYS 54

Query: 165 LIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
            +GC    C+   G +++   + GC   + TC      Y L+YG G ++ G L  + L  
Sbjct: 55  KVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRLTL 106

Query: 223 PS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKFDD 275
            S +++ NF+ GC    L +   AGI GFG  S S  +Q+        FSYC      ++
Sbjct: 107 ASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 166

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI- 334
                   L  GP + D          +   P          Y +    ++V    ++I 
Sbjct: 167 GS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLEID 211

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLR 392
           PY Y+   +      IVDSG+  T++  P+F+A+ K   ++M    Y+R  D       R
Sbjct: 212 PYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER-----R 260

Query: 393 PCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
            CF IS   S      P + +K    + + LP EN F    N V+C     D+A     G
Sbjct: 261 ICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA-----G 313

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                +LG+  +++F L FD+    FGF  + C
Sbjct: 314 VRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 125/452 (27%), Positives = 174/452 (38%), Gaps = 102/452 (22%)

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS--VHSYGGYSISLSFGTPPQAS 117
           A  S++RA H                 Y  SL   P S  +   G Y ++ S GTPP   
Sbjct: 57  ARRSINRANHF----------------YKYSLANIPQSTVIPDIGEYLMTYSVGTPP-FK 99

Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
              I DTGS +VW  C     C +           P F P +SSS + I C +  C  + 
Sbjct: 100 LYGIVDTGSDIVWLQCEPCQECYN--------QTTPMFNPSKSSSYKNIPCPSKLCQSM- 150

Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS-ETLRFPSK-----TVPNFL 231
                     C+ +N         Y   YG    +G  LS +TL   S      + PN +
Sbjct: 151 ------EDTSCNDKNYC------EYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIV 198

Query: 232 AGC---SILS-DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL 284
            GC   +ILS +   +GI GFG    S  +QLG     KFSYC L+  F    + SN   
Sbjct: 199 IGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYC-LTPLFSVTNIQSNATS 257

Query: 285 -----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
                D    SGD    G+  TP  K    +      FYY+ L    VG++ V+I     
Sbjct: 258 KLNFGDAATVSGD----GVVTTPILKKDPET------FYYLTLEAFSVGNRRVEIGG--- 304

Query: 340 VPGSDGNGGVIVDSGSTFT--------FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           VP  D  G +I+DSG+T T        F+E  + + V  E         R  D  +   L
Sbjct: 305 VPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLE---------RVDDPTQT--L 353

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+ +  +   + P + + FK GA + L P + F  V + V CL   +           
Sbjct: 354 NLCYSVKAEGYDF-PIITMHFK-GADVDLHPISTFVSVADGVFCLAFESSQDHA------ 405

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              I G+   QN  + +DL      F    C 
Sbjct: 406 ---IFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 107/406 (26%), Positives = 167/406 (41%), Gaps = 69/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS ++W  C +  +C   +   +D   +  + PK SS
Sbjct: 86  GLYYTEVRLGTPPKRFYVQV-DTGSDILWVNCITCDQCPHKSGLGLD---LTLYDPKASS 141

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +   + C    C+  FG     R   CS  N  C      Y + YG G  T G  +++ L
Sbjct: 142 TGSTVMCDQGFCADTFG----GRLPKCSA-NVPC-----EYSVTYGDGSSTVGSFVNDAL 191

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F       +T P   + + GC          S +   GI GFG ++ S+ SQL      
Sbjct: 192 QFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKV 251

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            K F++CL + K              G G    GD   P +  TP   +           
Sbjct: 252 KKIFAHCLDTIK--------------GGGIFAIGDVVQPKVKTTPLVADK--------PH 289

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V L+ I VG   +++P     PG     G I+DSG+T T++   +F+ V      +  
Sbjct: 290 YNVNLKTIDVGGTTLELPADIFKPGE--KRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQ 347

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
           + +   DV+       CF+ SG      P L   F+    + + P  YF   GN+V C +
Sbjct: 348 DIT-FHDVQD----FLCFEYSGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGNDVYC-V 401

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            F + A     G+   +++GD  L N  + +DL N   G+    C+
Sbjct: 402 GFQNGALQSKDGK-DIVLMGDLVLSNKLVVYDLENRVIGWTDYNCS 446


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 165/414 (39%), Gaps = 82/414 (19%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
           V S G Y   +  G+PP+     + DTGS ++W  C     C    + NF       +  
Sbjct: 68  VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWVNCKPCPECPSKTNLNF------HLSL 120

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           F    SS+S+ +GC +  CS+I      S+   C P      + C  +++      + G 
Sbjct: 121 FDVNASSTSKKVGCDDDFCSFI------SQSDSCQP-----AVGCSYHIVYADESTSEGN 169

Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
            + + L         +T P     + GC          SD    G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQL 229

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
                  + FS+CL + K              G G    G   +P +  TP   N +   
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y V L  + V    + +P S +      NGG IVDSG+T  +    L++++ + 
Sbjct: 273 -----HYNVMLMGMDVDGTALDLPPSIM-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322

Query: 372 FI-RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
            + RQ        D  +      CF  S    V  P +  +F+   K+ + P +Y   + 
Sbjct: 323 ILARQPVKLHIVEDTFQ------CFSFSENVDVAFPPVSFEFEDSVKLTVYPHDYLFTLE 376

Query: 431 NEVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            E+ C   F   A G   G R   I+LGD  L N  + +DL N+  G+A   C+
Sbjct: 377 KELYC---FGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 129/500 (25%), Positives = 196/500 (39%), Gaps = 97/500 (19%)

Query: 11  LFSLLILLFTTDAGAGSSAAT-----VTVPLTPLSTKHYLHHSDS------DPLKILHSL 59
           +FSL+I++    + A  SAAT      TV L          H DS      +PL+  +  
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELI---------HRDSPKSPMYNPLENHYHR 54

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
            + +L R+    T     T ++ I +N               G Y + LS GTPP     
Sbjct: 55  VADTLRRSISHNTGLVTNTVEAPIYNNR--------------GEYLMKLSVGTPPFPIIA 100

Query: 120 FIFDTGSSLVW---FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
            + DTGS ++W    PCT+ Y+             +P F P +S++ + + C +P CS+ 
Sbjct: 101 -VADTGSDIIWTQCVPCTNCYQ-----------QDLPMFNPSKSTTYRKVSCSSPVCSFT 148

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-----VPNF 230
              N  S    C            +Y + YG    + G    +TL   S +      P  
Sbjct: 149 GEDNSCSFKPDC------------TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196

Query: 231 LAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
             GC   +    D   +GI G G    SL  Q+G     KFSYCL     DD   +    
Sbjct: 197 AIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNK--- 253

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L+ G  +  S +  +S TP Y      S  F  FY + L+ + VG  +     +  + G 
Sbjct: 254 LNFGSNANVSGSGAVS-TPIYI-----SDKFKSFYSLKLKAVSVGRNNTFYSTANSILG- 306

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
            G   +I+DSG+T T +   L+   AK     + N  R  D  +   L  CF+ +     
Sbjct: 307 -GKANIIIDSGTTLTLLPVDLYHNFAKAISNSI-NLQRTDDPNQF--LEYCFETT-TDDY 361

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            +P + + F+ GA + L  EN    V + V+CL       A          I G+    N
Sbjct: 362 KVPFIAMHFE-GANLRLQRENVLIRVSDNVICL-------AFAGAQDNDISIYGNIAQIN 413

Query: 464 FYLEFDLANDRFGFAKQKCA 483
           F + +D+ N    F    C 
Sbjct: 414 FLVGYDVTNMSLSFKPMNCV 433


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 126/510 (24%), Positives = 209/510 (40%), Gaps = 101/510 (19%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKIL---H 57
           MA+    LI   SL++ L    A +G S   +  P          HH  S P  IL   H
Sbjct: 1   MASLWTQLISTVSLILSLARWVAVSGDSGNVLLFPSR--------HHEGSRPAMILPLHH 52

Query: 58  SLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
           S+  SSLS     RHL+              ++ N+ ++    +   G Y+  L  GTPP
Sbjct: 53  SVPESSLSHFNPRRHLQGSQ---------SEHHPNARMRLFDDLLRNGYYTTRLWIGTPP 103

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
           Q     I DTGS++ + PC++   C  C        + P F P+ S + Q +     KC+
Sbjct: 104 QRFA-LIVDTGSTVTYVPCST---CKHCG-----SHQDPKFRPEASETYQPV-----KCT 149

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV---PNF 230
           W        +C  C    K C     +Y  +Y  +  ++G+L  + + F +++       
Sbjct: 150 W--------QC-NCDDDRKQC-----TYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRA 195

Query: 231 LAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
           + GC       + +++  GI G GR   S+  QL  KK    ++S  F        +   
Sbjct: 196 IFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKK----VISDAFSLCYGGMGVGGG 251

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS-D 344
                G S    + +T  + +PV S      +Y + L++I V  K +     +L P   D
Sbjct: 252 AMVLGGISPPADMVFT--HSDPVRSP-----YYNIDLKEIHVAGKRL-----HLNPKVFD 299

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD---ISGKK 401
           G  G ++DSG+T+ ++    F A     +++  +  R       SG  P ++    SG +
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRI------SGPDPHYNDICFSGAE 353

Query: 402 ------SVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPA 453
                 S   P + + F  G K++L PENY           CL +F++       G  P 
Sbjct: 354 INVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSN-------GNDPT 406

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +LG   ++N  + +D  + + GF K  C+
Sbjct: 407 TLLGGIVVRNTLVMYDREHSKIGFWKTNCS 436


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 112/417 (26%), Positives = 168/417 (40%), Gaps = 71/417 (17%)

Query: 88  SNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S+S +  P+S  +Y G   Y + L  GTP Q  T  + DTGS L W  C           
Sbjct: 97  SSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFT-LVADTGSDLTWVKCAG--------- 146

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
               P R+  F PK S S   I C +  C      +V      CS     C      Y  
Sbjct: 147 -ASPPGRV--FRPKTSRSWAPIPCSSDTCKL----DVPFTLANCSSPASPCTY---DYRY 196

Query: 205 QYGLGFTAGLLLSE--TLRFPSKTVP---NFLAGCSILSD----RQPAGIAGFGRSSESL 255
           + G     G++ +E  T+  P   V    + + GCS   D    R   G+   G +  S 
Sbjct: 197 KEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISF 256

Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
            +Q   +    FSYCL+      AP ++   L  GPG    +TP  + T  + +P     
Sbjct: 257 ATQAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQ-VPRTPA-TQTKLFLDPEM--- 308

Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
               FY V +  I V  K + IP       S   GGVI+DSG+T T +  P ++AV    
Sbjct: 309 ---PFYGVKVDAIHVAGKALDIPAEVWDAKS---GGVILDSGNTLTVLAAPAYKAVVAAL 362

Query: 373 IRQMGNYSRAADVEKKSGLRP---CFDISGKK---SVYLPELILKFKGGAKMALPPENYF 426
            + +       D   K    P   C++ + ++      +P+L ++F G A++  P ++Y 
Sbjct: 363 SKHL-------DGVPKVSFPPFEHCYNWTARRPGAPEIIPKLAVQFAGSARLEPPAKSYV 415

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             V   V C+        G   G  P + ++G+   Q    EFDL N +  F +  C
Sbjct: 416 IDVKPGVKCI--------GVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNC 464


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 100/410 (24%), Positives = 175/410 (42%), Gaps = 75/410 (18%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
           S G Y   +  G+PP+     + DTGS ++W  C    +C    D   P      +  + 
Sbjct: 73  SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 125

Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
            K SS+S+ +GC++  CS+I              +++TC    P SY + YG G T+ G 
Sbjct: 126 SKASSTSKNVGCEDAFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 172

Query: 215 LLSETLRFPS-----KTVP---NFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQL 259
            + + +         +T P     + GC       L   + A  GI GFG+S+ S+ SQL
Sbjct: 173 FVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQL 232

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
                  + FS+CL     D+        +      G+ ++P +  TP   N V      
Sbjct: 233 AAGGSVKRIFSHCL-----DNMNGGGIFAI------GEVESPVVKTTPLVPNQV------ 275

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
              Y V L+ + V  + + +P S  +  ++G+GG I+DSG+T  ++   L+ ++ ++   
Sbjct: 276 --HYNVILKGMDVDGEPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 331

Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
           +     +   +        CF  +       P + L F+   K+++ P +Y   +  ++ 
Sbjct: 332 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 386

Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           C   F   + G     G   I+LGD  L N  + +DL N+  G+A   C+
Sbjct: 387 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 433


>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
 gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
          Length = 434

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 106/396 (26%), Positives = 173/396 (43%), Gaps = 81/396 (20%)

Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           I D G   +W  C ++Y          R   C+    D   +    PK        GC N
Sbjct: 63  IVDLGGLFLWVDCENQYISSTYRPARCRSAQCSLAKFDDCGVCFSSPKP-------GCNN 115

Query: 171 PKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAG--LLLSETLRFPSKTV 227
             CS   G +V +S   G         LA     +Q   GF  G  +++S  L   ++T 
Sbjct: 116 NTCSVAPGNSVTQSAMSG--------ELAEDILSIQSSNGFNPGQNVMVSRFLFSCARTF 167

Query: 228 PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FDDAPV 278
              L G +       +G+AG GR+  +LPSQL       KKF+ CL S K    F D P 
Sbjct: 168 --LLEGLA----SGASGMAGLGRNKLALPSQLASAFSFAKKFAICLSSSKGVVLFGDGPY 221

Query: 279 S--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF-----YYVGLRQIIVGSKH 331
               N+V D+           L+YTP   NP  S++AF +      Y++G++ I +  K 
Sbjct: 222 GFLPNVVFDS---------KSLTYTPLLINPF-STAAFAKSEPSAEYFIGVKTIKIDGKV 271

Query: 332 VKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           V +  S L +  S+G GG  + +   +T +E  +++AV   F++     S A ++++   
Sbjct: 272 VSLDTSLLSIDSSNGAGGTKISTVDPYTVLEASIYKAVTDAFVKA----SAARNIKRVDS 327

Query: 391 LRP---CF-DISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
           + P   C+ +++G +    +P + L  +      +   N    + +EVLCL        G
Sbjct: 328 VAPFEFCYTNVTGTRLGADVPTIELYLQNNVIWRIFGANSMVNINDEVLCL--------G 379

Query: 446 PALG---RGPAIILGDFQLQNFYLEFDLANDRFGFA 478
             +G      +I++G +QL+N  L+FDLA  + GF+
Sbjct: 380 FVIGGENTWASIVIGGYQLENNLLQFDLAASKLGFS 415


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 112/405 (27%), Positives = 175/405 (43%), Gaps = 72/405 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRS 160
           G + + +S GTPP A+   + DTGS+L W  C    RC + C+     P     F P +S
Sbjct: 73  GKFFMDISLGTPPVANLVTV-DTGSTLSWVVCQ---RCQISCH--TTAPEAGSVFDPDKS 126

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG----FTAGLLL 216
           ++ +L+GC +  C+ +    V     GC     TC      Y L+YG G    ++AG L 
Sbjct: 127 TTYELVGCSSRDCADVQRSLVAPF--GCIEETDTC-----LYSLRYGSGPSGQYSAGRLG 179

Query: 217 SETLRFPSKT--VPNFLAGCSILSDRQ--PAGIAGFGRSSESLPSQLG----LKKFSYCL 268
           ++ L   S +  +  F+ GCS     +   +G+ GFG ++ S  +Q+      + FSYC 
Sbjct: 180 TDKLTLASSSSIIDGFIFGCSGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCF 239

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSS---AFGEFYYVGLRQ 324
                                 GD    G LS   + K+ +  ++    FG+     L+Q
Sbjct: 240 ---------------------PGDHTAEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQ 278

Query: 325 I--IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           I  +V    +++  S           ++VDSG+  TF+ GP+F+A +K     M      
Sbjct: 279 IDMMVDGNRLQVDQSEYTKRM-----MVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFL 333

Query: 383 ADVEKKSGLRPCFDISGKKSV---YLPELILKFKGGAKMALPPENYFA--LVGNEVLCLI 437
           +D     G   CF  +G  SV    LP + ++F  G  + LPPEN F   L  ++ +CL 
Sbjct: 334 SDT---VGTETCFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLA 389

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              D A     G     ILG+    +F + +DL    FGF    C
Sbjct: 390 FKPDVA-----GVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 154/397 (38%), Gaps = 64/397 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++L  GTP    T  + DTGS L W       +C  C        + P F P  SSS 
Sbjct: 171 YVVTLGIGTPAVQQT-VLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSY 223

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C +  C  +        C G S        A   Y ++YG    T G+  +ETL  
Sbjct: 224 ASVPCDSDACRKLAAGAYGHGCTGVSGGAA----ALCEYGIEYGNRATTTGVYSTETLTL 279

Query: 223 -PSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
            P   V +F  GC    D Q        G+ G G + ESL SQ   +    FSYCL    
Sbjct: 280 KPGVVVADFGFGCG---DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL---- 332

Query: 273 FDDAPVSSN---LVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               P S     L L   P S  S    GLS+TP  + P     +   FY V L  I VG
Sbjct: 333 ---PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLP-----SVPTFYIVTLTGISVG 384

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
              + IP S        + G+++DSG+  T +    + A+   F   M  Y R       
Sbjct: 385 GAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNG 437

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF---TDNAAG 445
             L  C+D +G  +V +P + L F GGA + L       + G    CL      TDNA G
Sbjct: 438 GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIG 493

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                    I+G+   + F + +D      GF    C
Sbjct: 494 ---------IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 101/384 (26%), Positives = 156/384 (40%), Gaps = 53/384 (13%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           ++S G PP      + DTGS ++W  CT    C +      DPS+   F P   +     
Sbjct: 104 NISIGQPPIPQL-VVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFE 162

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
           GC+                  C P   T   A  S        F    ++ ET    +  
Sbjct: 163 GCR------------------CDPIPFTVTYADNS---TASGTFGRDTVVFETTDEGTSR 201

Query: 227 VPNFLAGC--SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
           + + L GC  +I  D  P   GI G     +SL ++LG +KFSYC+     D       L
Sbjct: 202 ISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLG-QKFSYCI-GNLADPYYNYHQL 259

Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
           +L  G     +   G S TPF          +  FYYV +  I VG K + I        
Sbjct: 260 ILGEG-----ADLEGYS-TPF--------EVYNGFYYVTMEGISVGEKRLDIAPETFEMK 305

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
            +  GGVI+D+GST TF+   + + ++KE    +G   R A +EK   ++  +    +  
Sbjct: 306 ENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDL 365

Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA----LGRGPAIILGD 458
           V  P +   F  GA +AL   ++F  + + V C+ +      GP     +   P++I G 
Sbjct: 366 VGFPVVTFHFSDGADLALDSGSFFNQLNDNVFCMTV------GPVSSLNIKSKPSLI-GL 418

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
              Q++ + +DL N    F +  C
Sbjct: 419 LAQQSYNVGYDLVNQFVYFQRIDC 442


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 166/407 (40%), Gaps = 69/407 (16%)

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           +G Y  S+  G+P Q +   I DTGS L W  C     C     P+VD      +   RS
Sbjct: 97  FGEYYTSIKLGSPGQEAI-LIVDTGSELTWLQCLPCKVCA----PSVDT----IYDAARS 147

Query: 161 SSSQLIGCQNPK-CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           +S + + C N + CS     N          R   C  A       YG G F+ G L ++
Sbjct: 148 ASYRPVTCNNSQLCS-----NSSQGTYAYCARGSQCQFAA-----FYGDGSFSYGSLSTD 197

Query: 219 TLRFPSK------TVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
           TL   +       TV +F  GC+     L     +GI G      +LP QLG +   KFS
Sbjct: 198 TLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFS 257

Query: 266 YCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           +C         P  S+ +  TG    G+ +     + YT      + +S    +FY+V L
Sbjct: 258 HCF--------PDRSSHLNSTGVVFFGNAELPHEQVQYTSV---ALTNSELQRKFYHVAL 306

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + + + S  +     +L  GS     VI+DSGS+F+    P    + + F++      + 
Sbjct: 307 KGVSINSHEL----VFLPRGSV----VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKH 358

Query: 383 ADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
            + +    L  CF +S          LP L L F+ G  + +P        A   N V  
Sbjct: 359 LEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKM 418

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              F D       G  P  ++G++Q QN ++E+D+   R GFA+  C
Sbjct: 419 CFAFEDG------GPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 162/392 (41%), Gaps = 59/392 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++L  GTP    T  I DTGS L W       +C  CN  +  P + P + P  SS+ 
Sbjct: 127 YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCNSSSCYPQKDPLYDPTASSTY 179

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TAGLLLSE 218
             + C +  C  +  P+      GC+  + T        L QYG+ +     T G+  +E
Sbjct: 180 APVPCDSKACKDLV-PDAYDH--GCTNSSGTS-------LCQYGIEYGNRDTTVGVYSTE 229

Query: 219 TLRF-PSKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
           TL   P  +V +F  GC ++   +     G+ G G + ESL SQ        FSYCL   
Sbjct: 230 TLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPG 289

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                  S+   L  G  + ++ T G  +TP +  P  ++     FY V L  + VG K 
Sbjct: 290 N------STTGFLALGAPTNNNDTAGFLFTPLHSLPEQAT-----FYLVNLTGVSVGGKP 338

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + IP + L      +GG+I+DSG+  T +    + A+   F   M  Y           L
Sbjct: 339 LDIPPTVL------SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPP-NNDDVL 391

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA-GPALGR 450
             C++ +G  +V +P + L F GGA + L           +V   +L  D  A       
Sbjct: 392 DTCYNFTGIANVTVPTVALTFDGGATIDL-----------DVPSGVLIQDCLAFAGGASD 440

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G   I+G+   + F + +D      GF    C
Sbjct: 441 GDVGIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 172/411 (41%), Gaps = 61/411 (14%)

Query: 85  SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S   N+ +K    + S G Y+  L  GTPPQ     I DTGS++ + PC++   C  C  
Sbjct: 57  SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 111

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                 + P F P+ S+S Q + C NP C      N +   K C    +   ++  S +L
Sbjct: 112 ----KHQDPKFQPELSTSYQALKC-NPDC------NCDDEGKLCVYERRYAEMSSSSGVL 160

Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQL 259
              L  + G   +E+   P + V     GC       L  ++  GI G GR   S+  QL
Sbjct: 161 SEDL-ISFG---NESQLSPQRAV----FGCENEETGDLFSQRADGIMGLGRGKLSVVDQL 212

Query: 260 GLKKFSYCLLSRKFDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
             K     + S  +    V    +VL        S  PG+ ++  + +P  S      +Y
Sbjct: 213 VDKGVIEDVFSLCYGGMEVGGGAMVL-----GKISPPPGMVFS--HSDPFRSP-----YY 260

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            + L+Q+ V  K +K+         +G  G ++DSG+T+ +     F A+    I+++ +
Sbjct: 261 NIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPS 316

Query: 379 YSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF--ALVGNE 432
             R    +       CF  +G+       + PE+ ++F  G K+ L PENY         
Sbjct: 317 LKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRG 375

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             CL +F D        R    +LG   ++N  + +D  ND+ GF K  C+
Sbjct: 376 AYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 91.7 bits (226), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 152/387 (39%), Gaps = 61/387 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++S GTP  A T    DTGS + W       +C  C  P     R P F P RSSS 
Sbjct: 142 YVVTVSLGTPAVAQT-LEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSY 194

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             + C    CS      +     GCS     C      Y++ YG G  T G+  S+TL  
Sbjct: 195 SAVPCAAASCS-----QLALYSNGCS--GGQC-----GYVVSYGDGSTTTGVYSSDTLTL 242

Query: 223 P-SKTVPNFLAGCSILSDRQPAGIA---GFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
             S  +  FL GC        AG+    G GR  +SL SQ        FSYCL       
Sbjct: 243 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------- 295

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P + N V     G G S T G S TP       ++S    +Y V L  I VG + + I 
Sbjct: 296 -PPTQNSVGYISLG-GPSSTAGFSTTPLL-----TASNDPTYYIVMLAGISVGGQPLSID 348

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S    G+      +VD+G+  T +    + A+   F   M  Y   +       L  C+
Sbjct: 349 ASVFASGA------VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPS-APATGILDTCY 401

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D +   +V LP + + F GGA M L              CL      A  P  G   A I
Sbjct: 402 DFTRYGTVTLPTISIAFGGGAAMDLGTSGIL-----TSGCL------AFAPTGGDSQASI 450

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+ Q ++F + FD      GF    C
Sbjct: 451 LGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 107/412 (25%), Positives = 174/412 (42%), Gaps = 63/412 (15%)

Query: 85  SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S   N+ +K    + S G Y+  L  GTPPQ     I DTGS++ + PC++   C  C  
Sbjct: 57  SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 111

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                 + P F P+ S+S Q + C NP C      N +   K C    +   ++  S +L
Sbjct: 112 ----KHQDPKFQPELSTSYQALKC-NPDC------NCDDEGKLCVYERRYAEMSSSSGVL 160

Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQL 259
              L  + G   +E+   P + V     GC       L  ++  GI G GR   S+  QL
Sbjct: 161 SEDL-ISFG---NESQLSPQRAV----FGCENEETGDLFSQRADGIMGLGRGKLSVVDQL 212

Query: 260 GLKKFSYCLLSRKFDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
             K     + S  +    V    +VL        S  PG+ ++  + +P  S      +Y
Sbjct: 213 VDKGVIEDVFSLCYGGMEVGGGAMVL-----GKISPPPGMVFS--HSDPFRSP-----YY 260

Query: 319 YVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
            + L+Q+ V  K +K+ P  +     +G  G ++DSG+T+ +     F A+    I+++ 
Sbjct: 261 NIDLKQMHVAGKSLKLNPKVF-----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIP 315

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF--ALVGN 431
           +  R    +       CF  +G+       + PE+ ++F  G K+ L PENY        
Sbjct: 316 SLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVR 374

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              CL +F D        R    +LG   ++N  + +D  ND+ GF K  C+
Sbjct: 375 GAYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 112/448 (25%), Positives = 190/448 (42%), Gaps = 105/448 (23%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R RHL+   KP +         SN+ ++    + + G Y+  L  G+PPQ     I DTG
Sbjct: 60  RLRHLQNLVKPHS---------SNARMRLHDDLLTNGYYTTRLWIGSPPQ-EFALIVDTG 109

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S++ + PC++   CV C        + P F P+ SS+ Q + C N  C+           
Sbjct: 110 STVTYVPCSN---CVQCG-----NHQDPRFQPELSSTYQPVKC-NADCN----------- 149

Query: 186 KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILSD-- 239
             C      C     +Y  +Y  +  ++G+L  + + F   S+ VP   + GC  +    
Sbjct: 150 --CDENGVQC-----TYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGD 202

Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS- 290
              ++  GI G GR + S+  QL  K      FS C                +D G G+ 
Sbjct: 203 LYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-------------MDVGGGAM 249

Query: 291 ---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGN 346
              G S  PG+ ++  + +P  S      +Y + L++I V  K +K+ P ++     DG 
Sbjct: 250 VLGGISSPPGMVFS--HSDPSRSP-----YYNIELKEIHVAGKPLKLNPRTF-----DGK 297

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK 401
            G I+DSG+T+ +     + A     ++++      + +++ SG  P     CF  +G+ 
Sbjct: 298 YGAILDSGTTYAYFPEKAYYAFKDAIMKKI------SFLKQISGPDPNFKDICFSGAGRD 351

Query: 402 SVYL----PELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAII 455
              L    PE+ + F  G K++L PENY       +   CL +F +       G     +
Sbjct: 352 VTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKN-------GNDQTTL 404

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG   ++N  + ++  N   GF K  C+
Sbjct: 405 LGGIIVRNTLVTYNRENSTIGFWKTNCS 432


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 113/399 (28%), Positives = 160/399 (40%), Gaps = 62/399 (15%)

Query: 91  LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
           L++TP        Y +    GTPPQ       DT +   W PC+    C  C      P+
Sbjct: 102 LLQTPT-------YVVRARLGTPPQQLL-LAVDTSNDAAWIPCSG---CAGC------PT 144

Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
             P F P  S S + + C +P CS    P+       CS   K+C      + L Y    
Sbjct: 145 TTP-FNPAASKSYRAVPCGSPACSRAPNPS-------CSLNTKSC-----GFSLTYADSS 191

Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
               L  ++L   +  V ++  GC   +  +   P G+ G GR   S  SQ   +    F
Sbjct: 192 LEAALSQDSLAVANDVVKSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTF 251

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           SYCL S  F     S  L L      G    P  +  TP   NP  SS      YYV + 
Sbjct: 252 SYCLPS--FKSLNFSGTLRL------GRKGQPLRIKTTPLLVNPHRSS-----LYYVSMT 298

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I VG K V IP + L        G ++DSG+ FT +  P + AV  E  R++    R A
Sbjct: 299 GIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRI----RGA 354

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
            +    G   C++     +V  P +   F  G ++ LP +N   LV +           A
Sbjct: 355 PLSSLGGFDTCYNT----TVKWPPVTFMFT-GMQVTLPADN---LVIHSTYGTTSCLAMA 406

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A P        ++   Q QN  + FD+ N R GFA+++C
Sbjct: 407 AAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 116/407 (28%), Positives = 179/407 (43%), Gaps = 68/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     I DTGS ++W  C S   C  C   +    ++  F P+ SS
Sbjct: 75  GLYYTKVKLGTPPREFYVQI-DTGSDVLWVSCGS---CNGCPQTSGLQIQLNYFDPRSSS 130

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S LI C + +C       V++    CS +N  C     +Y  QYG G  T+G  +S+ +
Sbjct: 131 TSSLISCSDRRCR----SGVQTSDASCSSQNNQC-----TYTFQYGDGSGTSGYYVSDLM 181

Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
            F         + +  + + GCSIL       S+R   GI GFG+   S+ SQL L    
Sbjct: 182 HFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIA 241

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL   K D++     LVL      G+   P + Y+P  ++           Y +
Sbjct: 242 PRVFSHCL---KGDNSG-GGVLVL------GEIVEPNIVYSPLVQSQ--------PHYNL 283

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  + V  P +  V  +  N G IVDSG+T  +    L E     F+  +    
Sbjct: 284 NLQSISVNGQIV--PIAPAVFATSNNRGTIVDSGTTLAY----LAEEAYNPFVNAITALV 337

Query: 381 RAADVEKKSGLRPCFDISGKKSVYL-PELILKFKGGAKMALPPENYFA---LVGNEVLCL 436
             +     S    C+ I+   +V + P++ L F GGA + L P++Y      +G   +  
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWC 397

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I F     G ++      ILGD  L++    +DLA  R G+A   C+
Sbjct: 398 IGF-QRIPGQSI-----TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 160/391 (40%), Gaps = 64/391 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAFIPKRS 160
           G Y I++ FGTP +  T  +FDTGS + W  C     RC           + P F P  S
Sbjct: 14  GNYVITVGFGTPTRTQT-VVFDTGSDVNWLQCKPCAVRCY--------AQQEPLFDPSLS 64

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S+ + + C  P C       V    +GCS  + TC      Y + YG G  T G L  +T
Sbjct: 65  STYRNVSCTEPAC-------VGLSTRGCS--SSTCL-----YGVFYGDGSSTIGFLAMDT 110

Query: 220 LRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE-SLPSQLG---LKKFSYCLLSR 271
               P++   NF+ GC   +    +  AG+ G GRSS  SL SQ+       FSYCL S 
Sbjct: 111 FMLTPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPST 170

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
                  S+   L+ G       TPG  YT    +     +     Y++ L  I VG   
Sbjct: 171 S------SATGYLNIG---NPQNTPG--YTAMLTD-----TRVPTLYFIDLIGISVGGTR 214

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + +  +        + G I+DSG+  T +    + A+       M  Y+ A  V     L
Sbjct: 215 LSLSSTVF-----QSVGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTI---L 266

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+D S   SV  P ++L F  G  + +P    F +  +  +CL  F  N     +G  
Sbjct: 267 DTCYDFSRTTSVVYPVIVLHFA-GLDVRIPATGVFFVFNSSQVCL-AFAGNTDSTMIG-- 322

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+ Q     + +D    R GF+   C
Sbjct: 323 ---IIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 129/500 (25%), Positives = 196/500 (39%), Gaps = 97/500 (19%)

Query: 11  LFSLLILLFTTDAGAGSSAAT-----VTVPLTPLSTKHYLHHSDS------DPLKILHSL 59
           +FSL+I++    + A  SAAT      TV L          H DS      +PL+  +  
Sbjct: 4   IFSLVIVIIFLISTAVVSAATGPDYGFTVELI---------HRDSPKSPMYNPLENHYHR 54

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
            + +L R+    T     T ++ I +N               G Y + LS GTPP     
Sbjct: 55  VADTLRRSISHNTGLVTNTVEAPIYNNR--------------GEYLMKLSVGTPPFPIIA 100

Query: 120 FIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
            + DTGS ++W    PCT+ Y+             +P F P +S++ + + C +P CS+ 
Sbjct: 101 -VADTGSDIIWTQCEPCTNCYQ-----------QDLPMFNPSKSTTYRKVSCSSPVCSFT 148

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-----VPNF 230
              N  S    C            +Y + YG    + G    +TL   S +      P  
Sbjct: 149 GEDNSCSFKPDC------------TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196

Query: 231 LAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
             GC   +    D   +GI G G    SL  Q+G     KFSYCL     DD   +    
Sbjct: 197 AIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNK--- 253

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L+ G  +  S +  +S TP Y      S  F  FY + L+ + VG  +     +  + G 
Sbjct: 254 LNFGSNANVSGSGAVS-TPIYI-----SDKFKSFYSLKLKAVSVGRNNTFYSTANSILG- 306

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
            G   +I+DSG+T T +   L+   AK     + N  R  D  +   L  CF+ +     
Sbjct: 307 -GKANIIIDSGTTLTLLPVDLYHNFAKAISNSI-NLQRTDDPNQF--LEYCFETT-TDDY 361

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            +P + + F+ GA + L  EN    V + V+CL       A          I G+    N
Sbjct: 362 KVPFIAMHFE-GANLRLQRENVLIRVSDNVICL-------AFAGAQDNDISIYGNIAQIN 413

Query: 464 FYLEFDLANDRFGFAKQKCA 483
           F + +D+ N    F    C 
Sbjct: 414 FLVGYDVTNMSLSFKPMNCV 433


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 114/407 (28%), Positives = 168/407 (41%), Gaps = 72/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     I DTGS ++W  CTS   C  C   +    ++  F P  SS
Sbjct: 82  GLYYTKVKLGTPPREFNVQI-DTGSDVLWVSCTS---CNGCPKTSELQIQLSFFDPGVSS 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S+ L+ C + +C   F    ES   GCSP N  C     SY  +YG G  T+G  +S+ +
Sbjct: 138 SASLVSCSDRRCYSNF--QTES---GCSP-NNLC-----SYSFKYGDGSGTSGYYISDFM 186

Query: 221 RFPSKTVPN--------FLAGCSILSD-------RQPAGIAGFGRSSESLPSQLGL---- 261
            F +             F+ GCS L         R   GI G G+ S S+ SQL +    
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            + FS+CL      D      +VL      G  K P   YTP   +           Y V
Sbjct: 247 PRVFSHCLKG----DKSGGGIMVL------GQIKRPDTVYTPLVPSQ--------PHYNV 288

Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L+ I V  + + I P  + +   DG    I+D+G+T  ++    +    +     +  Y
Sbjct: 289 NLQSIAVNGQILPIDPSVFTIATGDGT---IIDTGTTLAYLPDEAYSPFIQAVANAVSQY 345

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
            R    E       CF+I+       P++ L F GGA M L P  Y   F+  G+ + C+
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCI 401

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                      +      ILGD  L++  + +DL   R G+A+  C+
Sbjct: 402 -------GFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 131/468 (27%), Positives = 193/468 (41%), Gaps = 90/468 (19%)

Query: 37  TPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL 96
           +PLS  +  +H+D D L+   +  S S+SR    KTK        +I S + N L+    
Sbjct: 43  SPLSPLYNPNHTDFDRLR---NAFSRSISRVNVFKTKA------VDINS-FQNDLVPNG- 91

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW---FPCTSRYRCVDCNFPNVDPSRIP 153
                G Y + +S GTP       I DTGS L W    PC   YR            + P
Sbjct: 92  -----GEYFMKMSIGTP-LVEVIVIADTGSDLTWVQCLPCDPCYR-----------QKSP 134

Query: 154 AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTA 212
            F P RSSS + + C +  C+ +     +   + C+     C      Y   YG   +T 
Sbjct: 135 LFDPSRSSSYRHMLCGSRFCNAL-----DVSEQACTMDTNIC-----EYHYSYGDKSYTN 184

Query: 213 GLLLSETLRFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLG--L 261
           G L +E     S +     +   + GC   +    D   +GI G G  + SL SQL   +
Sbjct: 185 GNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSII 244

Query: 262 K-KFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
           K KFSYCL+    + + V+S +   T     G     TP +S  P              +
Sbjct: 245 KGKFSYCLVPLS-EQSNVTSKIKFGTDSVISGPQVVSTPLVSKQP------------DTY 291

Query: 318 YYVGLRQIIVGSKHVKIPYSY-LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           YYV L  I VG+K  ++PY+  L+ G+   G VI+DSG+T TF++   F     E  R +
Sbjct: 292 YYVTLEAISVGNK--RLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFF----TELERVL 345

Query: 377 GNYSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
               +A  V    GL   CF  +G   + LP + + F   A + L P N F     ++LC
Sbjct: 346 EETVKAERVSDPRGLFSVCFRSAGD--IDLPVIAVHFN-DADVKLQPLNTFVKADEDLLC 402

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             + + N  G         I G+    +F + +DL      F    C 
Sbjct: 403 FTMISSNQIG---------IFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 136/340 (40%), Gaps = 59/340 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC----NFPNVDPSRIPAFIPKR 159
           Y++   +GTP Q   P  FD           S  RC  C    +      +   AF P  
Sbjct: 138 YTVLAGYGTPAQ-QLPLFFDVSG-------MSNMRCKPCFSGSSGGETTTTCDVAFDPSM 189

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
           SSS + + C +P C              CS        +C   L      F  G ++ +T
Sbjct: 190 SSSFRSVLCGSPDCGG----------HSCSAGG-----SCTFTLQNSTFVFGNGTIVMDT 234

Query: 220 LRF-PSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQL------GLKKFSYC 267
           L   PS T  NF  GC      + +D    G      S  SL +++      G+  FSYC
Sbjct: 235 LTLSPSATFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYC 294

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           L       A   ++  L   P   D S   G+ Y P   NP G +     FYYV L  I 
Sbjct: 295 L------PADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPN-----FYYVDLVAIA 343

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           +  + + IP +       GNG  ++DS S FT++  P++ A+  EF + M  Y     V 
Sbjct: 344 INGEDLPIPPALFT----GNG-TMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQ---PVP 395

Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF 426
              GL  C++ +  +++YLP++ L+F  G  M L    + 
Sbjct: 396 AFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFM 435


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/250 (32%), Positives = 118/250 (47%), Gaps = 28/250 (11%)

Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGL 298
           P+G+ G GR   SL SQ G  KFSYC L+  F +   + +L +       G GD  T   
Sbjct: 151 PSGLMGLGRGRLSLVSQTGATKFSYC-LTPYFHNNGATGHLFVGASASLGGHGDVMT--- 206

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY-----LVPGSDGNGGVIVDS 353
             T F K P GS      FYY+ L  + VG   + IP +      + PG   +GGVI+DS
Sbjct: 207 --TQFVKGPKGS-----PFYYLPLIGLTVGETRLPIPATVFDLREVAPGL-FSGGVIIDS 258

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
           GS FT +    ++A+A E   ++     A   +   G   C        V +P ++  F+
Sbjct: 259 GSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGAL-CVARRDVGRV-VPAVVFHFR 316

Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
           GGA MA+P E+Y+A V             +AGP   +    ++G++Q QN  + +DLAN 
Sbjct: 317 GGADMAVPAESYWAPVDKAA---ACMAIASAGPYRRQS---VIGNYQQQNMRVLYDLANG 370

Query: 474 RFGFAKQKCA 483
            F F    C+
Sbjct: 371 DFSFQPADCS 380


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 173/418 (41%), Gaps = 61/418 (14%)

Query: 77  KTKDSNIGSNY--SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
           K K   +G +   S+S+  TP +  + G Y   L  GTP   S   + DTGSSL W  C+
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTP-ATSYVMVVDTGSSLTWLQCS 160

Query: 135 SRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNK 193
               C V C+         P F P+ S +   + C + +C  +    +      CS  N 
Sbjct: 161 P---CSVSCHR-----QAGPVFDPRASGTYAAVQCSSSECGELQAATLNP--SACSVSNV 210

Query: 194 TCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFG 249
                   Y   YG   ++ G L  +T+ F S + P F  GC   ++    + AG+ G  
Sbjct: 211 CI------YQASYGDSSYSVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLA 264

Query: 250 RSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYK 305
           ++  SL  QL       FSYCL        P SS      G  S  S  PG  SYTP   
Sbjct: 265 KNKLSLLYQLAPSLGYAFSYCL--------PTSSAAA---GYLSIGSYNPGQYSYTP--- 310

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
             + SSS     Y+V L  I V    + +P     P    +   I+DSG+  T +   ++
Sbjct: 311 --MASSSLDASLYFVTLSGISVAGAPLAVP-----PSEYRSLPTIIDSGTVITRLPPNVY 363

Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
            A+++     M   S A      S L  CF  S    + +P + + F GGA +AL P N 
Sbjct: 364 TALSRAVAAAM--ASAAPRAPTYSILDTCFRGS-AAGLRVPRVDMAFAGGATLALSPGNV 420

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              V +   CL      A  P    G   I+G+ Q Q F + +D+A  R GFA   C+
Sbjct: 421 LIDVDDSTTCL------AFAPT---GGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469


>gi|224066523|ref|XP_002302122.1| predicted protein [Populus trichocarpa]
 gi|222843848|gb|EEE81395.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 168/410 (40%), Gaps = 89/410 (21%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           PQ     + D G   +W         VDC+   V  +  PA             C +  C
Sbjct: 54  PQVPINLVVDLGGQFLW---------VDCDKNYVSSTYRPA------------RCGSALC 92

Query: 174 SWIFGPNVESRCKGCS-----PR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
           S        +R  GC      PR    N TC +   + + +   G   G L ++ +   S
Sbjct: 93  SL-------ARAGGCGDCFSGPRPGCNNNTCGVIPDNTVTRTATG---GELATDVVSVNS 142

Query: 225 K---------TVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL-----KKFS 265
                     +VP FL  C+     Q       G+AG GR+  + PSQ        +KF+
Sbjct: 143 TNGSNPGREASVPRFLFSCAPTFLLQGLASGVVGMAGLGRTRIAFPSQFASAFSFNRKFA 202

Query: 266 YCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE----- 316
            CL S     AP    ++   GP     +    +  LS+TP + NPV ++SAF +     
Sbjct: 203 ICLTS----PAPAKGVIIFGDGPYNFLPNIQLTSQSLSFTPLFINPVSTASAFSQGEPSA 258

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
            Y++G++ I +  K V +  + L   S G GG  + + + +T +E  +F AV + FI   
Sbjct: 259 EYFIGVKSIRISDKTVPLNATLLSIDSQGKGGTKISTVNPYTVLESSIFNAVTRAFI--- 315

Query: 377 GNYSRAADVEKKSGLRP---CFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFAL 428
            N S A ++ + + + P   CF      S +    +P + L  +       +   N    
Sbjct: 316 -NESAARNITRVASVAPFDVCFSSDNIFSTRLGAAVPTISLVLQNENVIWRIFGANSMVQ 374

Query: 429 VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           V + VLCL  F +  + P      +I++G +QL++   +FDLA  R GF+
Sbjct: 375 VSDNVLCL-GFVNGGSNPTT----SIVIGGYQLEDNLFQFDLAASRLGFS 419


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 108/418 (25%), Positives = 170/418 (40%), Gaps = 62/418 (14%)

Query: 90  SLIKTPLSVHSY---GGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNF 144
           S    PL+  +Y   G Y +    GTP Q   PF+   DTGS L W  C  R      + 
Sbjct: 93  SAFAMPLTSGAYTGTGQYFVQFRVGTPAQ---PFVLVADTGSDLTWVKCRGR----RASS 145

Query: 145 PNVDPSRIP-AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
           P+  P   P  F P  S S   I C +  C      ++ +   G +P     P  C  Y 
Sbjct: 146 PDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTP-----PAPC-GYD 199

Query: 204 LQY-------GLGFT--AGLLLSETLRFPSKTVPNFLAGCSILSDRQP----AGIAGFGR 250
            +Y       G+  T  A + LS +       +   + GC+   D Q      G+   G 
Sbjct: 200 YRYKDKSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGN 259

Query: 251 SSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP 307
           S+ S  S+   +   +FSYCL+      AP ++   L  GP        G +++P  + P
Sbjct: 260 SNISFASRAAARFGGRFSYCLVDHL---APRNATSYLTFGP-------VGAAHSP-SRTP 308

Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           +   +    FY V +  + V  K + IP    V     NGG I+DSG++ T +  P ++A
Sbjct: 309 LLLDAQVAPFYAVTVDAVSVAGKALNIPAE--VWDVKKNGGAILDSGTSLTILATPAYKA 366

Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYF 426
           V     +Q+    R            C++ +  ++   +P L ++F G A++  P ++Y 
Sbjct: 367 VVAALSKQLARVPRV----TMDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYV 422

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                 V C+        G   G  P + ++G+   Q    EFDLAN    F + +CA
Sbjct: 423 IDAAPGVKCI--------GLQEGVWPGVSVIGNILQQEHLWEFDLANRWLRFQESRCA 472


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 112/448 (25%), Positives = 190/448 (42%), Gaps = 105/448 (23%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R RHL+   KP +         SN+ ++    + + G Y+  L  G+PPQ     I DTG
Sbjct: 60  RLRHLQNLVKPHS---------SNARMRLHDDLLTNGYYTTRLWIGSPPQ-EFALIVDTG 109

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S++ + PC++   CV C        + P F P+ SS+ Q + C N  C+           
Sbjct: 110 STVTYVPCSN---CVQCG-----NHQDPRFQPELSSTYQPVKC-NADCN----------- 149

Query: 186 KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILSD-- 239
             C      C     +Y  +Y  +  ++G+L  + + F   S+ VP   + GC  +    
Sbjct: 150 --CDENGVQC-----TYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGD 202

Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS- 290
              ++  GI G GR + S+  QL  K      FS C                +D G G+ 
Sbjct: 203 LYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-------------MDVGGGAM 249

Query: 291 ---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGN 346
              G S  PG+ ++  + +P  S      +Y + L++I V  K +K+ P ++     DG 
Sbjct: 250 VLGGISSPPGMVFS--HSDPSRSP-----YYNIELKEIHVAGKPLKLNPRTF-----DGK 297

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK 401
            G I+DSG+T+ +     + A     ++++      + +++ SG  P     CF  +G+ 
Sbjct: 298 YGAILDSGTTYAYFPEKAYYAFKDAIMKKI------SFLKQISGPDPNFKDICFSGAGRD 351

Query: 402 SVYL----PELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAII 455
              L    PE+ + F  G K++L PENY       +   CL +F +       G     +
Sbjct: 352 VTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKN-------GNDQTTL 404

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG   ++N  + ++  N   GF K  C+
Sbjct: 405 LGGIIVRNTLVTYNRENSTIGFWKTNCS 432


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 134/510 (26%), Positives = 204/510 (40%), Gaps = 95/510 (18%)

Query: 1   MAACPFSLICLFSLLILLFTTDAGAGSSAATVTV-----PLTPLSTKHYLHHSDSDPLKI 55
           MA   F L C    +   F +++ A     TV +     P +PL   +  HH+ SD L  
Sbjct: 1   MATKTF-LYCSLLAISFFFASNSSANRENLTVELIHRDSPHSPL---YNPHHTVSDRL-- 54

Query: 56  LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
            ++    S+SR+R   TKT                 +++ L + + G Y +S+S GTPP 
Sbjct: 55  -NAAFLRSISRSRRFTTKTD----------------LQSGL-ISNGGEYFMSISIGTPP- 95

Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
           +    I DTGS L W  C    +C   N         P F  K+SS+ +   C +  C  
Sbjct: 96  SKVFAIADTGSDLTWVQCKPCQQCYKQN--------SPLFDKKKSSTYKTESCDSKTCQA 147

Query: 176 IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTV-----PN 229
           +         +GC      C      Y   YG   FT G + +ET+   S +      P 
Sbjct: 148 L-----SEHEEGCDESKDICK-----YRYSYGDNSFTKGDVATETISIDSSSGSSVSFPG 197

Query: 230 FLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNL 282
            + GC   +    +   +GI G G    SL SQLG    KKFSYCL       A  +   
Sbjct: 198 TVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCL---SHTAATTNGTS 254

Query: 283 VLDTGPG---SGDSKTPGLSYTPF-YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS- 337
           V++ G     S  SK      TP   K+P         +Y++ L  + VG    K+PY+ 
Sbjct: 255 VINLGTNSIPSNPSKDSATLTTPLIQKDP-------ETYYFLTLEAVTVG--KTKLPYTG 305

Query: 338 --YLVPG--SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
             Y + G  S   G +I+DSG+T T ++   ++         +    R +D   +  L  
Sbjct: 306 GGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSD--PQGLLTH 363

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
           CF  SG K + LP + + F   A + L P N F  +  + +CL +               
Sbjct: 364 CFK-SGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTTEVA-------- 413

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            I G+    +F + +DL      F +  C+
Sbjct: 414 -IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 102/413 (24%), Positives = 163/413 (39%), Gaps = 80/413 (19%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
           V S G Y   +  G+PP+     + DTGS ++W  C    +C    + NF      R+  
Sbjct: 68  VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWINCKPCPKCPTKTNLNF------RLSL 120

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           F    SS+S+ +GC +  CS+I      S+   C P      L C  +++      + G 
Sbjct: 121 FDMNASSTSKKVGCDDDFCSFI------SQSDSCQP-----ALGCSYHIVYADESTSDGK 169

Query: 215 LLSETLRFPS-----KTVP---NFLAGCSILS-------DRQPAGIAGFGRSSESLPSQL 259
            + + L         KT P     + GC           D    G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQL 229

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
                  + FS+CL + K              G G    G   +P +  TP   N +   
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y V L  + V    + +P S +      NGG IVDSG+T  +    L++++ + 
Sbjct: 273 -----HYNVMLMGMDVDGTSLDLPRSIV-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
            + +     +   +        CF  S       P +  +F+   K+ + P +Y   +  
Sbjct: 323 ILAR-----QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE 377

Query: 432 EVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           E+ C   F   A G     R   I+LGD  L N  + +DL N+  G+A   C+
Sbjct: 378 ELYC---FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 116/453 (25%), Positives = 179/453 (39%), Gaps = 83/453 (18%)

Query: 49  DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
           D D +  +H LA+     AR   T   P +    +       +   PL   +Y    +S+
Sbjct: 94  DQDRVDSIHRLAA-----ARPSSTADDPSSASKGVSLPARRGV---PLGTANY---IVSV 142

Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
             GTP +     +FDTGS L W  C     C  C +   D    P F P +S++   + C
Sbjct: 143 GLGTPKR-DLLVVFDTGSDLSWVQCKP---CDGC-YQQHD----PLFDPSQSTTYSAVPC 193

Query: 169 QNPKCSWIFGPNVES-RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF---- 222
              +C  +   +  S +C+               Y + YG +  T G L  +TL      
Sbjct: 194 GAQECRRLDSGSCSSGKCR---------------YEVVYGDMSQTDGNLARDTLTLGPSS 238

Query: 223 ---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
               S  +  F+ GC         +  G+ G GR   SL SQ   K    FSYCL S   
Sbjct: 239 SSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSST 298

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            +  +S           G +  P   +T        + S    FYY+ L  I V  + V+
Sbjct: 299 AEGYLS----------LGSAAPPNARFTAMV-----TRSDTPSFYYLNLVGIKVAGRTVR 343

Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADVEKKSG 390
           + P  +  PG+      ++DSG+  T +    + A+   F   M  YS  RA  +   S 
Sbjct: 344 VSPAVFRTPGT------VIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPAL---SI 394

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
           L  C+D +G+  V +P + L F GGA + L       +      CL  F  N    ++  
Sbjct: 395 LDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQACLA-FASNGDDTSIA- 452

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               ILG+ Q + F + +D+AN + GF  + C+
Sbjct: 453 ----ILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 152/387 (39%), Gaps = 61/387 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +++S GTP  A T    DTGS + W       +C  C  P     R P F P RSSS 
Sbjct: 131 YVVTVSLGTPAVAQT-LEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSY 183

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
             + C    CS      +     GCS     C      Y++ YG G  T G+  S+TL  
Sbjct: 184 SAVPCAAASCS-----QLALYSNGCS--GGQC-----GYVVSYGDGSTTTGVYSSDTLTL 231

Query: 223 P-SKTVPNFLAGCSILSDRQPAGIA---GFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
             S  +  FL GC        AG+    G GR  +SL SQ        FSYCL       
Sbjct: 232 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------- 284

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
            P + N V     G G S T G S TP       ++S    +Y V L  I VG + + I 
Sbjct: 285 -PPTQNSVGYISLG-GPSSTAGFSTTPLL-----TASNDPTYYIVMLAGISVGGQPLSID 337

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            S    G+      +VD+G+  T +    + A+   F   M  Y   +       L  C+
Sbjct: 338 ASVFASGA------VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPS-APATGILDTCY 390

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           D +   +V LP + + F GGA M L              CL      A  P  G   A I
Sbjct: 391 DFTRYGTVTLPTISIAFGGGAAMDLGTSGIL-----TSGCL------AFAPTGGDSQASI 439

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           LG+ Q ++F + FD      GF    C
Sbjct: 440 LGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 107/391 (27%), Positives = 162/391 (41%), Gaps = 56/391 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+P Q     + DT +   W PCT    C  C+      S    + P+ S+
Sbjct: 106 GSYVVRVKLGSPNQLFF-MVLDTSTDEAWVPCTG---CTGCS------SSSTYYSPQAST 155

Query: 162 S-SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
           +    + C  P+C+         + +G  P   T   AC       G  F+A  L+ ++L
Sbjct: 156 TYGGAVACYAPRCA---------QARGALPCPYTGSKACTFNQSYAGSTFSA-TLVQDSL 205

Query: 221 RFPSKTVPNFLAGCS------ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
           R    T+P++  GC        L  +   G+     S  S  S+L    FSYCL S  F 
Sbjct: 206 RLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPS--FQ 263

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
            +  S +L L  GP     +   +  TP  +NP   S      YYV L  + VG   V +
Sbjct: 264 SSYFSGSLKL--GPTGQPRR---IRTTPLLQNPRRPS-----LYYVNLTGVTVGRVKVPL 313

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLR 392
           P  YL    +   G I+DSG+  T   GP++ A+  EF  Q+    +SR        G  
Sbjct: 314 PIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRG-------GFD 366

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRG 451
            CF  + +     P + L+F  G  + LP EN         + CL +    AA P     
Sbjct: 367 TCFVKTYEN--LTPLIKLRFT-GLDVTLPYENTLIHTAYGGMACLAM----AAAPNNVNS 419

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++ ++Q QN  + FD  N+R G A++ C
Sbjct: 420 VLNVIANYQQQNLRVLFDTVNNRVGIARELC 450


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 175/413 (42%), Gaps = 74/413 (17%)

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
           ++S+  GTPPQ  T  + DTGS L           + CN  ++ P   PA F    S + 
Sbjct: 66  TVSVVVGTPPQNVT-MVLDTGSEL---------SGLLCNGSSLSP---PAPFNASASLTY 112

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
             + C +P C W  G ++  R    +P + +C ++     + Y    +A G L+++T   
Sbjct: 113 SAVDCSSPACVW-RGRDLPVRPFCDAPPSTSCRVS-----ISYADASSADGHLVADTFIL 166

Query: 223 PSKTVPNFLAGC-----------SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYCLL 269
            ++ VP    GC           S  +D   A  G+ G  R S S  +Q    +F+YC+ 
Sbjct: 167 GTQAVPALF-GCITSYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATLRFAYCI- 224

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIV 327
                 AP     +L  G        P L+YTP  +   P+         Y V L  I V
Sbjct: 225 ------APGQGPGILLLG--GDGGAAPPLNYTPLIEISQPLPYFDRVA--YSVQLEGIRV 274

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           GS  ++IP S L P   G G  +VDSG+ FTF+    + A+  EF+ Q    S  A + +
Sbjct: 275 GSALLQIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFLNQA--RSLLAPLGE 332

Query: 388 -----KSGLRPCF----DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------ 432
                +     CF    +     S  LPE+ L  + GA++A+  E     V  E      
Sbjct: 333 PGFVFQGAFDACFRGPEERVSAASRLLPEVGLVLR-GAEVAVAGEKLLYSVPGERRGEEG 391

Query: 433 ---VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              V CL     + AG +     A ++G    Q+ ++E+DL N R GFA  +C
Sbjct: 392 AEAVWCLTFGNSDMAGMS-----AYVIGHHHQQDVWVEYDLQNGRVGFAPARC 439


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 162/397 (40%), Gaps = 62/397 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +S GTPP  S   + DTGS ++W  C     C   N P  DPS        +S+
Sbjct: 81  GEYLVEISVGTPP-FSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPS--------KST 131

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + + + C +P CS+            CS  ++        Y + YG    + G L  +T+
Sbjct: 132 TYKNVACSSPVCSY------SGDGSSCSDDSECL------YSIAYGDDSHSQGNLAVDTV 179

Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
              S +      P  + GC   +    +   +GI G GR   SL +QLG     KFSYCL
Sbjct: 180 TMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCL 239

Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
           +         + +  L+ G  +  S + G   TP Y     SS+ +  FY + L  + VG
Sbjct: 240 I--PIGTGSTNDSTKLNFGSNANVSGS-GTVSTPIY-----SSAQYKTFYSLKLEAVSVG 291

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
                 P      G + N  +I+DSG+T T++   L  +     I Q  +   A D  + 
Sbjct: 292 DTKFNFPEGASKLGGESN--IIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEF 348

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
             L  CF  +      +P + + F+ GA + L  EN F  + ++ +CL          A 
Sbjct: 349 --LDYCFATT-TDDYEMPPVTMHFE-GADVPLQRENLFVRLSDDTICL----------AF 394

Query: 449 GRGP---AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           G  P     I G+    NF + +D+ N    F    C
Sbjct: 395 GSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|225432542|ref|XP_002277699.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
          Length = 435

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 109/401 (27%), Positives = 170/401 (42%), Gaps = 78/401 (19%)

Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
            S P   D G   +W         VDC+   V  S    + P R  S+Q    ++  C  
Sbjct: 56  VSIPLTLDLGGQFLW---------VDCDQGYVSSS----YRPVRCGSAQCSLTRSKACGE 102

Query: 176 IF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN----- 229
            F GP      KGC+    TC L+  + +       T+G +  + +   S    N     
Sbjct: 103 CFSGP-----VKGCN--YSTCVLSPDNTVTGTA---TSGEVGEDAVSIQSTDGSNPGRVV 152

Query: 230 ------FLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK--- 272
                 F  G + L +    +  G+AG GRS  +LPSQ        +KFS CL S     
Sbjct: 153 SVRRLLFTCGSTFLLEGLASRVKGMAGLGRSRVALPSQFSSAFSFNRKFSICLSSSTKST 212

Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GEF---YYVGLR 323
               F D P      +D         +  L+YTP   NPV ++SA+  GE    Y++G++
Sbjct: 213 GVVFFGDGPYVLLPKVD--------ASQSLTYTPLITNPVSTASAYFQGEASVEYFIGVK 264

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +  K V +  + L   S G GG  + +   +T +E  +++AV + F++++   +R A
Sbjct: 265 SIKINGKAVPLNATLLSIDSQGYGGTKISTVHPYTVLETSIYKAVTQAFLKELSTITRVA 324

Query: 384 DVEKKSGLRPCF---DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
            V   S    CF   DI   +   +V   +L+L+ +      +   N    V + VLCL 
Sbjct: 325 SV---SPFGACFSSKDIGSTRVGPAVPPIDLVLQ-RQSVYWRVFGANSMVQVSDNVLCL- 379

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            F D    P      +I++G  QL++  L+FDLA  R GF+
Sbjct: 380 GFVDGGVNPR----TSIVIGGRQLEDNLLQFDLATSRLGFS 416


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 181/406 (44%), Gaps = 82/406 (20%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++S GTPP      I DTGS L+W  C     C DC +  VD    P F PK SS
Sbjct: 88  GEYLMNVSIGTPPFPIMA-IADTGSDLLWTQCAP---CDDC-YTQVD----PLFDPKTSS 138

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + + C + +C+ +     E++   CS  + TC     SY L YG   +T G +  +TL
Sbjct: 139 TYKDVSCSSSQCTAL-----ENQA-SCSTNDNTC-----SYSLSYGDNSYTKGNIAVDTL 187

Query: 221 RF-PSKTVP----NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
               S T P    N + GC   +    +++ +GI G G    SL  QLG     KFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247

Query: 269 L---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           +   S+K   + ++  +N ++    GSG   TP +           + ++   FYY+ L+
Sbjct: 248 VPLTSKKDQTSKINFGTNAIV---SGSGVVSTPLI-----------AKASQETFYYLTLK 293

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRA 382
            I VGSK ++   S         G +I+DSG+T T +          EF  ++ +  + +
Sbjct: 294 SISVGSKQIQYSGSDSESSE---GNIIIDSGTTLTLL--------PTEFYSELEDAVASS 342

Query: 383 ADVEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
            D EKK    SGL  C+  +G   V  P + + F  GA + L   N F  V  +++C   
Sbjct: 343 IDAEKKQDPQSGLSLCYSATGDLKV--PVITMHFD-GADVKLDSSNAFVQVSEDLVCF-- 397

Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   A    P+  I G+    NF + +D  +    F    CA
Sbjct: 398 --------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 118/465 (25%), Positives = 185/465 (39%), Gaps = 88/465 (18%)

Query: 57  HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
           H L   ++ R+R+          ++   S     + +TP+ + + G Y + L  GTPP  
Sbjct: 45  HELLRRAIQRSRYRLAGIGMARGEA--ASARKAVVAETPI-MPAGGEYLVKLGIGTPPYK 101

Query: 117 STPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
            T  I DT S L+W    PCT  Y  VD           P F P+ SS+   + C +  C
Sbjct: 102 FTAAI-DTASDLIWTQCQPCTGCYHQVD-----------PMFNPRVSSTYAALPCSSDTC 149

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLA 232
             +   +V  RC      +++C      Y   Y G   T G L  + L            
Sbjct: 150 DEL---DVH-RCG--HDDDESC-----QYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAF 198

Query: 233 GCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVL 284
           GCS  S       Q +G+ G GR   SL SQL +++F+YCL    SR      +   LVL
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASR------IPGKLVL 252

Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI---------- 334
                +  + T  ++  P  ++P      +  +YY+ L  +++G + + +          
Sbjct: 253 GADADAARNATNRIA-VPMRRDP-----RYPSYYYLNLDGLLIGDRAMSLPPTTTTTATA 306

Query: 335 ------------PYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
                       P +  V   D N  G+I+D  ST TF+E  L++ +  +   ++    R
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEI-RLPR 365

Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
                   GL  CF   D      VY+P + L F  G  + L     FA      ++CL+
Sbjct: 366 GTG--SSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLM 422

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +    A       G   ILG+FQ QN  + ++L   R  F +  C
Sbjct: 423 VGRAEA-------GSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 118/465 (25%), Positives = 185/465 (39%), Gaps = 88/465 (18%)

Query: 57  HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
           H L   ++ R+R+          ++   S     + +TP+ + + G Y + L  GTPP  
Sbjct: 45  HELLRRAIQRSRYRLAGIGMARGEA--ASARKAVVAETPI-MPAGGEYLVKLGIGTPPYK 101

Query: 117 STPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
            T  I DT S L+W    PCT  Y  VD           P F P+ SS+   + C +  C
Sbjct: 102 FTAAI-DTASDLIWTQCQPCTGCYHQVD-----------PMFNPRVSSTYAALPCSSDTC 149

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLA 232
             +   +V  RC      +++C      Y   Y G   T G L  + L            
Sbjct: 150 DEL---DVH-RCG--HDDDESC-----QYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAF 198

Query: 233 GCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVL 284
           GCS  S       Q +G+ G GR   SL SQL +++F+YCL    SR      +   LVL
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASR------IPGKLVL 252

Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI---------- 334
                +  + T  ++  P  ++P      +  +YY+ L  +++G + + +          
Sbjct: 253 GADADAARNATNRIA-VPMRRDP-----RYPSYYYLNLDGLLIGDRTMSLPPTTTTTATA 306

Query: 335 ------------PYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
                       P +  V   D N  G+I+D  ST TF+E  L++ +  +   ++    R
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEI-RLPR 365

Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
                   GL  CF   D      VY+P + L F  G  + L     FA      ++CL+
Sbjct: 366 GTG--SSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLM 422

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +    A       G   ILG+FQ QN  + ++L   R  F +  C
Sbjct: 423 VGRAEA-------GSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 140/513 (27%), Positives = 208/513 (40%), Gaps = 101/513 (19%)

Query: 1   MAACPFSLICLFSLLILL-----FTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKI 55
           MAA   + + LF + + L      T     GS  A++    +P+S    L++  +     
Sbjct: 1   MAAFSITHLSLFVIFVALISKTSLTASMNNGSFTASLIHRDSPISP---LYNPKNTYFDR 57

Query: 56  LHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
           L S    S+SRA      + +  KT + +I              +   G Y + +S GTP
Sbjct: 58  LQSSFHRSISRANRFTPNSVSAAKTLEYDI--------------IPGGGEYFMRISIGTP 103

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P      I DTGS L+W  C     C +C        + P F PK+SS+ + + C+   C
Sbjct: 104 P-IEVLVIADTGSDLIWVQCQP---CQECY-----KQKSPIFNPKQSSTYRRVLCETRYC 154

Query: 174 SWIFGPNVESRCKGCSPRN--KTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS--KTVP 228
           + +      S  + CS     K C      Y   YG   FT G L +E     S   ++ 
Sbjct: 155 NAL-----NSDMRACSAHGFFKAC-----GYSYSYGDHSFTMGYLATERFIIGSTNNSIQ 204

Query: 229 NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV--S 279
               GC   +    D   +GI G G  S SL SQLG K   KFSYCL+       P+   
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLV-------PILEK 257

Query: 280 SNLVLDT---GPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
           SN  L     G  S  S +     TP   K P         FYY+ L  I VG++ +   
Sbjct: 258 SNFSLGKIVFGDNSFISGSDTYVSTPLVSKEP-------ETFYYLTLEAISVGNERL--- 307

Query: 336 YSYLVPGSDGN---GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
            +Y    +DGN   G +I+DSG+T TF++  L+  +  E + +     +A + E+ S   
Sbjct: 308 -AYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKL--ELVLE-----KAVEGERVSDPN 359

Query: 393 PCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
             F I    K  + LP + + F   A + L P N FA    ++LC  +   N        
Sbjct: 360 GIFSICFRDKIGIELPIITVHFT-DADVELKPINTFAKAEEDLLCFTMIPSNGIA----- 413

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               I G+    NF + +DL  +   F    C+
Sbjct: 414 ----IFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 181/406 (44%), Gaps = 82/406 (20%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++S GTPP      I DTGS L+W  C     C DC +  VD    P F PK SS
Sbjct: 88  GEYLMNVSIGTPPFPIMA-IADTGSDLLWTQCAP---CDDC-YTQVD----PLFDPKTSS 138

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           + + + C + +C+ +     E++   CS  + TC     SY L YG   +T G +  +TL
Sbjct: 139 TYKDVSCSSSQCTAL-----ENQA-SCSTNDNTC-----SYSLSYGDNSYTKGNIAVDTL 187

Query: 221 RF-PSKTVP----NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
               S T P    N + GC   +    +++ +GI G G    SL  QLG     KFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247

Query: 269 L---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           +   S+K   + ++  +N ++    GSG   TP +           + ++   FYY+ L+
Sbjct: 248 VPLTSKKDQTSKINFGTNAIV---SGSGVVSTPLI-----------AKASQETFYYLTLK 293

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRA 382
            I VGSK ++   S         G +I+DSG+T T +          EF  ++ +  + +
Sbjct: 294 SISVGSKQIQYSGSDSESSE---GNIIIDSGTTLTLL--------PTEFYSELEDAVASS 342

Query: 383 ADVEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
            D EKK    SGL  C+  +G   V  P + + F  GA + L   N F  V  +++C   
Sbjct: 343 IDAEKKQDPQSGLSLCYSATGDLKV--PVITMHFD-GADVKLDSSNAFVQVSEDLVCF-- 397

Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   A    P+  I G+    NF + +D  +    F    CA
Sbjct: 398 --------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 112/440 (25%), Positives = 172/440 (39%), Gaps = 81/440 (18%)

Query: 59  LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
           + S+  SR +  +  T P   +  + S   ++L          G Y   +  G+P Q   
Sbjct: 78  VVSNYDSRRKGFEMTTTPAEVEMPMHSGRDDAL----------GEYFAEVKVGSPGQ-RF 126

Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
             + DTGS   W  C+  +  V C                R     L        S +F 
Sbjct: 127 WLVVDTGSEFTWLNCSKSFEAVTC--------------ASRKCKVDL--------SELFS 164

Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-----PSKTVPNFLA 232
            +V      C   +  C      Y + Y  G +A G   ++++           + N   
Sbjct: 165 LSV------CPKPSDPCL-----YDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTI 213

Query: 233 GC--SILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
           GC  S+L+    + +  GI G G + +S   +   K   KFSYCL+        VSSNL 
Sbjct: 214 GCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDH-LSHRSVSSNLT 272

Query: 284 LDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
           +    G  ++K  G +  T     P         FY V +  I +G + +KIP    V  
Sbjct: 273 IG---GHHNAKLLGEIRRTELILFP--------PFYGVNVVGISIGGQMLKIPPQ--VWD 319

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
            +  GG ++DSG+T T +  P +EAV +   + +    R    E    L  CFD  G   
Sbjct: 320 FNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTG-EDFDALEFCFDAEGFDD 378

Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
             +P L+  F GGA+   P ++Y   V   V C+ +       P  G G A ++G+   Q
Sbjct: 379 SVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIV------PIDGIGGASVIGNIMQQ 432

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N   EFDL+ +  GFA   C
Sbjct: 433 NHLWEFDLSTNTVGFAPSTC 452


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/403 (26%), Positives = 165/403 (40%), Gaps = 66/403 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S+  GTP +  T  +FDTGS L W       +C  C+       + P F P  SS
Sbjct: 152 GNYVVSVGLGTPARDLT-VVFDTGSDLSWV------QCGPCSSGGCYKQQDPLFAPSDSS 204

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C   +C           C G SP +  CP     Y + YG    T G L ++TL
Sbjct: 205 TFSAVRCGAREC------RARQSCGG-SPGDDRCP-----YEVVYGDKSRTQGHLGNDTL 252

Query: 221 RFPS-----------KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
              +             +P F+ GC   +     Q  G+ G GR   SL SQ   K    
Sbjct: 253 TLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEG 312

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL       AP   +L          +  P  ++  F   P+ + +    FYYV L 
Sbjct: 313 FSYCL-PSSSSSAPGYLSL---------GTPVPAPAHAQF--TPMLNRTTTPSFYYVKLV 360

Query: 324 QIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I V  + +++    + +P       +IVDSG+  T +    + A+   F+  MG Y   
Sbjct: 361 GIRVAGRAIRVSSPRVALP-------LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYK 413

Query: 383 ADVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
               + S L  C+D +     +V +P + L F GGA +++       +      CL  F 
Sbjct: 414 -RAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLA-FA 471

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            N  G + G     ILG+ Q +   + +D+A  + GFA + C+
Sbjct: 472 PNGDGRSAG-----ILGNTQQRTLAVVYDVARQKIGFAAKGCS 509


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 107/388 (27%), Positives = 150/388 (38%), Gaps = 63/388 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++ S GTP  A T    DTGS L W       +C  C  P+    + P F P +SSS 
Sbjct: 137 YVVTASLGTPGMAQT-LEVDTGSDLSWV------QCKPCAAPSCYRQKDPLFDPAQSSSY 189

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C              S C G       C  A   Y++ YG G  T G+  S+TL  
Sbjct: 190 AAVPCG------------RSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 237

Query: 223 PSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            +  TV  FL GC             G+ GFGR   SL  Q        FSYCL ++   
Sbjct: 238 AANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKS-- 295

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
               S+   L  G  SG    PG S T    +P   +     +Y V L  I VG + + +
Sbjct: 296 ----STTGYLTLGGPSG--VAPGFSTTQLLPSPNAPT-----YYVVMLTGISVGGQPLSV 344

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S    G+      +VD+G+  T +    + A+   F   M +Y  A  +     L  C
Sbjct: 345 PASAFAAGT------VVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGI---LDTC 395

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +  +G  +V L  + L F  GA M L  +   +       CL   +  +       G   
Sbjct: 396 YSFAGYGTVNLTSVALTFSSGATMTLGADGIMSFG-----CLAFASSGS------DGSMA 444

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q ++F +  D      GF    C
Sbjct: 445 ILGNVQQRSFEVRID--GSSVGFRPSSC 470


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 124/420 (29%), Positives = 170/420 (40%), Gaps = 79/420 (18%)

Query: 89  NSLIKTPLS-VHSY-GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
           NSL  TP S V SY G Y +S S GTPP  S   I DTGS +VW  C    +C +     
Sbjct: 70  NSLASTPESTVISYEGDYIMSYSVGTPPIKSYG-IVDTGSDIVWLQCEPCEQCYN----- 123

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
                 P F P +SSS + I C +  C  +       R   C+ + K C      Y + Y
Sbjct: 124 ---QTTPKFNPSKSSSYKNISCSSKLCQSV-------RDTSCNDK-KNCE-----YSINY 167

Query: 207 G-LGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQPAGIAGFGRSS-------- 252
           G    + G L  ETL   S T      P  + GC          I  F R S        
Sbjct: 168 GNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCG------TNNIGSFKRVSSGVVGLGG 221

Query: 253 --ESLPSQLGLK---KFSYCL--LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK 305
              SL +QLG     KFSYCL  +S    +  + S+  L+ G  +  S    LS TP  K
Sbjct: 222 GPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSK-LNFGDVAIVSGHNVLS-TPIVK 279

Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG--NGGVIVDSGSTFTFMEGP 363
                  +F  FYY+ +    VG K V+   S     S G   G +I+DS +  TF+   
Sbjct: 280 K----DHSF--FYYLTIEAFSVGDKRVEFAGS-----SKGVEEGNIIIDSSTIVTFVPSD 328

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
           ++  +    +  +    R  D  ++  L  C+++S  +    P +   FK GA + L   
Sbjct: 329 VYTKLNSAIV-DLVTLERVDDPNQQFSL--CYNVSSDEEYDFPYMTAHFK-GADILLYAT 384

Query: 424 NYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           N F  V  +VLC       A  P+ G     I G F  Q+F + +DL      F    C 
Sbjct: 385 NTFVEVARDVLCF------AFAPSNG---GAIFGSFSQQDFMVGYDLQQKTVSFKSVDCT 435


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/406 (26%), Positives = 166/406 (40%), Gaps = 72/406 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPC------TSRYRCVDCNFPNVDPSRIPAFIP 157
           Y   +  G P Q     I DTGS ++WF C      +S+   + C+   +    I  + P
Sbjct: 88  YYAQIGVGHPVQFLNA-IVDTGSDILWFKCKLCQGCSSKKNVIVCS-SIIMQGPITLYDP 145

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLL 216
           + S ++    C +P CS             C   N +C     +Y + Y     + G+  
Sbjct: 146 ELSITASPATCSDPLCS---------EGGSCRGNNNSC-----AYDISYEDTSSSTGIYF 191

Query: 217 SETLRFPSKTVPN---FLAGCSILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLSRK 272
            + +    K   N   FL   + +S   P  GI GFGRS  S+P+QL  +  SY +    
Sbjct: 192 RDVVHLGHKASLNTTMFLGCATSISGLWPVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHC 251

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
                    +++    G  D + P + YTP   N +         Y V L  + V SK +
Sbjct: 252 LSGEKEGGGILV---LGKND-EFPEMVYTPMLANDI--------VYNVKLVSLSVNSKAL 299

Query: 333 KIPYS-YLVPGSDGNGGVIVDSG-STFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKS 389
            I  S +    + GNGG I+DSG S+ TF    L       F++ +  ++ A      +S
Sbjct: 300 PIEASEFEYNATVGNGGTIIDSGTSSATFPSKAL-----ALFVKAVSKFTTAIPTAPLES 354

Query: 390 GLRPCF-DISGKKSVYL--PELILKFKGGAKMALPPENYFALV------------GNEVL 434
              PCF  IS + SV +  P + LKF GGA M L   NY   V            G  ++
Sbjct: 355 SGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLV 414

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
           C+          +   G + ILGD  L++  + +D+   R G+ KQ
Sbjct: 415 CI----------SWSVGNSTILGDAILKDKVVVYDMEKSRIGWVKQ 450


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 165/407 (40%), Gaps = 69/407 (16%)

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           +G Y  S+  G+P Q +   I DTGS L W  C     C     P+VD      +   RS
Sbjct: 97  FGEYYTSIKLGSPGQEAI-LIVDTGSELTWLKCLPCKVCA----PSVDT----IYDAARS 147

Query: 161 SSSQLIGCQNPK-CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
            S + + C N + CS     N          R   C  A       YG G F+ G L ++
Sbjct: 148 VSYKPVTCNNSQLCS-----NSSQGTYAYCARGSQCQFAA-----FYGDGSFSYGSLSTD 197

Query: 219 TLRFPSK------TVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
           TL   +       TV +F  GC+     L     +GI G      +LP QLG +   KFS
Sbjct: 198 TLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFS 257

Query: 266 YCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           +C         P  S+ +  TG    G+ +     + YT      + +S    +FY+V L
Sbjct: 258 HCF--------PDRSSHLNSTGVVFFGNAELPHEQVQYTSV---ALTNSELQRKFYHVAL 306

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + + + S  + +    L  GS     VI+DSGS+F+    P    + + F++      + 
Sbjct: 307 KGVSINSHELVL----LPRGSV----VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKH 358

Query: 383 ADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
            + +    L  CF +S          LP L L F+ G  + +P        A   N V  
Sbjct: 359 LEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKM 418

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              F D       G  P  ++G++Q QN ++E+D+   R GFA+  C
Sbjct: 419 CFAFEDG------GPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 135/500 (27%), Positives = 200/500 (40%), Gaps = 85/500 (17%)

Query: 8   LICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR- 66
           L+C+F    L     AG GS    VTVP +          +   P +   ++    L R 
Sbjct: 6   LLCIFLCFYLSIVNGAGNGS---FVTVPSSSFVPDTVCSGALVKPEQNGSAVYVPLLHRH 62

Query: 67  ---ARHLKTKTKPKTKDSNIGSNYSNSLIKT--PLSVHSYGGYSI-------SLSFGTP- 113
              A  L T T P   +    S+   S I +   +SV ++ G S+       ++SFGTP 
Sbjct: 63  GPCAPSLSTDTPPSMSEMFRRSHARLSYIVSGKKVSVPAHLGTSVKSLEYVATVSFGTPA 122

Query: 114 -PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
            PQ     + DTGS L W       +C  C+     P + P F P  SS+   + C + +
Sbjct: 123 VPQV---VVIDTGSDLTWL------QCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGE 173

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNF 230
           C  +      S C    P    C  A     + Y  G  T G+   + L   P   V +F
Sbjct: 174 CKKLAADAYGSGCSNGQP----CGFA-----ISYVDGTSTVGVYGKDKLTLAPGAIVKDF 224

Query: 231 LAGCSILSDRQPAGIAGFGRS---SESLPSQLGLKK-FSYCLLSRKFDDAPVSSNLVLDT 286
             GC       P    G       SESL +Q G    FSYCL +             +++
Sbjct: 225 YFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPA-------------VNS 271

Query: 287 GPGS---GDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
            PG    G  + P G  +TP  + P   +     F  V L  I VG K + +  S     
Sbjct: 272 KPGFLAFGAGRNPSGFVFTPMGRVPGQPT-----FSTVTLAGITVGGKKLDLRPSAF--- 323

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
              +GG+IVDSG+  T ++  ++ A+   F   M  Y           L  C+D++G K+
Sbjct: 324 ---SGGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLV-----HGDLDTCYDLTGYKN 375

Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
           V +P++ L F GGA + L   N   + G    CL  F +         G A +LG+   +
Sbjct: 376 VVVPKIALTFSGGATINLDVPNGILVNG----CLA-FAETGK-----DGTAGVLGNVNQR 425

Query: 463 NFYLEFDLANDRFGFAKQKC 482
            F + FD +  +FGF  + C
Sbjct: 426 TFEVLFDTSASKFGFRAKAC 445


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 116/406 (28%), Positives = 156/406 (38%), Gaps = 75/406 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +S G P Q     I DTGS L+W  C     C   N         P F P+RSS
Sbjct: 91  GEYLMRISIGNP-QVEILAIADTGSDLIWVQCQPCEMCYKQN--------SPIFDPRRSS 141

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRN--KTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S + + C N  C+ + G       + C  R   KTC      Y   YG   F+ G L  E
Sbjct: 142 SYRNVLCGNEFCNKLDG-----EARSCDARGFVKTC-----GYTYSYGDQSFSDGHLAIE 191

Query: 219 TLRFPSKTVPNFLA---------GCSILS----DRQPAGIAGFGRSSESLPSQLGLK--- 262
                S       A         GC   +    D   +GI G G  S SL SQLG K   
Sbjct: 192 RFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSG 251

Query: 263 KFSYCLLSRKFDDAPVSS----NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           KFSYCL+         S     N +  +G       TP L   P              +Y
Sbjct: 252 KFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKP------------ETYY 299

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           Y+ L  I V +K  ++PY+ L  G    G +I+DSG+T TF++   F  +          
Sbjct: 300 YLTLEAISVENK--RLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAV------ 351

Query: 379 YSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
              A   E+ S     F+I    +K++ LP +   F  GA + L P N FA V  ++LC 
Sbjct: 352 -EEAVKGERVSDPHGLFNICFKDEKAIELPIITAHFT-GADVELQPVNTFAKVEEDLLCF 409

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +   N            I G+    NF + +DL      F    C
Sbjct: 410 TMIPSNDIA---------IFGNLAQMNFLVGYDLEKKAVSFLPTDC 446


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 111/229 (48%), Gaps = 25/229 (10%)

Query: 257 SQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFG 315
           SQLG +KFSYCL S    +   SS L    G  +  +  PG +  TP  +NP   S    
Sbjct: 173 SQLGTQKFSYCLTS--IHENKTSSLLF---GSLAYSNFNPGKIPRTPLIQNPFLPS---- 223

Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
            +YY+ L+ I VG   + IP      G DG+GG+I+DSG+T T+++   F+ +   FI Q
Sbjct: 224 -YYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQ 282

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNEV 433
                + A+    +GL  CF +  K +  V +P+LI  FK G  +ALP ENY  +V +  
Sbjct: 283 --TELQVAN-SSTTGLDLCFHLPVKNAAEVKVPKLIFHFK-GLDLALPVENY--MVSDPE 336

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + LI    +A       G   I G+ Q QN  +  DL          +C
Sbjct: 337 MGLICLAIDAT------GSLSIFGNIQQQNMLVLHDLKKSTLSLVPTQC 379


>gi|255552241|ref|XP_002517165.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543800|gb|EEF45328.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 434

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 107/416 (25%), Positives = 171/416 (41%), Gaps = 85/416 (20%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T     TP +      D G   +W  C   Y                      SS
Sbjct: 41  TLQYLTSINQRTPLVPVKLTLDLGGQYLWVDCDQGYV---------------------SS 79

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPR----NKTCPLACPSYLLQYGLGFTAGLLLS 217
           S + + C++ +CS     +  S C   SPR    N TC L   + +   G   T+G +  
Sbjct: 80  SYKPVRCRSAQCSLAKSKSCISECFS-SPRPGCNNDTCALLPDNTVTHSG---TSGEVGQ 135

Query: 218 ETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLGL-- 261
           + +   S          +VP  +  C  + L +   +G+   AG GR+  SLPSQ     
Sbjct: 136 DVVTVQSTDGFSPGRVVSVPKLIFTCATTFLLEGLASGVKGMAGLGRTKISLPSQFSAAF 195

Query: 262 ---KKFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
              +KF+ CL S        F D P      +D         +  L YTP   NPV ++S
Sbjct: 196 SFDRKFAICLTSSNAKGIVFFGDGPYVFLPNIDV--------SKSLIYTPLILNPVSTAS 247

Query: 313 AF-----GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           AF        Y++G++ I +  K V +  S L    +G GG  + +   +T +E  +++A
Sbjct: 248 AFFKGDPSSEYFIGVKSIKINGKAVPLNTSLLFIDKEGVGGTKISTVDPYTVLETTIYQA 307

Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKGGAKM-ALPP 422
           V K FI+++    R A V   S    CF+ S   S      +P++ L  +  +    +  
Sbjct: 308 VTKVFIKELAEVPRVAPV---SPFGVCFNSSNIGSTRVGPAVPQIDLVLQSSSVFWRIFG 364

Query: 423 ENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            N    V ++VLCL  F D    P      +I++G  Q+++  L+FDLA  + GF+
Sbjct: 365 ANSMVQVKSDVLCL-GFVDGGLNPR----TSIVIGGHQIEDNLLQFDLAASKLGFS 415


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 122/403 (30%), Positives = 156/403 (38%), Gaps = 68/403 (16%)

Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y   +  GTP    TP   + DTGS +VW  C    RC D +    D        P+ 
Sbjct: 145 GEYFTKIGVGTP---VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFD--------PRA 193

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           S S   + C  P C  +          GC  R K C      Y + YG G  TAG   +E
Sbjct: 194 SHSYGAVDCAAPLCRRL-------DSGGCDLRRKAC-----LYQVAYGDGSVTAGDFATE 241

Query: 219 TLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
           TL F S   VP    GC   ++      AG+ G GR S S PSQ+     + FSYCL+ R
Sbjct: 242 TLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDR 301

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               A  +S     T  GSG     G        +P G     G+             + 
Sbjct: 302 TSSSASATSRSSTVTF-GSGARGALGRRVL----HPDGEEPQDGDVLLRAAHGHQRRRRA 356

Query: 332 VKIPYSYLVP--GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
                    P   S G GGVIVDSG        P +    +         SRAA     +
Sbjct: 357 RPGRGRVRPPPDPSTGRGGVIVDSG-----RPSPAWARAGR--TPPCATRSRAA----AA 405

Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           GLR           C+D+SG K V +P + + F GGA+ ALPPENY   V +       F
Sbjct: 406 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 465

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                G +       I+G+ Q Q F + FD    R GF  + C
Sbjct: 466 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 115/447 (25%), Positives = 173/447 (38%), Gaps = 67/447 (14%)

Query: 46  HHSDSDPLKILHSLASSSLSRARHLKT-KTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
            H     +  + ++AS   +R  +L +    PK     I S            V + G Y
Sbjct: 49  QHKAGSWVNTVINMASKDPARVTYLSSLVASPKATSVPIASGQQ---------VLNIGNY 99

Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
            + +  GTP Q     + DT     W PC     C  C+ P         F P  SS+  
Sbjct: 100 VVRVKLGTPGQLMF-MVLDTSRDAAWVPCAD---CAGCSSPT--------FSPNTSSTYA 147

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE-TLRFP 223
            + C  P+C+ + G +  +        N+T           YG   +   +LS+ +L   
Sbjct: 148 SLQCSVPQCTQVRGLSCPTTGTAACFFNQT-----------YGGDSSFSAMLSQDSLGLA 196

Query: 224 SKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
             T+P++  GC      S   P G+ G GR   SL SQ G      FSYC  S  F    
Sbjct: 197 VDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPS--FKSYY 254

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP  G  K   +  TP  +NP   +      YYV L  + VG   V +   
Sbjct: 255 FSGSLRL--GP-LGQPK--NIRTTPLLRNPHRPT-----LYYVNLTGVSVGRVLVPVAPE 304

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFD 396
            L    +   G I+DSG+  T    P++ A+  EF +Q+ G ++     +       CF 
Sbjct: 305 LLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGAFDT------CFA 358

Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
            + +     P +   F  G  + LP EN         + CL +    AA P        +
Sbjct: 359 ATNED--IAPPVTFHFT-GMDLKLPLENTLIHSSAGSLACLAM----AAAPNNVNSVLNV 411

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + + Q QN  + FD+ N R G A++ C
Sbjct: 412 IANLQQQNLRIMFDVTNSRLGIARELC 438


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 106/407 (26%), Positives = 162/407 (39%), Gaps = 71/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP      I DTGS ++W  C+S   C   +   +D   +  F    S 
Sbjct: 98  GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSF 153

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  + C +P CS +F    ++    CS  N+        Y  +YG G  T+G  +++T 
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203

Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
            F +    + +A        GCS         SD+   GI GFG+   S+ SQL  +   
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263

Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
              FS+CL      D       VL      G+   PG+ Y+P   +            + 
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLLPSQP----------HY 303

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + +G     +P    V  +    G IVD+G+T T+    L +     F+  + N  
Sbjct: 304 NLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTY----LVKEAYDPFLNAISNSV 359

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
                   S    C+ +S   S   P + L F GGA M L P++Y    G        F 
Sbjct: 360 SQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYG--------FY 411

Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           D A+   +G   A     ILGD  L++    +DLA  R G+A   C+
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDCS 458


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 116/458 (25%), Positives = 171/458 (37%), Gaps = 65/458 (14%)

Query: 39  LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
           LS  H +H     PL+ + +LA +  +R   L +K              S  +   P+ S
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGITSAPVAS 72

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
             +   Y +    GTP Q       DT +   W  C     C  C          PA   
Sbjct: 73  GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           FIP  SSS   + C +  C    G    +       ++ + PL   ++   +        
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172

Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
           L S+TLR     +  +  GC        ++    G+ G GR   SL SQ G +    FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL S  +     S +L L      G +  P  + YTP   NP   S      YYV +  +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   VK+P            G ++DSG+  T    P++ A+ +EF RQ+   S     
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
                   CF+     +   P + L   GG  + LP EN         L CL +    A 
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            P        ++ + Q QN  +  D+A  R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/417 (24%), Positives = 174/417 (41%), Gaps = 73/417 (17%)

Query: 85  SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S   N+ +K    + S G Y+  L  GTPPQ     I DTGS++ + PC++   C  C  
Sbjct: 61  SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 115

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
                 + P F P+ SSS + + C NP C+             C    K C      Y  
Sbjct: 116 ----KHQDPKFQPELSSSYKALKC-NPDCN-------------CDDEGKLC-----VYER 152

Query: 205 QYG-LGFTAGLLLSETLRFPSK---TVPNFLAGCS-----ILSDRQPAGIAGFGRSSESL 255
           +Y  +  ++G+L  + + F ++   T    + GC       L  ++  GI G GR   S+
Sbjct: 153 RYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSV 212

Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVL--DTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
             QL  K     + S  +    V    ++     P +G       S++  +++P      
Sbjct: 213 VDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGMV----FSHSDPFRSP------ 262

Query: 314 FGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
              +Y + L+Q+ V  K +K+ P  +     +G  G ++DSG+T+ +     F A+    
Sbjct: 263 ---YYNIDLKQMHVAGKSLKLNPKVF-----NGKHGTVLDSGTTYAYFPKEAFIAIKDAI 314

Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF-- 426
           I+++ +  R    +       CF  +G+       + PE+ ++F  G K+ L PENY   
Sbjct: 315 IKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFR 373

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   CL +F D        R    +LG   ++N  + +D  ND+ GF K  C+
Sbjct: 374 HTKVRGAYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 422


>gi|255552253|ref|XP_002517171.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543806|gb|EEF45334.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 437

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 165/401 (41%), Gaps = 71/401 (17%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P        D G SL+W  C   Y                      SSS + + C +  C
Sbjct: 54  PLVPVKLTVDLGGSLMWINCEEGYV---------------------SSSYRPLSCDSALC 92

Query: 174 SWIFGPNVESRCKGC--SPR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK-- 225
           S     N +S  K C  SP+    N TC  +  + ++  G G   G  +     F  K  
Sbjct: 93  SL---SNSQSCNKECYSSPKPGCYNNTCGQSSNNRVVYIGTGGDLGQDVVALQSFDGKNL 149

Query: 226 ----TVPNFLAGCSI---LSDRQPA--GIAGFGRSSESLP----SQLGLKK-FSYCLLSR 271
               +VPNF   C I   L D      G+AG GRS+ SLP    S +G  K FS CL   
Sbjct: 150 GRIVSVPNFPFVCGITWLLDDLADGVTGMAGLGRSNISLPAYFSSAIGFSKTFSICL--- 206

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS--SSAFGEF---YYVGLRQII 326
               +   SN V+  G G     +  L Y     NPVG+   S+ GE    YY+G++ I 
Sbjct: 207 ---SSSTKSNGVIVFGDGPSSIVSNDLIYIRLILNPVGTPGYSSLGESSADYYIGVKSIR 263

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
           V  K VK   + L    DGNGG ++ + + +T +   +++A+ K FI+++          
Sbjct: 264 VDGKEVKFDKTLLSIDKDGNGGTMLSTVNPYTVLHTSIYKALLKAFIKKLVFRFSLVVPS 323

Query: 387 KKSGLRPCFDISGKKSV-----YLPELILKFK----GGAKMALPPENYFALVGNEVLCLI 437
                  C   +G ++      Y+P + L+ +          +   N    V +  +CL 
Sbjct: 324 VPVPFGACVFSNGFRTTEEFLSYVPIINLELESEQGNSVYWRILGANSMVAVNSYTMCLA 383

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            F D  + P   R P II+G  QL++  L FDLA+ R GF+
Sbjct: 384 -FIDGGSQP---RTP-IIIGGHQLEDNLLHFDLASSRLGFS 419


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 129/461 (27%), Positives = 192/461 (41%), Gaps = 83/461 (18%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKT---KDSNIGSNYSNSLIKTPLSVHSY 101
           L H DS P    ++ A +S  R R+   ++   T    + +   N   S I +     + 
Sbjct: 30  LIHRDS-PKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITS-----NR 83

Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y +++S GTPP    P   I DTGS L+W  C     C DC          P F PK 
Sbjct: 84  GEYLMNISIGTPP---VPILAIADTGSDLIWTQCNP---CEDCY-----QQTSPLFDPKE 132

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           SS+ + + C + +C  +           CS    TC     SY + YG   +T G +  +
Sbjct: 133 SSTYRKVSCSSSQCRAL-------EDASCSTDENTC-----SYTITYGDNSYTKGDVAVD 180

Query: 219 TLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
           T+   S      ++ N + GC   +    D   +GI G G  S SL SQL      KFSY
Sbjct: 181 TVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSY 240

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           CL+    +    S       G  SGD     +S +   K+P         +Y++ L  I 
Sbjct: 241 CLVPFTSETGLTSKINFGTNGIVSGDGV---VSTSMVKKDP-------ATYYFLNLEAIS 290

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF---EAVAKEFIRQMGNYSRAA 383
           VGSK  KI ++  + G+ G G +++DSG+T T +    +   E+V    I       +A 
Sbjct: 291 VGSK--KIQFTSTIFGT-GEGNIVIDSGTTLTLLPSNFYYELESVVASTI-------KAE 340

Query: 384 DVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
            V+   G L  C+  S   S  +P++ + FKGG  + L   N F  V  +V C       
Sbjct: 341 RVQDPDGILSLCYRDS--SSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF---- 393

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           AA   L      I G+    NF + +D  +    F K  C+
Sbjct: 394 AANEQL-----TIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429


>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 435

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 103/406 (25%), Positives = 166/406 (40%), Gaps = 89/406 (21%)

Query: 114 PQASTPFIFDTGSSLVWFPC----TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI--- 166
           P  +     D G + +W  C    +S Y  V C+      S +       S +++     
Sbjct: 58  PLVAVKLTVDLGGTFMWVDCDNYVSSSYTPVRCD------SALCKLADSHSCTTECYSSP 111

Query: 167 --GCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
             GC N  CS I + P V     G                    +G     L S   ++P
Sbjct: 112 KPGCYNNTCSHIPYNPVVHVSTSG-------------------DIGLDVVSLQSMDGKYP 152

Query: 224 SK--TVPN--FLAGCSILSDRQP---AGIAGFGRSSESLP----SQLGLK-KFSYCLLSR 271
            +  +VPN  F+ G   + +       G+AG GR + SLP    S LGL+ KF+ CL S 
Sbjct: 153 GRNVSVPNVPFVCGTGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSL 212

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQII 326
                  +S+ V+  G   G   +  L YTP  +NPV ++ A+ E      Y++ ++ + 
Sbjct: 213 ------TNSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLR 266

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--------- 377
           VG K +K   + L   ++G GG  + +   +T +   +++AV K F +QM          
Sbjct: 267 VGGKEIKFNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAKQMKFLIEVNPPI 326

Query: 378 ------NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
                   S A D+ +   + P  D           L+L+ +G     +   N    + +
Sbjct: 327 APFGLCYQSAAMDINEYGPVVPFID-----------LVLESQGSVYWRIWGANSMVKISS 375

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
            V+CL  F D    P      +II+G  QL++  L+FDLA+ R GF
Sbjct: 376 YVMCL-GFVDGGLKP----DSSIIIGGRQLEDNLLQFDLASARLGF 416


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 117/452 (25%), Positives = 182/452 (40%), Gaps = 82/452 (18%)

Query: 40  STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS-- 97
           S+K  L+    +  + + + A  S++RA H                 Y  +L  TP S  
Sbjct: 37  SSKSPLYQPTQNKYQHIVNAARRSINRANHF----------------YKTALTNTPQSTV 80

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           +  +G Y ++ S GTPP      I DTGS +VW  C     C +           P F P
Sbjct: 81  IPDHGEYLMTYSVGTPP-FKLYGIADTGSDIVWLQCEPCKECYN--------QTTPKFKP 131

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
            +SS+ + I C +              CK     N    L+  +  L+   G        
Sbjct: 132 SKSSTYKNIPCSS------------DLCKSGQQGN----LSVDTLTLESSTG-------- 167

Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
             + FP KTV       ++  +   +GI G G    SL +QLG     KFSYCLL    +
Sbjct: 168 HPISFP-KTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVE 226

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
               S     DT   SGD    G+  TP   K+P+        FYY+ L    VG+K ++
Sbjct: 227 SNTTSKLNFGDTAVVSGD----GVVSTPIVKKDPI-------VFYYLTLEAFSVGNKRIE 275

Query: 334 IPYSYLVPGSDG--NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
              S     S+G   G +I+DSG+T T +   ++  + +  + ++    R  D  +   L
Sbjct: 276 FEGS-----SNGGHEGNIIIDSGTTLTVIPTDVYNNL-ESAVLELVKLKRVNDPTRLFNL 329

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             C+ ++     + P +   FK GA + L P + F  V + ++CL   T +A  P+    
Sbjct: 330 --CYSVTSDGYDF-PIITTHFK-GADVKLHPISTFVDVADGIVCLAFATTSAFIPS---D 382

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              I G+   QN  + +DL      F    C+
Sbjct: 383 VVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 116/458 (25%), Positives = 171/458 (37%), Gaps = 65/458 (14%)

Query: 39  LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
           LS  H +H     PL+ + +LA +  +R   L +K              S  +   P+ S
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGVTSAPVAS 72

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
             +   Y +    GTP Q       DT +   W  C     C  C          PA   
Sbjct: 73  GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           FIP  SSS   + C +  C    G    +       ++ + PL   ++   +        
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172

Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
           L S+TLR     +  +  GC        ++    G+ G GR   SL SQ G +    FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL S  +     S +L L      G +  P  + YTP   NP   S      YYV +  +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   VK+P            G ++DSG+  T    P++ A+ +EF RQ+   S     
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
                   CF+     +   P + L   GG  + LP EN         L CL +    A 
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            P        ++ + Q QN  +  D+A  R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/334 (27%), Positives = 137/334 (41%), Gaps = 51/334 (15%)

Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
           C    CS I   + E        R  TC     +Y   YG G  T G+  +E   F S  
Sbjct: 3   CAGTLCSDILHHSCE--------RPDTC-----TYRYNYGDGTMTVGVYATERFTFASSG 49

Query: 227 VPNFLA-------GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
                        GC   ++ S    +GI GFGR+  SL SQL +++FSYCL S  +   
Sbjct: 50  GGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS--YASR 107

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
             S+ L      G     T  +  TP  ++P   +     FYYV    + VG++ ++IP 
Sbjct: 108 RQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPT-----FYYVHFTGLTVGARRLRIPE 162

Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           S      DG+GGVIVDSG+  T +   +   V + F RQ      A     + G+  CF 
Sbjct: 163 SAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF-RQQLRLPFANGGNPEDGV--CFL 219

Query: 397 I-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPAL 448
           +       S    + +P ++L F+ GA + LP  NY         LCL+L      G   
Sbjct: 220 VPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST- 277

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  +G+   Q+  + +DL  +    A  +C
Sbjct: 278 -------IGNLVQQDMRVLYDLEAETLSIAPARC 304


>gi|359806276|ref|NP_001241217.1| uncharacterized protein LOC100818868 precursor [Glycine max]
 gi|255644718|gb|ACU22861.1| unknown [Glycine max]
          Length = 450

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 106/416 (25%), Positives = 163/416 (39%), Gaps = 72/416 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           YS S+  GTPP  +   + D     +WF C + Y     N     P R      K++  +
Sbjct: 50  YSTSIDMGTPP-LTLDLVIDIRERFLWFECGNDY-----NSSTYYPVRCGTKKCKKAKGT 103

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL--- 220
             I C N                GC+  N TC +        +G  F +G +  + L   
Sbjct: 104 ACITCTNHPLK-----------TGCT--NNTCGV---DPFNPFGEFFVSGDVGEDILSSL 147

Query: 221 ------RFPSKT-VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
                 R PS   VP F++ C +  D+               G+ G  R++ SLP+QL  
Sbjct: 148 HSTSGARAPSTLHVPRFVSTC-VYPDKFGVEGFLQGLAKGKKGVLGLARTAISLPTQLAA 206

Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG------LSYTPFYKNPVGS 310
           K     KF+ CL S          N + D   G G    P       LSYTP   NP  +
Sbjct: 207 KYNLEPKFALCLPSTS------KYNKLGDLFVGGGPYYLPPHDASKFLSYTPILTNPQST 260

Query: 311 SSAF----GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE 366
              F       Y++ ++ I +  K V +  S L     GNGG  + +   +T     +++
Sbjct: 261 GPIFDADPSSEYFIDVKSIKLDGKIVNVNTSLLSIDRQGNGGCKLSTVVPYTKFHTSIYQ 320

Query: 367 AVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSV--YLPELILKFKGGAKMALPP 422
            +  +F++Q     +   V   +    CFD    GK      +P + L  KGG +  +  
Sbjct: 321 PLVNDFVKQAA-LRKIKRVTSVAPFGACFDSRTIGKTVTGPNVPTIDLVLKGGVQWRIYG 379

Query: 423 ENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            N    V   VLCL  F D    P      +I++G +Q+++  LEFDL + + GF+
Sbjct: 380 ANSMVKVSKNVLCL-GFVDGGLEPGSPIATSIVIGGYQMEDNLLEFDLVSSKLGFS 434


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 104/407 (25%), Positives = 162/407 (39%), Gaps = 71/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP      I DTGS ++W  C+S   C   +   +D   +  F    S 
Sbjct: 98  GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSL 153

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  + C +P CS +F    ++    CS  N+        Y  +YG G  T+G  +++T 
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203

Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
            F +    + +A        GCS         SD+   GI GFG+   S+ SQL  +   
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263

Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
              FS+CL      D       VL      G+   PG+ Y+P   +            + 
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP----------HY 303

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + +G     +P    V  +    G IVD+G+T T++    ++     F+  + N  
Sbjct: 304 NLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSV 359

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
                   S    C+ +S   S   P + L F GGA M L P++Y    G          
Sbjct: 360 SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IY 411

Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           D A+   +G   A     ILGD  L++    +DLA  R G+A   C+
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 119/405 (29%), Positives = 165/405 (40%), Gaps = 70/405 (17%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
           V S G Y ++LS GTPP      I DTGS L W    PCT  Y+ V           +P 
Sbjct: 86  VPSAGEYIMNLSIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPF 133

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           F PK SS+ +   C    C        +  C+      K C     +++  Y  G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFC---LALGNDRSCR----NGKKC-----TFMYSYADGSFTGG 181

Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK-- 262
            L  ETL   S      + P F  GC   S    D   +GI G G +  S+ SQL     
Sbjct: 182 NLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTIN 241

Query: 263 -KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF-YKNPVGSSSAFGEFYYV 320
            +FSYCLL   F D+ +SS +       SG     G   TP   K P         +Y +
Sbjct: 242 GRFSYCLLP-VFTDSSMSSRINFGR---SGIVSGAGTVSTPLVMKGPD------TYYYLI 291

Query: 321 GLRQIIVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPL-FEAVAKEFIRQMGN 378
            L    VG K +    +S      +GN  +IVDSG+T+T++  PL F    +E +     
Sbjct: 292 TLEGFSVGKKRLSYKGFSKKAEVEEGN--IIVDSGTTYTYL--PLEFYVKLEESVAHSIK 347

Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
             R  D    S L  C++ +  + +  P +   FK  A + L P N F  +  +++C  +
Sbjct: 348 GKRVRDPNGISSL--CYNTTVDQ-IDAPIITAHFK-DANVELQPWNTFLRMQEDLVCFTV 403

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              +  G         ILG+    NF + FDL   R  F    C 
Sbjct: 404 LPTSDIG---------ILGNLAQVNFLVGFDLRKKRVSFKAADCT 439


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 122/440 (27%), Positives = 178/440 (40%), Gaps = 56/440 (12%)

Query: 58  SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
           +LAS   +R  +L+ +  P    S+  S  S   I +    H  G Y + +  G+PP   
Sbjct: 81  ALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS----HGSGEYLVRVGIGSPPLEQ 136

Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
              + DTGS ++W  C+    C DC +   DP     F P  S+S   + C +  C    
Sbjct: 137 H-LVADTGSDVIWVQCSP---CSDC-YAQGDP----LFDPANSASFSPVPCNSGVC---- 183

Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT-VPNFLAGCS 235
                +R    S            Y + YG   +T G+L  ETL     T V     GC 
Sbjct: 184 --RAAARYSSSSCGGGG---GECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAMGCG 238

Query: 236 ILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPG 289
             +     + AG+ G G    SL  QLG      FSYCL      +   S +LVL    G
Sbjct: 239 HENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVL----G 294

Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
             D+   G  + P  +NP   S     FYYVG+  + V  + +++       G DG GGV
Sbjct: 295 REDAAPTGAVWVPLVRNPDAPS-----FYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGV 349

Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFDISGKKSVYLPEL 408
           ++D+G+  T +    + A+   F    G +   A       L   C+D+SG  SV +P +
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAF---AGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTV 406

Query: 409 ILKFKG------GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
            L F G       A + LP  N    V +     + F   A+GP+       ILG+ Q Q
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPS-------ILGNIQQQ 459

Query: 463 NFYLEFDLANDRFGFAKQKC 482
              +  D A+   GF    C
Sbjct: 460 GIEITVDSASGYVGFGPATC 479


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 161/406 (39%), Gaps = 71/406 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP      I DTGS ++W  C+S   C   +   +D   +  F    S 
Sbjct: 98  GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSL 153

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  + C +P CS +F    ++    CS  N+        Y  +YG G  T+G  +++T 
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203

Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
            F +    + +A        GCS         SD+   GI GFG+   S+ SQL  +   
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263

Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
              FS+CL      D       VL      G+   PG+ Y+P   +            + 
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP----------HY 303

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + +G     +P    V  +    G IVD+G+T T++    ++     F+  + N  
Sbjct: 304 NLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSV 359

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
                   S    C+ +S   S   P + L F GGA M L P++Y    G          
Sbjct: 360 SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IY 411

Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           D A+   +G   A     ILGD  L++    +DLA  R G+A   C
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 142/305 (46%), Gaps = 44/305 (14%)

Query: 194 TCPLACP--SYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAG 247
            C  A P  +Y + YG G FT G L  E L+F +  V +F+ GC   +       +G+ G
Sbjct: 125 VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMG 184

Query: 248 FGRSSESLPSQL-GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
            GRS  SL SQ  G+    FSYCL S    +   S +L+L        + +P +SY    
Sbjct: 185 LGRSDLSLISQTSGIFGGVFSYCLPS---TERKGSGSLILGGNSSVYRNSSP-ISYAKMI 240

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
           +NP         FY++ L  I +G   ++ P       S G   ++VDSG+  T +   +
Sbjct: 241 ENP-----QLYNFYFINLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTI 288

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN 424
           ++A+  EF++Q   +  A      S L  CF++S  + V +P + + F+G A++ +    
Sbjct: 289 YKALKAEFLKQFTGFPPAPAF---SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTG 345

Query: 425 YFALVGNEV--LCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            F  V ++   +CL L    + D  A          ILG++Q +N  + +D    + GFA
Sbjct: 346 VFYFVKSDASQVCLALASLEYQDEVA----------ILGNYQQKNLRVIYDTKETKVGFA 395

Query: 479 KQKCA 483
            + C+
Sbjct: 396 LETCS 400


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 156/374 (41%), Gaps = 54/374 (14%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGSSL W  C     C        DP     + P  S + + + C + +CS +   
Sbjct: 1   MILDTGSSLSWLQCQP---CAVYCHAQADP----LYDPSVSKTYKKLSCASVECSRLKAA 53

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS-KTVPNFLAGCSIL 237
            +        P  +T   AC  Y   YG   F+ G L  + L   S +T+P F  GC   
Sbjct: 54  TLND------PLCETDSNACL-YTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCG-- 104

Query: 238 SDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPG 289
            D Q      AGI G  R   S+ +QL  K    FSYCL        P +++     G  
Sbjct: 105 QDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL--------PTANSGSSGGGFL 156

Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGG 348
           S  S +P    T +   P+ + S     Y++ L  I V  + + +  + Y VP       
Sbjct: 157 SIGSISP----TSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP------- 205

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
            ++DSG+  T +   ++ A+ + F++ M   ++ A     S L  CF  S K    +PE+
Sbjct: 206 TLIDSGTVITRLPMSMYAALRQAFVKIMS--TKYAKAPAYSILDTCFKGSLKSISAVPEI 263

Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
            + F+GGA + L   +        + CL       AG + G     I+G+ Q Q + + +
Sbjct: 264 KMIFQGGADLTLRAPSILIEADKGITCLAF-----AGSS-GTNQIAIIGNRQQQTYNIAY 317

Query: 469 DLANDRFGFAKQKC 482
           D++  R GFA   C
Sbjct: 318 DVSTSRIGFAPGSC 331


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 108/407 (26%), Positives = 163/407 (40%), Gaps = 71/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP+     I DTGS ++W  C S   C +C   +    ++  F    SS
Sbjct: 64  GLYFTKVKLGSPPREFNVQI-DTGSDVLWVCCNS---CNNCPRTSGLGIQLNFFDSSSSS 119

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++  + C +P C+      V++    CS +   C     SY  QYG G  T+G  +S+TL
Sbjct: 120 TAGQVRCSDPICT----SAVQTTATQCSSQTDQC-----SYTFQYGDGSGTSGYYVSDTL 170

Query: 221 RFPS----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
            F +      + N     + GCS         +D+   GI GFG+   S+ SQL  +   
Sbjct: 171 YFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGIT 230

Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
              FS+CL      D      LVL      G+   PG+ Y+P   +           Y +
Sbjct: 231 PRVFSHCLKG----DGSGGGILVL------GEILEPGIVYSPLVPSQ--------PHYNL 272

Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
            L  I V  + + I P ++    S G    IVDSG+T  ++    ++     F+  +   
Sbjct: 273 NLLSIAVNGQLLPIDPAAFATSNSQGT---IVDSGTTLAYLVAEAYD----PFVSAVNAI 325

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
              +     S    C+ +S   S   P     F GGA M L PE+Y   F   G   +  
Sbjct: 326 VSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWC 385

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I F               ILGD  L++    +DL   R G+A   C+
Sbjct: 386 IGFQKVQG--------VTILGDLVLKDKIFVYDLVRQRIGWANYDCS 424


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 133/302 (44%), Gaps = 51/302 (16%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           + + G Y   +S GTPPQ     + DTGS++ W  C     C  C      P  +  F P
Sbjct: 35  IFAMGLYYTRISLGTPPQQFYVDV-DTGSNVAWVKCAP---CTGCEHSGDVPVPMSTFDP 90

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
           ++S++   I C + +C       V ++   CSP   +CP     Y L YG G  TAG  L
Sbjct: 91  RKSTTKISISCTDAEC------GVLNKKLQCSPERLSCP-----YSLLYGDGSSTAGYYL 139

Query: 217 SETLRFPSKTVPN-----------FLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
           ++   F      N           F  G +        G+ GFG ++ SLP+QL  +  S
Sbjct: 140 NDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWSVDGLLGFGPTTVSLPNQLAQQNIS 199

Query: 266 YCLLSRKFD-DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLR 323
             + +     D     +LV+      G  + P L YTP           FGE +Y V L 
Sbjct: 200 VNIFAHCLQGDVSGRGSLVI------GTIREPDLVYTPM---------VFGEDHYNVQLL 244

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +  ++V  P S+ +   +  GGVI+DSG+T T++  P ++    EF R +  + +++
Sbjct: 245 NIGISGRNVTTPASFDL---EYTGGVIIDSGTTLTYLVQPAYD----EFRRGVSVFKQSS 297

Query: 384 DV 385
           D+
Sbjct: 298 DL 299


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 115/435 (26%), Positives = 174/435 (40%), Gaps = 78/435 (17%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF-- 120
           S++RA H            N  S YSN+ +++P+++   G Y +S S GTPP    P   
Sbjct: 59  SMNRANHF-----------NQISVYSNA-VESPVTLLDDGDYLMSYSLGTPP---FPVYG 103

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I DT S ++W  C     C +   P  DPS    +     SS+     Q   CS      
Sbjct: 104 IVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCS------ 157

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-----PSKTVPNFLAGC 234
                   S   K C      + + Y  G  + G L+ ET+       P    P  + GC
Sbjct: 158 --------SDERKIC-----EHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGC 204

Query: 235 SILSDR--QPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSS-NLVLDTGP 288
              ++      GI G G    SL  QL     KKFSYCL       AP+S  +  L  G 
Sbjct: 205 IRNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCL-------APISDRSSKLKFGD 257

Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
            +  S    +S    +K+       + +FYY+ L    VG+   +I +      S G G 
Sbjct: 258 AAMVSGDGTVSTRIVFKD-------WKKFYYLTLEAFSVGNN--RIEFRSSSSRSSGKGN 308

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
           +I+DSG+TFT +   ++  + +  +  +    RA D  K+  L  C+  +  K V +P +
Sbjct: 309 IIIDSGTTFTVLPDDVYSKL-ESAVADVVKLERAEDPLKQFSL--CYKSTYDK-VDVPVI 364

Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
              F  GA + L   N F +  + V+CL   +  +           I G+   QNF + +
Sbjct: 365 TAHF-SGADVKLNALNTFIVASHRVVCLAFLSSQSGA---------IFGNLAQQNFLVGY 414

Query: 469 DLANDRFGFAKQKCA 483
           DL      F    C 
Sbjct: 415 DLQRKIVSFKPTDCT 429


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 115/439 (26%), Positives = 169/439 (38%), Gaps = 81/439 (18%)

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI--FD 123
           R +   +  KPK   S + +N+  SL  T         Y  SL  GTP   +T  +   D
Sbjct: 110 RRKVTASSNKPKGGVSLL-ANWGKSLSTT--------NYVASLRLGTP---ATELVVELD 157

Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
           TGS   W  C     C DC        R P F P  SS+   + C   +C  +   +   
Sbjct: 158 TGSDQSWVQCKP---CADCY-----EQRDPVFDPTASSTYSAVPCGARECQELASSSSSR 209

Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-------SKTVPNFLAGCSI 236
            C   + +N      CP  +       T G L  +TL          + TVP F+ GC  
Sbjct: 210 NCSSDNNKN------CPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCG- 262

Query: 237 LSDRQPAGIAG-------FGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDT 286
                 AG  G        G    SLPSQ+  +    FSYCL S        S+   L  
Sbjct: 263 ---HSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSP------SAAGYLSF 313

Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
           G   G +      +T        +S      YY+ L  I+V  + +K+P S     +   
Sbjct: 314 G---GAAARANAQFTEMVTGQDPTS------YYLNLTGIVVAGRAIKVPASAFATAA--- 361

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
            G I+DSG+ F+ +    + A+   F   MG Y R            C+D +G ++V +P
Sbjct: 362 -GTIIDSGTAFSRLPPSAYAALRSSFRSAMGRY-RYKRAPSSPIFDTCYDFTGHETVRIP 419

Query: 407 ELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
            + L F  GA + L P        N+V   CL    ++  G         ILG+ Q +  
Sbjct: 420 AVELVFADGATVHLHPSGVL-YTWNDVAQTCLAFVPNHDLG---------ILGNTQQRTL 469

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + +D+ + R GF ++ CA
Sbjct: 470 AVIYDVGSQRIGFGRKGCA 488


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 161/406 (39%), Gaps = 73/406 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y   +  G+PP      I DTGS ++W  C+S   C   +   +D   +  F    S ++
Sbjct: 105 YFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSLTA 160

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C +P CS +F    ++    CS  N+        Y  +YG G  T+G  +++T  F
Sbjct: 161 GSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTFYF 210

Query: 223 PSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK---- 263
            +    + +A        GCS         SD+   GI GFG+   S+ SQL  +     
Sbjct: 211 DAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPP 270

Query: 264 -FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVG 321
            FS+CL      D       VL      G+   PG+ Y+P     P           +  
Sbjct: 271 VFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP-----------HYN 309

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L  + +G     +P    V  +    G IVD+G+T T++    ++     F+  + N   
Sbjct: 310 LNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSVS 365

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
                  S    C+ +S   S   P + L F GGA M L P++Y    G          D
Sbjct: 366 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IYD 417

Query: 442 NAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            A+   +G   A     ILGD  L++    +DLA  R G+A   C+
Sbjct: 418 GASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 463


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 176/410 (42%), Gaps = 75/410 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
           G Y   +  G PP+     I DTGS ++W  C S   C  C  P     +IP   F P  
Sbjct: 81  GLYYTRVQLGNPPKDFYVQI-DTGSDVLWVSCNS---CNGC--PATSGLQIPLNFFDPGS 134

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           S+++ L+ C +  C+      V+S    C  ++  C     +Y+ QYG G  T+G  + +
Sbjct: 135 STTASLVSCSDQICAL----GVQSSDSACFGQSNQC-----AYVFQYGDGSGTSGYYVMD 185

Query: 219 TLRFP--------SKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL---G 260
            +           S +  + + GCS         SDR   GI GFG+   S+ SQL   G
Sbjct: 186 MIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRG 245

Query: 261 L--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
           +  K FS+CL   K DD+     LVL      G+   P + YTP   +           Y
Sbjct: 246 IAPKVFSHCL---KGDDSG-GGILVL------GEIVEPNVVYTPLVPSQ--------PHY 287

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
            + L+ I V  +   +P S  V  +  + G I+DSG+T  ++    + A    F+  + N
Sbjct: 288 NLNLQSISVNGQ--VLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYNA----FVVAVTN 341

Query: 379 -YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEV 433
             S++       G R C+  S   S   P++ L F GGA + L  ++Y     ++ G  V
Sbjct: 342 IVSQSTQSVVLKGNR-CYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTV 400

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            C+            G+G  I LGD  L++    +DLAN R G+    C+
Sbjct: 401 WCI------GFQKIPGQGITI-LGDLVLKDKIFIYDLANQRIGWTNYDCS 443


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 117/427 (27%), Positives = 169/427 (39%), Gaps = 95/427 (22%)

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF---DTGSSLVWFPCTSRYRCVDCN 143
           Y  S I++P+S +    Y + LS GTPP      I+   DTGS LVWF C    +C    
Sbjct: 44  YKPSTIQSPVSAYDCE-YLMELSIGTPPIK----IYAEADTGSDLVWFQCIPCTKCY--- 95

Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
                  + P F P+ SSS   I C    C+ +           CS   KTC     +Y 
Sbjct: 96  -----KQQNPMFDPRSSSSYTNITCGTESCNKL-------DSSLCSTDQKTC-----NYT 138

Query: 204 LQYGLG-FTAGLLLSETLRFPSKT-----VPNFLAGC----SILSDRQPAGIAGFGRSSE 253
             Y     T G+L  ETL   S T         + GC    S  +DR+  G+ G GR   
Sbjct: 139 YSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDRE-MGLIGLGRGPL 197

Query: 254 SLPSQLGL------KKFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFY 304
           SL SQ+G         FS CL+     D  ++S +    G    G+G   TP +S     
Sbjct: 198 SLISQIGSSLGAGGNMFSQCLVPFN-TDPSITSQMNFGKGSEVLGNGTVSTPLIS----- 251

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP-GSDGNGGVIVDSGSTFTFMEGP 363
           K+  G        Y+  L  I V  + + +P+S     G+   G +++DSG+T T++   
Sbjct: 252 KDGTG--------YFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTTITYL--- 298

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL-------PELILKFKGGA 416
                 +EF  ++        V  K  L P F I G +  Y        P L + F+GG 
Sbjct: 299 -----PEEFYHRL-----IEQVRNKVALEP-FRIDGYELCYQTPTNLNGPTLTIHFEGG- 346

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
            + L P   F  V ++  C  +F  N           +  G++   N+ + FDL      
Sbjct: 347 DVLLTPAQMFIPVQDDNFCFAVFDTNEE--------YVTYGNYAQSNYLIGFDLERQVVS 398

Query: 477 FAKQKCA 483
           F    C 
Sbjct: 399 FKATDCT 405


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 101/403 (25%), Positives = 159/403 (39%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS ++W  C S  +C   +   +D   +  + PK SS
Sbjct: 82  GLYFTEIKLGTPPKRYYVQV-DTGSDILWVNCISCEKCPRKSGLGLD---LTFYDPKASS 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C    C+  +G     +  GC+  N  C      Y + YG G  T G  +++ L
Sbjct: 138 SGSTVSCDQGFCAATYG----GKLPGCTA-NVPC-----EYSVMYGDGSSTTGFFVTDAL 187

Query: 221 RF----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
           +F          P      F  G     D     +   GI GFG+++ S+ SQL      
Sbjct: 188 QFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKV 247

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL + K        N+V            P +  TP   +           Y V
Sbjct: 248 KKIFAHCLDTIKGGGIFAIGNVV-----------QPKVKTTPLVADM--------PHYNV 288

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I VG   +++P      G     G I+DSG+T T++     E V KE +  + N  
Sbjct: 289 NLKSIDVGGTTLQLPAHVFETGE--RKGTIIDSGTTLTYLP----ELVFKEVMAAIFNKH 342

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           +            CF   G      P +   F+    + + P  YF   GN++ C + F 
Sbjct: 343 QDIVFHNVQDFM-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDMYC-VGFQ 400

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           + A     G+   +++GD  L N  + +DL N   G+    C+
Sbjct: 401 NGALQSKDGK-DIVLMGDLVLSNKLVIYDLENQVIGWTDYNCS 442


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 104/407 (25%), Positives = 159/407 (39%), Gaps = 71/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV-DCNFPNVDPSRIPAFIPKRS 160
           G Y   +  GTPP      + DTGS + W  C     CV +   P++   ++  + P RS
Sbjct: 35  GLYYTKIYLGTPPVGYYVQV-DTGSDVTWLNCAPCTSCVTETQLPSI---KLTTYDPSRS 90

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           S+   + C++  C    G N E  C        T    C +Y   YG G  T G  + + 
Sbjct: 91  STDGALSCRDSNCGAALGSN-EVSC--------TSAGYC-AYSTTYGDGSSTQGYFIQDV 140

Query: 220 LRFP---SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
           + F    + T  N  A    GC        ++S R   G+ GFG+++ S+PSQL      
Sbjct: 141 MTFQEIHNNTQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKV 200

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
             +F++CL      D      +V+      G    P +SYTP               Y V
Sbjct: 201 GNRFAHCLQG----DNQGGGTIVI------GSVSEPNISYTPIVSR---------NHYAV 241

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
           G++ I V  ++V  P S+    S   GGVI+DSG+T  ++  P +     +F+  +  + 
Sbjct: 242 GMQNIAVNGRNVTTPASFDTT-STSAGGVIMDSGTTLAYLVDPAY----TQFVNAVSTFE 296

Query: 381 RAADVEKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
            +      S    C  ++        P + L F  GA M L P NY        G    C
Sbjct: 297 SS----MFSSHSQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYC 352

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +           L      ILGD  L++  + +D  N   G+    C
Sbjct: 353 MGWQKSTTKAGYLSYS---ILGDIVLKDHLVVYDNDNRVVGWKSFDC 396


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 110/398 (27%), Positives = 163/398 (40%), Gaps = 70/398 (17%)

Query: 96  LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
           LS  +   Y +++  GTP +   P IFDTGS L+W  C     C    +P     ++P F
Sbjct: 124 LSKITASDYIVNVGIGTP-KKEMPLIFDTGSGLIWTQCKPCKAC----YP-----KVPVF 173

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGL 214
            P +S+S + + C +  C        +S  +GCS    T       YL  Y     + G 
Sbjct: 174 DPTKSASFKGLPCSSKLC--------QSIRQGCSSPKCT-------YLTAYVDNSSSTGT 218

Query: 215 LLSETLRFP--SKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLG---LKKFSY 266
           L +ET+ F        N L GCS     +    +GI G  RS  SL SQ      K FSY
Sbjct: 219 LATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSY 278

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           C+ S      P S      TG  +   K P  + ++P  K    S       Y + +  I
Sbjct: 279 CIPS-----TPGS------TGHLTFGGKVPNDVRFSPVSKTAPSSD------YDIKMTGI 321

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG + + I  S     S       +DSG+  T +    + A+   F   M  Y     +
Sbjct: 322 SVGGRKLLIDASAFKIAS------TIDSGAVLTRLPPKAYSALRSVFREMMKGYPL---L 372

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAA 444
           ++   L  C+D S   +V +P + + F+GG +M +        V G++V CL        
Sbjct: 373 DQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAF------ 426

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             A       I G+FQ + + + FD A +R GFA   C
Sbjct: 427 --AELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|255552239|ref|XP_002517164.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543799|gb|EEF45327.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 433

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 104/393 (26%), Positives = 157/393 (39%), Gaps = 73/393 (18%)

Query: 121 IFDTGSSLVWFPCTSRY--------RC--VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           I D G   +W  C   Y        RC    CN  N +      F   R       GC N
Sbjct: 60  ILDLGGLYLWVDCDRGYVSSTYRPARCNSAQCNLANANGCITACFDAPRP------GCNN 113

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
             C+ +    V +             L      LQ   G   G ++S         V NF
Sbjct: 114 NTCALLVDNTVTNI-------GTDGELGQDVVSLQSTDGSNPGRVVS---------VSNF 157

Query: 231 LAGC--SILSDRQPAG---IAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDA 276
           L  C  S + +  P+G   +AG GR+  SLPSQ        +KF+ CL S K    F   
Sbjct: 158 LFVCAPSFILNGLPSGTEGMAGLGRTKVSLPSQFAAAFSFNRKFAICLSSSKGVVFFGKE 217

Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKH 331
           P      +D       SK   L+YTP   NPV +++AF +      Y++G++ I +  K 
Sbjct: 218 PYIIQPNIDV------SKI--LTYTPLIINPVSTAAAFVQGDPSSDYFIGVKSININGKP 269

Query: 332 VKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
           V +  + L +    G GG ++ +   +T ME  ++ A    F++++ +  R A V     
Sbjct: 270 VPLNTTLLSINSQTGFGGTMISTVVPYTVMETTIYNAFVNAFVKELVDVPRVASVAP--- 326

Query: 391 LRPCFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFALVGNEVLCLILFTDNAAG 445
              CFD    +  +    +P + L  +       +   N    V  +VLCL  F D    
Sbjct: 327 FGACFDASKIVGTRLGAAVPSIDLVLQSSNVFWRIVGANSMVQVNEDVLCL-GFVDGGEN 385

Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           P      +I++G  QL++  L+FDLA  R GF+
Sbjct: 386 PR----TSIVIGGHQLEDNLLQFDLATSRLGFS 414


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 92/382 (24%), Positives = 155/382 (40%), Gaps = 36/382 (9%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           S + GTPPQ ++ FI D G  LVW  C+        N        +P F P +SS+ +  
Sbjct: 27  SFTIGTPPQPASAFI-DVGGLLVWTQCSQCSSSSCFN------QELPPFDPTKSSTYRPE 79

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
            C    C   F P     C G       C     + L ++    T+G + ++ +   + T
Sbjct: 80  PCGTALCE--FFPASIRNCSG-----DVCAYEASTQLFEH----TSGKIGTDAVAIGTAT 128

Query: 227 VPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
             +   GC + SD +     P+G  G  R+  SL +Q+ +  FS+CL          +S 
Sbjct: 129 AASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHDGGGGK-NSR 187

Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
           L L                TPF K+      +   +Y + L  I  G + +       VP
Sbjct: 188 LFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSL--YYLINLEGIKAGDEAI-----ITVP 240

Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
            S     V++ + S  +F+   +++ + K     +G  +     + +S    CF   G  
Sbjct: 241 QSGRT--VLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGGVS 298

Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
               P+++L F+G A + +PP NY   VG++ +C+ + +          G + ILG  Q 
Sbjct: 299 GA--PDVVLTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMS-ILGGLQQ 355

Query: 462 QNFYLEFDLANDRFGFAKQKCA 483
           QN +  +DL  +   F    C+
Sbjct: 356 QNVHFLYDLEKETLSFEAADCS 377


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 118/448 (26%), Positives = 175/448 (39%), Gaps = 104/448 (23%)

Query: 63  SLSRARHL-KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI 121
           S++RA HL ++   P + ++ +              + + G Y IS S GTP       I
Sbjct: 61  SINRANHLNQSFVSPNSPETTV--------------ISALGEYLISYSVGTP-SLQVFGI 105

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DTGS ++W  C    +C +           P F   +S + + + C +  C  + G   
Sbjct: 106 LDTGSDIIWLQCQPCKKCYE--------QTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFC 157

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT-----VPNFLAGC- 234
            SR        K C      Y + Y  G  + G L  ETL   S        P  + GC 
Sbjct: 158 SSR--------KHCL-----YSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCG 204

Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLL--------SRKFDDAPVSS 280
              +I  + + +GI G GR   SL +QL      KFSYCL+           F +A V S
Sbjct: 205 RYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVS 264

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
                   G G   TP      F KN +        FY++ L    VG   ++    +  
Sbjct: 265 --------GRGTVSTP-----LFSKNGL-------VFYFLTLEAFSVGRNRIE----FGS 300

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFE----AVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
           PGS G G +I+DSG+T T +   ++     AVAK  I Q     R  D  +  GL  C+ 
Sbjct: 301 PGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQ-----RVRDPNQVLGL--CYK 353

Query: 397 IS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           ++  K    +P +   F  GA + L   N F  V ++V+C   F     G         +
Sbjct: 354 VTPDKLDASVPVITAHFS-GADVTLNAINTFVQVADDVVCFA-FQPTETGA--------V 403

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            G+   QN  + +DL  +   F    C 
Sbjct: 404 FGNLAQQNLLVGYDLQMNTVSFKHTDCT 431


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 116/458 (25%), Positives = 170/458 (37%), Gaps = 65/458 (14%)

Query: 39  LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
           LS  H +H     PL+ + +LA +  +R   L +K              S  +   P+ S
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGVTSAPVAS 72

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
             +   Y +    GTP Q       DT +   W  C     C  C          PA   
Sbjct: 73  GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           FIP  SSS   + C +  C    G    +       ++ + PL   ++   +        
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172

Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
           L S+TLR     +  +  GC        ++    G+ G GR   SL SQ G      FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSY 232

Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
           CL S  +     S +L L      G +  P  + YTP   NP   S      YYV +  +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279

Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
            VG   VK+P            G ++DSG+  T    P++ A+ +EF RQ+   S     
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336

Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
                   CF+     +   P + L   GG  + LP EN         L CL +    A 
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            P        ++ + Q QN  +  D+A  R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 108/395 (27%), Positives = 160/395 (40%), Gaps = 61/395 (15%)

Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
           GY IS   GTPP      + DT +  +WF C     C +   P  DPS        +SS+
Sbjct: 88  GYIISFLIGTPP-FQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPS--------KSST 138

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
            + I C +PKC      NVE+    CS  +K     C       G  ++ G L  +TL  
Sbjct: 139 YKTIPCSSPKCK-----NVEN--THCSSDDKK---VCEYSFTYGGEAYSQGDLSIDTLTL 188

Query: 223 PSK-----TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLGLK---KFSYCLL 269
            S      +  N + GC    ++ P     +G  G GR   S  SQL      KFSYCL+
Sbjct: 189 NSNNDTPISFKNIVIGCG-HRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLV 247

Query: 270 SRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
              F +  +S  L   D    SG     G   TP     +G        Y   L  + VG
Sbjct: 248 PL-FSNEGISGKLHFGDKSVVSG----VGTVSTPITAGEIG--------YSTTLNALSVG 294

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
              +K   S     +D  G  I+DSG+T T +   ++  + +  +  M    RA    ++
Sbjct: 295 DHIIKFENS--TSKNDNLGNTIIDSGTTLTILPENVYSRL-ESIVTSMVKLERAKSPNQQ 351

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
              + C+  +  K++ +P +   F  GA + L   N F  + +EV+C        A  ++
Sbjct: 352 --FKLCYKAT-LKNLDVPIITAHFN-GADVHLNSLNTFYPIDHEVVCF-------AFVSV 400

Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G  P  I+G+   QNF + FDL  +   F    C 
Sbjct: 401 GNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 469

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/397 (25%), Positives = 162/397 (40%), Gaps = 66/397 (16%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P      I D G+  +W  C   Y                      SSS   + C +  C
Sbjct: 89  PLVPVKLIVDLGARFMWVDCEEGYV---------------------SSSYTPVSCDSLLC 127

Query: 174 SWIFGPNVESRCK-----GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-- 226
                    + C      GC   N TC  +  + +++ G     G  +     F  KT  
Sbjct: 128 KLANSLACATECNSTPKPGC--HNNTCAHSPENPVIRLGTSGQIGQDVVSLQSFNGKTPD 185

Query: 227 ----VPNF--LAGCSILSDRQP---AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK 272
               VPNF  + G + L +       G+AG G S+ SLP+Q        KKF+ CL +  
Sbjct: 186 RIVSVPNFPFVCGPTFLLENLADGVTGLAGLGNSNISLPAQFSSAFGFPKKFAVCLSNS- 244

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS--SAFGEF---YYVGLRQIIV 327
                  SN ++  G G   +    L+YTP   NPV ++  S  GE    Y++G++ I +
Sbjct: 245 -----TKSNGLIFFGDGPYSNLPNDLTYTPLIHNPVSTAGGSYLGEASVEYFIGVKSIRI 299

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
           G K VK   + L   S+G GG  + +   +T +   +++AV K F+++M           
Sbjct: 300 GGKDVKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKEMDKKFIPQVQPP 359

Query: 388 KSGLRPCFDI----SGKKSVYLP--ELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
            +    CF      S +    LP  +L+L+ +G     +   N    + + V+CL  F D
Sbjct: 360 IAPFGACFQSIVIDSNEFGPVLPFIDLVLEGQGSVTWRIWGANSMVKISSLVMCL-GFVD 418

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
               P      +I++G  Q+++  L+FDLA+ + GF+
Sbjct: 419 GGIEPRT----SIVIGGRQIEDNLLQFDLASSKLGFS 451


>gi|147801500|emb|CAN61502.1| hypothetical protein VITISV_011733 [Vitis vinifera]
          Length = 415

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 106/389 (27%), Positives = 155/389 (39%), Gaps = 80/389 (20%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + D G   +W  C   Y         V  S  P  +          GC N  CS +   
Sbjct: 59  LVVDLGGQFLWVDCEQNY---------VSSSYRPGAVQP--------GCNNNTCS-VLPD 100

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
           N  +R            LA  +  +Q   G   G          S +V  FL  C+  S 
Sbjct: 101 NTVTRTASSDE------LAEDAVSVQSTDGSNPG---------RSVSVSKFLFSCAPTSL 145

Query: 240 RQ-----PAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
            +       G+AG GR+  +LPSQ        +KF+ CL S    D       V+  G G
Sbjct: 146 LEGLASGAKGMAGLGRTRIALPSQFASAFSFHRKFAICLSSSTTADG------VILLGDG 199

Query: 290 S-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKIPYSYL 339
           S         +  L YTP   NPV ++SA  +      Y++G++ I +  K V +  S L
Sbjct: 200 SYGLLPNVDASQLLIYTPLILNPVSTASAHSQGEPSAEYFIGVKSIQINEKAVPLNTSLL 259

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDI 397
              S G GG  + + + +T ME  ++ A  K FI      N +R A V   S    CF  
Sbjct: 260 SINSKGVGGTKISTVNPYTVMETSIYSAFTKAFISAAASMNITRVAAVAPFS---VCFS- 315

Query: 398 SGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLILFTDNAAGPALG 449
              K+VY       +P + L  +  + +  +   N    V  +VLCL  F D  A P   
Sbjct: 316 --SKNVYSTRGGAAVPTIGLVLQNNSVVWRIFGANSMVFVNGDVLCL-GFVDGGANPR-- 370

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFA 478
              +I++G +QL++  L+FDLA  R GF+
Sbjct: 371 --TSIVIGGYQLEDNLLQFDLAASRLGFS 397


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 103/404 (25%), Positives = 165/404 (40%), Gaps = 65/404 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C S  RC   +   ++   +  + PK SS
Sbjct: 87  GLYYTEIGIGTPTKRYYVQV-DTGSDILWVNCISCDRCPRKSGLGLE---LTLYDPKDSS 142

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +   + C    C+  +G        GC     T  L C  Y + YG G  T G  +S+ L
Sbjct: 143 TGSKVSCDQGFCAATYG----GLLPGC-----TTSLPC-EYSVTYGDGSSTTGYFVSDLL 192

Query: 221 RF----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
           +F          P+ +   F  G     D     +   GI GFG+S+ S+ SQL      
Sbjct: 193 QFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKV 252

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL +          N+V            P +  TP   N           Y V
Sbjct: 253 KKIFAHCLDTINGGGIFAIGNVV-----------QPKVKTTPLVPNM--------PHYNV 293

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I VG   +K+P      G     G I+DSG+T T++     E V KE +  +  ++
Sbjct: 294 NLKSIDVGGTALKLPSHMFDTGE--KKGTIIDSGTTLTYLP----EIVYKEIM--LAVFA 345

Query: 381 RAADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           +  D+   +     CF   G+     P++   F+    + + P +YF   G+ + C + F
Sbjct: 346 KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYC-VGF 404

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +       G+G  ++LGD  L N  + +DL N   G+ +  C+
Sbjct: 405 QNGGLQSKDGKG-MVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 447


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 96/349 (27%), Positives = 145/349 (41%), Gaps = 52/349 (14%)

Query: 154 AFIPKRSSSSQLIGCQNPKC----SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
            F P RS S Q + C + KC    S +F  ++      C   +  C      Y + Y  G
Sbjct: 190 VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSL------CPKPSDPCL-----YDISYADG 238

Query: 210 FTA-GLLLSETLRFPSKT-----VPNFLAGCS------ILSDRQPAGIAGFGRSSESLPS 257
            +A G   ++T+    K      + N   GC+      +  +    GI G G + +S   
Sbjct: 239 SSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFID 298

Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSA 313
           +   +   KFSYCL+        VSS L +    G  ++K  G +  T     P      
Sbjct: 299 KAAYEYGAKFSYCLVDH-LSHRNVSSYLTIG---GHHNAKLLGEIKRTELILFP------ 348

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
              FY V +  I +G + +KIP    V   +  GG ++DSG+T T +  P +E V +  I
Sbjct: 349 --PFYGVNVVGISIGGQMLKIPPQ--VWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALI 404

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
           + +    R    E    L  CFD  G     +P L+  F GGA+   P ++Y   V   V
Sbjct: 405 KSLTKVKRVTG-EDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLV 463

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            C+ +       P  G G A ++G+   QN   EFDL+ +  GFA   C
Sbjct: 464 KCIGIV------PIDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
          Length = 424

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 99/418 (23%), Positives = 149/418 (35%), Gaps = 93/418 (22%)

Query: 95  PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD------ 148
           PL       Y  S   G PPQ +   + DTGS LVW  C++      C  P         
Sbjct: 69  PLRWSGKTQYIASYGIGDPPQPAEAVV-DTGSDLVWTQCST------CRLPAAAAAGGGG 121

Query: 149 --PSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
             P  +P +    S +++ + C +   +     P      +G    +  C +A       
Sbjct: 122 CFPQNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAA-----S 176

Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQL 259
           YG G   G+L ++   FPS +      GC   +   P      +GI G GR + SL    
Sbjct: 177 YGAGVALGVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALTGASGIIGLGRGALSL---- 232

Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
                                                         NP    S F  FYY
Sbjct: 233 ----------------------------------------------NP--KDSPFSTFYY 244

Query: 320 VGLRQIIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
           + L  +  G+  V +P               GG ++DSGS FT +  P   A+ KE  RQ
Sbjct: 245 LPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQ 304

Query: 376 M-GNYSRAADVEKKSG-LRPCF----DISGKKSVYLPELILKFK----GGAKMALPPENY 425
           + G+ S      K  G L  C     D     +  +P L+L+F     GG ++ +P E Y
Sbjct: 305 LRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKY 364

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +A V     C+ + +  +    L      I+G+F  Q+  + +DLAN    F    C+
Sbjct: 365 WARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 107/391 (27%), Positives = 159/391 (40%), Gaps = 58/391 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSS 162
           Y IS+  G+P   +   + DTGS + W       +C  C  P+   +   A F P  SS+
Sbjct: 135 YVISVGLGSPAM-TQRVVIDTGSDVSWV------QCEPCPAPSPCHAHAGALFDPAASST 187

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
                C    C+ + G + E+   GC  +++        Y+++YG G  T G   S+ L 
Sbjct: 188 YAAFNCSAAACAQL-GDSGEA--NGCDAKSRC------QYIVKYGDGSNTTGTYSSDVLT 238

Query: 222 FP-SKTVPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
              S  V  F  GCS        D +  G+ G G  ++SL SQ      K FSYCL +  
Sbjct: 239 LSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPA-- 296

Query: 273 FDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
               P SS  L L      G       + TP  +     S     +Y+  L  I VG K 
Sbjct: 297 ---TPASSGFLTLGAPASGGGGGASRFATTPMLR-----SKKVPTYYFAALEDIAVGGKK 348

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + +  S    GS      +VDSG+  T +    + A++  F   M  Y+RA   E    L
Sbjct: 349 LGLSPSVFAAGS------LVDSGTVITRLPPAAYAALSSAFRAGMTRYARA---EPLGIL 399

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
             CF+ +G   V +P + L F GGA + L   +   +V    L      D+    A G  
Sbjct: 400 DTCFNFTGLDKVSIPTVALVFAGGAVVDL---DAHGIVSGGCLAFAPTRDD---KAFG-- 451

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               +G+ Q + F + +D+    FGF    C
Sbjct: 452 ---TIGNVQQRTFEVLYDVGGGVFGFRAGAC 479


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 105/410 (25%), Positives = 171/410 (41%), Gaps = 72/410 (17%)

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           YG Y+  +  GTPP+  T  I DTGS ++W  C +   C +C   +     +  F    S
Sbjct: 81  YGLYTTKVKMGTPPREFTVQI-DTGSDILWINCNT---CSNCPKSSGLGIELNFFDTVGS 136

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
           S++ L+ C +P C+      ++     CSP+   C     SY  QY  G  T+G+ +S+ 
Sbjct: 137 STAALVPCSDPMCA----SAIQGAAAQCSPQVNQC-----SYTFQYEDGSGTSGVYVSDA 187

Query: 220 LRFP---SKTVPNFLA-------GCSIL-------SDRQPAGIAGFGRSSESLPSQL--- 259
           + F     ++ P  +A       GCS         +D+   GI GFG    S+ SQL   
Sbjct: 188 MYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSR 247

Query: 260 GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
           G+  K FS+CL      D      LVL      G+   P + Y+P   +           
Sbjct: 248 GITPKVFSHCLKG----DGNGGGILVL------GEILEPSIVYSPLVPSQ--------PH 289

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y + L+ I V  + + I  +  V  +    G I+DSG+T +++    ++ +       + 
Sbjct: 290 YNLNLQSIAVNGQVLSINPA--VFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVS 347

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
            ++ +  + K S    C+ +        P +   F+GGA M L P  Y    G       
Sbjct: 348 QFATSF-ISKGS---QCYLVLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRG------- 396

Query: 438 LFTDNAAGPALG----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            F D A    +G    +    ILGD  L++  + +DLA  + G+    C+
Sbjct: 397 -FQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQIGWTNYDCS 445


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 126/291 (43%), Gaps = 36/291 (12%)

Query: 202 YLLQYGLG-FTAGLLLSETLRFPSK-TVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
           Y +QYG G +T G    +TL   S   +  F  GC   ++    + AG+ G GR   SLP
Sbjct: 23  YGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLP 82

Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
            Q   K    F++C  +R       S    L+ GPGS  + +  LS TP   +  G +  
Sbjct: 83  VQTYDKYGGVFAHCFPARS------SGTGYLEFGPGSSPAVSAKLSTTPMLID-TGPT-- 133

Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
              FYYVG+  I VG K + IP S          G IVDSG+  T +    + ++   F 
Sbjct: 134 ---FYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSAFA 185

Query: 374 RQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
             M    Y RA  +   S L  C+D++G   V +P + L F+GG  + +           
Sbjct: 186 ASMAARGYKRAPAL---SLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASV 242

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              CL    + AA          I+G+ QL+ F + +D+A+   GF    C
Sbjct: 243 SQACLGFAGNEAA------DDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|24796804|gb|AAN64480.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 161

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 80/140 (57%), Gaps = 3/140 (2%)

Query: 197 LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESL 255
           LA  +  + Y  G T  LL+S+TLR P +T+ NF+ GCS++S  +Q +G+ GF     S+
Sbjct: 3   LAADAIGVVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSV 62

Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
           PSQLGL KF Y LL+R+FDD   +S+ ++  G G  D     + Y P  ++   +     
Sbjct: 63  PSQLGLTKFFYFLLARRFDDNATASDELILGGAGGKDDNVR-MQYIPLARS-ASTRPLCS 120

Query: 316 EFYYVGLRQIIVGSKHVKIP 335
            +YY+ L  I V  K V++P
Sbjct: 121 VYYYLALIAITVRRKSVQLP 140


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 108/390 (27%), Positives = 158/390 (40%), Gaps = 57/390 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  GTP Q     + DT +   + PC+    C D  F            PK S+
Sbjct: 98  GNYVVRVKLGTPGQLLF-MVLDTSTDEAFVPCSGCTGCSDTTFS-----------PKAST 145

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           S   + C  P+C  + G +  +   G    N+       SY    G  F+A  L+ ++LR
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQ-------SYA---GSSFSA-TLVQDSLR 194

Query: 222 FPSKTVPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
             +  +PN+  GC  +I     PA      G       S+S  +  G+  FSYCL S  F
Sbjct: 195 LATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGI--FSYCLPS--F 250

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
                S +L L  GP  G  K+  +  TP  ++P   S      YYV    I VG   V 
Sbjct: 251 KSYYFSGSLKL--GP-VGQPKS--IRTTPLLRSPHRPS-----LYYVNFTGISVGRVLVP 300

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
            P  YL    +   G I+DSG+  T    P++ AV +EF +Q+G                
Sbjct: 301 FPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDT 356

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
           CF          P + L F+ G  + LP EN         + CL +    AA P      
Sbjct: 357 CF--VKTYETLAPPITLHFE-GLDLKLPLENSLIHSSAGSLACLAM----AAAPDNVNSV 409

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             ++ +FQ QN  + FD  N++ G A++ C
Sbjct: 410 LNVIANFQQQNLRILFDTVNNKVGIAREVC 439


>gi|356576537|ref|XP_003556387.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 438

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 175/404 (43%), Gaps = 79/404 (19%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P  +     D G   +W  C   Y                      SS+S+   C + +C
Sbjct: 55  PLVAVKLTVDLGGGYLWVNCEKGYV---------------------SSTSRPARCGSAQC 93

Query: 174 SW--IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL-RFPSK--TVP 228
           S   ++G + E +  G SP N    ++       YG      + ++ T    P+K  +VP
Sbjct: 94  SLFGLYGCSTEDKICGRSPSNTVTGVS------TYGDIHADVVAVNSTDGNNPTKVVSVP 147

Query: 229 NFL--AGCSILSDRQPAGI---AGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
            FL   G +++     +G+   AG GR+  SLPSQ        +KF+ CL S    +   
Sbjct: 148 KFLFICGSNVVQKGLASGVTGMAGLGRTKVSLPSQFASAFSFHRKFAICLSSSTMTNGV- 206

Query: 279 SSNLVLDTGP------GSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIV 327
              +    GP       S  SK   L++TP   NPV ++ ++  GE    Y++G++ I V
Sbjct: 207 ---MFFGDGPYNFGYLNSDLSKV--LTFTPLISNPVSTAPSYFQGEPSVEYFIGVKSIKV 261

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
             K+V +  + L    +G GG  + + + +T ME  +++AV++ F++++G    A  V  
Sbjct: 262 SDKNVALNTTLLSIDRNGIGGTKISTVNPYTVMETTIYKAVSEVFVKEVG----APTVAP 317

Query: 388 KSGLRPCF---DI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
            +    CF   DI S +    +P + L  +      +   N    V N+V+CL  F D  
Sbjct: 318 VAPFGTCFATKDIGSTRMGPAVPGIDLVLQNDVVWTIIGANSMVYV-NDVICL-GFVDAG 375

Query: 444 AGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
           + P++        G  P  +I +G  QL+N  L+FDLA  R GF
Sbjct: 376 SSPSVAQVGFVAGGSHPRTSITIGAHQLENNLLQFDLATSRLGF 419


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/402 (25%), Positives = 164/402 (40%), Gaps = 65/402 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y   +  GTP +     + DTGS ++W  C S  RC   +   ++   +  + PK SS+ 
Sbjct: 4   YYTEIGIGTPTKRYYVQV-DTGSDILWVNCISCDRCPRKSGLGLE---LTLYDPKDSSTG 59

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C    C+  +G        GC     T  L C  Y + YG G  T G  +S+ L+F
Sbjct: 60  SKVSCDQGFCAATYG----GLLPGC-----TTSLPC-EYSVTYGDGSSTTGYFVSDLLQF 109

Query: 223 ----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL-----K 262
                     P+ +   F  G     D     +   GI GFG+S+ S+ SQL       K
Sbjct: 110 DQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKK 169

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
            F++CL +          N+V            P +  TP   N           Y V L
Sbjct: 170 IFAHCLDTINGGGIFAIGNVV-----------QPKVKTTPLVPNM--------PHYNVNL 210

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           + I VG   +K+P      G     G I+DSG+T T++     E V KE +  +  +++ 
Sbjct: 211 KSIDVGGTALKLPSHMFDTGE--KKGTIIDSGTTLTYLP----EIVYKEIM--LAVFAKH 262

Query: 383 ADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
            D+   +     CF   G+     P++   F+    + + P +YF   G+ + C + F +
Sbjct: 263 KDITFHNVQEFLCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYC-VGFQN 321

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                  G+G  ++LGD  L N  + +DL N   G+ +  C+
Sbjct: 322 GGLQSKDGKG-MVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 362


>gi|225436984|ref|XP_002272235.1| PREDICTED: basic 7S globulin [Vitis vinifera]
          Length = 436

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/400 (25%), Positives = 162/400 (40%), Gaps = 70/400 (17%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P      + D G+  +W  C   Y                ++ P R  S+Q    +   C
Sbjct: 53  PLVPVKLVVDLGAQFLWVDCEQNYVS-------------SSYRPARCRSAQCSLARANGC 99

Query: 174 SWIFG---PNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
              F    P   +   G  P N     A    L      +Q   G   G ++S + +F  
Sbjct: 100 GDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQSTDGSNPGRVVSVS-KFLF 158

Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
              P FL  G +        G+AG GR+  + PSQ        +KF+ CL S        
Sbjct: 159 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 208

Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVG 328
           ++N V+  G G          +  L YTP Y NPV ++SA+ +      Y++ ++ I + 
Sbjct: 209 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRVKSIRIN 268

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVE 386
            K + +  S L   S+G GG  + + + +T ME  +++A  K FI      N +R A V 
Sbjct: 269 EKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINITRVAAVA 328

Query: 387 KKSGLRPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLIL 438
                  CF     K+VY       +P + L  +  +    +   N    V ++VLCL  
Sbjct: 329 P---FNVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-G 381

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           F D  A P      +I++G +QL++  L+FDLA  R GF+
Sbjct: 382 FVDGGANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 417


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 164/404 (40%), Gaps = 64/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S     DTGS ++W  C    +C  C   +     +  +    S 
Sbjct: 78  GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCI---QCKQCPRRSTLGIELTLYNIDESD 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S +L+ C +  C  I G  + S CK     N +CP     YL  YG G  TAG  + + +
Sbjct: 134 SGKLVSCDDDFCYQISGGPL-SGCKA----NMSCP-----YLEIYGDGSSTAGYFVKDVV 183

Query: 221 RFPS-----KTVP---NFLAGC------SILSDRQPA--GIAGFGRSSESLPSQLG---- 260
           ++ S     KT     + + GC       + S  + A  GI GFG+++ S+ SQL     
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243

Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K F++CL  R          +V            P ++ TP   N           Y 
Sbjct: 244 VKKIFAHCLDGRNGGGIFAIGRVV-----------QPKVNMTPLVPNQ--------PHYN 284

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V +  + VG + + IP     PG     G I+DSG+T  ++   ++E + K+   Q    
Sbjct: 285 VNMTAVQVGQEFLTIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPLVKKITSQ---- 338

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
             A  V        CF  SG+     P +   F+    + + P +Y  L  +E +  I +
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDY--LFPHEGMWCIGW 396

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N+A  +  R    +LGD  L N  + +DL N   G+ +  C+
Sbjct: 397 -QNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 150/361 (41%), Gaps = 64/361 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP      I DTGS ++W  C S   C  C   +    ++  F P  SS
Sbjct: 23  GLYYTKVQLGTPPVEFNVQI-DTGSDVLWVSCNS---CSGCPQTSGLQIQLNFFDPGSSS 78

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           +S +I C + +C+      ++S    CS +N  C     SY  QYG G  T+G  +S+ +
Sbjct: 79  TSSMIACSDQRCN----NGIQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMM 129

Query: 221 R----FPSKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
                F      N  A    GCS         SDR   GI GFG+   S+ SQL   G+ 
Sbjct: 130 HLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 189

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYY 319
            + FS+CL      D+     LVL      G+   P + YT      P          Y 
Sbjct: 190 PRVFSHCLKG----DSSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYN 230

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           + L+ I V  + ++I  S  V  +  + G IVDSG+T  ++    ++         +   
Sbjct: 231 LNLQSIAVNGQTLQIDSS--VFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQS 288

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
              A     S    C+ I+   +   P++ L F GGA M L P++Y     ++ G  V C
Sbjct: 289 VHTA----VSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC 344

Query: 436 L 436
           +
Sbjct: 345 I 345


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 108/248 (43%), Gaps = 33/248 (13%)

Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
           P+G  G GR+  SL +Q+ L +FSYCL      D   +S L L    G+      G ++T
Sbjct: 148 PSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL----GASAKLAGGGAWT 200

Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           PF K     +    ++Y + L +I  G   + +P         G   V+V +      + 
Sbjct: 201 PFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSL- 249

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISGKKSVYLPELILKFKGGAKMA 419
             L ++V +EF + +     AA      G     CF  +G      P+L+  F+ GA + 
Sbjct: 250 --LVDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFPKAGVSGA--PDLVFTFQAGAALT 305

Query: 420 LPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
           +PP NY   VGN+ +CL    I   +  A   L      ILG FQ +N +L FDL  D  
Sbjct: 306 VPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----ILGSFQQENVHLLFDLDKDML 360

Query: 476 GFAKQKCA 483
            F    C+
Sbjct: 361 SFEPADCS 368


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 114/405 (28%), Positives = 162/405 (40%), Gaps = 62/405 (15%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVD 148
           I + LS+ S G Y   +  G P Q S     DTGS + W    PC+S Y  VD       
Sbjct: 1   ISSGLSLGS-GEYFARMGIGNP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVD------- 51

Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG- 207
               P + P  SSS + + C +  C  +      S C+G         + C SY + YG 
Sbjct: 52  ----PIYDPSNSSSYRRVYCGSALCQAL----DYSACQG---------MGC-SYRVVYGD 93

Query: 208 LGFTAGLLLSETLRF---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
              ++G L  E+       S  + N   GC   +    R  AG+ G G  + S  SQ+  
Sbjct: 94  SSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAA 153

Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEF 317
                FSYCL+ R       SS L+       G +  P    +TP  KNP         F
Sbjct: 154 SIGPAFSYCLVDRYSQLQSRSSPLIF------GRTAIPFAARFTPLLKNP-----RINTF 202

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           YY  L  I VG   + IP +      +G GG I+DSG++ T +  P +  +   +     
Sbjct: 203 YYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASR 262

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
           N   A  V     L  CF+  G  +V +P L+L F  G  M LP  N    V       +
Sbjct: 263 NLPPAPGVYL---LDTCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCL 319

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            F  ++        P  ++G+ Q Q F + FDL       A ++C
Sbjct: 320 AFAPSSM-------PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 138/342 (40%), Gaps = 51/342 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +S S GTPPQ  T  + D  S  VW  C++   C  C       +  P F      
Sbjct: 95  GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPF------ 144

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
                        + F    ++R    +P    C      Y   YG G    TAGLL  +
Sbjct: 145 -------------YAFLSFHDTR----APTTPPC-----GYSYVYGGGAANTTAGLLAVD 182

Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
              F +      + GC++ ++    G+ G GR   S  SQL + +FSY L     DDA  
Sbjct: 183 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVD 239

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
             + +L       D   P  S       P+ +S A    YYV L  I V  + + IP   
Sbjct: 240 VGSFILFL-----DDAKPRTSRA--VSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGT 292

Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
               +DG+GGV++      TF++   ++ V +    ++    RAAD   + GL  C+   
Sbjct: 293 FDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI--ELRAAD-GSELGLDLCYTSE 349

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFAL---VGNEVLCLI 437
              +  +P + L F GGA M L   NYF +    G E L ++
Sbjct: 350 SLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTIL 391


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 117/466 (25%), Positives = 182/466 (39%), Gaps = 103/466 (22%)

Query: 56  LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSL-IKTPLSVHSYGGYSISLSFGTPP 114
           LH L+ ++ S  +HL +   P TKDS     Y   +  +TPL                  
Sbjct: 21  LHQLSLNNHSDPKHLFS---PVTKDSATTLQYIAQINQRTPL------------------ 59

Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
                 + D G   +W  C + Y                 + P R  S+Q    ++  C 
Sbjct: 60  -VPLNLVVDLGGKFLWVDCENHYTS-------------STYRPVRCPSAQCSLAKSDSC- 104

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-------- 226
              G    S   GC   N TC L  P   + +    T G L  + L   S +        
Sbjct: 105 ---GDCFSSPKPGC---NNTCGLI-PDNTITHSA--TRGDLAEDVLSIQSTSGFNTGQNV 155

Query: 227 -VPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
            V  FL  C+  S  +      +G+AG GR+  +LPSQL       +KF++C  S     
Sbjct: 156 VVSRFLFSCAPTSLLRGLAGGASGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVI 215

Query: 273 -FDDAPVS--------SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GEF---Y 318
            F D P S         N+V D+           L+YTP   N V ++SAF  GE    Y
Sbjct: 216 IFGDGPYSFLADNPSLPNVVFDS---------KSLTYTPLLINHVSTASAFLQGESSVEY 266

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           ++G++ I +  K V +  S L   + G GG  + +   +T +E  +++AV   F++   +
Sbjct: 267 FIGVKTIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVK--AS 324

Query: 379 YSRAADVEKKS-GLRPCFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNEV 433
            +R    E  S     C+         L    P + L  +     ++   N    + +EV
Sbjct: 325 VARNITTEDSSPPFEFCYSFDNLPGTPLGASVPTIELLLQNNVIWSMFGANSMVNINDEV 384

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
           LCL           +    +I++G +QL+N  L+FDLA  R GF+ 
Sbjct: 385 LCL-----GFVNGGVNLRTSIVIGGYQLENNLLQFDLAASRLGFSN 425


>gi|388509650|gb|AFK42891.1| unknown [Lotus japonicus]
          Length = 347

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 90/288 (31%), Positives = 136/288 (47%), Gaps = 52/288 (18%)

Query: 226 TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
           +VPNFL   G  ++ +   +   G+AG GR+  SLPSQ        +KF+ CL +     
Sbjct: 57  SVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGAD 116

Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS-SAF-GE---FYYVGLR 323
               F D P   NL  D       SK   L+YTP   NPV ++ SAF GE    Y++G++
Sbjct: 117 GVMFFGDGPY--NLNQDV------SKV--LTYTPLITNPVSTAPSAFLGEPSVEYFIGVK 166

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I V  K+V +  + L    +G GG  + + + +T ME  +++AVA  F++ +G    A 
Sbjct: 167 SIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG----AP 222

Query: 384 DVEKKSGLRPCF---DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            V   +    CF   DIS  +    +P + L  + G +  +   N      ++V+CL  F
Sbjct: 223 TVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICL-GF 280

Query: 440 TDNAAGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
            D  + P          G  P  +I +G  QL+N  L+FDLA  R GF
Sbjct: 281 VDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 328


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 166/406 (40%), Gaps = 71/406 (17%)

Query: 104 YSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           Y   +  GTPP+   PF    DTGS ++W  C S  +C   +   +D   +  + PK SS
Sbjct: 87  YYTKIEIGTPPK---PFHVQVDTGSDILWVNCVSCDKCPTKSGLGID---LALYDPKGSS 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C N  C+  +G     +  GC+   K C      Y  +YG G  TAG  +S++L
Sbjct: 141 SGSAVSCDNKFCAATYGSG--EKLPGCTA-GKPC-----EYRAEYGDGSSTAGSFVSDSL 192

Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
           ++              N + GC          +++   GI GFG+S+ S  SQL      
Sbjct: 193 QYNQLSGNAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEV 252

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            K FS+CL + K              G G    G+   P +  TP   N           
Sbjct: 253 KKIFSHCLDTIK--------------GGGIFAIGEVVQPKVKSTPLLPNM--------SH 290

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V L+ I V    +++P  ++   S+  G  I+DSG+T T++     E V K+ +  + 
Sbjct: 291 YNVNLQSIDVAGNALQLP-PHIFETSEKRG-TIIDSGTTLTYLP----ELVYKDILAAVF 344

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              +        G   CF+ S       P++   F+    + + P +YF   G+ + CL 
Sbjct: 345 QKHQDITFRTIQGFL-CFEYSESVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCL- 402

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            F +    P   +   ++LGD  L N  + +DL     G+    C+
Sbjct: 403 GFQNGGFQPKDAK-DMVLLGDLVLSNKVVVYDLEKQVIGWTDYNCS 447


>gi|388508700|gb|AFK42416.1| unknown [Lotus japonicus]
          Length = 440

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 85/288 (29%), Positives = 134/288 (46%), Gaps = 52/288 (18%)

Query: 226 TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
           +VPNFL   G  ++ +   +   G+AG GR+  SLPSQ        +KF+ CL +     
Sbjct: 150 SVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGAD 209

Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS-SAF-GE---FYYVGLR 323
               F D P + N             +  L+YTP   NPV ++ SAF GE    Y++G++
Sbjct: 210 GVMFFGDGPYNLN----------QDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVK 259

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            + V  K+V +  + L    +G GG  + + + +T ME  +++AVA  F++ +G    A 
Sbjct: 260 SVKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG----AP 315

Query: 384 DVEKKSGLRPCF---DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            V   +    CF   DIS  +    +P + L  + G +  +   N      ++V+CL  F
Sbjct: 316 TVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICL-GF 373

Query: 440 TDNAAGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
            D  + P          G  P  +I +G  QL+N  L+FDLA  R GF
Sbjct: 374 VDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/413 (24%), Positives = 170/413 (41%), Gaps = 67/413 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS ++W  C S  +C   +   +D   +  + PK SS
Sbjct: 85  GLYFTEIKLGTPPKRYYVQV-DTGSDILWVNCISCSKCPRKSGLGLD---LTFYDPKASS 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C    C+  +G     +  GC+  N  C      Y + YG G  T G  +++ L
Sbjct: 141 SGSTVSCDQGFCAATYG----GKLPGCTA-NVPC-----EYSVMYGDGSSTTGFFITDAL 190

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F       +T P       GC          S++   GI GFG+++ S+ SQL      
Sbjct: 191 QFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKA 250

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF------ 314
            K F++CL + K        N+V            P   +  F+ + + +   F      
Sbjct: 251 KKIFAHCLDTIKGGGIFAIGNVV-----------QPKCYFVFFFAHGLLNIPLFLLVMIL 299

Query: 315 --GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
                Y V L+ I VG   +++P      G     G I+DSG+T T++   +F+ V    
Sbjct: 300 LSRPHYNVNLKSIDVGGTTLQLPAHVFETGE--KKGTIIDSGTTLTYLPELVFKQVMDVV 357

Query: 373 IRQMGNYSRAADVEKKSGLRP--CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
                 +S+  D+   + L+   CF  SG      P +   F+    + + P  YF   G
Sbjct: 358 ------FSKHRDIAFHN-LQDFLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFPNG 410

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           N++ C + F + A     G+   +++GD  L N  + +DL N   G+    C+
Sbjct: 411 NDIYC-VGFQNGALQSKDGK-DIVLMGDLVLSNKLVVYDLENQVIGWTDYNCS 461


>gi|383143511|gb|AFG53183.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 79/152 (51%), Gaps = 20/152 (13%)

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
           YCL     D    SS +V+      G+   PG   L+YTP   NP+     +  FYY+GL
Sbjct: 1   YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             + +G K + +P++     S GNGG I+DSG++FT     ++  +A EF  Q+G Y R 
Sbjct: 45  EAVSIGRKRMNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
              E  +GL  C+++SG ++   P+    FKG
Sbjct: 104 PGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 110/409 (26%), Positives = 161/409 (39%), Gaps = 87/409 (21%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKR 159
           G Y IS S G PP      I DTGS ++W  C    +C +      DPS+   +  +P  
Sbjct: 84  GEYLISYSVGIPP-FQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFS 142

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           S++ Q +  ++  CS              S   K C      Y + YG G ++ G L  E
Sbjct: 143 STTCQSV--EDTSCS--------------SDNRKMCE-----YTIYYGDGSYSQGDLSVE 181

Query: 219 TLRFPSKTVPNF-----LAGC----SILSDRQPAGIAGFGRSSESLPSQLGL------KK 263
           TL   S    +      + GC    ++  + + +GI G G    SL +QL        +K
Sbjct: 182 TLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRK 241

Query: 264 FSYCLLSR-------KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
           FSYCL S         F DA V S        G G   TP +++ P              
Sbjct: 242 FSYCLASMSNISSKLNFGDAAVVS--------GDGTVSTPIVTHDP------------KV 281

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           FYY+ L    VG+  ++   S    G  GN  +I+DSG+T T +   ++  + +  +  +
Sbjct: 282 FYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDSGTTLTLLPNDIYSKL-ESAVADL 338

Query: 377 GNYSRAADVEKKSGL--RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
               R  D  K+  L  R  FD      +  P ++  F  GA + L   N F  V   V 
Sbjct: 339 VELDRVKDPLKQLSLCYRSTFD-----ELNAPVIMAHF-SGADVKLNAVNTFIEVEQGVT 392

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           CL  F  +  GP        I G+   QNF + +DL      F    C+
Sbjct: 393 CL-AFISSKIGP--------IFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432


>gi|297843130|ref|XP_002889446.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
 gi|297335288|gb|EFH65705.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
          Length = 433

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 121/255 (47%), Gaps = 33/255 (12%)

Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
            G+AG GR +  LPSQ        +KF+ CL S +    F + P    + L   PG    
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFNRKFAVCLTSGRGVAFFGNGPY---VFL---PGI--- 224

Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
           +  GL  TP   NPV ++SAF +      Y++G+  I +  K V I P    +  S G G
Sbjct: 225 QISGLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIKIVEKTVPINPTLLKINASTGFG 284

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVY- 404
           G  + S + +T +E  ++ A   EF++Q    N +R A V+  S      ++   +  Y 
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQAAARNITRVASVKPFSACFSTKNVGVTRLGYA 344

Query: 405 LPELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
           +PE+ L          +   N    V ++V+CL  F D           ++++G FQL++
Sbjct: 345 VPEIQLVLHSNDVVWRIFGANSMVSVSDDVICL-GFVDGGVNAR----TSVVIGGFQLED 399

Query: 464 FYLEFDLANDRFGFA 478
             +EFDLA++RFGF+
Sbjct: 400 NLIEFDLASNRFGFS 414


>gi|383143501|gb|AFG53178.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143503|gb|AFG53179.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143507|gb|AFG53181.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143509|gb|AFG53182.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143517|gb|AFG53186.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143519|gb|AFG53187.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 79/152 (51%), Gaps = 20/152 (13%)

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
           YCL     D    SS +V+      G+   PG   L+YTP   NP+     +  FYY+GL
Sbjct: 1   YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             + +G K + +P++     S GNGG I+DSG++FT     ++  +A EF  Q+G Y R 
Sbjct: 45  EAVSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
              E  +GL  C+++SG ++   P+    FKG
Sbjct: 104 PGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 121/296 (40%), Gaps = 65/296 (21%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y + L+ GTPP+       DTGS LVW  C     C D   P +DP+         SS+ 
Sbjct: 86  YLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAA--------SSTY 136

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             + C  P+C  +      + C G     ++C      Y+  YG    T G + ++   F
Sbjct: 137 AALPCGAPRCRAL----PFTSCGG-----RSC-----VYVYHYGDKSVTVGKIATDRFTF 182

Query: 223 PSKTVPN----------FLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
                 N             GC   +         GIAGFGR   SLPSQL    FSYC 
Sbjct: 183 GDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCF 242

Query: 269 LSRKFDDAPVSSNLVLDTGPG-------SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
            S  FD    SS + L   P        SG+ +T     TP +KNP   S      Y++ 
Sbjct: 243 TS-MFDSK--SSIVTLGGAPAALYSHAHSGEVRT-----TPLFKNPSQPS-----LYFLS 289

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           L+ I VG   + +P +            I+DSG++ T +   ++EAV  EF  Q+G
Sbjct: 290 LKGISVGKTRLPVPETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVG 338


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/248 (30%), Positives = 114/248 (45%), Gaps = 23/248 (9%)

Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYC 267
           T+G L ++T  F +  VP  + GCS  S       +G+ G GR + SL SQL   KFSY 
Sbjct: 129 TSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQ 188

Query: 268 LLS-RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
           LL+    DD    S +        GD   P          P+ SS+ + +FYYV L  + 
Sbjct: 189 LLAPEATDDGSADSVIRF------GDDAVPKTKRG--RSTPLLSSTLYPDFYYVNLTGVR 240

Query: 327 V-GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSR 381
           V G++   IP       ++G GGVI+ S +  T++E   ++ V      ++G    N S 
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSA 300

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
           A +++       C++ S    V +P+L L F GGA M L   NYF +  +  L  +    
Sbjct: 301 ALELDL------CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLP 354

Query: 442 NAAGPALG 449
           +  G  LG
Sbjct: 355 SQGGSVLG 362


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 112/249 (44%), Gaps = 35/249 (14%)

Query: 242 PAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKT 295
           P G+ GFG    S PSQ     G   FSYCL S K      SSN    L  GP     + 
Sbjct: 375 PQGLVGFGCGPLSFPSQNKDVYGFV-FSYCLPSYK------SSNFSSTLRLGPAGQPKR- 426

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
             +  TP   NP   S      YYV +  I VG + + +P S L        G IVD+G+
Sbjct: 427 --IKMTPLLSNPHRPS-----LYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 479

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
            FT +  P++ AV   F  ++    RA       G   C+++    ++ +P +   F G 
Sbjct: 480 MFTRLSAPVYAAVRDVFRSRV----RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGR 531

Query: 416 AKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLAND 473
             + LP EN      ++ + CL +    AAGP+ G    + +L   Q QN  + FD+AN 
Sbjct: 532 VSVTLPEENVVIRSSSDGIACLAM----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANG 587

Query: 474 RFGFAKQKC 482
           R GF+++ C
Sbjct: 588 RVGFSRELC 596


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 173/400 (43%), Gaps = 72/400 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y ++L  GTPP      I DTGS L+W  C+    C         P   P F P +SS
Sbjct: 90  GEYLMTLYIGTPPVERLA-IADTGSDLIWVQCSPCQNCF--------PQDTPLFEPLKSS 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + +   C +  C+ +  P  + +C     +   C      Y   YG   FT G++ +ETL
Sbjct: 141 TFKAATCDSQPCTSV--PPSQRQCG----KVGQC-----IYSYSYGDKSFTVGVVGTETL 189

Query: 221 RFPS----KTV--PNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLK---KFS 265
            F S    +TV  P+ + GC + ++       +  G+ G G    SL SQLG +   KFS
Sbjct: 190 SFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFS 249

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
           YCLL       P SSN       GS     T G+  TP    P+     F  FY++ L  
Sbjct: 250 YCLL-------PFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPL-----FPSFYFLNLEA 297

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           + +G K        +VP    +G +I+DSG+  T++E   +       ++++ +   A D
Sbjct: 298 VTIGQK--------VVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVAS-LQEVLSVESAQD 348

Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNA 443
           +      + CF     + + +P +  +F  GA +AL P+N    L    +LCL +   + 
Sbjct: 349 LPFP--FKFCFPY---RDMTIPVIAFQFT-GASVALQPKNLLIKLQDRNMLCLAVVPSSL 402

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +G +       I G+    +F + +DL   +  FA   C 
Sbjct: 403 SGIS-------IFGNVAQFDFQVVYDLEGKKVSFAPTDCT 435


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 101/410 (24%), Positives = 161/410 (39%), Gaps = 80/410 (19%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
           V S G Y   +  G+PP+     + DTGS ++W  C    +C    + NF      R+  
Sbjct: 68  VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWINCKPCPKCPTKTNLNF------RLSL 120

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
           F    SS+S+ +GC +  CS+I      S+   C P      L C  +++      + G 
Sbjct: 121 FDMNASSTSKKVGCDDDFCSFI------SQSDSCQP-----ALGCSYHIVYADESTSDGK 169

Query: 215 LLSETLRFPS-----KTVP---NFLAGCSILS-------DRQPAGIAGFGRSSESLPSQL 259
            + + L         KT P     + GC           D    G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQL 229

Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
                  + FS+CL + K              G G    G   +P +  TP   N +   
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y V L  + V    + +P S +      NGG IVDSG+T  +    L++++ + 
Sbjct: 273 -----HYNVMLMGMDVDGTSLDLPRSIV-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
            + +     +   +        CF  S       P +  +F+   K+ + P +Y   +  
Sbjct: 323 ILAR-----QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE 377

Query: 432 EVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
           E+ C   F   A G     R   I+LGD  L N  + +DL N+  G+A  
Sbjct: 378 ELYC---FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADH 424


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 103/404 (25%), Positives = 164/404 (40%), Gaps = 64/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP ++    + DTGS ++W  C    +C  C   +     +  +    S 
Sbjct: 78  GLYYAKIGIGTPAKSYYVQV-DTGSDIMWVNCI---QCKQCPRRSTLGIELTLYNIDESD 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S +L+ C +  C  I G  + S CK     N +CP     YL  YG G  TAG  + + +
Sbjct: 134 SGKLVSCDDDFCYQISGGPL-SGCKA----NMSCP-----YLEIYGDGSSTAGYFVKDVV 183

Query: 221 RFPS-----KTVP---NFLAGC------SILSDRQPA--GIAGFGRSSESLPSQLG---- 260
           ++ S     KT     + + GC       + S  + A  GI GFG+++ S+ SQL     
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243

Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K F++CL  R          +V            P ++ TP   N           Y 
Sbjct: 244 VKKIFAHCLDGRNGGGIFAIGRVV-----------QPKVNMTPLVPNQ--------PHYN 284

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V +  + VG + + IP     PG     G I+DSG+T  ++   ++E + K+   Q    
Sbjct: 285 VNMTAVQVGQEFLNIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPLVKKITSQ---- 338

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
             A  V        CF  SG+     P +   F+    + + P +Y  L   E +  I +
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDY--LFPYEGMWCIGW 396

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N+A  +  R    +LGD  L N  + +DL N   G+ +  C+
Sbjct: 397 -QNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439


>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
          Length = 274

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 107/248 (43%), Gaps = 77/248 (31%)

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD-------SKTP 296
           GIAGFGR   SLPSQL +  FSYC  S  FD     S+ V+  G  + +       + T 
Sbjct: 88  GIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD---TKSSSVVTLGAAAAELLHTHHAAHTG 143

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
            +  T   KNP   S      Y+V LR I VG   V +P S L          I+DSG++
Sbjct: 144 DVRTTRLIKNPSQPS-----LYFVPLRGISVGGARVAVPESRL------RSSTIIDSGAS 192

Query: 357 FTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
            T +   ++EAV  EF+ Q+  GNY                       V+          
Sbjct: 193 ITTLPEDVYEAVKAEFVSQLPRGNY-----------------------VF---------- 219

Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
                   E+Y A     VLC++L  D AA      G  +++G++Q QN ++ +DL ND 
Sbjct: 220 --------EDYAA----RVLCVVL--DAAA------GEQVVIGNYQQQNTHVVYDLENDV 259

Query: 475 FGFAKQKC 482
             FA  +C
Sbjct: 260 LSFAPARC 267


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 160/404 (39%), Gaps = 72/404 (17%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
           HSY  +  +L  GTP + +   I DTGS++ + PC     C  C     +      F P 
Sbjct: 10  HSY--FYTTLKLGTP-ERTFSVIIDTGSTITYIPCKD---CSHCGKHTAE-----WFDPD 58

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
           +S++++ + C +P C+             C   + TC      Y   Y     + G ++ 
Sbjct: 59  KSTTAKKLACGDPLCN-------------CGTPSCTCNNDRCYYSRTYAERSSSEGWMIE 105

Query: 218 ETLRFPSKTVPNFLA-GCSILSD----RQPA-GIAGFGRSSESLPSQLGLKKFSYCLLSR 271
           +T  FP    P  L  GC         RQ A GI G G +  +  SQL  +K    + S 
Sbjct: 106 DTFGFPDSDSPVRLVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSL 165

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
            F   P    L+L      GD   P      YTP        +     +Y V +  I V 
Sbjct: 166 CFG-YPKDGILLL------GDVTLPEGANTVYTPLL------THLHLHYYNVKMDGITVN 212

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            + +    S      D   G ++DSG+TFT++    F+A+AK     +G+Y     ++  
Sbjct: 213 GQTLAFDASVF----DRGYGTVLDSGTTFTYLPTDAFKAMAK----AVGDYVEKKGLQST 264

Query: 389 SGLRPCF-DISGKKS--------VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            G  P + DI  K +         Y P     F GGAK+ LPP  Y  L      CL +F
Sbjct: 265 PGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLPPLRYLFLSKPAEYCLGIF 324

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  +G         ++G   +++  + +D  N + GF    CA
Sbjct: 325 DNGNSG--------ALVGGVSVRDVVVTYDRRNSKVGFTTMACA 360


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 107/415 (25%), Positives = 165/415 (39%), Gaps = 67/415 (16%)

Query: 88  SNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
           S+S +  P+S  +Y G   Y + +  GTP Q  T  + DTGS L W  C           
Sbjct: 72  SSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFT-LVADTGSELTWVKCAG--------- 121

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
               P  +  F P+ S S   + C +  C      +V      CS     C     SY  
Sbjct: 122 -GASPPGL-VFRPEASKSWAPVPCSSDTCKL----DVPFSLANCSSSASPC-----SYDY 170

Query: 205 QYGLGFTAGLLL----SETLRFPSKTVP---NFLAGCSILSDRQP----AGIAGFGRSSE 253
           +Y  G    L +    S T+  P   V    + + GCS   D Q      G+   G +  
Sbjct: 171 RYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKI 230

Query: 254 SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
           S  S+   +    FSYCL+      AP ++   L  GPG    +TP  + T  + +P   
Sbjct: 231 SFASRAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQ-VPRTPA-TQTKLFLDPAM- 284

Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
                 FY V +  + V  + + IP     P S   GGVI+DSG+T T +  P ++AV  
Sbjct: 285 -----PFYGVKVDAVHVAGQALDIPAEVWDPKS---GGVILDSGTTLTVLATPAYKAVVA 336

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKK--SVYLPELILKFKGGAKMALPPENYFAL 428
              + +    +  D         C++ +  +  +  +P+L ++F G A++  P ++Y   
Sbjct: 337 ALTKLLAGVPK-VDFPP---FEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVID 392

Query: 429 VGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           V   V C+        G   G  P + ++G+   Q    EFDL N    F    C
Sbjct: 393 VKPGVKCI--------GLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 106/462 (22%), Positives = 173/462 (37%), Gaps = 73/462 (15%)

Query: 52  PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY---GGYSISL 108
           P   L   A   L R  +++++    ++     +    S    PLS  +Y   G Y +  
Sbjct: 47  PGASLSDRARDDLHRHAYIRSQLA-SSRRGRRAAEVGASAFAMPLSSGAYTGTGQYFVRF 105

Query: 109 SFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
             GTP Q   PF+   DTGS L W  C  R            P+R+  F    S S   I
Sbjct: 106 RVGTPAQ---PFVLVADTGSDLTWVKC--RGAGAAAGTGAGSPARV--FRTAASKSWAPI 158

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-- 223
            C +  C+      V      CS     C     +Y  +Y  G  A G++ +++      
Sbjct: 159 ACSSDTCT----SYVPFSLANCSSPASPC-----AYDYRYRDGSAARGVVGTDSATIALS 209

Query: 224 --------------SKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLK--- 262
                            +   + GC+   D Q      G+   G S+ S  S+   +   
Sbjct: 210 SGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGG 269

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           +FSYCL+      AP ++   L  GPG+           P  + P+        FY V +
Sbjct: 270 RFSYCLVDHL---APRNATSYLTFGPGA---------TAPAAQTPLLLDRRMTPFYAVTV 317

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             + V  + + IP    V   D NGG I+DSG++ T +  P + AV     + +    R 
Sbjct: 318 DAVYVAGEALDIPAD--VWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRV 375

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
                      C++ +   ++ +P++ + F G A++  P ++Y       V C+      
Sbjct: 376 ----TMDPFEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCI------ 425

Query: 443 AAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             G   G  P + ++G+   Q    EFDL +    F   +CA
Sbjct: 426 --GVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 465


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 113/435 (25%), Positives = 182/435 (41%), Gaps = 75/435 (17%)

Query: 69  HLKTKT-KPKTKDSNIGSNYSNSLIKTPLSVH---SYGGYSISLSFGTPPQASTPFIFDT 124
            L+ K+   +  + N GS++       P+        G Y + ++ GTP + S     DT
Sbjct: 6   QLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTP-KLSLSLALDT 64

Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
           GS + W  C     CV   +          F P++SSS + + C +  C  I        
Sbjct: 65  GSDITWTQCEP---CVGSCYRQAQTK----FDPRKSSSYKNVSCSSSSCRII---TDSGG 114

Query: 185 CKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQP 242
            +GC   + TC      Y +QYG G ++ G   +E L   PS  + NFL GC     +Q 
Sbjct: 115 ARGCV--SSTCI-----YKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCG----QQN 163

Query: 243 AGIAGFGRSSESL------------PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
           AG   FGR +  L             S+     F+YCL S     +  + +L L      
Sbjct: 164 AG--RFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFS---SSSTGHLTL------ 212

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
           G      + +TP        S AF    FY + ++ + VG   + I  S        N G
Sbjct: 213 GGQVPKSVKFTPL-------SPAFKNTPFYGIDIKGLSVGGHVLPIDASVF-----SNAG 260

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
            I+DSG+  T ++  ++ A++ +F + M +Y +    +  S L  C+D SG +S+ +P +
Sbjct: 261 AIIDSGTVITRLQPTVYSALSSKFQQLMKDYPK---TDGFSILDTCYDFSGNESISVPRI 317

Query: 409 ILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
              FKGG ++ +       ++   + +CL      A  P    G  ++ G+ Q Q + + 
Sbjct: 318 SFFFKGGVEVDIKFFGILTVINAWDKVCL------AFAPNDDDGDFVVFGNSQQQTYDVV 371

Query: 468 FDLANDRFGFAKQKC 482
            DLA  R GFA   C
Sbjct: 372 HDLAKGRIGFAPSGC 386


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 150/357 (42%), Gaps = 54/357 (15%)

Query: 92  IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
           +  P+ + S G Y  + + GTPPQ  +  +  TG  LVW  CT    C + + P  DP++
Sbjct: 45  VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
                   SS+ + + C +  C  I  P     C      +  C    P+       G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143

Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
            G   ++T     +K    F  GC +++D++      P+GI G GR+  SL +Q+ +  F
Sbjct: 144 GGKAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201

Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
           SYCL  +        S+  L  G  +        S TPF  K   GSS      YY    
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
             +V    +K   + L   S     V++D+ S  +++    ++A+ K     +G    A+
Sbjct: 250 --MVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307

Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
             +        +D+   K+V    PEL+  F GGA + +PP NY    GN  +CL +
Sbjct: 308 PPKP-------YDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI 357


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 160/403 (39%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C S   C  C   +     +  + P+ S 
Sbjct: 88  GLYFTRIGIGTPAKRYYVQV-DTGSDILWVNCVS---CDGCPRKSNLGIELTMYDPRGSQ 143

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S +L+ C    C   +G  V   C   SP        C  Y + YG G  TAG  +++ L
Sbjct: 144 SGELVTCDQQFCVANYG-GVLPSCTSTSP--------C-EYSISYGDGSSTAGFFVTDFL 193

Query: 221 RF----------PSKTVPNFLAGCSILSDRQPA-----GIAGFGRSSESLPSQLGL---- 261
           ++          P+    +F  G  +  D   +     GI GFG+S+ S+ SQL      
Sbjct: 194 QYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKV 253

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL +          N+V            P +  TP   +           Y V
Sbjct: 254 RKMFAHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVSDM--------PHYNV 294

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I VG   + +P +    G+  + G I+DSG+T  ++     E V K     + +  
Sbjct: 295 ILKGIDVGGTALGLPTNIFDSGN--SKGTIIDSGTTLAYVP----EGVYKALFAMVFDKH 348

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           +   V+       CF  SG      PE+   F+G   + + P +Y    G  + C+  F 
Sbjct: 349 QDISVQTLQDF-SCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCM-GFQ 406

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +       G+   ++LGD  L N  + +DL N   G+A   C+
Sbjct: 407 NGGVQTKDGK-DMVLLGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 153/385 (39%), Gaps = 56/385 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  S   GTPPQ  +  + D  S LVW  C +                   F P RS+
Sbjct: 98  GMYVFSYGIGTPPQQVSGAL-DISSDLVWTACGA----------------TAPFNPVRST 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C +  C   F P  ++   G    +  C     +Y+   G   T GLL +E   
Sbjct: 141 TVADVPCTDDACQQ-FAP--QTCGAGAGAGSSECAY---TYMYGGGAANTTGLLGTEAFT 194

Query: 222 FPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
           F    +   + GC + +       +G+ G GR + SL SQL + +FSY       DD+  
Sbjct: 195 FGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVD 251

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY-S 337
           + + +L      GD  TP  S+T      + +S A    YYV L  I V  K + IP  +
Sbjct: 252 TQSFIL-----FGDDATPQTSHT--LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGT 304

Query: 338 YLVPGSDGNGGVIVDSGSTFTFME----GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           + +   DG+GGV +      T +E     PL +AVA +      N S         GL  
Sbjct: 305 FDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSAL-------GLDL 357

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
           C+         +P + L F GGA M L   NYF +     L CL +   +A       G 
Sbjct: 358 CYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSA-------GD 410

Query: 453 AIILGDFQLQNFYLEFDLANDRFGF 477
             +LG       ++ +D+   +  F
Sbjct: 411 GSVLGSLIQVGTHMMYDINGSKLVF 435


>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 430

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 100/432 (23%), Positives = 162/432 (37%), Gaps = 96/432 (22%)

Query: 63  SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF 122
           +   ARH +    P     N       S++ + L       Y  ++  GTPP+     + 
Sbjct: 44  TFDSARHGRLLQSPVHGSFNWKVERDTSILLSAL-------YYTTVQIGTPPR-ELDVVI 95

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS LVW  C S   CV C   NV       F P  SSS+  + C + +CS       +
Sbjct: 96  DTGSDLVWVSCNS---CVGCPLHNV-----TFFDPGASSSAVKLACSDKRCS--SDLQKK 145

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ 241
           SRC           L   +Y ++YG G  T+G  +S+ + F + +   ++A         
Sbjct: 146 SRCS---------LLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWTYIA--------- 187

Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
                 F  +S   P                     V    ++ T P    +    +S  
Sbjct: 188 ------FRDNSTWHPW--------------------VRQGAIIGTFPALCSTPCSTVSSQ 221

Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           P Y NP  S              + V    +++P    V       G I+DSG+T     
Sbjct: 222 PLYYNPQFS------------HMMTVAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFP 269

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL------PELILKFKGG 415
           G  ++ + +  +  +  Y R    E       CF+I+   S +L      PE+ L F GG
Sbjct: 270 GEAYDPLIQAILNVVSQYGRPIPYESFQ----CFNITSGISSHLVIADMFPEVHLGFAGG 325

Query: 416 AKMALPPENY----FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           A M + PE Y    F  + N + CL  ++  +           I+G+  +++    +DL 
Sbjct: 326 ASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTSR-------RITIIGEVAIRDKMFVYDLD 378

Query: 472 NDRFGFAKQKCA 483
           + R G+A+  C+
Sbjct: 379 HQRIGWAEYNCS 390


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 104/385 (27%), Positives = 151/385 (39%), Gaps = 60/385 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y  S   GTPPQ  +  + D  S LVW  C +                   F P RS+
Sbjct: 98  GMYVFSYGIGTPPQQVSGAL-DISSDLVWTACGA----------------TAPFNPVRST 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           +   + C +  C   F P        C      C     +Y+   G   T GLL +E   
Sbjct: 141 TVADVPCTDDACQQ-FAPQT------CGAGASECAY---TYMYGGGAANTTGLLGTEAFT 190

Query: 222 FPSKTVPNFLAGCSI--LSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
           F    +   + GC +  + D    +G+ G GR + SL SQL + +FSY       DD+  
Sbjct: 191 FGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVD 247

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY-S 337
           + + +L      GD  TP  S+T      + +S A    YYV L  I V  K + IP  +
Sbjct: 248 TQSFIL-----FGDDATPQTSHT--LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGT 300

Query: 338 YLVPGSDGNGGVIVDSGSTFTFME----GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           + +   DG+GGV +      T +E     PL +AVA +      N S         GL  
Sbjct: 301 FDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSAL-------GLDL 353

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
           C+         +P + L F GGA M L   NYF +     L CL +   +A       G 
Sbjct: 354 CYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSA-------GD 406

Query: 453 AIILGDFQLQNFYLEFDLANDRFGF 477
             +LG       ++ +D+   +  F
Sbjct: 407 GSVLGSLIQVGTHMMYDINGSKLVF 431


>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 436

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 165/363 (45%), Gaps = 71/363 (19%)

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
           SS+ + I C + +CS +FG +      GCS + K C  +   Y +  G+  T+G + S+ 
Sbjct: 81  SSTLKPILCSSSQCS-LFGSH------GCSDK-KICGRS--PYNIVTGVS-TSGDIQSDI 129

Query: 220 LRFPSK---------TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG----- 260
           +   S          +VPNFL   G +++ +   +   G+AG GR+  SLPSQ       
Sbjct: 130 VSVQSTNGNYSGRFVSVPNFLFICGSNVVQNGLAKGVKGMAGLGRTKVSLPSQFSSAFSF 189

Query: 261 LKKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS--SAF 314
             KF+ CL ++     F D P   N            ++  L YTP   NPV +S  S  
Sbjct: 190 KNKFAICLGTQNGVLFFGDGPYLFNF----------DESKNLIYTPLITNPVSTSPSSFL 239

Query: 315 GE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
           GE    Y++G++ I V SK+VK+  + L    +G GG  + + + +T ME  +++AVA  
Sbjct: 240 GEKSVEYFIGVKSIRVSSKNVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADA 299

Query: 372 FIRQMGNYSRAADVEKKSGLRPCF---DISGKK---SVYLPELILKFKGGAKMALPPENY 425
           F++ +      + VE  +    CF    IS  +    V   +L+L+ +      +     
Sbjct: 300 FVKAL----NVSTVEPVAPFGTCFASQSISSSRMGPDVPSIDLVLQNENVVWNIIGANAM 355

Query: 426 FALVGNEVLCLILFTDNAAGPAL---------GRGP--AIILGDFQLQNFYLEFDLANDR 474
             +   +V+CL  F D  +  A          G  P  +I +G  QL+N  L+FDLA  R
Sbjct: 356 VRINDKDVICL-GFVDAGSDFAKTSQVGFVVGGSKPMTSITIGAHQLENNLLQFDLATSR 414

Query: 475 FGF 477
            GF
Sbjct: 415 LGF 417


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 156/391 (39%), Gaps = 77/391 (19%)

Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
            GTPP      I DTGS L W  C    +C             P F P +S+S   + C 
Sbjct: 86  IGTPP-VDYLGIADTGSDLTWAQCLPCLKCYQ--------QLRPIFNPLKSTSFSHVPCN 136

Query: 170 NPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT 226
              C  +   +  V+  C                Y   YG   ++ G L  E +   S +
Sbjct: 137 TQTCHAVDDGHCGVQGVCD---------------YSYTYGDRTYSKGDLGFEKITIGSSS 181

Query: 227 VPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCL---LSRKFDD 275
           V + + GC   S       +G+ G G    SL SQ+       ++FSYCL   LS     
Sbjct: 182 VKSVI-GCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 240

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
                N V+     SG    PG+  TP   KN V        +YY+ L  I +G++    
Sbjct: 241 INFGQNAVV-----SG----PGVVSTPLISKNTV-------TYYYITLEAISIGNER--- 281

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP- 393
              ++     GN  VI+DSG+T +F+   L++ V    ++ +    +A  V+        
Sbjct: 282 ---HMAFAKQGN--VIIDSGTTLSFLPKELYDGVVSSLLKVV----KAKRVKDPGNFWDL 332

Query: 394 CFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
           CFD  I+   S  +P +  +F GGA + L P N F  V N V CL L     A P    G
Sbjct: 333 CFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTL---TPASPTDEFG 389

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+G+  L NF + +DL   R  F    C
Sbjct: 390 ---IIGNLALANFLIGYDLEAKRLSFKPTVC 417


>gi|383143497|gb|AFG53176.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143499|gb|AFG53177.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143505|gb|AFG53180.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143513|gb|AFG53184.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
 gi|383143515|gb|AFG53185.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
          Length = 135

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 51/152 (33%), Positives = 79/152 (51%), Gaps = 20/152 (13%)

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
           YCL     D    SS +V+      G+   PG   L+YTP   NP+     +  FYY+GL
Sbjct: 1   YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             + +G K + +P++     S GNGG I+DSG++FT     ++  +A EF  Q+G Y R 
Sbjct: 45  EAVSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
              E  + L  C+++SG +++  P+    FKG
Sbjct: 104 PGAESTTALGLCYNVSGVENIQFPQFAFHFKG 135


>gi|147857949|emb|CAN80378.1| hypothetical protein VITISV_038701 [Vitis vinifera]
          Length = 436

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 102/400 (25%), Positives = 161/400 (40%), Gaps = 70/400 (17%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P      + D G+  +W  C   Y                ++ P R  S+Q    +   C
Sbjct: 53  PLVPVKLVVDLGAQFLWVDCEQNYVS-------------SSYRPARCRSAQCSLARANGC 99

Query: 174 SWIFG---PNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
              F    P   +   G  P N     A    L      +Q   G   G ++S + +F  
Sbjct: 100 GDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQSTDGSNPGRVVSVS-KFLF 158

Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
              P FL  G +        G+AG GR+  + PSQ        +KF+ CL S        
Sbjct: 159 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 208

Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVG 328
           ++N V+  G G          +  L YTP Y NPV ++SA+ +      Y++ ++ I + 
Sbjct: 209 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRVKSIRIN 268

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVE 386
            K + +  S L   S+G GG  + + + +T ME  +++   K FI      N +R A V 
Sbjct: 269 EKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKXFTKAFISAAAAINITRVAAVA 328

Query: 387 KKSGLRPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLIL 438
                  CF     K+VY       +P + L  +  +    +   N    V ++VLCL  
Sbjct: 329 P---FNVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-G 381

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           F D  A P      +I++G +QL++  L+FDLA  R GF+
Sbjct: 382 FVDGGANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 417


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 112/427 (26%), Positives = 177/427 (41%), Gaps = 94/427 (22%)

Query: 86  NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           N SN+ ++    +   G Y+  L  GTPPQ     I DTGS++ + PC++   C  C   
Sbjct: 65  NLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFA-LIVDTGSTVTYVPCST---CEQCG-- 118

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
                + P F P+ SS+ + I C N  C              C      C      Y  Q
Sbjct: 119 ---RHQDPKFDPESSSTYKPIKC-NIDCI-------------CDSDGVQC-----VYERQ 156

Query: 206 YG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILS-----DRQPAGIAGFGRSSESLP 256
           Y  +  ++G+L  + + F   S+ +P   + GC  +       ++  GI G G    SL 
Sbjct: 157 YAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLV 216

Query: 257 SQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNP 307
            QL  K      FS C                +D G G+    G S    + +T  Y +P
Sbjct: 217 DQLVEKGAINDSFSLCYGG-------------MDIGGGAMVLGGISPPSDMIFT--YSDP 261

Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           V S      +Y V L++I V  K  K+P S  +   DG  G ++DSG+T+ ++    F A
Sbjct: 262 VRSP-----YYNVDLKEIHVAGK--KLPLSSGI--FDGRYGAVLDSGTTYAYLPAEAFSA 312

Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKM 418
                + ++ +      ++K  G  P     CF  +G  +  L    P + + F+ G K+
Sbjct: 313 FKDAIMDEIHS------LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366

Query: 419 ALPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
           +L PENYF      +   CL +F +       G     +LG   ++N  + +D AN + G
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFEN-------GNDQTTLLGGIVVRNTLVMYDRANSKIG 419

Query: 477 FAKQKCA 483
           F K  C+
Sbjct: 420 FWKTNCS 426


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 112/427 (26%), Positives = 177/427 (41%), Gaps = 94/427 (22%)

Query: 86  NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
           N SN+ ++    +   G Y+  L  GTPPQ     I DTGS++ + PC++   C  C   
Sbjct: 65  NLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFA-LIVDTGSTVTYVPCST---CEQCG-- 118

Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
                + P F P+ SS+ + I C N  C              C      C      Y  Q
Sbjct: 119 ---RHQDPKFDPESSSTYKPIKC-NIDCI-------------CDSDGVQC-----VYERQ 156

Query: 206 YG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILS-----DRQPAGIAGFGRSSESLP 256
           Y  +  ++G+L  + + F   S+ +P   + GC  +       ++  GI G G    SL 
Sbjct: 157 YAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLV 216

Query: 257 SQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNP 307
            QL  K      FS C                +D G G+    G S    + +T  Y +P
Sbjct: 217 DQLVEKGAINDSFSLCYGG-------------MDIGGGAMVLGGISPPSDMIFT--YSDP 261

Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
           V S      +Y V L++I V  K  K+P S  +   DG  G ++DSG+T+ ++    F A
Sbjct: 262 VRSP-----YYNVDLKEIHVAGK--KLPLSSGI--FDGRYGAVLDSGTTYAYLPAEAFSA 312

Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKM 418
                + ++ +      ++K  G  P     CF  +G  +  L    P + + F+ G K+
Sbjct: 313 FKDAIMDEIHS------LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366

Query: 419 ALPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
           +L PENYF      +   CL +F +       G     +LG   ++N  + +D AN + G
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFEN-------GNDQTTLLGGIVVRNTLVMYDRANSKIG 419

Query: 477 FAKQKCA 483
           F K  C+
Sbjct: 420 FWKTNCS 426


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 112/249 (44%), Gaps = 35/249 (14%)

Query: 242 PAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKT 295
           P G+ GFG    S PSQ     G   FSYCL S K      SSN    L  GP     + 
Sbjct: 314 PQGLVGFGCGPLSFPSQNKDVYGFV-FSYCLPSYK------SSNFSSTLRLGPAGQPKR- 365

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
             +  TP   NP   S      YYV +  I VG + + +P S L        G IVD+G+
Sbjct: 366 --IKMTPLLSNPHRPS-----LYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 418

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
            FT +  P++ AV   F  ++    RA       G   C+++    ++ +P +   F G 
Sbjct: 419 MFTRLSAPVYAAVRDVFRSRV----RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGR 470

Query: 416 AKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLAND 473
             + LP EN      ++ + CL +    AAGP+ G    + +L   Q QN  + FD+AN 
Sbjct: 471 VSVTLPEENVVIRSSSDGIACLAM----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANG 526

Query: 474 RFGFAKQKC 482
           R GF+++ C
Sbjct: 527 RVGFSRELC 535


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 121/459 (26%), Positives = 169/459 (36%), Gaps = 105/459 (22%)

Query: 53  LKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--HSYGGYSI 106
           L+++H  +S S     ++ ++ +     +   + +   Y  SL  TP S      G Y +
Sbjct: 31  LELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYKYSLTSTPQSTVNSDKGEYLM 90

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           S S GTPP     F+ DTGS LVW  C    +C         P   P F P  SSS Q I
Sbjct: 91  SYSIGTPPFKVFGFV-DTGSDLVWLQCEPCKQCY--------PQITPIFDPSLSSSYQNI 141

Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
            C +  C  +       R   C  R                     G L  ETL   S T
Sbjct: 142 PCLSDTCHSM-------RTTSCDVR---------------------GYLSVETLTLDSTT 173

Query: 227 -----VPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCL------ 268
                 P  + GC   +        +GI G G    SLPSQLG     KFSYCL      
Sbjct: 174 GYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPN 233

Query: 269 --LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
                 F DA +            GD    G   TP  K    S       YY+ L    
Sbjct: 234 STSKLNFGDAAIV----------YGD----GAMTTPIVKKDAQSG------YYLTLEAFS 273

Query: 327 VGSKHVKIPYSYLVPGSDGN-GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
           VG+K ++    +  P   GN G +++DSG+TFTF+       V   F   +  Y     V
Sbjct: 274 VGNKLIE----FGGPTYGGNEGNILIDSGTTFTFLP----YDVYYRFESAVAEYINLEHV 325

Query: 386 EKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
           E  +G  + C++++       P +   FK GA + L   + F  V + + CL       A
Sbjct: 326 EDPNGTFKLCYNVA-YHGFEAPLITAHFK-GADIKLYYISTFIKVSDGIACLAFIPSQTA 383

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                     I G+   QN  + ++L  +   F    C 
Sbjct: 384 ----------IFGNVAQQNLLVGYNLVQNTVTFKPVDCT 412


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 157/398 (39%), Gaps = 69/398 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +   +G P Q   P  FDT   +    C        C+         PAF P RSSS 
Sbjct: 88  YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 137

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +P+C+          C G S     CP     + +Q+G +    G L+ +TL  
Sbjct: 138 AAIPCGSPECAV--------ECTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 179

Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
            PS T   F  GC  + +D        G+    RSS SL S++           FSYCL 
Sbjct: 180 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 239

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           S     +  SS   L  G    +     + Y P   NP   +S     Y+V L  I VG 
Sbjct: 240 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVELVGISVGG 290

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           + + +P     P      G ++++ + FTF+    + A+   F R M  Y  A       
Sbjct: 291 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRV-- 343

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
            L  C++++G  S+ +P + L+F GG ++ L      YFA    V + V CL        
Sbjct: 344 -LDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 402

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +      ++G    ++  + +DL   R GF   +C
Sbjct: 403 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 100/405 (24%), Positives = 162/405 (40%), Gaps = 64/405 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     + DTGS ++W  C     C  C   +     +  + PKRS 
Sbjct: 67  GLYFTKIGLGSPSKDYYVQV-DTGSDILWVNCV---ECTRCPRKSDIGIGLTLYDPKRSK 122

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S+ + C++  CS  +    E R  GC   N      CP Y + YG G  T G  + + L
Sbjct: 123 TSEFVSCEHNFCSSTY----EGRILGCKAEN-----PCP-YSISYGDGSATTGYYVQDYL 172

Query: 221 RF------PSKTVPN--FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL--- 261
            F      P     N   + GC           S+    GI GFG+++ S+ SQL     
Sbjct: 173 TFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGK 232

Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K FS+CL      D  V   +        G+   P +  TP   N           Y 
Sbjct: 233 VKKIFSHCL------DTNVGGGIF-----SIGEVVEPKVKTTPLVPNMA--------HYN 273

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V L+ I V    +++P       S+   G ++DSG+T  ++   +++ +  + + +    
Sbjct: 274 VILKNIEVDGDILQLPSDTF--DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL 331

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLIL 438
                 E+ S    CF  +G      P + L F+    + + P +Y F   G+   C I 
Sbjct: 332 KVYLVEEQYS----CFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWC-IG 386

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +  +A+    G+    +LGDF L N  + +DL N   G+    C+
Sbjct: 387 WQKSASETKNGK-DMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 117/437 (26%), Positives = 177/437 (40%), Gaps = 72/437 (16%)

Query: 58  SLASSSLSRARHLKTKTKPKTKDSN-IGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
           ++AS    R ++L T    KT  +  I S  + ++          G Y + +  GTP Q 
Sbjct: 62  NMASKDPVRVKYLSTLVSQKTVSTAPIASGQAFNI----------GNYVVRVKLGTPGQL 111

Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
               + DT +   + PC+    C D  F            PK S+S   + C  P+C  +
Sbjct: 112 LF-MVLDTSTDEAFVPCSGCTGCSDTTFS-----------PKASTSYGPLDCSVPQCGQV 159

Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-- 234
            G +  +   G    N+       SY    G  F+A  L+ + LR  +  +P +  GC  
Sbjct: 160 RGLSCPATGTGACSFNQ-------SYA---GSSFSA-TLVQDALRLATDVIPYYSFGCVN 208

Query: 235 SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
           +I     PA      G       S+S  +  G+  FSYCL S  F     S +L L  GP
Sbjct: 209 AITGASVPAQGLLGLGRGPLSLLSQSGSNYSGI--FSYCLPS--FKSYYFSGSLKL--GP 262

Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
             G  K+  +  TP  ++P   S      YYV    I VG   V  P  YL    +   G
Sbjct: 263 -VGQPKS--IRTTPLLRSPHRPS-----LYYVNFTGISVGRVLVPFPSEYLGFNPNTGSG 314

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
            I+DSG+  T    P++ AV +EF +Q+G                CF          P +
Sbjct: 315 TIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDTCF--VKTYETLAPPI 368

Query: 409 ILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
            L F+ G  + LP EN  +L+      + CL +    AA P        ++ +FQ QN  
Sbjct: 369 TLHFE-GLDLKLPLEN--SLIHSSAGSLACLAM----AAAPDNVNSVLNVIANFQQQNLR 421

Query: 466 LEFDLANDRFGFAKQKC 482
           + FD+ N++ G A++ C
Sbjct: 422 ILFDIVNNKVGIAREVC 438


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 77/285 (27%), Positives = 125/285 (43%), Gaps = 32/285 (11%)

Query: 209 GFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKF 264
           G T G++ ++T    + T  +   GC + S       P+G+ G GR+  SL SQ+ + KF
Sbjct: 136 GHTLGIVATDTFAIGTATA-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKF 194

Query: 265 SYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
           SYCL      D+  +S L+L +     G G+S T     TPF K   G      ++Y + 
Sbjct: 195 SYCLTPH---DSGKNSRLLLGSSAKLAGGGNSTT-----TPFVKTSPGDD--MSQYYPIQ 244

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L  I  G   + +P S       GN  V+V + +  +F+    ++A+ KE  + +G    
Sbjct: 245 LDGIKAGDAAIALPPS-------GN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPT 296

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKF-KGGAKMALPPENYFALVGNE--VLCLIL 438
           A  ++       CF  +G  +   P+L+  F +G A + +PP  Y   VG E   +C+ +
Sbjct: 297 ATPLQP---FDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAI 353

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + +            ILG  Q +N +   DL      F    CA
Sbjct: 354 LSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCA 398


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 160/403 (39%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C S   C  C   +     +  + P+ S 
Sbjct: 88  GLYFTRIGIGTPAKRYYVQV-DTGSDILWVNCVS---CDGCPRKSNLGIELTMYDPRGSQ 143

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S +L+ C    C   +G  V   C   SP        C  Y + YG G  TAG  +++ L
Sbjct: 144 SGELVTCDQQFCVANYG-GVLPSCTSTSP--------C-EYSISYGDGSSTAGFFVTDFL 193

Query: 221 RF----------PSKTVPNFLAGCSILSDRQPA-----GIAGFGRSSESLPSQLGL---- 261
           ++          P+    +F  G  +  D   +     GI GFG+S+ S+ SQL      
Sbjct: 194 QYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKV 253

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL +          N+V            P +  TP   +           Y V
Sbjct: 254 RKMFAHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVPDM--------PHYNV 294

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I VG   + +P +    G+  + G I+DSG+T  ++     E V K     + +  
Sbjct: 295 ILKGIDVGGTALGLPTNIFDSGN--SKGTIIDSGTTLAYVP----EGVYKALFAMVFDKH 348

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           +   V+       CF  SG      PE+   F+G   + + P +Y    G  + C+  F 
Sbjct: 349 QDISVQTLQDF-SCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCM-GFQ 406

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +       G+   ++LGD  L N  + +DL N   G+A   C+
Sbjct: 407 NGGVQTKDGK-DMVLLGDLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +  T  IFDTGS + W  C     CV   +      + P   P  S+
Sbjct: 69  GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 120

Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S + I C +  C  +  G      C      + TC      Y +QYG G ++ G   +ET
Sbjct: 121 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 170

Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
           L   S  V  NFL GC   ++    G AG     R+  +LPSQ      K FSYCL    
Sbjct: 171 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 226

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
               P SS+       G   SK+  + +TP        S+ F    FY + +  + VG +
Sbjct: 227 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 273

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S        + G ++DSG+  T +    +  ++  F   M +Y   +     S 
Sbjct: 274 QLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 324

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
              C+D S   +V +P++ + FKGG +M +      + + G + +CL    ++       
Sbjct: 325 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 378

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                I G+ Q + + + +D A  R GFA   C+
Sbjct: 379 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +  T  IFDTGS + W  C     CV   +      + P   P  S+
Sbjct: 129 GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 180

Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S + I C +  C  +  G      C      + TC      Y +QYG G ++ G   +ET
Sbjct: 181 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 230

Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
           L   S  V  NFL GC   ++    G AG     R+  +LPSQ      K FSYCL    
Sbjct: 231 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 286

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
               P SS+       G   SK+  + +TP        S+ F    FY + +  + VG +
Sbjct: 287 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 333

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S        + G ++DSG+  T +    +  ++  F   M +Y   +     S 
Sbjct: 334 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 384

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
              C+D S   +V +P++ + FKGG +M +      + + G + +CL    ++       
Sbjct: 385 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 438

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                I G+ Q + + + +D A  R GFA   C+
Sbjct: 439 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++  GTP +  T  IFDTGS + W  C     CV   +      + P   P  S+
Sbjct: 117 GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 168

Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           S + I C +  C  +  G      C      + TC      Y +QYG G ++ G   +ET
Sbjct: 169 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 218

Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
           L   S  V  NFL GC   ++    G AG     R+  +LPSQ      K FSYCL    
Sbjct: 219 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 274

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
               P SS+       G   SK+  + +TP        S+ F    FY + +  + VG +
Sbjct: 275 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 321

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            + I  S        + G ++DSG+  T +    +  ++  F   M +Y   +     S 
Sbjct: 322 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 372

Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
              C+D S   +V +P++ + FKGG +M +      + + G + +CL    ++       
Sbjct: 373 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 426

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                I G+ Q + + + +D A  R GFA   C+
Sbjct: 427 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460


>gi|222822564|gb|ACM68431.1| xyloglucan-specific endoglucanase inhibitor protein [Capsicum
           annuum]
          Length = 437

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 166/411 (40%), Gaps = 77/411 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  Q  TP +      D G   +W         VDC+   V  S  PA    RS+
Sbjct: 44  TLQYLTQIQQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
              L G     C   F P       GC+  N TC L   + + +     T+G L S+ + 
Sbjct: 93  QCSLAGATG--CGECFSPPRP----GCN--NNTCGLFPDNTVTRTA---TSGELASDVVS 141

Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
             S    N           F+ G + L     +G+   AG GR+  SLPSQ        +
Sbjct: 142 VQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201

Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG- 315
           KF+ CL S K      F D P     + +T   + D       YTP   NPV ++SAF  
Sbjct: 202 KFAVCLSSSKSKGVVLFGDGPYF--FLPNTEFSNND-----FQYTPLLINPVSTASAFSA 254

Query: 316 ----EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y++G++ + +  K V I  + L   + G GG  + + + +T +E  L+ A+   
Sbjct: 255 GQPSSEYFIGVKSVKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTVLETSLYNAITNF 314

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENYF 426
           F++++ N +R A V        CFD     S      +P++ L  +       +   N  
Sbjct: 315 FVKELANVTRVASVAP---FGACFDSRNIGSTRVGPAVPQIDLVLQNENVIWTIFGANSM 371

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
             V   VLCL  F D      +    +I++G   +++  L+ D+A  R GF
Sbjct: 372 VQVSENVLCL-GFVDG----GVNSRTSIVIGGHTIEDNLLQLDIARSRLGF 417


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 150/388 (38%), Gaps = 50/388 (12%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           + + GTPPQ ++  I D    LVW  C+   RC   +        +P FIP  SS+ +  
Sbjct: 46  NFTIGTPPQPASAII-DVAGELVWTQCSRCSRCFKQD--------LPLFIPNASSTFRPE 96

Query: 167 GCQNPKCSWIFGPNVESRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
            C    C         S C G  C+  + T         ++     T G++ +ET    +
Sbjct: 97  PCGTDACK----STPTSNCSGDVCTYESTTN--------IRLDRHTTLGIVGTETFAIGT 144

Query: 225 KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
            T  +   GC + SD       +G  G GR+  SL +Q+ L KFSYCL  R       SS
Sbjct: 145 ATA-SLAFGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK---SS 200

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            L L  G  +  +     S  PF K      S    +Y + L  I  G+  +    S   
Sbjct: 201 RLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS--- 253

Query: 341 PGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DIS 398
                 GG++V  + S F+ +    + A  K     +G  +             CF   +
Sbjct: 254 ------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAA 307

Query: 399 GKKSVYLPELILKFK-GGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
           G      P+L+  F+ GGA + +PP  Y   VG E    C  + +  A     G     +
Sbjct: 308 GFSRATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILS-MARLNRTGLEGVSV 366

Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           LG  Q +N +  +DL  +   F    C+
Sbjct: 367 LGSLQQENVHFLYDLKKETLSFEPADCS 394


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 162/405 (40%), Gaps = 76/405 (18%)

Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           G Y +++S GTPP    P   I DTGS L+W       +C+ C  PN      P F PK 
Sbjct: 92  GAYLMNISLGTPP---VPMLGIADTGSDLIW------RQCLPC--PNCYEQVEPLFDPKE 140

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           S + + + C N  C  +         +G    + TC     +Y   YG   +T G L S+
Sbjct: 141 SETYKTLDCDNEFCQDL-------GQQGSCDDDNTC-----TYSYSYGDRSYTRGDLSSD 188

Query: 219 TLRFPSK-----TVPNFLAGC-----SILSDRQPAGIAGFGRSSE---SLPSQLGLKKFS 265
           TL   S      + P    GC        +++    I   G        L S++G  +FS
Sbjct: 189 TLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG-GQFS 247

Query: 266 YCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
           YCL+     D+ VSS +         GSG   TP +  TP              FYY+ L
Sbjct: 248 YCLVPLS-SDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP------------DTFYYLTL 294

Query: 323 RQIIVGSKHVKIP---YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
             + VGS+ V       +   P +   G +I+DSG+T T +    +  V       +G  
Sbjct: 295 EGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQ 354

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
           +     +       C+  S   ++ +P +   F  GA + LPP N F  V  +++C  + 
Sbjct: 355 TT---TDPNGIFSLCY--SSVNNLEIPTITAHFT-GADVQLPPLNTFVQVQEDLVCFSMI 408

Query: 440 -TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + N A          I G+    NF + +DL N++  F +  C 
Sbjct: 409 PSSNLA----------IFGNLAQINFLVGYDLKNNKVSFKQTDCT 443


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 100/401 (24%), Positives = 164/401 (40%), Gaps = 82/401 (20%)

Query: 106 ISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           ++ S G P  PQ +   I DTGS+++W  C    RC   N P +DPS        +SS+ 
Sbjct: 101 VNFSMGQPATPQLA---IMDTGSNILWVRCAPCKRCTQQNGPLLDPS--------KSSTY 149

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C N  C   + P+       C+  N+        Y L Y  G  +AG+L +E L F
Sbjct: 150 ASLPCTNTMCH--YAPSAY-----CNRLNQC------GYNLSYATGLSSAGVLATEQLIF 196

Query: 223 PS-----KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
            S       VP+ + GCS       DR+  G+ G G+   S  +++G  KFSYCL     
Sbjct: 197 HSSDEGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMG-SKFSYCL----- 250

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPF----------YKNPVGSSSAFGEFYYVGLR 323
                            G+   P   Y             Y  P+         YYV L 
Sbjct: 251 -----------------GNIADPHYGYNQLVFGEKANFEGYSTPL---KVVNGHYYVTLE 290

Query: 324 QIIVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
            I VG K + I   ++ + G++ +   ++DSG+  T++    F A+  E +RQ+ +    
Sbjct: 291 GISVGEKRLDIDSTAFSMKGNEKSA--LIDSGTALTWLAESAFRALDNE-VRQLLD---G 344

Query: 383 ADVEKKSGLRPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
             +    G   C+     +  +  P +   F GGA + L  E+ F     ++LC+ +   
Sbjct: 345 VLMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQATPDILCIAVRQA 404

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           +A G         ++G    Q + + +DL +++  F +  C
Sbjct: 405 SAYGNDFKSFS--VIGLMAQQYYNMAYDLNSNKLFFQRIDC 443


>gi|32482806|gb|AAP84703.1| putative xyloglucanase inhibitor [Solanum tuberosum]
          Length = 437

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 167/411 (40%), Gaps = 77/411 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  Q  TP +      D G   +W         VDC+   V  S  PA    RS+
Sbjct: 44  TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
              L G     C   F P       GC+  N TC L   + + +     T+G L S+ + 
Sbjct: 93  QCSLGGASG--CGECFSPPR----PGCN--NNTCGLLPDNTVTRTA---TSGELASDIVS 141

Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
                 + P ++V +    F+ G + L     +G+   AG GR+  SLPSQ        +
Sbjct: 142 VQSTNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201

Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
           KF+ CL S        F D P      L     S +       YTP + NPV ++SAF  
Sbjct: 202 KFALCLTSSNSKGVVLFGDGPY---FFLPNREFSNND----FQYTPLFINPVSTASAFSS 254

Query: 317 -----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y++G++ I +  K V I  + L   + G GG  + + + +T +E  L+ A+   
Sbjct: 255 GQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNF 314

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENYF 426
           F++++ N +R A V      + CFD     S      +P + L  +       +   N  
Sbjct: 315 FVKELANVTRVAAVAP---FKVCFDSRNIGSTRVGPAVPSIDLVLQNENVVWTIFGANSM 371

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
             V   VLCL +         +    +I++G   +++  L+FD A  R GF
Sbjct: 372 VQVSENVLCLGVLDG-----GVNSRTSIVIGGHTIEDNLLQFDHAASRLGF 417


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 142/378 (37%), Gaps = 57/378 (15%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P  + P   DT   L W       +C  C  P   P +   F P+RS +S  + C +  C
Sbjct: 158 PILAQPMSIDTSIDLPWI------QCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAAC 211

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFL 231
                  +     GCS  N  C      Y + YG G  T+G  + + L   PS  V NF 
Sbjct: 212 G-----ELGRYGAGCS--NNQC-----QYFVDYGDGRATSGTYMVDALTLNPSTVVMNFR 259

Query: 232 AGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVL 284
            GCS           +G    G   +SL SQ        FSYC+        P SS   L
Sbjct: 260 FGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD------PSSSGF-L 312

Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
             G  +        + TP  +NP    S     Y V LR I VG + + +P         
Sbjct: 313 SLGGPADGGGAGRFARTPLVRNP----SIIPTLYLVRLRGIEVGGRRLNVPPVVFA---- 364

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
             GG ++DS    T +    + A+   F   M  Y R A    ++GL  C+D     SV 
Sbjct: 365 --GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAG--GRAGLDTCYDFVRFTSVT 420

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P + L F GGA + L        +G  V   + F       ALG      +G+ Q Q  
Sbjct: 421 VPAVSLVFDGGAVVRLDA------MGVMVEGCLAFVPTPGDFALG-----FIGNVQQQTH 469

Query: 465 YLEFDLANDRFGFAKQKC 482
            + +D+     GF +  C
Sbjct: 470 EVLYDVGGGSVGFRRGAC 487


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 142/378 (37%), Gaps = 57/378 (15%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P  + P   DT   L W       +C  C  P   P +   F P+RS +S  + C +  C
Sbjct: 142 PILAQPMSIDTSIDLPWI------QCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAAC 195

Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFL 231
                  +     GCS  N  C      Y + YG G  T+G  + + L   PS  V NF 
Sbjct: 196 G-----ELGRYGAGCS--NNQC-----QYFVDYGDGRATSGTYMVDALTLNPSTVVMNFR 243

Query: 232 AGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVL 284
            GCS           +G    G   +SL SQ        FSYC+        P SS   L
Sbjct: 244 FGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD------PSSSGF-L 296

Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
             G  +        + TP  +NP    S     Y V LR I VG + + +P         
Sbjct: 297 SLGGPADGGGAGRFARTPLVRNP----SIIPTLYLVRLRGIEVGGRRLNVPPVVFA---- 348

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
             GG ++DS    T +    + A+   F   M  Y R A    ++GL  C+D     SV 
Sbjct: 349 --GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAG--GRAGLDTCYDFVRFTSVT 404

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P + L F GGA + L        +G  V   + F       ALG      +G+ Q Q  
Sbjct: 405 VPAVSLVFDGGAVVRLDA------MGVMVEGCLAFVPTPGDFALG-----FIGNVQQQTH 453

Query: 465 YLEFDLANDRFGFAKQKC 482
            + +D+     GF +  C
Sbjct: 454 EVLYDVGGGSVGFRRGAC 471


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 155/404 (38%), Gaps = 68/404 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  GTPP      I DTGS+++W PC +   C DC   N   S    F P  SS
Sbjct: 96  GNYLMKLLIGTPPTEIHAAI-DTGSNVIWIPCIN---CKDCF--NQSSS---IFNPLASS 146

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           + Q   C + +C        E+    C   N  C  +C     ++ L    G +  +T+ 
Sbjct: 147 TYQDAPCDSYQC--------ETTSSSCQSDN-VCLYSCDE---KHQLNCPNGRIAVDTMT 194

Query: 222 FPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSR 271
             S           +F+ G SI       G+ G GR + SL S+   L   KFSYC L+ 
Sbjct: 195 LTSSDGRPFPLPYSDFVCGNSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYC-LAD 253

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
            +   P   N  L +     D +    +        +G     G  YYV L  I VG K 
Sbjct: 254 YYSKQPSKINFGLQSFISDDDLEVVSTT--------LGHHRHSGN-YYVTLEGISVGEKR 304

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE--------AVAKEFIRQMGNYSRAA 383
             + Y    P +   G +++DSG+ FT +    ++        A+ +       N     
Sbjct: 305 QDL-YYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPF 363

Query: 384 DVEKKSGLRPCFDISGKKSVYLPEL----ILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            ++    L PCF        Y PEL    I      A + L  +N F  V  +V+C    
Sbjct: 364 SMDNTLKLSPCF-------WYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCF--- 413

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               A  A   G + + G +Q  NF L +DL      F +  C+
Sbjct: 414 ----AFAATQPGQSTVYGSWQQMNFILGYDLKRGTVSFKRTDCS 453


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 109/429 (25%), Positives = 161/429 (37%), Gaps = 90/429 (20%)

Query: 70  LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
           LK KT P+T           SL    L        ++SL+ G+PPQ  T  + DTGS L 
Sbjct: 13  LKVKTLPQT-----------SLSPRKLPFQHNVTLTVSLTVGSPPQRVT-MVLDTGSELS 60

Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
           W  C    +  + NF          F P  SSS     C +P C+               
Sbjct: 61  WLHCK---KLPNLNF---------IFNPLVSSSYTPTPCTSPICT-------------TQ 95

Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------SILSDRQPA 243
            R+   P++C +  L + + F  G      + F          GC      S   D +  
Sbjct: 96  TRDLINPVSCDANKLCHIITFFVGGPAQRGMVF----------GCMDTGTSSGDEDSKTT 145

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           G+ G    S S  +Q+ L KFSYC+ ++      V  N+       +   +   L YTP 
Sbjct: 146 GLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENI-------ANPPRLGPLHYTPL 198

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
            K     ++    F     R   +  K      S  +P   G G  +VDS + FTF+  P
Sbjct: 199 VKK----TTPLPYFN----RNCCLFQK------SAFLPDHTGAGQTMVDSATQFTFLRQP 244

Query: 364 LFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDIS-GKKSVYLPELILKFKGGAKMA 419
           ++ A+  EF  Q  N        K   +  +  CF +  G     LP + L F  GA++ 
Sbjct: 245 VYTALKNEFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMF-DGAELR 303

Query: 420 LPPENYFALVGN------EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
           +  E     V N       + C      +  G       A I+G    +N ++E+DLAN 
Sbjct: 304 VTGERLLYKVSNVAKSNSWIYCFTFGNSDLLGIE-----AFIIGHHHQRNVWMEYDLANS 358

Query: 474 RFGFAKQKC 482
           R GF+   C
Sbjct: 359 RIGFSDTNC 367


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 149/387 (38%), Gaps = 49/387 (12%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
           + + GTPPQ ++  I D    LVW  C+   RC   +        +P FIP  SS+ +  
Sbjct: 46  NFTIGTPPQPASAII-DVAGELVWTQCSRCSRCFKQD--------LPLFIPNASSTFRPE 96

Query: 167 GCQNPKCSWIFGPNVESRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
            C    C         S C G  C+  + T         ++     T G++ +ET    +
Sbjct: 97  PCGTDACK----STPTSNCSGDVCTYESTTN--------IRLDRHTTLGIVGTETFAIGT 144

Query: 225 KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
            T  +   GC + SD       +G  G GR+  SL +Q+ L KFSYCL  R       SS
Sbjct: 145 ATA-SLAFGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK---SS 200

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
            L L  G  +  +     S  PF K      S    +Y + L  I  G+  +    S   
Sbjct: 201 RLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS--- 253

Query: 341 PGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DIS 398
                 GG++V  + S F+ +    + A  K     +G  +             CF   +
Sbjct: 254 ------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAA 307

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIIL 456
           G      P+L+  F+G A + +PP  Y   VG E    C  + +  A     G     +L
Sbjct: 308 GFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSVL 366

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
           G  Q ++ +  +DL  +   F    C+
Sbjct: 367 GSLQQEDVHFLYDLKKETLSFEPADCS 393


>gi|350536487|ref|NP_001234249.1| xyloglucan-specific fungal endoglucanase inhibitor protein
           precursor [Solanum lycopersicum]
 gi|27372527|gb|AAN87262.1| xyloglucan-specific fungal endoglucanase inhibitor protein
           precursor [Solanum lycopersicum]
          Length = 438

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 165/410 (40%), Gaps = 74/410 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  Q  TP +      D G   +W         VDC+   V  S  PA    R  
Sbjct: 44  TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCG 90

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
           S+Q        C   F P       GC+  N TC L   + +       T+G L S+ + 
Sbjct: 91  SAQCSLGGASGCGECFSPPR----PGCN--NNTCGLLPDNTVTGTA---TSGELASDVVS 141

Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
                 + P ++V +    F+ G + L     +G+   AG GR+  SLPSQ        +
Sbjct: 142 VESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISLPSQFSAEFSFPR 201

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-- 316
           KF+ CL S       V    +   GP     +         YTP + NPV ++SAF    
Sbjct: 202 KFALCLTSSSNSKGVV----LFGDGPYFFLPNRQFSNNDFQYTPLFINPVSTASAFSSGQ 257

Query: 317 ---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
               Y++G++ I +  K V I  + L   + G GG  + + + +T +E  L+ A+   F+
Sbjct: 258 PSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNFFV 317

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFA 427
           +++ N +R A V      R CFD     S  +       +L+L+        +   N   
Sbjct: 318 KELANVTRVAVVAP---FRVCFDSRDIGSTRVGPAVPSIDLVLQ-NANVVWTIFGANSMV 373

Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
            V   VLCL +         +    +I++G   +++  L+FD A  R GF
Sbjct: 374 QVSENVLCLGVLDG-----GVNARTSIVIGGHTIEDNLLQFDHAASRLGF 418


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 114/403 (28%), Positives = 165/403 (40%), Gaps = 96/403 (23%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
           V S G Y ++L  GTPP      I DTGS L W    PCT  Y+ V           +P 
Sbjct: 86  VPSAGEYLMNLYIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPL 133

Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
           F PK SS+ +   C    C  +       + + CS + K C     ++   Y  G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLAL------GKDRSCS-KEKKC-----TFRYSYADGSFTGG 181

Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQL----- 259
            L SETL   S      + P F  GC   S    D+  +GI G G    SL SQL     
Sbjct: 182 NLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTIN 241

Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
           GL  FSYCLL       PVS++  + +    G S               G  S +G    
Sbjct: 242 GL--FSYCLL-------PVSTDSSISSRINFGAS---------------GRVSGYGTV-- 275

Query: 320 VGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
                    S  +++PY  Y        G +IVDSG+T+TF+    +  + K     + N
Sbjct: 276 ---------STPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK----SVAN 322

Query: 379 YSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
             +   V   +G+   C++ + +  +  P +   FK  A + L P N F  +  +++C  
Sbjct: 323 SIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFK-DANVELQPLNTFMRMQEDLVCFT 379

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
           +   +  G         +LG+    NF + FDL   R GF+K+
Sbjct: 380 VAPTSDIG---------VLGNLAQVNFLVGFDLRKKR-GFSKK 412



 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 16/139 (11%)

Query: 346 NGGVIVDSGSTFTFMEGPL-FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
            G +IVDSG+T+T++  PL F    +E +       R  D    S L  C++ +  + + 
Sbjct: 417 EGNIIVDSGTTYTYL--PLEFYVKLEESVAHSIKGKRVRDPNGISSL--CYNTTVDQ-ID 471

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
            P +   FK  A + L P N F  +  +++C  +   +  G         ILG+    NF
Sbjct: 472 APIITAHFK-DANVELQPWNTFLRMQEDLVCFTVLPTSDIG---------ILGNLAQVNF 521

Query: 465 YLEFDLANDRFGFAKQKCA 483
            + FDL   R  F    C 
Sbjct: 522 LVGFDLRKKRVSFKAADCT 540


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/406 (23%), Positives = 166/406 (40%), Gaps = 68/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G PP+     + DTGS ++W  C +   C  C   +    ++  + P+ S+
Sbjct: 80  GLYFAKIGLGNPPKDYYVQV-DTGSDILWVNCAN---CDKCPTKSDLGVKLTLYDPQSST 135

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S+  I C +  C+  +        +GC     T  L C  Y + YG G  TAG  + + L
Sbjct: 136 SATRIYCDDDFCAATY----NGVLQGC-----TKDLPC-QYSVVYGDGSSTAGFFVKDNL 185

Query: 221 RFP--------SKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGL---- 261
           +F         S    + + GC          S     GI GFG+++ S+ SQL      
Sbjct: 186 QFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKV 245

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            + F++CL + K              G G    G+  +P ++ TP   N           
Sbjct: 246 KRVFAHCLDNVK--------------GGGIFAIGEVVSPKVNTTPMVPNQ--------PH 283

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V +++I VG   +++P      G     G I+DSG+T  ++   ++E++  + + +  
Sbjct: 284 YNVVMKEIEVGGNVLELPTDIFDTGD--RRGTIIDSGTTLAYLPEVVYESMMTKIVSEQP 341

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              +   VE++     CF  +G  +   P +   F G   + + P +Y   +  EV C  
Sbjct: 342 GL-KLHTVEEQ---FTCFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVWCF- 396

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + ++      GR    +LGD  L N  + +DL N   G+    C+
Sbjct: 397 GWQNSGMQSKDGR-DMTLLGDLVLSNKLVLYDLENQAIGWTDYNCS 441


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 140/517 (27%), Positives = 212/517 (41%), Gaps = 115/517 (22%)

Query: 6   FSLICLFSLLILLF---------TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPL-KI 55
           FSL  LF  L ++F         T  +  G  +  +    +PLS  +    +  D L K 
Sbjct: 4   FSLKFLFYTLAVIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKA 63

Query: 56  LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
            H     S+SRA H +             +  S + I++P+ + + G Y +++S GTPP 
Sbjct: 64  FHR----SISRANHFR------------ANGVSTNSIQSPV-ISNNGEYLMNISLGTPP- 105

Query: 116 ASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
            S   I DTGS L+W    PC S Y  ++           P F P +S + Q++ C+   
Sbjct: 106 VSMHGIADTGSDLLWRQCKPCDSCYEQIE-----------PIFDPAKSKTYQILSCEGKS 154

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT----- 226
           CS + G        GCS  N TC      Y   YG G  T+G L  +TL   S T     
Sbjct: 155 CSNLGGQG------GCSDDN-TCI-----YSYSYGDGSHTSGDLAVDTLTIGSTTGRPVS 202

Query: 227 VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVS 279
           VP  + GC   +    +   +G+ G G    S+ SQL      +FSYCL+    +D  VS
Sbjct: 203 VPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLG-NDPSVS 261

Query: 280 SNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
           S +   +     G+G   TP  S  P              FYY+ L  + VGSK  K+ Y
Sbjct: 262 SKMHFGSRGIVSGAGAVSTPLASRQP------------DTFYYLTLESMSVGSK--KLAY 307

Query: 337 -------SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
                  S L    +GN  +I+DSG+T T +    +  +    +  +G        +  +
Sbjct: 308 KGFSKVGSPLADADEGN--IIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR---DPNN 362

Query: 390 GLRPCF-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC--LILFTDNAAGP 446
               C+ ++SG +   +P +   F  GA + L P N F  V  ++ C  +I  +D A   
Sbjct: 363 VFSLCYSNLSGLR---IPTITAHFV-GADLELKPLNTFVQVQEDLFCFAMIPVSDLA--- 415

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                   I G+    NF + +DL +    F    C 
Sbjct: 416 --------IFGNLAQMNFLVGYDLKSRTVSFKPTDCT 444


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 109/246 (44%), Gaps = 33/246 (13%)

Query: 244 GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGL 298
           G+ GF R   S PSQ   +    FSYCL S K      SSN    L  GP     +   +
Sbjct: 344 GLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYK------SSNFSGTLRLGPAGQPKR---I 394

Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
             TP   NP   S      YYV +  I VG + V +P S L        G IVD+G+ FT
Sbjct: 395 KTTPLLSNPHRPS-----LYYVNMVGIRVGGRPVAVPASALAFDPASGHGTIVDAGTMFT 449

Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
            +  P++ AV   F  ++    RA       G   C+++    ++ +P +   F G   +
Sbjct: 450 RLSAPVYAAVCDVFRSRV----RAPVAGPLGGFDTCYNV----TISVPTVTFLFDGRVSV 501

Query: 419 ALPPENYFALVG-NEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFG 476
            LP EN       + + CL +    AAGP+      + ++   Q QN  + FD+AN R G
Sbjct: 502 TLPEENVVIRSSLDGIACLAM----AAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVG 557

Query: 477 FAKQKC 482
           F+++ C
Sbjct: 558 FSRELC 563


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/391 (25%), Positives = 150/391 (38%), Gaps = 62/391 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  ++  GTP    T  I DTGSSL W       +C  CN     P R+P F P  SSS 
Sbjct: 129 YVATVGLGTPAVPQT-LILDTGSSLTWV------QCKPCNSSQCYPQRLPLFDPNTSSSY 181

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
             + C + +C       ++    GC+         C +Y + YG G T AG   ++ L  
Sbjct: 182 SPVPCDSQECR-ALAAGIDG--DGCTSDGD---WGC-AYEIHYGSGATPAGEYSTDALTL 234

Query: 223 -PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLKK----FSYCLLSRKF 273
            P   V  F  GC     R       G+ G GR  +SL  Q   ++    FS+CL     
Sbjct: 235 GPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
               ++     DT              + F   P+ +      FY +    I V  + + 
Sbjct: 295 STGFLALGAPHDT--------------SAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLD 340

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
           IP +          GVI DSG+  + ++   + A+   F   M  Y  A  V     L  
Sbjct: 341 IPPAVF------REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGH---LDT 391

Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT--DNAAGPALGRG 451
           CF+ +G  +V +P + L F+GGA + L   +   + G    CL  ++  D   G      
Sbjct: 392 CFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLMDG----CLAFWSSGDEYTG------ 441

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++G    +   + +D+   + GF    C
Sbjct: 442 ---LIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|222822566|gb|ACM68432.1| xyloglucanase-specific endoglucanase inhibitor protein [Petunia x
           hybrida]
          Length = 436

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 111/412 (26%), Positives = 164/412 (39%), Gaps = 79/412 (19%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T     TP +      D G   +W         VDC+   V  S IPA    RS+
Sbjct: 43  TLQYLTQISQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYIPARC--RSA 91

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
              L G     C   F P       GC+  N TC     + + +     T+G L S+ + 
Sbjct: 92  KCSLAGSSG--CGDCFSP----PSPGCN--NNTCGAFPDNSITRTA---TSGELASDIVS 140

Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
             S    N           F+ G + L +   +G+   AG GR+  SLPSQ        +
Sbjct: 141 VQSSNGKNPGRNVSDKDFLFVCGATFLLNGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 200

Query: 263 KFSYCLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
           KF+ CL S         F D P S    L     S D      SYTP + NPV ++SAF 
Sbjct: 201 KFAVCLSSTSNSKGVVLFGDGPYS---FLPNREYSSDD----FSYTPLFINPVSTASAFS 253

Query: 316 E-----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
                  Y++G++ I +  K V I  + L   S G GG  + + + +T +E  ++ AV  
Sbjct: 254 SGTPSSEYFIGVKSIKINEKVVPINTTLLSIDSQGVGGTKISTVNPYTILETSIYNAVTN 313

Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENY 425
            F++++        V   +    CFD     S      +P + L  +       +   N 
Sbjct: 314 FFVKELA----IPTVPSVAPFGVCFDSRNITSTRVGPGVPSIDLVLQNENVFWRIFGANS 369

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
             LV   VLCL  F D    P      +I++G   +++  L+FDLA  R GF
Sbjct: 370 MVLVSENVLCL-GFVDGGVNPR----TSIVIGGHTIEDNLLQFDLAASRLGF 416


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 153/393 (38%), Gaps = 73/393 (18%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++S G+P  A T FI DTGS + W  C SR                  + P  SS+ 
Sbjct: 131 YVITVSIGSPAVAXTMFI-DTGSDVSWLRCKSRL-----------------YDPGTSSTY 172

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C  P C+      +  R  GCS    TC      Y ++YG G  T G   S+TL  
Sbjct: 173 APFSCSAPACA-----QLGRRGTGCS-SGSTC-----VYSVKYGDGSNTTGTYGSDTLTL 221

Query: 223 PSKTVP---NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
              + P    F  GCS +     +    G+ G G  ++S  SQ        FSYCL    
Sbjct: 222 AGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL---- 277

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
                 +S+  L  G           +   F   P+  S     FY + LR I VG K +
Sbjct: 278 --PPTWNSSGFLTLG------APSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTL 329

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
           +IP S    GS      IVDSG+  T +    + A++  F   M  Y +      +  L 
Sbjct: 330 EIPSSVFSAGS------IVDSGTVITRLPPTAYGALSAAFRDGMARY-QYQPAAPRGLLD 382

Query: 393 PCFDISGK---KSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
            CFD +G     +  +P + L   GGA + L P     +V +  L      D+       
Sbjct: 383 TCFDFTGHGEGNNFTVPSVALVLDGGAVVDLHPN---GIVQDGCLAFAATDDD------- 432

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G   I+G+ Q + F + +D+    FGF    C
Sbjct: 433 -GRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|350536203|ref|NP_001234746.1| xyloglucan-specific fungal endoglucanase inhibitor protein
           precursor [Solanum lycopersicum]
 gi|68449754|gb|AAY97864.1| xyloglucan-specific fungal endoglucanase inhibitor protein
           precursor [Solanum lycopersicum]
          Length = 438

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 164/410 (40%), Gaps = 74/410 (18%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  Q  TP +      D G   +W         VDC+   V  S  PA    R  
Sbjct: 44  TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCG 90

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
           S+Q        C   F P       GC   N TC L   + +       T+G L S+ + 
Sbjct: 91  SAQCSLGGASGCGECFSPPR----PGCD--NNTCGLLPDNTVTGTA---TSGELASDVVS 141

Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
                 + P ++V +    F+ G + L     +G+   AG GR+  SLPSQ        +
Sbjct: 142 VESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISLPSQFSAEFSFPR 201

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-- 316
           K + CL S       V    +   GP     +         YTP + NPV ++SAF    
Sbjct: 202 KSALCLTSSSNSKGVV----LFGDGPYFFLPNRQFSNNDFQYTPLFINPVSTASAFSSGQ 257

Query: 317 ---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
               Y++G++ I +  K V I  + L   + G GG  + + + +T +E  L+ A+   F+
Sbjct: 258 PSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNFFV 317

Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFA 427
           +++ N +R A V      R CFD     S  +       +L+L+        +   N   
Sbjct: 318 KELANVTRVAVVAP---FRVCFDSRDIGSTRVGPAVPSIDLVLQ-NANVVWTIFGANSMV 373

Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
            V   VLCL +         +  G +I++G   +++  L+FD A  R GF
Sbjct: 374 QVSENVLCLGVLDG-----GVNAGTSIVIGGHTIEDNLLQFDHAASRLGF 418


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 101/413 (24%), Positives = 172/413 (41%), Gaps = 94/413 (22%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S G Y+  L  GTPPQ     I DTGS++ + PC+S   C  C        + P F P  
Sbjct: 73  SNGYYTTRLFIGTPPQEFA-LIVDTGSTVTYVPCSS---CEQCG-----KHQDPRFQPDL 123

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
           SS+ + + C NP C+             C    K C     +Y  +Y  +  ++G++  +
Sbjct: 124 SSTYRPVKC-NPSCN-------------CDDEGKQC-----TYERRYAEMSSSSGVIAED 164

Query: 219 TLRFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFS 265
            + F +++       + GC       L  ++  GI G GR   S+  QL  K      FS
Sbjct: 165 VVSFGNESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFS 224

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
            C                +D G G+      S  P + ++  + NP  S      +Y + 
Sbjct: 225 LCYGG-------------MDVGGGAMVLGQISPPPNMVFS--HSNPYRSP-----YYNIE 264

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           L+++ V  K +K+         D   G ++DSG+T+ +     F A+    ++++ +   
Sbjct: 265 LKELHVAGKPLKLKPKVF----DEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRH--- 317

Query: 382 AADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENYF--ALVG 430
              +++  G  P     CF  +G++  +L    PE+ + F  G K++L PENY       
Sbjct: 318 ---LKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKV 374

Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +   CL +F +       G     +LG   ++N  + +D  ND+ GF K  C+
Sbjct: 375 SGAYCLGIFQN-------GNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCS 420


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 115/407 (28%), Positives = 164/407 (40%), Gaps = 70/407 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP   T  I DTGS ++W  C S   C  C   +    ++  F    SS
Sbjct: 77  GLYFTKVKLGTPPMEFTVQI-DTGSDILWVNCNS---CNGCPRSSGLGIQLNFFDASSSS 132

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           SS L+ C +P C+  F    ++    C  ++  C     SY  QYG G  T+G  +SE++
Sbjct: 133 SSSLVSCSDPICNSAF----QTTATQCLTQSNQC-----SYTFQYGDGSGTSGYYVSESM 183

Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
            F      + +A        GCS         SD    GI GFG    S+ SQL      
Sbjct: 184 YFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGIT 243

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL      +      LVL      G+   PG+ Y+P   +           Y +
Sbjct: 244 PKVFSHCLKG----EGNGGGILVL------GEVLEPGIVYSPLVPSQ--------PHYNL 285

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I V  + + I  S  V  +  N G I+DSG+T  +    L E     F+  +    
Sbjct: 286 YLQSISVNGQTLPIDPS--VFATSINRGTIIDSGTTLAY----LVEEAYTPFVSAITAAV 339

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
             +     S    C+ +S       P + L F G A M L PE Y   +G        F 
Sbjct: 340 SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLG--------FY 391

Query: 441 DNAAGPALG----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           D AA   +G    +    ILGD  +++    +DLA  R G+A   C+
Sbjct: 392 DGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGWASYDCS 438


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 103/398 (25%), Positives = 157/398 (39%), Gaps = 69/398 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +   +G P Q   P  FDT   +    C        C+         PAF P RSSS 
Sbjct: 88  YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 137

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +P+C+          C G S     CP     + +Q+G +    G L+ +TL  
Sbjct: 138 AAIPCGSPECAV--------ECTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 179

Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
            PS T   F  GC  + +D        G+    RSS SL S++           FSYCL 
Sbjct: 180 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 239

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           S     +  SS   L  G    +     + Y P   NP   +S     Y+V L  I VG 
Sbjct: 240 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVDLVGISVGG 290

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           + + +P     P      G ++++ + FTF+    + A+   F + M  Y  A       
Sbjct: 291 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRV-- 343

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
            L  C++++G  S+ +P + L+F GG ++ L      YFA    V + V CL        
Sbjct: 344 -LDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 402

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +      ++G    ++  + +DL   R GF   +C
Sbjct: 403 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|384482417|pdb|3VLA|A Chain A, Crystal Structure Of Edgp
          Length = 413

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 105/407 (25%), Positives = 167/407 (41%), Gaps = 83/407 (20%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P  S   + D G   +W  C   Y                 + P R  +SQ     +  C
Sbjct: 31  PLVSENLVVDLGGRFLWVDCDQNYVS-------------STYRPVRCRTSQCSLSGSIAC 77

Query: 174 SWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
              F GP       GC+  N TC +   + ++    G   G +  + +   S        
Sbjct: 78  GDCFNGPR-----PGCN--NNTCGVFPENPVINTATG---GEVAEDVVSVESTDGSSSGR 127

Query: 226 --TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKF 273
             TVP F+  C+  S  Q       G+AG GR+  +LPSQ        +KF+ CL     
Sbjct: 128 VVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL----- 182

Query: 274 DDAPVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYY 319
                SSN V+  G    D  T           L+YTP   NPV +S  S  GE    Y+
Sbjct: 183 -SGSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYF 238

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           +G++ I + SK V +  S L   S G GG  + + + +T +E  +++AV + FI++    
Sbjct: 239 IGVKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE---- 294

Query: 380 SRAADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGN 431
           S A ++ + + + P   CF      S  L    P + L  +  + +  +   N    + +
Sbjct: 295 SAARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND 354

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            V+CL +      G  L    +I++G  QL++  ++FDLA  R GF+
Sbjct: 355 NVVCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 396


>gi|285741|dbj|BAA03413.1| EDGP precursor [Daucus carota]
          Length = 433

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 82/285 (28%), Positives = 131/285 (45%), Gaps = 50/285 (17%)

Query: 226 TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDD 275
           TVP F+  C+  S  Q       G+AG GR+  +LPSQ        +KF+ CL       
Sbjct: 150 TVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL------S 203

Query: 276 APVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYYVG 321
              SSN V+  G    D  T           L+YTP   NPV +S  S  GE    Y++G
Sbjct: 204 GSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYFIG 260

Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
           ++ I + SK V +  S L   S G GG  + + + +T +E  +++AV + FI++    S 
Sbjct: 261 VKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE----SA 316

Query: 382 AADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGNEV 433
           A ++ + + + P   CF      S  L    P + L  +  + +  +   N    + + V
Sbjct: 317 ARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYINDNV 376

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           +CL +      G  L    +I++G  QL++  ++FDLA  R GF+
Sbjct: 377 VCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 416


>gi|356535355|ref|XP_003536212.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 444

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 134/287 (46%), Gaps = 47/287 (16%)

Query: 226 TVPNFL--AGCSILSDRQPAGI---AGFGRSSESLPSQLG-----LKKFSYCLLSRKFDD 275
           +VP FL   G +++ +   +G+   AG GR+  SLPSQ       L+KF+ CL S    +
Sbjct: 151 SVPKFLFICGANVVQNGLASGVTGMAGLGRTKVSLPSQFSSAFSFLRKFAICLSSSTMTN 210

Query: 276 APVSSNLVLDTGP------GSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQ 324
                 +    GP       S  SK   L++TP   NPV ++ ++  GE    Y++G++ 
Sbjct: 211 GV----MFFGDGPYNFGYLNSDLSKV--LTFTPLITNPVSTAPSYFQGEPSVEYFIGVKS 264

Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
           I V  K+V +  + L    +G GG  + + + +T +E  +++AV++ F++ +G    A  
Sbjct: 265 IRVSDKNVPLNTTLLSIDRNGIGGTKISTVNPYTVLETTIYKAVSEAFVKAVG----APT 320

Query: 385 VEKKSGLRPCFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           V   +    CF     +S  +    P++ L  +     ++   N      N+V+CL  F 
Sbjct: 321 VAPVAPFGTCFATKDIQSTRMGPAVPDINLVLQNEVVWSIIGANSMVYT-NDVICL-GFV 378

Query: 441 DNAAGPALGRG----------PAIILGDFQLQNFYLEFDLANDRFGF 477
           D  + P+  +            +I +G  QL+N  L+FDLA  R GF
Sbjct: 379 DAGSDPSTAQVGFVVGYSQPITSITIGAHQLENNMLQFDLATSRLGF 425


>gi|384482418|pdb|3VLB|A Chain A, Crystal Structure Of Xeg-Edgp
 gi|384482420|pdb|3VLB|C Chain C, Crystal Structure Of Xeg-Edgp
          Length = 413

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 105/407 (25%), Positives = 167/407 (41%), Gaps = 83/407 (20%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P  S   + D G   +W  C   Y                 + P R  +SQ     +  C
Sbjct: 31  PLVSENLVVDLGGRFLWVDCDQNYVS-------------STYRPVRCRTSQCSLSGSIAC 77

Query: 174 SWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
              F GP       GC+  N TC +   + ++    G   G +  + +   S        
Sbjct: 78  GDCFNGPR-----PGCN--NNTCGVFPENPVINTATG---GEVAEDVVSVESTDGSSSGR 127

Query: 226 --TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKF 273
             TVP F+  C+  S  Q       G+AG GR+  +LPSQ        +KF+ CL     
Sbjct: 128 VVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL----- 182

Query: 274 DDAPVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYY 319
                SSN V+  G    D  T           L+YTP   NPV +S  S  GE    Y+
Sbjct: 183 -SGSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYF 238

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           +G++ I + SK V +  S L   S G GG  + + + +T +E  +++AV + FI++    
Sbjct: 239 IGVKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE---- 294

Query: 380 SRAADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGN 431
           S A ++ + + + P   CF      S  L    P + L  +  + +  +   N    + +
Sbjct: 295 SAARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND 354

Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            V+CL +      G  L    +I++G  QL++  ++FDLA  R GF+
Sbjct: 355 NVVCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 396


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/170 (31%), Positives = 80/170 (47%), Gaps = 18/170 (10%)

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           +YYVGL  I VG + + IP +     S GNGG+IVDSG+  T ++  ++  V   F++  
Sbjct: 10  YYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVKGT 69

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
            +     +V   S    C+D+S K SV +P +   F  G  + LP +NY   V       
Sbjct: 70  KDLLATNEV---SLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPV------- 119

Query: 437 ILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
               D+         P +    I+G+ Q Q   + FDLAN   GF+  +C
Sbjct: 120 ----DSVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
          Length = 443

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 104/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y   ++  TPP      + D G   +W  C   Y+                     SS+ 
Sbjct: 48  YITQITQRTPP-VQLKVVLDVGGEFLWIDCEKGYK---------------------SSTK 85

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ--YGLGFTAGLLLSETLR 221
           + + C +P+C       V S    C+  +    +     +    +    T+G L  + L 
Sbjct: 86  RPVPCGSPQC-------VLSGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILY 138

Query: 222 FPSK---------TVPNFLAGC---SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYC 267
             S          +VPN L  C   S+L        G+AGFGR+  +LPS L    FS+ 
Sbjct: 139 IQSTNGFNPGKQVSVPNLLFSCAPNSLLEGLASGIIGMAGFGRNKVALPS-LFSSAFSF- 196

Query: 268 LLSRKFDDAPVSSNLVLDTG-------PGSGDSKTPGLSYTPFYKNPVGSSSAF----GE 316
              RKF     SSN V+  G       PG   S    L+YTP  +NP    S+F      
Sbjct: 197 --PRKFGVCLSSSNGVIFFGKEPYVLLPGIDVSDPTSLTYTPLIQNPRSLVSSFEGNPSA 254

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
            Y++G++ I V  K +++  + L   ++G +GG  + +   FT +E  +++AV   F++ 
Sbjct: 255 EYFIGVKSIKVDGKPLRLNTTLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFVKA 314

Query: 376 MGNYSRAADVEKKSGLRPCFDIS--GKKSV--YLPELILKFKGGAKMALPPENYFALVGN 431
           +G   +   V+  +    CF+    G   V   +P++ L  +     ++   N    VG+
Sbjct: 315 LG--PKVPRVKAVAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDKLWSIFGANSMVSVGD 372

Query: 432 EVLCLILFTDNAAGPALGRG-----PAIILGDFQLQNFYLEFDLANDRFGFA 478
           +VLCL  F D      +  G      A+++G  Q++N +L FDL   R GF+
Sbjct: 373 DVLCL-GFVDGGPLNFVDWGVKFTPTAVVIGGHQIENNFLLFDLGASRLGFS 423


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 105/398 (26%), Positives = 159/398 (39%), Gaps = 69/398 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +   +G P Q   P  FDT   +    C        C+         PAF P RSSS 
Sbjct: 176 YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 225

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
             I C +P+C+      VE  C G S     CP     + +Q+G +    G L+ +TL  
Sbjct: 226 AAIPCGSPECA------VE--CTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 267

Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
            PS T   F  GC  + +D        G+    RSS SL S++           FSYCL 
Sbjct: 268 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 327

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
           S     +  SS   L  G    +     + Y P   NP   +S     Y+V L  I VG 
Sbjct: 328 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVDLVGISVGG 378

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           + + +P     P      G ++++ + FTF+    + A+   F + M  Y  A       
Sbjct: 379 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRV-- 431

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
            L  C++++G  S+ +P + L+F GG ++ L      YFA    V + V CL        
Sbjct: 432 -LDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 490

Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +      ++G    ++  + +DL   R GF   +C
Sbjct: 491 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 523


>gi|223974335|gb|ACN31355.1| unknown [Zea mays]
          Length = 91

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 42/84 (50%), Positives = 53/84 (63%), Gaps = 6/84 (7%)

Query: 405 LPELILKFKGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGD 458
           LPEL  +F+GGA M LP ENYF + G    E +CL + TD + G   G    GPAIILG 
Sbjct: 3   LPELSFRFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGS 62

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
           FQ QN+ +E+DL  +R GF +Q C
Sbjct: 63  FQQQNYLVEYDLEKERLGFRRQSC 86


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 95/388 (24%), Positives = 145/388 (37%), Gaps = 65/388 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + +  G+P       + D+GS +VW  C    +C +   P  +P+   +FI     
Sbjct: 127 GEYFVRIGIGSPAIYQY-MVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIG---- 181

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
               + C +  C+ +   +V  R   C             Y + YG G +T G L  ET+
Sbjct: 182 ----VACSSNVCNQL-DDDVACRKGRCG------------YQVAYGDGSYTKGTLALETI 224

Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGLKK---FSYCLLSRKFD 274
                 + +   GC   ++    G AG         S   QLG +    F YCL+SR   
Sbjct: 225 TIGRTVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAM- 283

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
             PV                  G  + P   NP      +  FYYV L  + VG   V I
Sbjct: 284 --PV------------------GAMWVPLIHNPF-----YPSFYYVSLSGLAVGGIRVPI 318

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
                     G GGV++D+G+  T +    + A    FI Q  N  RA  V   S    C
Sbjct: 319 SEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGV---SIFDTC 375

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           +D++G  +V +P +   F GG  +  P  N+     +       F  + +G +       
Sbjct: 376 YDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLS------- 428

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           I+G+ Q +   +  D  N   GF    C
Sbjct: 429 IIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 119/448 (26%), Positives = 175/448 (39%), Gaps = 75/448 (16%)

Query: 49  DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
           D   L+ +H    SS    RH + ++  +T             + + LS+ S G Y   +
Sbjct: 4   DEARLRWIHHRIQSS--DHRHRRGRSLLQTAQ-----------VSSGLSLGS-GEYFARM 49

Query: 109 SFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
             G+P Q S     DTGS + W    PC+S Y  VD           P + P  SSS + 
Sbjct: 50  GIGSP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVD-----------PIYDPSNSSSYRR 97

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF-- 222
           + C +  C  +      S C+G         + C SY + YG    ++G L  E+     
Sbjct: 98  VYCGSALCQAL----DYSACQG---------MGC-SYRVVYGDSSASSGDLGIESFYLGP 143

Query: 223 -PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
             S  + N   GC   +    R  AG+ G G  + S  SQ+       FSYCL+ R    
Sbjct: 144 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 203

Query: 276 APVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
              SS L+       G +  P    +TP  KNP         FYY  L  I VG   + I
Sbjct: 204 QSRSSPLIF------GRTAIPFAARFTPLLKNP-----RIDTFYYAILTGISVGGTALPI 252

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P +      +G GG I+DSG++ T +    +  +   +     N   A  V     L  C
Sbjct: 253 PPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYL---LDTC 309

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           F+  G  +V +P L+L F     M LP  N    V       + F  ++        P  
Sbjct: 310 FNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSM-------PIS 362

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ++G+ Q Q F + FDL       A ++C
Sbjct: 363 VIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 112/434 (25%), Positives = 164/434 (37%), Gaps = 86/434 (19%)

Query: 95  PLSVHSY---GGYSISLSFGTPPQASTPFIF--DTGSSLVWFPC-----TSRYRCVDCNF 144
           PL+  +Y   G Y +    GTP Q   PF+   DTGS L W  C      +       + 
Sbjct: 83  PLTSAAYTGIGQYFVRFRVGTPAQ---PFLLVADTGSDLTWVKCRPAKAAAASTNSSSSA 139

Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
               P R  AF P++S +   I C +  CS           K       TCP   P    
Sbjct: 140 SASSPRR--AFRPEKSKTWAPIPCASDTCS-----------KSLPFSLSTCP--TPGSPC 184

Query: 205 QYGLGFTAGLLLSETLRFPSKTVP------------------NFLAGCSIL----SDRQP 242
            Y   +  G     T+   S T+                     + GC+      S    
Sbjct: 185 AYDYRYKDGSAARGTVGTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEAS 244

Query: 243 AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS------ 293
            G+   G S+ S  S    +   +FSYCL+      +P ++   L  GP S  S      
Sbjct: 245 DGVLSLGYSNVSFASHAASRFGGRFSYCLVDHL---SPRNATSYLTFGPNSALSGPCPAA 301

Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
             PG   TP   +     S    FY V ++ I V  + +KIP    V   DG GGVIVDS
Sbjct: 302 AGPGARQTPLVLD-----SRMRPFYDVSIKAISVDGELLKIPRD--VWEVDGGGGVIVDS 354

Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG----KKSVYLPELI 409
           G++ T +  P + AV     +++  + R A          C++ +      +   LP+L 
Sbjct: 355 GTSLTVLAKPAYRAVVAALGKKLARFPRVA----MDPFEYCYNWTSPSRKDEGDDLPKLA 410

Query: 410 LKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEF 468
           + F G A++  P ++Y       V C+        G   G  P I ++G+   Q    EF
Sbjct: 411 VHFAGSARLEPPSKSYVIDAAPGVKCI--------GVQEGPWPGISVIGNILQQEHLWEF 462

Query: 469 DLANDRFGFAKQKC 482
           DL N R  F + +C
Sbjct: 463 DLKNRRLRFKRSRC 476


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 149/379 (39%), Gaps = 71/379 (18%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DT S + W       +C  C  P   P +   + P +SSSS +  C +P C+ + GP
Sbjct: 171 MVLDTASDVTWV------QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GP 223

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGC--- 234
                  GC+  N+        Y ++Y  G  TAG  +S+ L   P+  V +F  GC   
Sbjct: 224 YA----NGCTNNNQC------QYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHG 273

Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL---LSRKFDDAPVSSNLVLD 285
              S       AGI   G   ESL SQ      + FS+C      R F          L 
Sbjct: 274 VQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF--------FTL- 324

Query: 286 TGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
                G  +     Y  TP  KNP    +    FY V L  I V  + + +P +      
Sbjct: 325 -----GVPRVAAWRYVLTPMLKNPAIPPT----FYMVRLEAIAVAGQRIAVPPTVFA--- 372

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
               G  +DS +  T +    ++A+ + F  +M  Y  A     K  L  C+D++G +S 
Sbjct: 373 ---AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPA---PPKGPLDTCYDMAGVRSF 426

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            LP + L F   A + L P       G      + FT   AGP   + P II G+ QLQ 
Sbjct: 427 ALPRITLVFDKNAAVELDPS------GVLFQGCLAFT---AGPN-DQVPGII-GNIQLQT 475

Query: 464 FYLEFDLANDRFGFAKQKC 482
             + +++     GF    C
Sbjct: 476 LEVLYNIPAALVGFRHAAC 494


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 149/379 (39%), Gaps = 71/379 (18%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DT S + W       +C  C  P   P +   + P +SSSS +  C +P C+ + GP
Sbjct: 146 MVLDTASDVTWV------QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GP 198

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGC--- 234
                  GC+  N  C      Y ++Y  G  TAG  +S+ L   P+  V +F  GC   
Sbjct: 199 YA----NGCT-NNNQC-----QYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHG 248

Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL---LSRKFDDAPVSSNLVLD 285
              S       AGI   G   ESL SQ      + FS+C      R F          L 
Sbjct: 249 VQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF--------FTL- 299

Query: 286 TGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
                G  +     Y  TP  KNP    +    FY V L  I V  + + +P +      
Sbjct: 300 -----GVPRVAAWRYVLTPMLKNPAIPPT----FYMVRLEAIAVAGQRIAVPPTVFA--- 347

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
               G  +DS +  T +    ++A+ + F  +M  Y  A     K  L  C+D++G +S 
Sbjct: 348 ---AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPA---PPKGPLDTCYDMAGVRSF 401

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            LP + L F   A + L P       G      + FT   AGP   + P II G+ QLQ 
Sbjct: 402 ALPRITLVFDKNAAVELDPS------GVLFQGCLAFT---AGPN-DQVPGII-GNIQLQT 450

Query: 464 FYLEFDLANDRFGFAKQKC 482
             + +++     GF    C
Sbjct: 451 LEVLYNIPAALVGFRHAAC 469


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 166/404 (41%), Gaps = 80/404 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y+  L  GTPPQ     I D+GS++ + PC S  +C +         + P F P  SS
Sbjct: 86  GYYTTRLHIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN--------HQDPRFQPDLSS 136

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C           NV+  C   S +N+       +Y  QY  +  ++G+L  + +
Sbjct: 137 TYSPVKC-----------NVDCTCD--SDKNQC------TYERQYAEMSSSSGVLGEDIV 177

Query: 221 RFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
            F +++       + GC       L  +   GI G GR   S+  QL  K      FS C
Sbjct: 178 SFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC 237

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
                         +VL   P       PG+ YT  + N V S      +Y + L+++ V
Sbjct: 238 YGGMDIG----GGAMVLGAMPAP-----PGMIYT--HSNAVRSP-----YYNIELKEMHV 281

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADV 385
             K +++         DG  G ++DSG+T+ ++    F A       Q+      R  D 
Sbjct: 282 AGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDS 337

Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENY-FALVGNE-VLCLILF 439
             K     CF  +G+     S   P++ + F  G K++L PENY F     E   CL +F
Sbjct: 338 NYKD---ICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +       G+ P  +LG   ++N  + +D  N++ GF K  C+
Sbjct: 395 QN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 150/377 (39%), Gaps = 65/377 (17%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DT S + W       +C  C            + P +S SS+   C +P C  + GP
Sbjct: 184 MLLDTASDVAWV------QCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GP 236

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCS-- 235
                  GCS  + +       Y ++Y  G  T+G L+++ L   P+  VP F  GCS  
Sbjct: 237 YA----NGCSSSSNSAGQC--QYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHA 290

Query: 236 ---ILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSN---LVLDT 286
                S  + AGI   GR  +SL SQ   K    FSYC         P +S+    VL  
Sbjct: 291 ARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCF-------PPTASHKGFFVLGV 343

Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
            P    S+    + TP  K P+         Y V L  I V  + + +P +         
Sbjct: 344 -PRRSSSR---YAVTPMLKTPM--------LYQVRLEAIAVAGQRLDVPPTVFA------ 385

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
            G  +DS +  T +    ++A+   F  +M  Y  AA       L  C+D +G  S+ LP
Sbjct: 386 AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAA---ANGQLDTCYDFTGVSSIMLP 442

Query: 407 ELILKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
            + L F + GA + L P     L G+   CL   +      A G     I+G  QLQ   
Sbjct: 443 TISLVFDRTGAGVQLDPSG--VLFGS---CLAFASTAGDDRATG-----IIGFLQLQTIE 492

Query: 466 LEFDLANDRFGFAKQKC 482
           + +++A    GF +  C
Sbjct: 493 VLYNVAGGSVGFRRGAC 509


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 111/417 (26%), Positives = 172/417 (41%), Gaps = 82/417 (19%)

Query: 93  KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDP 149
           +TP+SVH Y  Y + LS GTPP  +   + DTGS L+W    PCT+ Y+ ++        
Sbjct: 49  QTPVSVHHYD-YLMELSIGTPPVKTYAQV-DTGSDLIWLQCIPCTNCYKQLN-------- 98

Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GL 208
              P F P+ SS+   I   +  CS ++          CSP    C     +Y   Y   
Sbjct: 99  ---PMFDPQSSSTYSNIAYGSESCSKLYS-------TSCSPDQNNC-----NYTYSYEDD 143

Query: 209 GFTAGLLLSETLRFPSKT-----VPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQ 258
             T G+L  ETL   S T     +   + GC      + +D++  GI G GR   SL SQ
Sbjct: 144 SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKE-MGIIGLGRGPLSLVSQ 202

Query: 259 LGL----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           +G     K FS CL+    + +  S    +  G GS +    G+  TP       S +  
Sbjct: 203 IGSSFGGKMFSQCLVPFHTNPSITSP---MSFGKGS-EVLGNGVVSTPLV-----SKNTH 253

Query: 315 GEFYYVGLRQIIVGSKHVKIPY---SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
             FY+V L  I V  + + +P+   S L P + GN  +++DSG+  T +    +  + +E
Sbjct: 254 QAFYFVTLLGISV--EDINLPFNDGSSLEPITKGN--MVIDSGTPTTLLPEDFYHRLVEE 309

Query: 372 FIRQMGNYSRAAD---VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
              ++     A D   ++   G + C+      ++    L   F+ GA + L P   F  
Sbjct: 310 VRNKV-----ALDPIPIDPTLGYQLCYRT--PTNLKGTTLTAHFE-GADVLLTPTQIFIP 361

Query: 429 VGNEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           V + + C    +   N  G         I G+    N+ + FDL      F    C 
Sbjct: 362 VQDGIFCFAFTSTFSNEYG---------IYGNHAQSNYLIGFDLEKQLVSFKATDCT 409


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGSSL W  C+     V C+  +      P F PK SS
Sbjct: 125 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPCV--VSCHRQSG-----PVFNPKASS 176

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S   + C   +CS +    +      CS  N         Y   YG   F+ G L  +T+
Sbjct: 177 SYASVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 228

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F S +VPNF  GC   ++    Q AG+ G  R+  SL  QL       FSYCL +    
Sbjct: 229 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 288

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
            +   S    + G           SYT     P+ SSS     Y++ +  I V  K +  
Sbjct: 289 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 334

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
                  +P        I+DSG+  T +   ++ A++K     M    RA+     S L 
Sbjct: 335 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 384

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            CF     + + +PE+ + F GGA + L   N    V +   CL      A  PA     
Sbjct: 385 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 434

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A I+G+ Q Q F + +D+ N + GFA   C+
Sbjct: 435 AAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 153/378 (40%), Gaps = 71/378 (18%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DTGS + W  C    +C           +   F P  S++ + + C +  C      
Sbjct: 3   LLIDTGSDITWIQCDPCPQCYK--------QQDSLFQPAGSATYKPLPCNSTMCQ----- 49

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
            ++S    C   N +C     +Y++ YG    T G    ETL   S      +VPNF  G
Sbjct: 50  QLQSFSHSC--LNSSC-----NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFG 102

Query: 234 CSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNL---VL 284
           C   +       AG+ G G+SS   P+Q  +   K FSYCL S       VSS +   +L
Sbjct: 103 CGHANKGLFNGAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPS-------VSSTIPSGIL 155

Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
             G      +   L Y   +   V SSS   + Y+V +  I VG + + I          
Sbjct: 156 HFG------EAAMLDYDVRFTPLVDSSSGPSQ-YFVSMTGINVGDELLPI---------- 198

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
            +  V+VDSG+  +  E   +E +   F + +     A  V        CF +S    + 
Sbjct: 199 -SATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP---FDTCFRVSTVDDIN 254

Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
           +P + L F+  A++ L P +    V + V+C        A  + GR    +LG+FQ QN 
Sbjct: 255 IPLITLHFRDDAELRLSPVHILYPVDDGVMCFAF-----APSSSGRS---VLGNFQQQNL 306

Query: 465 YLEFDLANDRFGFAKQKC 482
              +D+   R G +  +C
Sbjct: 307 RFVYDIPKSRLGISAFEC 324


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 149/382 (39%), Gaps = 65/382 (17%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           +  + ++AS    R ++L T    KT    I              V     Y + +  GT
Sbjct: 3   VNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGT 53

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           P Q     + DT +   W PC+    C  C+           F+P  S++   + C   +
Sbjct: 54  PGQQMF-MVLDTSNDAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSEAQ 101

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYGLGFT-AGLLLSETLRFPSKTVP 228
           CS +              R  +CP    S  L    YG   + A  L+ + +   +  +P
Sbjct: 102 CSQV--------------RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIP 147

Query: 229 NFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL 282
            F  GC + +S     P G+ G GR   SL SQ G      FSYCL S  F     S +L
Sbjct: 148 GFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSL 205

Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
            L  GP  G  K+  +  TP  +NP   S      YYV L  + VG   V IP   LV  
Sbjct: 206 KL--GP-VGQPKS--IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFD 255

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
            +   G I+DSG+  T    P++ A+  EF +Q+        +        CF  + +  
Sbjct: 256 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAE 310

Query: 403 VYLPELILKFKGGAKMALPPEN 424
              P + L F+ G  + LP EN
Sbjct: 311 A--PAVTLHFE-GLNLVLPMEN 329


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 105/388 (27%), Positives = 151/388 (38%), Gaps = 58/388 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF-PNVDPSRIPAFIPKRSSS 162
           Y ++ S GTP  A T    DTGS L W       +C  C+  P+    + P F P +SSS
Sbjct: 140 YVVTASLGTPGVAQT-MEVDTGSDLSWV------QCKPCSAAPSCYSQKDPLFDPAQSSS 192

Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
              + C  P C+   G  + +           C  A   Y++ YG G  T G+  S+TL 
Sbjct: 193 YAAVPCGGPVCA---GLGIYA--------ASACSAAQCGYVVSYGDGSNTTGVYSSDTLT 241

Query: 222 F-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
              S  V  F  GC            G+ G GR   SL  Q        FSYCL ++   
Sbjct: 242 LSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP-- 299

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
               S+   L  G G      PG S T    +P   +     +Y V L  I VG + + +
Sbjct: 300 ----STAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPT-----YYVVMLTGISVGGQQLSV 350

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P S         GG +VD+G+  T +    + A+   F   M +Y           L  C
Sbjct: 351 PASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGY-PTAPSNGILDTC 403

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
           ++ +G  +V LP + L F  GA + L  +            ++ F   A  P+   G   
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVMLGADG-----------ILSFGCLAFAPSGSDGGMA 452

Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           ILG+ Q ++F +  D      GF    C
Sbjct: 453 ILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 147/371 (39%), Gaps = 60/371 (16%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           + D+ S + W       +CV C  P   P     + P RS SS    C +P C+ + GP 
Sbjct: 162 VLDSASDVPWV------QCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPY 214

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILS 238
                 GC+  N  C      YL++Y  G  T+G  +++ L   +   V  F  GCS   
Sbjct: 215 A----NGCA--NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAE 263

Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
               D + AGI   G   ESL SQ   +    FSYC+ +   D              G  
Sbjct: 264 QGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDS-------------GFF 310

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
               P  + + +   P+        FY V LR I VG + + +  +    GS      ++
Sbjct: 311 TLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGS------VL 364

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DS +  T +    ++A+   F   M  Y  A     K  L  C+D +G  ++ LP++ L 
Sbjct: 365 DSRTAITRLPPTAYQALRSAFRSSMTMYRSA---PPKGYLDTCYDFTGVVNIRLPKISLV 421

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F   A + L P     ++ N+ L    FT NA     G     +LG  Q Q   + +D+ 
Sbjct: 422 FDRNAVLPLDPS---GILFNDCLA---FTSNADDRMPG-----VLGSVQQQTIEVLYDVG 470

Query: 472 NDRFGFAKQKC 482
               GF +  C
Sbjct: 471 GGAVGFRQGAC 481


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 128/486 (26%), Positives = 200/486 (41%), Gaps = 83/486 (17%)

Query: 25  AGSSAATVTVPLT-----PLSTKHYLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKT 78
           A S +   T+P T     P S    L H DS P    +S +  S L R   +++ ++   
Sbjct: 9   AASCSLLATLPFTEPSKTPSSFTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISR-AN 67

Query: 79  KDSNIGSNYSNSLIKT---PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS 135
           + S   S+  N L ++   P+ + + G Y + +  GTP       I DTGS L W  C+ 
Sbjct: 68  QLSLSLSHSLNQLKESSPEPIIIPNNGNYLMRIYIGTP-SVERLAIADTGSDLTWVQCSP 126

Query: 136 RYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKT 194
                 C+         P + P  SS+  L+ C +  C+ + +   V S    C      
Sbjct: 127 ------CDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCI----- 175

Query: 195 CPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTV---PNFLAGC----SILSDR--QPAG 244
                  Y   YG   ++ G L S+++R     +        GC       +D+  +  G
Sbjct: 176 -------YAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGCGFQNKFTADKSGKTTG 228

Query: 245 IAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSY 300
           I G G    SL SQLG +   KFSYCLL       P SSN       G     +  G+  
Sbjct: 229 IVGLGAGPLSLVSQLGDEIGHKFSYCLL-------PFSSNSNSKLKFGEAAIVQGNGVVS 281

Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFM 360
           TP    P         FYY+ L  I VG+K VK         +DGN  +I+DSGST T++
Sbjct: 282 TPLIIKPDL------PFYYLNLEGITVGAKTVK------TGQTDGN--IIIDSGSTLTYL 327

Query: 361 EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYL-PELILKFKGGAK 417
           E    E+   EF+  +        VE+   +   FD   + K+ +   P+++  F GG  
Sbjct: 328 E----ESFYNEFVSLV---KETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGG-D 379

Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           + L P N   L+ + ++C  +   +  G A       I G+    +F++ +D+   +  F
Sbjct: 380 VVLKPMNTLVLIEDNLICSTVVPSHFDGIA-------IFGNLGQIDFHVGYDIQGGKVSF 432

Query: 478 AKQKCA 483
           A   C+
Sbjct: 433 APTDCS 438


>gi|21537233|gb|AAM61574.1| EDGP precursor [Arabidopsis thaliana]
          Length = 433

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 76/260 (29%), Positives = 121/260 (46%), Gaps = 43/260 (16%)

Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
            G+AG GR +  LPSQ        +KF+ CL S K    F + P    + L   PG    
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPY---VFL---PGI--- 224

Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
           +   L  TP   NPV ++SAF +      Y++G+  I +  K V I P    +  S G G
Sbjct: 225 QISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGFG 284

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP---CFDISGKKSVY 404
           G  + S + +T +E  ++ A   EF++Q    + A  +++ + ++P   CF         
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQ----ALARSIKRVASVKPFGACFSTKNVGVTR 340

Query: 405 LP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
           L       EL+L  K      +   N    V ++V+CL  F D           ++++G 
Sbjct: 341 LGYAVPEIELVLHSKD-VVWRIFGANSMVSVSDDVICL-GFVDGGVNAR----TSVVIGG 394

Query: 459 FQLQNFYLEFDLANDRFGFA 478
           FQL++  +EFDLA++RFGF+
Sbjct: 395 FQLEDNLIEFDLASNRFGFS 414


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 116/462 (25%), Positives = 193/462 (41%), Gaps = 104/462 (22%)

Query: 52  PLKILHSLASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPL--SVHSYGGYSISL 108
           PL +    +S +LS +R HL+              ++S +  + PL   +  YG Y+  +
Sbjct: 48  PLTLSAPNSSRTLSHSRRHLQRS-----------ESHSTATARMPLYDDLIPYGYYTTRI 96

Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
             GTPPQ +   I DTGS+L + PC++  +C     PN        F P  SS+ Q + C
Sbjct: 97  WIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPN--------FQPDWSSTYQPLKC 147

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF----- 222
            + +C+             C      C      Y  QY  +  ++G+L  + + F     
Sbjct: 148 -SMECT-------------CDSEMMHC-----VYDRQYAEMSSSSGVLGEDIVSFGKQSE 188

Query: 223 --PSKTV---PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRK 272
             P +TV    N   G  I S R   GI G GR   S+  QL  K      FS C     
Sbjct: 189 LKPQRTVFGCENVETG-DIYSQRAD-GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGG-- 244

Query: 273 FDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
                      +D G G+    G S   G+ +T  + +P  S+     +Y + L++I + 
Sbjct: 245 -----------MDVGGGAMVLGGISPPAGMVFT--HSDPARSA-----YYNIDLKEIHIA 286

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            K  ++P + +V   DG  G I+DSG+T+ ++  P F+A     ++++ +       ++ 
Sbjct: 287 GK--QLPINPMV--FDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRN 342

Query: 389 SGLRPCF-----DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTD 441
                CF     D+S + S   P + L F  G +++L PENY       +   CL +F +
Sbjct: 343 YN-DICFSGVGSDVS-QLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQN 400

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                        +LG   ++N  + +D  + + GF K  C+
Sbjct: 401 E-------NDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 166/404 (41%), Gaps = 80/404 (19%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y+  L  GTPPQ     I D+GS++ + PC S  +C +         + P F P  SS
Sbjct: 86  GYYTTRLHIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN--------HQDPRFQPDLSS 136

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           +   + C           NV+  C   S +N+       +Y  QY  +  ++G+L  + +
Sbjct: 137 TYSPVKC-----------NVDCTCD--SDKNQC------TYERQYAEMSSSSGVLGEDIV 177

Query: 221 RFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
            F +++       + GC       L  +   GI G GR   S+  QL  K      FS C
Sbjct: 178 SFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC 237

Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
                         +VL   P       PG+ YT  + N V S      +Y + L+++ V
Sbjct: 238 YGGMDIG----GGAMVLGAMPAP-----PGMIYT--HSNAVRSP-----YYNIELKEMHV 281

Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADV 385
             K +++         DG  G ++DSG+T+ ++    F A       Q+      R  D 
Sbjct: 282 AGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDP 337

Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENY-FALVGNE-VLCLILF 439
             K     CF  +G+     S   P++ + F  G K++L PENY F     E   CL +F
Sbjct: 338 NYKD---ICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +       G+ P  +LG   ++N  + +D  N++ GF K  C+
Sbjct: 395 QN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 100/407 (24%), Positives = 155/407 (38%), Gaps = 68/407 (16%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIP 157
            S G Y   +  GTP +       DTGS ++W  C    RC    +   + P  + A   
Sbjct: 80  ESIGLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDA--- 135

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
             SS+++ + C +  CS++   N  S C   S    TC      Y++ YG G  T G L+
Sbjct: 136 --SSTAKSVSCSDNFCSYV---NQRSECHSGS----TCQ-----YVIMYGDGSSTNGYLV 181

Query: 217 SETL--------RFPSKTVPNFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQLG- 260
            + +        R    T    + GC       L + Q A  GI GFG+S+ S  SQL  
Sbjct: 182 KDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLAS 241

Query: 261 ----LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
                + F++CL                D   G G      +        P+ S SA   
Sbjct: 242 QGKVKRSFAHCL----------------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAH-- 283

Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
            Y V L  I VG+  +++  +    G D   GVI+DSG+T  ++   ++  +  E +   
Sbjct: 284 -YSVNLNAIEVGNSVLELSSNAFDSGDDK--GVIIDSGTTLVYLPDAVYNPLLNEILASH 340

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
              +     E  +    CF  + K   + P +  +F     +A+ P  Y   V  +  C 
Sbjct: 341 PELTLHTVQESFT----CFHYTDKLDRF-PTVTFQFDKSVSLAVYPREYLFQVREDTWCF 395

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                N      G     ILGD  L N  + +D+ N   G+    C+
Sbjct: 396 GW--QNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 116/462 (25%), Positives = 193/462 (41%), Gaps = 104/462 (22%)

Query: 52  PLKILHSLASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPL--SVHSYGGYSISL 108
           PL +    +S +LS +R HL+              ++S +  + PL   +  YG Y+  +
Sbjct: 48  PLTLSAPNSSRTLSHSRRHLQRS-----------ESHSTATARMPLYDDLIPYGYYTTRI 96

Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
             GTPPQ +   I DTGS+L + PC++  +C     PN        F P  SS+ Q + C
Sbjct: 97  WIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPN--------FQPDWSSTYQPLKC 147

Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF----- 222
            + +C+             C      C      Y  QY  +  ++G+L  + + F     
Sbjct: 148 -SMECT-------------CDSEMMHC-----VYDRQYAEMSSSSGVLGEDIVSFGKQSE 188

Query: 223 --PSKTV---PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRK 272
             P +TV    N   G  I S R   GI G GR   S+  QL  K      FS C     
Sbjct: 189 LKPQRTVFGCENVETG-DIYSQRAD-GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGG-- 244

Query: 273 FDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
                      +D G G+    G S   G+ +T  + +P  S+     +Y + L++I + 
Sbjct: 245 -----------MDVGGGAMVLGGISPPAGMVFT--HSDPARSA-----YYNIDLKEIHIA 286

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
            K  ++P + +V   DG  G I+DSG+T+ ++  P F+A     ++++ +       ++ 
Sbjct: 287 GK--QLPINPMV--FDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRN 342

Query: 389 SGLRPCF-----DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTD 441
                CF     D+S + S   P + L F  G +++L PENY       +   CL +F +
Sbjct: 343 YN-DICFSGVGSDVS-QLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQN 400

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                        +LG   ++N  + +D  + + GF K  C+
Sbjct: 401 E-------NDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435


>gi|295646769|gb|ADG23123.1| xyloglucan specific endoglucanase inhibitor [Solanum melongena]
          Length = 437

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 106/412 (25%), Positives = 162/412 (39%), Gaps = 79/412 (19%)

Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           +L + T  Q  TP +      D G   +W         VDC+   V  S  PA    RS+
Sbjct: 44  TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
              L G     C   F P       GC+  N TC L   + +     G   G L S+ + 
Sbjct: 93  QCSLAGAS--ACGECFSPPRP----GCN--NNTCSLFPDNTVTGTATG---GELASDIVS 141

Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
             S    N           F+ G + L     +G+   AG GR+  SLPSQ        +
Sbjct: 142 VQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201

Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
           KF+ CL S        F D P      L     S +       YTP + NPV +++AF  
Sbjct: 202 KFALCLTSSNSKGVVLFGDGPY---FFLPNKEFSNND----FQYTPLFINPVSTAAAFSS 254

Query: 317 -----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                 Y++G++ I +  K V I  + L   + G GG  + + + +T ME  L+ A+   
Sbjct: 255 GQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKLSTVNPYTVMETSLYNAITNF 314

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENY 425
           F++++ N +R A V        CFD     S  +       +L+L+ +      +   N 
Sbjct: 315 FVKELANVTRVAPVTP---FGACFDSRNIGSTRVGPAVPWIDLVLQNQ-NVVWTIFGANS 370

Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
              V   VLCL +         +    +I++G   +++  L+FD A  R GF
Sbjct: 371 MVQVSENVLCLGIVDG-----GVNARTSIVIGGHTIEDNLLQFDHAASRLGF 417


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 149/382 (39%), Gaps = 65/382 (17%)

Query: 53  LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
           +  + ++AS    R ++L T    KT    I              V     Y + +  GT
Sbjct: 3   VNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGT 53

Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
           P Q     + DT +   W PC+    C  C+           F+P  S++   + C   +
Sbjct: 54  PGQQMF-MVLDTSNDAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSEAQ 101

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYGLGFT-AGLLLSETLRFPSKTVP 228
           CS +              R  +CP    S  L    YG   + A  L+ + +   +  +P
Sbjct: 102 CSQV--------------RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIP 147

Query: 229 NFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL 282
            F  GC + +S     P G+ G GR   SL SQ G      FSYCL S  F     S +L
Sbjct: 148 GFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSL 205

Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
            L  GP  G  K+  +  TP  +NP   S      YYV L  + VG   V IP   LV  
Sbjct: 206 KL--GP-VGQPKS--IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFD 255

Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
            +   G I+DSG+  T    P++ A+  EF +Q+        +        CF  + +  
Sbjct: 256 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAETNEAE 310

Query: 403 VYLPELILKFKGGAKMALPPEN 424
              P + L F+ G  + LP EN
Sbjct: 311 A--PAVTLHFE-GLNLVLPMEN 329


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGSSL W  C+     V C+  +      P F PK SS
Sbjct: 127 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S   + C   +CS +    +      CS  N         Y   YG   F+ G L  +T+
Sbjct: 179 SYTSVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 230

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F S +VPNF  GC   ++    Q AG+ G  R+  SL  QL       FSYCL +    
Sbjct: 231 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 290

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
            +   S    + G           SYT     P+ SSS     Y++ +  I V  K +  
Sbjct: 291 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 336

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
                  +P        I+DSG+  T +   ++ A++K     M    RA+     S L 
Sbjct: 337 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 386

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            CF     + + +PE+ + F GGA + L   N    V +   CL      A  PA     
Sbjct: 387 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 436

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A I+G+ Q Q F + +D+ N + GFA   C+
Sbjct: 437 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 148/371 (39%), Gaps = 60/371 (16%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           + D+ S + W       +CV C  P   P     + P RS +S    C +P C+ + GP 
Sbjct: 32  VLDSASDVPWV------QCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPY 84

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILS 238
                 GC+  N  C      YL++Y  G  T+G  +++ L   +   V  F  GCS   
Sbjct: 85  A----NGCA--NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAE 133

Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
               D + AGI   G   ESL SQ   +    FSYC+ +   D    +  +         
Sbjct: 134 QGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGV--------- 184

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
               P  + + +   P+        FY V LR I VG + + +  +    GS      ++
Sbjct: 185 ----PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGS------VL 234

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DS +  T +    ++A+   F   M  Y  A     K  L  C+D +G  ++ LP++ L 
Sbjct: 235 DSRTAITRLPPTAYQALRAAFRSSMTMYRSA---PPKGYLDTCYDFTGVVNIRLPKISLV 291

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F   A + L P     ++ N+ L    FT NA     G     +LG  Q Q   + +D+ 
Sbjct: 292 FDRNAVLPLDPSG---ILFNDCLA---FTSNADDRMPG-----VLGSVQQQTIEVLYDVG 340

Query: 472 NDRFGFAKQKC 482
               GF +  C
Sbjct: 341 GGAVGFRQGAC 351


>gi|15218740|ref|NP_171821.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|13272443|gb|AAK17160.1|AF325092_1 unknown protein [Arabidopsis thaliana]
 gi|3850579|gb|AAC72119.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
           (EDGP) precursor from Daucus carota. ESTs gb|H37281,
           gb|T44167, gb|T21813, gb|N38437, gb|Z26470, gb|R65072,
           gb|N76373, gb|F15470, gb|Z35182, gb|H76373, gb|Z34678
           and gb|Z35387 come from this gene [Arabidopsis thaliana]
 gi|14334706|gb|AAK59531.1| unknown protein [Arabidopsis thaliana]
 gi|16323420|gb|AAL15204.1| unknown protein [Arabidopsis thaliana]
 gi|332189425|gb|AEE27546.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 433

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 122/260 (46%), Gaps = 43/260 (16%)

Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
            G+AG GR +  LPSQ        +KF+ CL S K    F + P    + L   PG    
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPY---VFL---PGI--- 224

Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
           +   L  TP   NPV ++SAF +      Y++G+  I +  K V I P    +  S G G
Sbjct: 225 QISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIG 284

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP---CFDISGKKSVY 404
           G  + S + +T +E  ++ A   EF++Q    + A  +++ + ++P   CF         
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQ----AAARSIKRVASVKPFGACFSTKNVGVTR 340

Query: 405 LP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
           L       EL+L  K      +   N    V ++V+CL  F D      +    ++++G 
Sbjct: 341 LGYAVPEIELVLHSKD-VVWRIFGANSMVSVSDDVICL-GFVDGG----VNARTSVVIGG 394

Query: 459 FQLQNFYLEFDLANDRFGFA 478
           FQL++  +EFDLA+++FGF+
Sbjct: 395 FQLEDNLIEFDLASNKFGFS 414


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/386 (26%), Positives = 146/386 (37%), Gaps = 57/386 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTP Q +     DT S + W PC     C+ C+           F    S++ 
Sbjct: 36  YIVRAKIGTPAQ-TMLMAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 83

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GCQ  +C  +  P              TC     S+ L YG    A  L  +T+   
Sbjct: 84  KSLGCQAAQCKQVPKP--------------TCGGGVCSFNLTYGGSSLAANLSQDTITLA 129

Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           +  VP +  GC        L  +   G+     S  S    L    FSYCL S  F    
Sbjct: 130 TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 187

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     +   + YTP  KNP   S      Y+V L  + VG + V +P  
Sbjct: 188 FSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPG 237

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                     G I DSG+ FT +  P + AV   F  ++G   R   V    G   C+ +
Sbjct: 238 SFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV 294

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N           CL +    AA P        ++
Sbjct: 295 ----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 345

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  L +D+ N R G A++ C
Sbjct: 346 ANLQQQNHRLLYDVPNSRLGVARELC 371


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGSSL W  C+     V C+  +      P F PK SS
Sbjct: 125 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 176

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S   + C   +CS +    +      CS  N         Y   YG   F+ G L  +T+
Sbjct: 177 SYASVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 228

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F S +VPNF  GC   ++    Q AG+ G  R+  SL  QL       FSYCL +    
Sbjct: 229 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 288

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
            +   S    + G           SYT     P+ SSS     Y++ +  I V  K +  
Sbjct: 289 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 334

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
                  +P        I+DSG+  T +   ++ A++K     M    RA+     S L 
Sbjct: 335 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 384

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            CF     + + +PE+ + F GGA + L   N    V +   CL      A  PA     
Sbjct: 385 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 434

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A I+G+ Q Q F + +D+ N + GFA   C+
Sbjct: 435 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 106/407 (26%), Positives = 165/407 (40%), Gaps = 69/407 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  G+PP+     + DTGS ++W  C    RC   +   +D   +  + PK S 
Sbjct: 68  GLYFTKLGLGSPPKDYYVQV-DTGSDILWVNCVKCSRCPRKSDLGID---LTLYDPKGSE 123

Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +S+LI C    CS  + GP       GC        + CP Y + YG G  T G  + + 
Sbjct: 124 TSELISCDQEFCSATYDGP-----IPGCKSE-----IPCP-YSITYGDGSATTGYYVQDY 172

Query: 220 LRFPS-----KTVPN---FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL-- 261
           L +       +T P     + GC  +        S+    GI GFG+S+ S+ SQL    
Sbjct: 173 LTYNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASG 232

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
              K FS+CL            N+        G+   P +S TP               Y
Sbjct: 233 KVKKIFSHCL-----------DNIRGGGIFAIGEVVEPKVSTTPLVPRMA--------HY 273

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLF-EAVAKEFIRQM 376
            V L+ I V +  +++P         GNG G I+DSG+T  ++   ++ E + K   RQ 
Sbjct: 274 NVVLKSIEVDTDILQLPSDIF---DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQ- 329

Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
               +   VE++     CF  +G      P + L F+    + + P +Y     + + C 
Sbjct: 330 -PRLKLYLVEQQ---FSCFQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWC- 384

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           I +  + A    G+    +LGD  L N  + +DL N   G+    C+
Sbjct: 385 IGWQKSVAQTKNGK-DMTLLGDLVLSNKLVIYDLENMAIGWTDYNCS 430


>gi|115442113|ref|NP_001045336.1| Os01g0937500 [Oryza sativa Japonica Group]
 gi|20160770|dbj|BAB89711.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
 gi|113534867|dbj|BAF07250.1| Os01g0937500 [Oryza sativa Japonica Group]
 gi|125573257|gb|EAZ14772.1| hypothetical protein OsJ_04701 [Oryza sativa Japonica Group]
 gi|215766348|dbj|BAG98576.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 443

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 106/408 (25%), Positives = 163/408 (39%), Gaps = 59/408 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y+IS+  G PP      + D   +LVW  C S +  V C     D   +    P+R    
Sbjct: 48  YTISVKNGAPP-----LVVDLAGALVWSTCPSTHSTVPCQSAACD--AVNRQQPRR---- 96

Query: 164 QLIGCQNPKCSWIF-GPNVESRCKGCS-----PRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
               C+     W + G    SRC  C+     P    C      ++ +         LL 
Sbjct: 97  ----CRYVDGGWFWAGREPGSRC-ACTAHPFNPVTGECSTGDLTTFTMSANTTNGTDLLY 151

Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGF-GRSSESLPSQLGLKK-----FSYCL-L 269
            E+        P  L     L  +  AG+AGF G +  SLPSQL  ++     F+ CL +
Sbjct: 152 PESFTAVGACAPERLLASPSLP-QAAAGVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPV 210

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-- 327
              F D PV    + +  P      T  L  TPF  NP  +       YY+ +++I V  
Sbjct: 211 FATFGDTPV---YLPNYNPYGPFDYTKMLRRTPFLTNPRRNGG-----YYLPVKRISVSW 262

Query: 328 ---GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSR 381
              G   V +P   L +    G GGV++ + + +  M   +F A  K F  +   G  SR
Sbjct: 263 RGPGDVPVSLPAGALDLNARTGRGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESR 322

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE-------NYFALVGNEVL 434
            A V ++     C+  +G   +  P  ++K  G    A+  E       N+  L GN ++
Sbjct: 323 MARVARQKQFELCYGGAGDTMLSFP--MMKRTGFDAPAITLELDAGATGNWTILNGNYLV 380

Query: 435 ---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
              C+ +      G  +   PA++LG  QL+N  + FDL     GF++
Sbjct: 381 RETCVGVVEMGPEGMPVDGEPAVVLGGMQLENILMVFDLDKRTLGFSR 428


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 166/396 (41%), Gaps = 68/396 (17%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           + +++ FG+P Q  T  I DTGS + W       +C+ C+  +      P F P +S++ 
Sbjct: 161 FVVTVGFGSPAQNYTLSI-DTGSDVSWI------QCLPCS-GHCYKQHDPVFDPTKSATY 212

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
             + C +P+C+   G     +C      + TC      Y + YG G  TAG+L  ETL  
Sbjct: 213 SAVPCGHPQCAAAGG-----KCS----NSGTC-----LYKVTYGDGSSTAGVLSHETLSL 258

Query: 223 PS-KTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
            S + +P F  GC   +  +     G+ G GR + SLPSQ        FSYCL S     
Sbjct: 259 SSTRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYD--- 315

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
                 L + +   +  +    + YT   +        +   Y+V +  I +G   + +P
Sbjct: 316 -TTHGYLTMGSTTPAASNDDDDVQYTAMIQK-----EDYPSLYFVEVVSIDIGGYILPVP 369

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
                P      G + DSG+  T++    + ++   F   M  Y  A   +       C+
Sbjct: 370 -----PTVFTRDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDP---FDTCY 421

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
           D +G  ++++P +  KF  GA   L P           + ++++ D+ A PA G      
Sbjct: 422 DFTGHNAIFMPAVAFKFSDGAVFDLSP-----------VAILIYPDDTA-PATGCLAFVP 469

Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                P  I+G+ Q +   + +D+A ++ GF +  C
Sbjct: 470 RPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP + S   + DTGSSL W  C+     V C+  +      P F PK SS
Sbjct: 127 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 178

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S   + C   +CS +    +      CS  N         Y   YG   F+ G L  +T+
Sbjct: 179 SYTSVSCSAQQCSDLTTATLSP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 230

Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
            F S +VPNF  GC   ++    Q AG+ G  R+  SL  QL       FSYCL +    
Sbjct: 231 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 290

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
            +   S    + G           SYT     P+ SSS     Y++ +  I V  K +  
Sbjct: 291 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 336

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
                  +P        I+DSG+  T +   ++ A++K     M    RA+     S L 
Sbjct: 337 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 386

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
            CF     + + +PE+ + F GGA + L   N    V +   CL      A  PA     
Sbjct: 387 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 436

Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A I+G+ Q Q F + +D+ N + GFA   C+
Sbjct: 437 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|218189700|gb|EEC72127.1| hypothetical protein OsI_05116 [Oryza sativa Indica Group]
          Length = 443

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 106/408 (25%), Positives = 163/408 (39%), Gaps = 59/408 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y+IS+  G PP      + D   +LVW  C S +  V C     D   +    P+R    
Sbjct: 48  YTISVKNGAPP-----LVVDLAGALVWSTCPSTHSTVPCQSAACD--AVNRQQPRR---- 96

Query: 164 QLIGCQNPKCSWIF-GPNVESRCKGCS-----PRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
               C+     W + G    SRC  C+     P    C      ++ +         LL 
Sbjct: 97  ----CRYVDGGWFWAGREPGSRC-ACTAHPFNPVTGECSTGDLTTFAMSANTTNGTDLLY 151

Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGF-GRSSESLPSQLGLKK-----FSYCL-L 269
            E+        P  L     L  +  AG+AGF G +  SLPSQL  ++     F+ CL +
Sbjct: 152 PESFTAVGACAPERLLASPSLP-QAAAGVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPV 210

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-- 327
              F D PV    + +  P      T  L  TPF  NP  +       YY+ +++I V  
Sbjct: 211 FATFGDTPV---YLPNYNPYGPFDYTKMLRRTPFLTNPRRNGG-----YYLPVKRISVSW 262

Query: 328 ---GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSR 381
              G   V +P   L +    G GGV++ + + +  M   +F A  K F  +   G  SR
Sbjct: 263 RGPGDVPVSLPAGALDLNARTGRGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESR 322

Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE-------NYFALVGNEVL 434
            A V ++     C+  +G   +  P  ++K  G    A+  E       N+  L GN ++
Sbjct: 323 MARVARQKQFELCYGGAGDTMLSFP--MMKRTGFDAPAITLELDAGATGNWTILNGNYLV 380

Query: 435 ---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
              C+ +      G  +   PA++LG  QL+N  + FDL     GF++
Sbjct: 381 RETCVGVVEMGPEGMPVDGEPAVVLGGMQLENILMVFDLDKRTLGFSR 428


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 109/260 (41%), Gaps = 41/260 (15%)

Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           G+ G  R + S  +Q G ++F+YC+       AP     VL  G   G    P L+YTP 
Sbjct: 181 GLLGMNRGTLSFVTQTGTRRFAYCI-------APGEGPGVLLLGDDGG--VAPPLNYTPL 231

Query: 304 YKNPVGSSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
            +     S     F    Y V L  I VG   + IP S L P   G G  +VDSG+ FTF
Sbjct: 232 IE----ISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTF 287

Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI--------SGKKSVYLPELILK 411
           +    + A+  EF  Q      A   E     +  FD             S  LPE+ L 
Sbjct: 288 LLADAYAALKAEFTSQ-ARLLLAPLGEPGFVFQGAFDACFRGPEARVAAASGLLPEVGLV 346

Query: 412 FKGGAKMALPPENYFALVGNE---------VLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
            + GA++A+  E    +V  E         V CL     + AG +     A ++G    Q
Sbjct: 347 LR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMS-----AYVIGHHHQQ 400

Query: 463 NFYLEFDLANDRFGFAKQKC 482
           N ++E+DL N R GFA  +C
Sbjct: 401 NVWVEYDLQNGRVGFAPARC 420


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 108/406 (26%), Positives = 160/406 (39%), Gaps = 92/406 (22%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSR-YRCVDCNFPNVDPSRIPAFIPKR 159
           + + + FGTP Q +   I DTGS L W    PC+   YR  D           P F P +
Sbjct: 137 FVVVVGFGTPAQTAA-IILDTGSDLSWIQCKPCSGHCYRQHD-----------PDFDPAK 184

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
           SSS   + C  P C+   G      C G      TC      Y +QYG G  T G+L  +
Sbjct: 185 SSSYAAVPCGTPVCAAAGG-----MCNG-----TTC-----LYGVQYGDGSSTTGVLSRD 229

Query: 219 TLRFPSKT-VPNFLAGCSILSDRQPAGIAGFGR--------------SSESLPSQLGLKK 263
           TL F S +    F  GC          I  FG                S++ PS  G+  
Sbjct: 230 TLTFNSSSKFTGFTFGCG------EKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGV-- 281

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
           FSYCL S        ++   L+ G     S  P + YT   K P      +  FY++ L 
Sbjct: 282 FSYCLPSYN------TTPGYLNIGATKPTSTVP-VQYTAMIKKP-----QYPSFYFIELV 329

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
            I +G   + +P S          G ++DSG+  T++  P + ++   F   M     A 
Sbjct: 330 SINIGGYILPVPPSVFT-----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAP 384

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP-------PENYFALVGNEVLCL 436
             E    L  C+D +G+ ++ +P +   F  GA   L        P++   L+G    CL
Sbjct: 385 PYEP---LDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIG----CL 437

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              +  AA P        I+G+ Q +   + +D+ + + GF    C
Sbjct: 438 AFVSRPAAMPFS------IVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 102/386 (26%), Positives = 145/386 (37%), Gaps = 57/386 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTP Q       DT S + W PC     C+ C+           F    S++ 
Sbjct: 101 YIVRAKIGTPAQTML-MAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 148

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           + +GCQ  +C  +  P              TC     S+ L YG    A  L  +T+   
Sbjct: 149 KSLGCQAAQCKQVPKP--------------TCGGGVCSFNLTYGGSSLAANLSQDTITLA 194

Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
           +  VP +  GC        L  +   G+     S  S    L    FSYCL S  F    
Sbjct: 195 TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 252

Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
            S +L L  GP     +   + YTP  KNP   S      Y+V L  + VG + V +P  
Sbjct: 253 FSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPG 302

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                     G I DSG+ FT +  P + AV   F  ++G   R   V    G   C+ +
Sbjct: 303 SFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV 359

Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
                +  P +   F  G  + LPP+N           CL +    AA P        ++
Sbjct: 360 ----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 410

Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
            + Q QN  L +D+ N R G A++ C
Sbjct: 411 ANLQQQNHRLLYDVPNSRLGVARELC 436


>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
          Length = 422

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 124/278 (44%), Gaps = 41/278 (14%)

Query: 223 PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK-- 272
           P    P     C + S+R      G+AG   S+ +LPSQL       +KF+ CL S    
Sbjct: 145 PLARFPQLAFACDLSSNRVISGTVGVAGMTSSTLALPSQLSAAEGFSRKFAMCLPSGNAP 204

Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
               F D P    LV    PG   S    +  TP  KN     S + + +Y+G+++I VG
Sbjct: 205 GALFFGDEP----LVFLPPPGRDLSSQ--IIRTPLIKN-----SVYTDVFYLGVQRIEVG 253

Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSRAADVE 386
             +V I    L    DG GG  + +   +T +  P++ ++   F  + +  N +R A V 
Sbjct: 254 GVNVAIDAEKLRFDKDGRGGTKLSTVVRYTQLASPIYNSLEGVFTSVAKKMNITRVASV- 312

Query: 387 KKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
             S    CFD SG  S  +       +++L+        +   N    V N+VLCL  F 
Sbjct: 313 --SPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTWRIFGANSMVRVNNKVLCL-GFV 369

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           D   G  L +  +I++G +Q+Q+  L+FDLA    GF+
Sbjct: 370 D--GGDNLQQ--SIVIGTYQMQDNLLQFDLATSTLGFS 403


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 103/423 (24%), Positives = 172/423 (40%), Gaps = 82/423 (19%)

Query: 89  NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD--CNFPN 146
           N+ ++    + + G Y+  L  GTP Q     I D+GS++ + PC +  +C +     PN
Sbjct: 77  NARMRLHDDLLTNGYYTTRLYIGTPSQEFA-LIVDSGSTVTYVPCATCEQCGNHQSESPN 135

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +  +  P F P  SS+   + C N  C+             C      C     +Y  QY
Sbjct: 136 IIEAHDPRFQPDLSSTYSPVKC-NVDCT-------------CDNERSQC-----TYERQY 176

Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
             +  ++G+L  + + F   S+  P     GC       L  +   GI G GR   S+  
Sbjct: 177 AEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMD 236

Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
           QL  K      FS C                +D G G+    G    P + ++  + NPV
Sbjct: 237 QLVEKGVISDSFSLCYGG-------------MDVGGGTMVLGGMPAPPDMVFS--HSNPV 281

Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            S      +Y + L++I V  K +++         +   G ++DSG+T+ ++    F A 
Sbjct: 282 RSP-----YYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGTTYAYLPEQAFVAF 332

Query: 369 AKEFIRQMGNYS--RAADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPP 422
                 ++ +    R  D   K     CF  +G+     S   P++ + F  G K++L P
Sbjct: 333 KDAVTNKVNSLKKIRGPDPNYKD---ICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSP 389

Query: 423 ENY-FALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
           ENY F     E   CL +F +       G+ P  +LG   ++N  + +D  N++ GF K 
Sbjct: 390 ENYLFRHSKVEGAYCLGVFQN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 442

Query: 481 KCA 483
            C+
Sbjct: 443 NCS 445


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 102/408 (25%), Positives = 168/408 (41%), Gaps = 73/408 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTPP+     + DTGS ++W  C S  +C   +   +D   +  + PK SS
Sbjct: 81  GLYYTEIEIGTPPKQYHVQV-DTGSDILWVNCISCNKCPRKSDLGID---LRLYDPKGSS 136

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C    C+  +G     +  GC+ +N  C      Y + YG G  T G  +S++L
Sbjct: 137 SGSTVSCDQKFCAATYG----GKLPGCA-KNIPC-----EYSVMYGDGSSTTGYFVSDSL 186

Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           ++              + + GC          +++   GI GFG+S+ S+ SQL      
Sbjct: 187 QYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEV 246

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            K FS+CL + K              G G    GD   P +  TP   +           
Sbjct: 247 KKIFSHCLDTIK--------------GGGIFAIGDVVQPKVKSTPLVPDM--------PH 284

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V L  I VG   +++P      G     G I+DSG+T T++     E V K+ +  + 
Sbjct: 285 YNVNLESINVGGTTLQLPSHMFETGE--KKGTIIDSGTTLTYLP----ELVYKDVLAAV- 337

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLC 435
            +++  D    S ++    I   +SV    P++   F+    + + P +YF   G+ + C
Sbjct: 338 -FAKHPDTTFHS-VQDFLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYC 395

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              F +       G+   ++LGD  L N  + +DL N   G+    C+
Sbjct: 396 F-GFQNGGLQSKDGK-DMVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 103/423 (24%), Positives = 172/423 (40%), Gaps = 82/423 (19%)

Query: 89  NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD--CNFPN 146
           N+ ++    + + G Y+  L  GTP Q     I D+GS++ + PC +  +C +     PN
Sbjct: 76  NARMRLHDDLLTNGYYTTRLYIGTPSQEFA-LIVDSGSTVTYVPCATCEQCGNHQSESPN 134

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           +  +  P F P  SS+   + C N  C+             C      C     +Y  QY
Sbjct: 135 IIEAHDPRFQPDLSSTYSPVKC-NVDCT-------------CDNERSQC-----TYERQY 175

Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
             +  ++G+L  + + F   S+  P     GC       L  +   GI G GR   S+  
Sbjct: 176 AEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMD 235

Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
           QL  K      FS C                +D G G+    G    P + ++  + NPV
Sbjct: 236 QLVEKGVISDSFSLCYGG-------------MDVGGGTMVLGGMPAPPDMVFS--HSNPV 280

Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            S      +Y + L++I V  K +++         +   G ++DSG+T+ ++    F A 
Sbjct: 281 RSP-----YYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGTTYAYLPEQAFVAF 331

Query: 369 AKEFIRQMGNYS--RAADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPP 422
                 ++ +    R  D   K     CF  +G+     S   P++ + F  G K++L P
Sbjct: 332 KDAVTNKVNSLKKIRGPDPNYKD---ICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSP 388

Query: 423 ENY-FALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
           ENY F     E   CL +F +       G+ P  +LG   ++N  + +D  N++ GF K 
Sbjct: 389 ENYLFRHSKVEGAYCLGVFQN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 441

Query: 481 KCA 483
            C+
Sbjct: 442 NCS 444


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 105/405 (25%), Positives = 152/405 (37%), Gaps = 67/405 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP   +   + DTGS LVW  C+   RC           R   F P+RSS
Sbjct: 84  GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVES---RCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLS 217
           + + + C +P+C  +  P  +S      GC             Y++ YG G ++ G L +
Sbjct: 135 TYRRVPCSSPQCRALRFPGCDSGGAAGGGCR------------YMVAYGDGSSSTGDLAT 182

Query: 218 ETLRFPSKT-VPNFLAGCSILSDRQPAGIAGFGRSSESL-PSQLGLKKFSYCLLSRKFDD 275
           + L F + T V N   GC              GR +E L  S  GL      L  R    
Sbjct: 183 DKLAFANDTYVNNVTLGC--------------GRDNEGLFDSAAGL------LGRRAAAR 222

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP------------------VGSSSAFGEF 317
            P        T P S  +   G       +                       + A   +
Sbjct: 223 YPSRRRWPRRTAPSSSTASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTW 282

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
            + G      GS   + P S       G GGV+VDSG+  +      + A+   F  +  
Sbjct: 283 TWPGSASAARGSPGSRTPASRWT-RRRGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 341

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
                    + S    C+D+ G+ +   P ++L F GGA MALPPENYF  V        
Sbjct: 342 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 401

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +       A   G ++I G+ Q Q F + FD+  +R GFA + C
Sbjct: 402 SYRRCLGFEAADDGLSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 445


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 104/401 (25%), Positives = 163/401 (40%), Gaps = 80/401 (19%)

Query: 108 LSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +  GTPP  +   + DTG++L +    PCT R     C+    D   I  F P +S S  
Sbjct: 210 IKLGTPPVWNLVAV-DTGATLSFVQCEPCTLR-----CH-KQTDAGEI--FDPSKSESFS 260

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRF 222
            +GC   KC  +    +  + K C  +  +C      Y + +G    ++ G L+ + L  
Sbjct: 261 RVGCSENKCRTV-QRALHLQSKACMEKEDSCL-----YSMTFGGTSSYSVGKLVRDRLAI 314

Query: 223 ----PSKTVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLG----LKKFSYCLLSRK 272
                  + P+FL GCS+ ++  +  AG+ GF     S   Q+      K FSYC  S +
Sbjct: 315 GKYAKGYSFPDFLFGCSLDTEYHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCFPSDR 374

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKH 331
                +S           GD      +YTP +     S       Y + L +++V G   
Sbjct: 375 RKTGYLSI----------GDYTRVNSTYTPLFLARQQSR------YALKLDEVLVNGMAL 418

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF---EAVAKEFIRQMG---NYSRAADV 385
           V  P             +IVDSGS +T +    F   +A   E +R +G   NY R +D 
Sbjct: 419 VTTP-----------SEMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDY 467

Query: 386 EKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
                   CF+ +  +       LP + LKF  G KM L P++ F    +  LC     D
Sbjct: 468 -------ICFEDAHFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRD 520

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            + G  +      +LG+   ++  + FD+   +FGF K  C
Sbjct: 521 ASLGSGVQ-----LLGNTMTRSVGITFDIQGGQFGFRKGDC 556


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 161/393 (40%), Gaps = 79/393 (20%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y I++  G+P    T  I DTGS + W  C S                +  F P +S++ 
Sbjct: 129 YVITVGIGSPAVTQTMMI-DTGSDVSWVRCNST-------------DGLTLFDPSKSTTY 174

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
               C +  C+ + G N +    GCS  N  C      Y +QYG G  T G   S+TL  
Sbjct: 175 APFSCSSAACAQL-GNNGD----GCS--NSGC-----QYRVQYGDGSNTTGTYSSDTLAL 222

Query: 223 -PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
             S TV +F  GCS   +     +  G+ G G  ++SL SQ      K FSYCL      
Sbjct: 223 SASDTVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL------ 276

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
             P ++        G+ +  + G   TP  + P   +      Y V L+ I VG   + I
Sbjct: 277 --PPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPT-----LYGVLLQDISVGGTPLGI 329

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLR 392
             S L  GS      ++DSG+  T++    + A++  F   M    + RAA +     L 
Sbjct: 330 QPSVLSNGS------VMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGI---LD 380

Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALG 449
            C+D +G  +V +P + L   GGA + L         GN ++   CL          A  
Sbjct: 381 TCYDFTGLVNVSIPAVSLVLDGGAVVDLD--------GNGIMIQDCLAF--------AAT 424

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            G +II G+ Q + F +  D+    FGF    C
Sbjct: 425 SGDSII-GNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
          Length = 205

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 86/179 (48%), Gaps = 8/179 (4%)

Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
           + G GR   SL SQLG  +FSYCL S      P   N  +       ++ + GL   P  
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSF-LSPEPSRLNFGVFATLNGTNASSSGL---PVQ 56

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
             P+  ++A    Y++ L+ I +G K + I         DG GGV +DSG++ T+++  +
Sbjct: 57  STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDV 116

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL--PELILKFKGGAKMALP 421
           ++AV +E +  +     A D E   GL  CF      +V +  P++ L F GGA M  P
Sbjct: 117 YDAVRRELVSVLRPLPPANDTEI--GLETCFPWPPPPTVTMTVPDMELHFDGGANMLHP 173


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 149/393 (37%), Gaps = 54/393 (13%)

Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
           YG          P   S     DT   + W       +C+ C  P   P R   F P+RS
Sbjct: 142 YGAVIDGDDDDDPMILSQTMAIDTTEDVPWI------QCLPCLIPQCYPQRNAFFDPRRS 195

Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET 219
           S+   + C +  C  + G        GCS  N T       Y ++Y     T G  +++T
Sbjct: 196 STGAPVRCGSRACRTLGG-----YANGCSKPNSTGDCL---YRIEYSDHRLTLGTYMTDT 247

Query: 220 LRF-PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
           L   PS T  NF  GCS         Q +G    G   +SL SQ        FSYC+   
Sbjct: 248 LTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCV--- 304

Query: 272 KFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
                P ++  +   GP +GD    +   + TP  ++   ++      Y V L+ I V  
Sbjct: 305 ---PGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRS---ANVINPTIYVVRLQGIEVAG 358

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
           + + +P          +GG ++DS +  T +    + A+   F   M  Y   A      
Sbjct: 359 RRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRA---PTG 409

Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
            L  CFD  G   V +P + L F GGA + L       L+   +   + F   AA  ALG
Sbjct: 410 NLDTCFDFVGVSKVTVPTVSLVFDGGAVIEL------GLLSVLLDSCLAFAPMAADFALG 463

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                 +G+ Q Q   + +D+A    GF    C
Sbjct: 464 -----FIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 148/385 (38%), Gaps = 73/385 (18%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            + DT S + W       +C  C  P+        + P +SSSS    C +P C      
Sbjct: 158 MVIDTASDVPWV------QCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACR----- 206

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF----PSKTVPNFLAGC 234
           N+     GC+P    C      Y +QY  G  +AG  +S+ L      P+  +  F  GC
Sbjct: 207 NLGPYANGCTPAGDQC-----QYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGC 261

Query: 235 S--ILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLD 285
           S  +L       + +GI   GR ++SLP+Q        FSYCL        PV S   + 
Sbjct: 262 SHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCL-----PPTPVHSGFFI- 315

Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
                     P ++ + +   P+  S A    Y V L  I V  K + +P +        
Sbjct: 316 -------LGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFA----- 363

Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS-----GK 400
             G ++DS +  T +    + A+   F+ +M  Y  AA    K  L  C+D S     G 
Sbjct: 364 -AGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAA---PKEHLDTCYDFSGAAPGGG 419

Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALGRGPAIILG 457
             V LP++ L F G       P     L  + VL   CL  F  N      G     I+G
Sbjct: 420 GGVKLPKITLVFDG-------PNGAVELDPSGVLLDGCLA-FAPNTDDQMTG-----IIG 466

Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
           + Q Q   + +++     GF +  C
Sbjct: 467 NVQQQALEVLYNVDGATVGFRRGAC 491


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/406 (24%), Positives = 154/406 (37%), Gaps = 66/406 (16%)

Query: 99  HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
            S G Y   +  GTP +     + DTGS ++W  C    RC        D   +  +   
Sbjct: 80  ESIGLYFAKIGLGTPSRDFHVQV-DTGSDILWVNCAGCIRCP----RKSDLVELTPYDAD 134

Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
            SS+++ + C +  CS++   N  S C   S    TC      Y++ YG G  T G L+ 
Sbjct: 135 ASSTAKSVSCSDNFCSYV---NQRSECHSGS----TCQ-----YVILYGDGSSTNGYLVR 182

Query: 218 ETL--------RFPSKTVPNFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQLG-- 260
           + +        R    T    + GC       L + Q A  GI GFG+S+ S  SQL   
Sbjct: 183 DVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQ 242

Query: 261 ---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
               + F++CL                D   G G      +        P+ S SA    
Sbjct: 243 GKVKRSFAHCL----------------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--- 283

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V L  I VG+  +++       G D   GVI+DSG+T  ++     +AV    + Q+ 
Sbjct: 284 YSVNLNAIEVGNSVLQLSSDAFDSGDDK--GVIIDSGTTLVYLP----DAVYNPLMNQIL 337

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              +  ++        CF    +   + P +  +F     +A+ P+ Y   V  +  C  
Sbjct: 338 ASHQELNLHTVQDSFTCFHYIDRLDRF-PTVTFQFDKSVSLAVYPQEYLFQVREDTWCFG 396

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
               N      G     ILGD  L N  + +D+ N   G+    C+
Sbjct: 397 W--QNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|388493426|gb|AFK34779.1| unknown [Medicago truncatula]
          Length = 454

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 98/414 (23%), Positives = 160/414 (38%), Gaps = 64/414 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           YS S+  GTP       + D     +WF C   Y     N    +P +      K++  +
Sbjct: 50  YSTSIKLGTP-AVPLDLVIDIRERFLWFECDDSY-----NSTTYNPIQCGTKKCKQARGT 103

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
             I C N                GC+  N TC +        +G  F +G +  + L FP
Sbjct: 104 GCIDCTN-----------HPSKTGCT--NNTCGV---EPFNPFGGFFVSGDVGEDILSFP 147

Query: 224 SKT----------VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
             T          VP F++ C +  D+               G+ G  R+  SLP+Q+  
Sbjct: 148 RVTSDGRRVTNVRVPRFISSC-VYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIAT 206

Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSS 312
           +     KF+ CL S    +     +L +  GP     + D  +  L YTP   N   +  
Sbjct: 207 RFKLDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGP 266

Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            F  F    Y++ ++ I V +  V    + L     G GG  + +    T +   ++  +
Sbjct: 267 IFDNFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPL 326

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVY---LPELILKFKGGAKMALPPEN 424
              F+++     +   V+  +    CFD  +  KSV    +P + L  KGG +  +   N
Sbjct: 327 LNAFVKK-AEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385

Query: 425 YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
               V   VLCL  F D  +        +II+G  QL++  +EFDL + + GF+
Sbjct: 386 SMVKVNENVLCL-GFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFS 438


>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
          Length = 383

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 153/384 (39%), Gaps = 39/384 (10%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN--FPNVDPSRIPAFIPKRSSSSQ 164
           S + GTPPQ ++ FI D G  LVW  C+        N   P V P ++   +P       
Sbjct: 27  SFTIGTPPQPASAFI-DVGGLLVWTQCSQCSSSSCFNQGAPAVRPDQV---VPPTGPEP- 81

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
              C    C   F P     C G       C     + L ++    T+G + ++ +   +
Sbjct: 82  ---CGTALCE--FFPASIRNCSG-----DVCAYEASTQLFEH----TSGKIGTDAVAIGT 127

Query: 225 KTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
            T  +   GC + SD +     P+G  G  R+  SL +Q+ +  FS+CL          +
Sbjct: 128 ATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHDGGGGK-N 186

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L L                TPF K+      +   +Y + L  I  G + +       
Sbjct: 187 SRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSL--YYLINLEGIKAGDEAI-----IT 239

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
           VP S     V++ + S  +F+   +++ + K     +G  +     + +S    CF   G
Sbjct: 240 VPQSGRT--VLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGG 297

Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
                 P+++L F+G A + +PP NY   VG++ +C+ + +          G + ILG  
Sbjct: 298 VSGA--PDVVLTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMS-ILGGL 354

Query: 460 QLQNFYLEFDLANDRFGFAKQKCA 483
           Q QN +  +DL  +   F    C+
Sbjct: 355 QQQNVHFLYDLEKETLSFEAADCS 378


>gi|326496543|dbj|BAJ94733.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326511583|dbj|BAJ91936.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 98/402 (24%), Positives = 163/402 (40%), Gaps = 76/402 (18%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD-PSRIPAFIPKRSSSSQLIGCQNPK 172
           PQ     + D G + +W  C + Y  V  ++  V   S++       + ++  +G  +P 
Sbjct: 52  PQVPVTAVLDLGGASLWVDCDAGY--VSSSYAGVPCASKLCRLAKSVACATSCVGKPSPG 109

Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
           C       +   C G  P N    ++            T G L+++ L  P+        
Sbjct: 110 C-------LNDTCSG-FPENTVTRVS------------TGGNLITDVLSVPTTFRPAPGP 149

Query: 226 --TVPNFL--AGCSILSDRQPAGIAGFG---RSSESLPSQLGL-----KKFSYCLLSRK- 272
             T P FL   G + L+D   AG  G     R+  +LP+QL       +KF+ CL S   
Sbjct: 150 LATAPAFLFTCGATFLTDGLAAGATGMASLSRARFALPTQLAATFRFSRKFALCLTSTSA 209

Query: 273 -----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGL 322
                F DAP +        PG   SK+  L+YTP   N V ++   G+      Y++G+
Sbjct: 210 AGVVVFGDAPYAFQ------PGVDLSKS--LTYTPLLVNNVSTAGVSGQKDKSNEYFIGV 261

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             I V  + V +  S L     G GG  + + + +T +E  + +AV   F  +     R 
Sbjct: 262 TAIKVNGRAVPLNASLLAIDKQGGGGTKLSTVAPYTVLETSIHKAVTDAFAAETAMIPR- 320

Query: 383 ADVEKKSGLRPCFDISGKKSVYL------PELILKFKGGAKMALPPENYFALVGNEVLCL 436
             V   +  + C+D S   S  +       EL+L+ +  + +     +  A  G   LCL
Sbjct: 321 --VRAVAPFKLCYDGSKVGSTRVGPAVPTVELVLQNEAASWVVFGANSMVAAKGGA-LCL 377

Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
            +  D  A P      ++++G   +++  LEFDL   R GF+
Sbjct: 378 GVV-DGGAAPRT----SVVIGGHTMEDNLLEFDLQRARLGFS 414


>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
          Length = 205

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 86/179 (48%), Gaps = 8/179 (4%)

Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
           + G GR   SL SQLG  +FSYCL S      P   N  +       ++ + GL   P  
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSF-LSPEPSRLNFGVFATLNGTNASSSGL---PVQ 56

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
             P+  ++A    Y++ L+ I +G K + I         DG GGV +DSG++ T+++  +
Sbjct: 57  STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDV 116

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL--PELILKFKGGAKMALP 421
           ++AV +E +  +     A D E   GL  CF      +V +  P++ L F GGA M  P
Sbjct: 117 YDAVRRELVSVLRPLPPANDTEI--GLETCFPWPPPPTVTMTVPDMELHFDGGANMLHP 173


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 101/408 (24%), Positives = 155/408 (37%), Gaps = 74/408 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP ++    + DTGS ++W  C     C  C   +     +  + P  SS
Sbjct: 79  GLYFTQIGIGTPAKSYYVQV-DTGSDILWVNCV---FCDTCPRKSGLGIELTLYDPSGSS 134

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           S   + C    C    G  + S            P A   Y + YG G  T G  +++ L
Sbjct: 135 SGTGVTCGQDFCVATHGGVIPS----------CVPAAPCQYSISYGDGSSTTGFFVTDFL 184

Query: 221 RFP----------SKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
           ++           + T   F  G  I  D     +   GI GFG+S+ S+ SQL      
Sbjct: 185 QYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKV 244

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG-----DSKTPGLSYTPFYKNPVGSSSAFG 315
            K F++CL                DT  G G     D   P +S TP             
Sbjct: 245 RKVFAHCL----------------DTINGGGIFAIGDVVQPKVSTTPLVPGM-------- 280

Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
             Y V L  I VG   +++P +    G   + G I+DSG+T  ++ G ++ A+  +   Q
Sbjct: 281 PHYNVNLEAIDVGGVKLQLPTNIFDIGE--SKGTIIDSGTTLAYLPGVVYNAIMSKVFAQ 338

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
            G+     D + +     CF  SG      P +   F+GG  + + P +Y  L  N  L 
Sbjct: 339 YGDMPLKNDQDFQ-----CFRYSGSVDDGFPIITFHFEGGLPLNIHPHDY--LFQNGELY 391

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            + F         G+   ++LGD    N  + +DL N   G+    C+
Sbjct: 392 CMGFQTGGLQTKDGK-DMVLLGDLAFSNRLVLYDLENQVIGWTDYNCS 438


>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
          Length = 414

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 78/154 (50%), Gaps = 19/154 (12%)

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
           P L YT F      +SS    FYYV L+ ++VG + +KI       G DG+GG I+DSG+
Sbjct: 33  PELKYTAF----TPTSSPADTFYYVKLKGVLVGGELLKISSDTWDVGKDGSGGTIIDSGT 88

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
           T ++   P+++AV  +                  G  PC+++SG +   +PEL L F  G
Sbjct: 89  TLSYFVEPVYQAVPSD--------------PGLLGAEPCYNVSGMERPEVPELSLLFPDG 134

Query: 416 AKMALPPENYFALVG-NEVLCLILFTDNAAGPAL 448
           A    P ENYF  +  ++++CL +   +  G ++
Sbjct: 135 AVWDFPAENYFVRLDPDDIMCLAVLGTSRTGMSI 168


>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
 gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
          Length = 472

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 103/429 (24%), Positives = 163/429 (37%), Gaps = 82/429 (19%)

Query: 75  KPKTKDSNIGSNYS----NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
           K +   +NI +++S    +  ++T   V +   ++++L+ GTPP     F     S   W
Sbjct: 55  KQRRTLANITTDFSVRGGDKGLETSFYVDNGLNFAMNLNLGTPP-VQHNFTMALNSEFFW 113

Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
             C+    CVDCN    DP     F    S+S   I C +P CS    P   +   G S 
Sbjct: 114 AACSP---CVDCNVSTNDP----LFSSASSTSYTRIPCTSPFCS--TSPGFSTNACGSSA 164

Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSET--LRFPSKTVPN----FLAGC-----SILSD 239
              T  L   SY   Y    +AG + S+   ++ P KT  N       GC     ++L  
Sbjct: 165 VGSTTCLYNFSYSTDYS---SAGEMASDVVAMKTPRKTRGNKSLRMSLGCGRESTTLLGI 221

Query: 240 RQPAGIAGFGRSSESLPSQLG----LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
              +G+ GF ++ +S   QL       KF YC+ S  F    V  N  +        S  
Sbjct: 222 LNTSGLVGFAKTDKSFIGQLAEMDYTSKFIYCVPSDTFSGKIVLGNYKI--------SSH 273

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
             LSYTP   N           YY+GLR I + +  +  P   ++  +DG GG I+DS  
Sbjct: 274 SSLSYTPMIVNSTA-------LYYIGLRSISI-TDTLTFPVQGIL--ADGTGGTIIDSTF 323

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS--GLRPCFDISGKKSVYLPELILKFK 413
            F++     +  + +       N ++ +  E  +  G   C+++S               
Sbjct: 324 AFSYFTPDSYTPLVQAIQNLNSNLTKVSSNETAALLGNDICYNVSVNDDD---------- 373

Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
                            N  +CL +      G +L      ++G +Q  +  +EFDL   
Sbjct: 374 ---------------AENATVCLAVGDSEKVGFSLN-----VIGTYQQLDVAVEFDLEKQ 413

Query: 474 RFGFAKQKC 482
             GF    C
Sbjct: 414 EIGFGTAGC 422


>gi|358347314|ref|XP_003637703.1| Basic 7S globulin [Medicago truncatula]
 gi|355503638|gb|AES84841.1| Basic 7S globulin [Medicago truncatula]
          Length = 454

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 98/414 (23%), Positives = 160/414 (38%), Gaps = 64/414 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           YS S+  GTP       + D     +WF C   Y     N    +P +      K++  +
Sbjct: 50  YSTSIKLGTP-AVPLDLVIDIRERFLWFECDDSY-----NSTTYNPIQCGTKKCKQARGT 103

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
             I C N                GC+  N TC +        +G  F +G +  + L FP
Sbjct: 104 GCIDCTNHPFK-----------TGCT--NNTCGV---EPFNPFGGFFVSGDVGEDILSFP 147

Query: 224 SKT----------VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
             T          VP F++ C +  D+               G+ G  R+  SLP+Q+  
Sbjct: 148 RVTSDGRRVTNVRVPRFISSC-VYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIAT 206

Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSS 312
           +     KF+ CL S    +     +L +  GP     + D  +  L YTP   N   +  
Sbjct: 207 RFKLDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGP 266

Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            F  F    Y++ ++ I V +  V    + L     G GG  + +    T +   ++  +
Sbjct: 267 IFDNFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPL 326

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVY---LPELILKFKGGAKMALPPEN 424
              F+++     +   V+  +    CFD  +  KSV    +P + L  KGG +  +   N
Sbjct: 327 LNAFVKK-AEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385

Query: 425 YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
               V   VLCL  F D  +        +II+G  QL++  +EFDL + + GF+
Sbjct: 386 SMVKVNENVLCL-GFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFS 438


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/385 (24%), Positives = 163/385 (42%), Gaps = 64/385 (16%)

Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
           DTGS  +W  C     C  C   +     +  + P  S +S+ + C +  C+  +   + 
Sbjct: 92  DTGSDTLWVNCVG---CTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQIS 148

Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-----KTVPN---FLAG 233
              KG         ++CP Y + YG G  T+G  + + L F       +TVP+    + G
Sbjct: 149 GCTKG---------MSCP-YSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFG 198

Query: 234 C--------SILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
           C        S  +D    GI GFG+++ S+ SQL       + FS+CL S       +S 
Sbjct: 199 CGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDS-------ISG 251

Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
             +       G+   P +  TP  +            Y V L+ I V    +++P S ++
Sbjct: 252 GGIF----AIGEVVQPKVKTTPLLQGMA--------HYNVVLKDIEVAGDPIQLP-SDIL 298

Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
             S G G  I+DSG+T  ++   +++ + ++ + Q     +   VE +     CF  S +
Sbjct: 299 DSSSGRG-TIIDSGTTLAYLPVSIYDQLLEKILAQRSGM-KLYLVEDQ---FTCFHYSDE 353

Query: 401 KSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
           +SV    P +   F+ G  +   P +Y  L   ++ C + +  + A    G+   I+LGD
Sbjct: 354 ESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWC-VGWQKSMAQTKDGK-ELILLGD 411

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
             L N  + +DL N   G+A   C+
Sbjct: 412 LVLANKLVVYDLDNMAIGWADYNCS 436


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 135/291 (46%), Gaps = 44/291 (15%)

Query: 194 TCPLACP--SYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAG 247
            C  A P  +Y + YG G FT G L  E L+F +  V +F+ GC   +       +G+ G
Sbjct: 68  VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMG 127

Query: 248 FGRSSESLPSQL-GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
            GRS  SL SQ  G+    FSYCL S    +   S +L+L        + +P +SY    
Sbjct: 128 LGRSDLSLISQTSGIFGGVFSYCLPS---TERKGSGSLILGGNSSVYRNSSP-ISYAKMI 183

Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
           +NP         FY++ L  I +G   ++ P       S G   ++VDSG+  T +   +
Sbjct: 184 ENP-----QLYNFYFINLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTI 231

Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN 424
           ++A+  EF++Q   +  A      S L  CF++S  + V +P + + F+G A++ +    
Sbjct: 232 YKALKAEFLKQFTGFPPAPAF---SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTG 288

Query: 425 YFALVGNEV--LCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
            F  V ++   +CL L    + D  A          ILG++Q +N  + +D
Sbjct: 289 VFYFVKSDASQVCLALASLEYQDEVA----------ILGNYQQKNLRVIYD 329


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 160/404 (39%), Gaps = 64/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP      + DTGS ++W  C     C   +   VD   +  + PK SS
Sbjct: 71  GLYYARIGIGSPPNDFHVQV-DTGSDILWVNCVGCSNCPKKSDIGVD---LQLYNPKSSS 126

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S LI C  P CS  +    ++   GC P      L C  Y + YG G  TAG  +++ +
Sbjct: 127 TSTLITCDQPFCSATY----DAPIPGCKP-----DLLC-QYKVIYGDGSATAGYFVNDYI 176

Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           +          S+T  + + GC          S     GI GFG+++ S+ SQL      
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL S       +S   +       G+   P L  TP   N           Y V
Sbjct: 237 KKIFAHCLDS-------ISGGGIF----AIGEVVEPKLKTTPVVPNQA--------HYNV 277

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + VG   + +P       +    G I+DSG+T  ++   ++  + ++ +    +  
Sbjct: 278 VLNGVKVGDTALDLPLGLF--ETSYKRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLK 335

Query: 381 -RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            R  D +       CF          P +  KF+    + + P  Y   + ++V C  + 
Sbjct: 336 LRTVDDQ-----FTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC--VG 388

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N+   +       +LGD  LQN  + ++L N   G+ +  C+
Sbjct: 389 WQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCS 432


>gi|356548993|ref|XP_003542883.1| PREDICTED: basic 7S globulin-like [Glycine max]
          Length = 473

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/398 (23%), Positives = 160/398 (40%), Gaps = 61/398 (15%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  S+  GTP + +   + D     +W+ C + Y        N    R  A   K+    
Sbjct: 87  YYTSVGIGTP-RHNFDLVIDLSGENLWYDCDTHY--------NSSSYRPIACGSKQCPEI 137

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
             +GC  P     F P       GC+  N TCP    + L ++     +G L  + +   
Sbjct: 138 GCVGCNGP-----FKP-------GCT--NNTCPANVINQLAKF---IYSGGLGEDFIFIR 180

Query: 224 SKTVPNFLAGC-------SILSDRQP--------AGIAGFGRSSESLPSQLGL-----KK 263
              V   L+ C       S   D  P         GI G  +S  +LP QL        K
Sbjct: 181 QNKVSGLLSSCIDTDAFPSFSDDELPLFGLPNNTKGIIGLSKSQLALPIQLASANKVPSK 240

Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV--GSSSAFG---EFY 318
           FS CL S         +NL++  G       +  L  TP   N V  G+ S  G   + Y
Sbjct: 241 FSLCLPSLNNQGF---TNLLVRAGEEHPQGISKFLKTTPLIVNNVSTGAISVEGVPSKEY 297

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
           ++ ++ + +    V +  S L   + GNGG  + + S FT ++  +++   ++FI++  +
Sbjct: 298 FIDVKAVQIDGNVVNLKPSLLAIDNKGNGGTKLSTMSPFTELQTTVYKTFIRDFIKKASD 357

Query: 379 YSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVL 434
             R   V   +    C+D +  ++    + +P + L  +GG +  +   N   +    V 
Sbjct: 358 -RRLKRVASVAPFEACYDSTSIRNSSTGLVVPTIDLVLRGGVQWTIYGANSMVMAKKNVA 416

Query: 435 CLILFTDNAAGPALGRGPA-IILGDFQLQNFYLEFDLA 471
           CL +  D    P +    A I++G +QL++  LEFD+A
Sbjct: 417 CLAI-VDGGTEPRMSFVKASIVIGGYQLEDNLLEFDVA 453


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 158/404 (39%), Gaps = 63/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C S   C   +   +D   +  + P  S+
Sbjct: 87  GLYFTQIGIGTPSKGYYVQV-DTGSDILWVNCISCDSCPRKSGLGID---LTLYDPTASA 142

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           SS+ + C    C+      V   C   SP        C  Y + YG G  T G  +++ L
Sbjct: 143 SSKTVTCGQEFCATATNGGVPPSCAANSP--------C-QYSITYGDGSSTTGFFVADFL 193

Query: 221 RFP--SKTVPNFLAGCSIL-------------SDRQPAGIAGFGRSSESLPSQLG----- 260
           ++   S      LA  S+              S+    GI GFG+++ S+ SQL      
Sbjct: 194 QYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKV 253

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL +          N+V            P +  TP               Y V
Sbjct: 254 TKIFSHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVPGM--------PHYNV 294

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L+ I VG   +++P +    G  G+ G I+DSG+T  ++   +++AV          +S
Sbjct: 295 VLKTIDVGGSTLQLPTNIFDIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAV------FS 347

Query: 381 RAADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
              DV  K+     CF  SG      PE+   F G   + + P +Y      +V C + F
Sbjct: 348 NHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQNTEDVYC-VGF 406

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                    G+   ++LGD  L N  + +DL N   G+    C+
Sbjct: 407 QSGGVQSKDGK-DMVLLGDLALSNKLVVYDLENQVIGWTNYNCS 449


>gi|297818546|ref|XP_002877156.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322994|gb|EFH53415.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 420

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 124/271 (45%), Gaps = 41/271 (15%)

Query: 227 VPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK---- 272
           +PN +  C   S      +   G+AG GR   SLPSQ        +KF+ CL S +    
Sbjct: 153 IPNIIFSCGSTSLLKGLAKGTVGMAGMGRHKISLPSQFAAAFSFNRKFAVCLTSGRGVTF 212

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
           F + P    + L   PG   S+   L  TP   NP       GE Y++G+R+I +  K V
Sbjct: 213 FGNGPY---VFL---PGIQISR---LQKTPLLINP-------GE-YFIGVREIKIVEKTV 255

Query: 333 KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKS 389
            I    L +    G GG  + S + +T +E  +F++    F+RQ    N +R A V+  S
Sbjct: 256 PINQMLLKINKETGFGGTKISSVNPYTVLESSIFKSFTSMFVRQATARNMTRVASVKPFS 315

Query: 390 GLRPCFDISGKKSVY-LPELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
                 ++   +  Y +PE+ L          +   N    V ++V+CL  F D      
Sbjct: 316 ACFSTQNVGVTRLGYAVPEIQLVLHSNDVVWRIFGGNSMVSVSDDVICL-GFVDGGVNAR 374

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
                ++++G FQL++  +EFDLA++RFGF+
Sbjct: 375 ----TSVVIGGFQLEDNLIEFDLASNRFGFS 401


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 148/379 (39%), Gaps = 71/379 (18%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I D+GS + W       +C  C  P     R P F P  S++   + C +  C+ + GP 
Sbjct: 171 IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 223

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
                +GCS  N  C         Q+G+ +  G   + T  F      P   +  F  GC
Sbjct: 224 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 269

Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
           +        D   AG    G  S+SL  Q   +    FSYCL        P +S+L   V
Sbjct: 270 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 322

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L   P              F   P+ SSS    FY V LR IIV  + + +P +     S
Sbjct: 323 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 376

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
                 ++DS +  + +    ++A+   F   M  Y  A  V   S L  C+D +G +S+
Sbjct: 377 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 427

Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
            LP + L F GGA + L  +    L+G+   CL       A  A  R P  I G+ Q + 
Sbjct: 428 TLPSIALVFDGGATVNL--DAAGILLGS---CLAF-----APTASDRMPGFI-GNVQQKT 476

Query: 464 FYLEFDLANDRFGFAKQKC 482
             + +D+      F    C
Sbjct: 477 LEVVYDVPAKAMRFRTAAC 495


>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 523

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 106/437 (24%), Positives = 170/437 (38%), Gaps = 85/437 (19%)

Query: 77  KTKDSNIGSNYSNSLIKTPLSVHSYGG-----YSISLSFGTPPQASTPFI--FDTGSSLV 129
           K +  NIGS Y          V  +G      +   +  GTP   S PF+   D GS L+
Sbjct: 71  KRRRLNIGSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTP---SVPFLVALDVGSDLL 127

Query: 130 WFPCTSRYRCVDC-----NFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
           W PC     C+ C     N+ +V    +  + P  SS+S+ + C +  C+W         
Sbjct: 128 WVPCD----CIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAW--------- 174

Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT--------VPNFLAGC-- 234
              C   N  C      Y        T+G ++ + L+  S +          + + GC  
Sbjct: 175 STTCKSANDPCTYKRDYYSDNTS---TSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGR 231

Query: 235 ----SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
               S L    P G+ G G  + S+P+ L  +       S  FD+      L  D GP +
Sbjct: 232 KQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPAT 291

Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEF--YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
             +      + P           FGEF  Y++G+    VGS  ++               
Sbjct: 292 QQTT----QFLPL----------FGEFAAYFIGVESFCVGSSCLQ----------RSGFQ 327

Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
            +VDSGS+FT++   +++ +  EF +Q+  N +R   V ++     C++IS   S  +P 
Sbjct: 328 ALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRI--VLRELPWNYCYNISTLVSFNIPS 385

Query: 408 LILKFKGGAKMALPPENYF-ALVGNEVLCLIL-FTDNAAGPALGRGPAIILGDFQLQNFY 465
           + L F         P     A  G +V CL L  TD   G         ++G   +  + 
Sbjct: 386 MQLVFPLNQIFIHDPVYVLPANQGYKVFCLTLEETDEDYG---------VIGQNLMVGYR 436

Query: 466 LEFDLANDRFGFAKQKC 482
           + FD  N + G++K KC
Sbjct: 437 MVFDRENLKLGWSKSKC 453


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 161/403 (39%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C    RC   +   VD   +  +  K S+
Sbjct: 153 GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 208

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S  +GC +  CS   GP       GC P      L C  Y + YG G  T G  + + +
Sbjct: 209 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 257

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
           ++       +T P     + GC          S     GI GFG+++ S+ SQL      
Sbjct: 258 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 317

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL     D+        +      G+   P ++ TP  +N           Y V
Sbjct: 318 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 358

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            +++I VG   + +P      G     G I+DSG+T  +    ++  + ++ + Q  +  
Sbjct: 359 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 415

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           R   VE+      CFD +G      P + L F     + + P  Y   V     C I + 
Sbjct: 416 RLHTVEQAF---TCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC-IGWQ 471

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++ A    G+    +LGD  L N  + +DL     G+ +  C+
Sbjct: 472 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 513


>gi|296086729|emb|CBI32364.3| unnamed protein product [Vitis vinifera]
          Length = 400

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 103/395 (26%), Positives = 155/395 (39%), Gaps = 96/395 (24%)

Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
           P      + D G+  +W  C   Y                      SSS +   C++ +C
Sbjct: 53  PLVPVKLVVDLGAQFLWVDCEQNYV---------------------SSSYRPARCRSAQC 91

Query: 174 SWIFGPNVESRCKGC-----SPR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
           S        +R  GC     +PR    N TC LA     +Q   G   G ++S + +F  
Sbjct: 92  SL-------ARANGCGDCFSAPRPGCNNNTCGLAEDFVSVQSTDGSNPGRVVSVS-KFLF 143

Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
              P FL  G +        G+AG GR+  + PSQ        +KF+ CL S        
Sbjct: 144 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 193

Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
           ++N V+  G G          +  L YTP Y NP                 I +  K + 
Sbjct: 194 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINP----------------SIRINEKAIS 237

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
           +  S L   S+G GG  + + + +T ME  +++A  K FI      N +R A V      
Sbjct: 238 LNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINITRVAAVAP---F 294

Query: 392 RPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLILFTDNA 443
             CF     K+VY       +P + L  +  +    +   N    V ++VLCL  F D  
Sbjct: 295 NVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-GFVDGG 350

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           A P      +I++G +QL++  L+FDLA  R GF+
Sbjct: 351 ANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 381


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 161/403 (39%), Gaps = 63/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C    RC   +   VD   +  +  K S+
Sbjct: 72  GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 127

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S  +GC +  CS   GP       GC P      L C  Y + YG G  T G  + + +
Sbjct: 128 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 176

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
           ++       +T P     + GC          S     GI GFG+++ S+ SQL      
Sbjct: 177 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 236

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL     D+        +      G+   P ++ TP  +N           Y V
Sbjct: 237 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 277

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            +++I VG   + +P      G     G I+DSG+T  +    ++  + ++ + Q  +  
Sbjct: 278 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 334

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           R   VE+      CFD +G      P + L F     + + P  Y   V     C I + 
Sbjct: 335 RLHTVEQA---FTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC-IGWQ 390

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++ A    G+    +LGD  L N  + +DL     G+ +  C+
Sbjct: 391 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 432


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 114/427 (26%), Positives = 172/427 (40%), Gaps = 54/427 (12%)

Query: 65  SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
           SR   + +K    +  S++ +  + +L     S+   G Y +++  GTP +     IFDT
Sbjct: 114 SRVDSIHSKLSKDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKK-DFSLIFDT 172

Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
           GS L W  C     CV   +      +   F P +S+S   I C +  C  +   +    
Sbjct: 173 GSDLTWTQCEP---CVKSCYNQ----KEAIFNPSQSTSYANISCGSTLCDSL--ASATGN 223

Query: 185 CKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPN-FLAGC---SILSD 239
              C+  + TC      Y +QYG   F+ G    E L   +  V N F  GC   +    
Sbjct: 224 IFNCA--SSTCV-----YGIQYGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLF 276

Query: 240 RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
              AG+ G GR   SL SQ      K FSYCL S       ++           G S + 
Sbjct: 277 GGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFLT----------FGGSTSK 326

Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
             S+TP      GSS     FY + L  I VG + + I      P      G I+DSG+ 
Sbjct: 327 SASFTPLATISGGSS-----FYGLDLTGISVGGRKLAIS-----PSVFSTAGTIIDSGTV 376

Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
            T +    + A++  F + M  Y  A  +   S L  CFD S   ++ +P++ L F GG 
Sbjct: 377 ITRLPPAAYSALSSTFRKLMSQYPAAPAL---SILDTCFDFSNHDTISVPKIGLFFSGGV 433

Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
            + +     F +     +CL  F  N+    +      I G+ Q +   + +D A  R G
Sbjct: 434 VVDIDKTGIFYVNDLTQVCLA-FAGNSDASDVA-----IFGNVQQKTLEVVYDGAAGRVG 487

Query: 477 FAKQKCA 483
           FA   C+
Sbjct: 488 FAPAGCS 494


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 162/403 (40%), Gaps = 64/403 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS ++W  C    RC   +   VD   +  +  K S+
Sbjct: 153 GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 208

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S  +GC +  CS   GP       GC P      L C  Y + YG G  T G  + + +
Sbjct: 209 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 257

Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
           ++       +T P     + GC          S     GI GFG+++ S+ SQL      
Sbjct: 258 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 317

Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL     D+        +      G+   P ++ TP  +N           Y V
Sbjct: 318 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 358

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            +++I VG   + +P      G     G I+DSG+T  +    ++  + ++ + Q  +  
Sbjct: 359 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 415

Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
           R   VE+      CFD +G      P + L F     + + P  Y  L  +E    I + 
Sbjct: 416 RLHTVEQAF---TCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEY--LFQHEFEWCIGWQ 470

Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           ++ A    G+    +LGD  L N  + +DL     G+ +  C+
Sbjct: 471 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 512


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 148/389 (38%), Gaps = 49/389 (12%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y +    GTP Q       DT S + W PC     C+ C+           F    S++ 
Sbjct: 101 YIVRAKIGTPAQTML-MAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 148

Query: 164 QLIGCQNPKCSWIF---GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
           + +GCQ  +C  +     P + S      P   TC     S+ L YG    A  L  +T+
Sbjct: 149 KSLGCQAAQCKQVLHLLSPLLTSPSVVPKP---TCGGGVCSFNLTYGGSSLAANLSQDTI 205

Query: 221 RFPSKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
              +  VP +  GC        L  +   G+     S  S    L    FSYCL S  F 
Sbjct: 206 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FK 263

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
               S +L L  GP     +   + YTP  KNP   S      Y+V L  + VG + V +
Sbjct: 264 SLNFSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDV 313

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
           P            G I DSG+ FT +  P + AV   F  ++G   R   V    G   C
Sbjct: 314 PPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTC 370

Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPA 453
           + +     +  P +   F  G  + LPP+N           CL +    AA P       
Sbjct: 371 YTV----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVL 421

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            ++ + Q QN  L +D+ N R G A++ C
Sbjct: 422 NVIANLQQQNHRLLYDVPNSRLGVARELC 450


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 108/469 (23%), Positives = 179/469 (38%), Gaps = 72/469 (15%)

Query: 27  SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
           +S +T T+ LT L  +H    +   P++   S      +R R L  +      D    S 
Sbjct: 53  NSPSTSTIRLTILHREHPCAPASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEATAS- 111

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
               LI        Y  Y   +  GTP +     + DT SSL W  C           P 
Sbjct: 112 ---GLIFANGVPWDYYSYVTQVQLGTPAKTHNVLV-DTASSLSWVGCE----------PC 157

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
           ++   IP F P  SS+ +++GC +  C+ +  P+     K C    + C     SY   Y
Sbjct: 158 INACLIPTFNPNASSTYKVVGCGSALCNAV--PSATMARKSCMAPTEGC-----SYRQSY 210

Query: 207 -GLGFTAGLLLSETLRFPSKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL- 261
                + G++ S+TL +   +   F+ GC  L      + +GI G   +  SL SQ+ + 
Sbjct: 211 HDYSLSVGVVSSDTLTYGLGS-QKFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMTVG 269

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
              +  SYC         P +   +     G  D     L +TP Y +        G  Y
Sbjct: 270 HRYRAMSYCF------PHPRNQGFL---QFGRYDEHKSLLRFTPLYID--------GNNY 312

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQM 376
           +V +  ++V +  + +        S GN  +    D+G+ +T +   LF +++      +
Sbjct: 313 FVHVSNVMVETMSLDVQ-------SSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLV 365

Query: 377 GNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
             Y R       S  + CF   G   +  +Y+P + ++F+ GA++ L  E+   +    V
Sbjct: 366 EGYYRVG----ASTGQTCFQADGNWIEGDLYMPTVKIEFQNGARITLNSEDLMFMEEPNV 421

Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            CL           +  G  I+LG   L   +   DL     G   Q C
Sbjct: 422 FCLAF--------KMNDGGDIVLGSRHLMGVHTVVDLEMMTMGLRGQGC 462


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 160/404 (39%), Gaps = 64/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+PP      + DTGS ++W  C     C   +   VD   +  + PK SS
Sbjct: 71  GLYYARIGIGSPPNDFHVQV-DTGSDILWVNCVGCSNCPKKSDIGVD---LQLYNPKSSS 126

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
           +S LI C  P CS  +    ++   GC P      L C  Y + YG G  TAG  +++ +
Sbjct: 127 TSTLITCDQPFCSATY----DAPIPGCKP-----DLLC-QYKVIYGDGSATAGYFVNDYI 176

Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
           +          S+T  + + GC          S     GI GFG+++ S+ SQL      
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K F++CL S       +S   +       G+   P L  TP   N           Y V
Sbjct: 237 KKIFAHCLDS-------ISGGGIF----AIGEVVEPKLXNTPVVPNQA--------HYNV 277

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
            L  + VG   + +P       +    G I+DSG+T  ++   ++  + ++ +    +  
Sbjct: 278 VLNGVKVGDTALDLPLGLF--ETSYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLK 335

Query: 381 -RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
            R  D +       CF          P +  KF+    + + P  Y   + ++V C  + 
Sbjct: 336 LRTVDDQ-----FTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC--VG 388

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N+   +       +LGD  LQN  + ++L N   G+ +  C+
Sbjct: 389 WQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCS 432


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 112/405 (27%), Positives = 170/405 (41%), Gaps = 76/405 (18%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++S G PP  +   I DTGS+L W  C     C V C+  +      P F P RS +S+
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 54

Query: 165 LIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETL 220
            + C + KC     P  + R +   C  +  +C     +Y + YG G  ++ G ++++TL
Sbjct: 55  RVRCSSVKCG---EPRYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTL 106

Query: 221 RFPSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSR 271
           R    +  + + GCS  +      AGI GFG SS S   QL         K FSYCL + 
Sbjct: 107 RI-GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT- 164

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
              D      ++L    G  D       YTP ++      S     Y + +  +I   + 
Sbjct: 165 ---DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQR 211

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKS 389
           +    S +          IVDSG+  T +    F  + K   + M +  Y R +   ++S
Sbjct: 212 LVTSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQES 261

Query: 390 GLRPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
            +  C+    D SG             LP L + F GGA +ALPP N F    +  LC+ 
Sbjct: 262 YI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMT 319

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                A  PAL    + ILG+   ++F   FD+   +FGF    C
Sbjct: 320 F----AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 110/473 (23%), Positives = 171/473 (36%), Gaps = 89/473 (18%)

Query: 56  LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY---GGYSISLSFGT 112
           L  LA S   R   + +  + + +++  GS  S +  + PL+  +Y   G Y +    GT
Sbjct: 45  LADLARSDRQRMAFIASHGRRRARETAAGS--SAAAFEMPLTSGAYTGIGQYFVRFRVGT 102

Query: 113 PPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
           P Q   PF+   DTGS L W  C    R    N          AF P+ S +   I C +
Sbjct: 103 PAQ---PFLLVADTGSDLTWVKC----RRPAANSSESGSGSGRAFRPEDSRTWAPISCAS 155

Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVP-- 228
             C+           K       TCP   P     Y   +  G     T+   S T+   
Sbjct: 156 DTCT-----------KSLPFSLATCP--TPGSPCAYDYRYKDGSAARGTVGTESATIALS 202

Query: 229 ------------NFLAGCSI--------LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
                         + GC+         +SD    G+   G S  S  S    +   +FS
Sbjct: 203 GRGREERKAKLKGLVLGCTSSYTGPSFEVSD----GVLSLGYSDVSFASHAASRFAGRFS 258

Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGD--------------SKTPGLSYTPFYKNPVGSS 311
           YCL+      +P ++   L  GP                  +           + P+   
Sbjct: 259 YCLVDHL---SPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLD 315

Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
                FY V ++ + V  + +KIP +  V   D  GGVI+DSG++ T +  P + AV   
Sbjct: 316 RRMRPFYDVAVKAVSVAGQFLKIPRA--VWDVDAGGGVILDSGTSLTVLAKPAYRAVVAA 373

Query: 372 FIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
               +    R            C++  S    V LP++ + F G A++  P ++Y     
Sbjct: 374 LSEGLAGLPRV----TMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAA 429

Query: 431 NEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
             V C+        G   G  P I ++G+   Q    EFD+ N R  F + +C
Sbjct: 430 PGVKCI--------GLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
 gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
          Length = 649

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 105/431 (24%), Positives = 165/431 (38%), Gaps = 92/431 (21%)

Query: 97  SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAF 155
           SV  +G Y  +++ G P   +   I DTGS+L + PC +  +C         DP      
Sbjct: 105 SVKEHGYYYANIALGDPSPRTFQVIVDTGSTLTYVPCATCAKCGTHTGGTRFDP------ 158

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
                 + + + CQ  +C    GP + +  +G +    T       Y   Y  G   +G 
Sbjct: 159 ------TGKWLTCQEKQCKAAGGPGICAGGRGAAANRCT-------YSRTYAEGSGVSGD 205

Query: 215 LLSETLRFPSKTVP------NFLAGCS-----ILSDRQPAGIAGFGRSS-ESLPSQL--- 259
           L+ + + F     P      + + GC+      + D++  G+ G G +   S+P+QL   
Sbjct: 206 LVRDKMHFGGDIAPATNGTLDVVFGCTNAESGTIHDQEADGLIGLGNNQFASIPNQLADT 265

Query: 260 -GLKK-FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            GL + FS C  S  F+     S   L   P      TP L YT    N      A   +
Sbjct: 266 HGLPRVFSLCFGS--FEGGGALSFGRLPATP-----HTPPLVYTDMRVN-----EAHPAY 313

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
           Y V    + +G   V  P    V       G ++DSG+TFT++   +F A A      + 
Sbjct: 314 YVVSTAAMKIGDVAVATPSDLAV-----GYGTVMDSGTTFTYVPTKVFHATAAALDAAVT 368

Query: 378 NYSRAADVEKKSGLRP----------CFDISGKKSV-----------YLPELILKFKG-G 415
             ++    EKK    P          CF   G   +           Y P L + F G G
Sbjct: 369 TNAKP---EKKLAKVPGPDPSYPDDVCFQREGATEIEPIVTMANLGEYYPPLTIAFDGEG 425

Query: 416 AKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD--LA 471
           A + LPP NY  + G +    CL +  +   G         ++G   +++  +E+D  + 
Sbjct: 426 ASLVLPPSNYLFVHGKKPGAFCLGVMDNKQQG--------TLIGGISVRDVLVEYDKTVG 477

Query: 472 NDRFGFAKQKC 482
             R GFA   C
Sbjct: 478 GGRIGFAATDC 488


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 154/381 (40%), Gaps = 70/381 (18%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G + + ++FGTPPQ  T  I DTGSS+ W  C    RC+  +  + DPS           
Sbjct: 160 GNFLVDVAFGTPPQKFT-LILDTGSSITWTQCKPCVRCLKASRRHFDPS----------- 207

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
                       ++  G  + S                 +Y + YG   T+ G    +T+
Sbjct: 208 ---------ASLTYSLGSCIPSTVGN-------------TYNMTYGDKSTSVGNYGCDTM 245

Query: 221 RFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRK 272
                 V P F  GC   ++        G+ G G+   S  SQ      K FSYCL    
Sbjct: 246 TLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP--- 302

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
            ++  + S L  +       S++  L +T     P  S      +Y+V L  I VG+K +
Sbjct: 303 -EEDSIGSLLFGE----KATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRL 357

Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-L 391
            IP S        + G I+DSG+  T +    + A+   F + M  Y  +    KK   L
Sbjct: 358 NIPSSVFA-----SPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL 412

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG 449
             C+++SG+K V LPE++L F  GA + L  +    + GN+   LCL            G
Sbjct: 413 DTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR--VIWGNDASRLCLAF---------AG 461

Query: 450 RGPAIILGDFQLQNFYLEFDL 470
                I+G+ Q  +  + +D+
Sbjct: 462 NSELTIIGNRQQVSLTVLYDI 482


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 105/401 (26%), Positives = 161/401 (40%), Gaps = 68/401 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y +++S GTPP  S   I DTGS L+W  C     C DC +  V+    P F PK+S 
Sbjct: 92  GSYLMNISLGTPP-VSMLGIADTGSDLIWRQC---LPCDDC-YKQVE----PLFDPKKSK 142

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
           + + +GC N  C  +       +   C   N      C S        +T   L SET  
Sbjct: 143 TYKTLGCNNDFCQDL------GQQGSCGDDN-----TCTSSYSYGDQSYTRRDLSSETFT 191

Query: 222 FPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLL 269
             S      + P    GC   +    + + +G+ G G    SL  QL  K   +FSYCL+
Sbjct: 192 IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 251

Query: 270 SRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
                D+  SS +         GSG   TP +  TP              FYY+ L  + 
Sbjct: 252 PLS-SDSTASSKINFGKSAVVSGSGTVSTPLIKGTP------------DTFYYLTLEGMS 298

Query: 327 VGSKHVKIP---YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           +GS+ V       +   P +     +I+DSG+T T +    +  +     + +G  +   
Sbjct: 299 LGSEKVAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTT-- 356

Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDN 442
             + +     C+  SG K + +P +   F  GA + LPP N F     +++C  +  + N
Sbjct: 357 -TDPRGTFSLCY--SGVKKLEIPTITAHFI-GADVQLPPLNTFVQAQEDLVCFSMIPSSN 412

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            A          I G+    NF + +DL N++  F    C 
Sbjct: 413 LA----------IFGNLSQMNFLVGYDLKNNKVSFKPTDCT 443


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 104/426 (24%), Positives = 176/426 (41%), Gaps = 94/426 (22%)

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
           + N+ ++    + + G Y+  L  GTPPQ     I D+GS++ + PC+S  +C +     
Sbjct: 71  HPNARMRLHDDLLTNGYYTTRLYIGTPPQEFA-LIVDSGSTVTYVPCSSCEQCGN----- 124

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
               + P F P  SSS   + C N  C+             C    K C     +Y  QY
Sbjct: 125 ---HQDPRFQPDLSSSYSPVKC-NVDCT-------------CDSDKKQC-----TYERQY 162

Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNF-LAGCS-----ILSDRQPAGIAGFGRSSESLPS 257
             +  ++G+L  + + F   S+  P   + GC       L  +   GI G GR   S+  
Sbjct: 163 AEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMD 222

Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
           QL  K      FS C                +D G G+    G    P + ++    +P+
Sbjct: 223 QLVEKGVISDSFSLCYGG-------------MDIGGGAMVLGGMLAPPDMIFS--NSDPL 267

Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
            S      +Y + L++I V  K +++         +   G ++DSG+T+ ++    F A 
Sbjct: 268 RSP-----YYNIELKEIHVAGKALRVESRIF----NSKHGTVLDSGTTYAYLPEQAFVAF 318

Query: 369 AKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMA 419
            KE +      S+   ++K  G  P     CF  +G+    L    P++ + F  G K++
Sbjct: 319 -KEAVT-----SKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLS 372

Query: 420 LPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           L PENY       +   CL +F +       G+ P  +LG   ++N  + +D  N++ GF
Sbjct: 373 LTPENYLFRHSKVDGAYCLGVFQN-------GKDPTTLLGGIIVRNTLVTYDRHNEKIGF 425

Query: 478 AKQKCA 483
            K  C+
Sbjct: 426 WKTNCS 431


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 103/412 (25%), Positives = 170/412 (41%), Gaps = 96/412 (23%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y+  L  GTPPQ     I DTGS++ + PC++   C  C        + P F P  S 
Sbjct: 87  GYYTTRLWIGTPPQRFA-LIVDTGSTVTYVPCST---CEHCG-----RHQDPKFQPDLSE 137

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + Q + C  P C+             C      C      Y  QY  +  ++G+L  + +
Sbjct: 138 TYQPVKC-TPDCN-------------CDGDTNQC-----MYDRQYAEMSSSSGVLGEDVV 178

Query: 221 RFP--SKTVPN-FLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKK-----FSYC 267
            F   S+  P   + GC       L  ++  GI G GR   S+  QL  KK     FS C
Sbjct: 179 SFGNLSELAPQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLC 238

Query: 268 LLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
                           +D G G+    G S    + +T  + +P  S      +Y + L+
Sbjct: 239 YGG-------------MDVGGGAMILGGISPPEDMVFT--HSDPDRSP-----YYNINLK 278

Query: 324 QIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
           ++ V  K +++ P  +     DG  G ++DSG+T+ ++    F A  +  +++  +    
Sbjct: 279 EMHVAGKKLQLNPKVF-----DGKHGTVLDSGTTYAYLPETAFLAFKRAIMKERNS---- 329

Query: 383 ADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNE- 432
             +++ +G  P     CF  +G     L    P + + F+ G K++L PENY        
Sbjct: 330 --LKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVR 387

Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              CL +F++       GR P  +LG   ++N  + +D  N + GF K  C+
Sbjct: 388 GAYCLGVFSN-------GRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCS 432


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 101/420 (24%), Positives = 170/420 (40%), Gaps = 82/420 (19%)

Query: 87  YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
           + N+ ++    + + G Y+  L  GTPPQ     I D+GS++ + PC S  +C +     
Sbjct: 72  HPNARMRLHDDLLTNGYYTTRLYIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN----- 125

Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
               + P F P  SSS   + C N  C+             C    K C     +Y  QY
Sbjct: 126 ---HQDPRFQPDLSSSYSPVKC-NVDCT-------------CDSDKKQC-----TYERQY 163

Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
             +  ++G+L  + + F   S+  P     GC       L  +   GI G GR   S+  
Sbjct: 164 AEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMD 223

Query: 258 QL---GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
           QL   G+   S+ L     D       +VL   P   D      S++   ++P       
Sbjct: 224 QLVEKGVISDSFSLCYGGMDIG--GGAMVLGGVPAPSDMV---FSHSDPLRSP------- 271

Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
             +Y + L++I V  K +++         +   G ++DSG+T+ ++    F A       
Sbjct: 272 --YYNIELKEIHVAGKALRVDSRVF----NSKHGTVLDSGTTYAYLPEQAFVAFKDAVT- 324

Query: 375 QMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENY 425
                S+   ++K  G  P     CF  +G+    L    P++ + F  G K++L PENY
Sbjct: 325 -----SKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENY 379

Query: 426 FALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
                  +   CL +F +       G+ P  +LG   ++N  + +D  N++ GF K  C+
Sbjct: 380 LFRHSKVDGAYCLGVFQN-------GKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCS 432


>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
          Length = 440

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 114/259 (44%), Gaps = 38/259 (14%)

Query: 244 GIAGFGRSSESLPSQLGL-----KKFSYCLLSRK-------FDDAPVSSNLVLDTGPGSG 291
           GI G G      P+QL       +KF+ CL S         F D+P    + L   PG  
Sbjct: 177 GILGLGNGYVGFPTQLANAFSVPRKFAICLTSSTTSRGVIFFGDSPY---VFL---PGMD 230

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
            SK   L YTP  KNPV +S ++ E      Y++G+  I +    V I  + L    DG 
Sbjct: 231 VSKR--LVYTPLLKNPVSTSGSYFEGEPSTDYFIGVTSIKINGNVVPINTTLLNITKDGK 288

Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
           GG  + +   +T +E  ++ A+ K F++ +    R   V   +  + C++ +   S  + 
Sbjct: 289 GGTKISTVDPYTKLETSIYNALTKAFVKSLAKVPRVKPV---APFKVCYNRTSLGSTRVG 345

Query: 407 ------ELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
                 EL+L  K       +   N    + N+VLCL  F D   G       +I++G  
Sbjct: 346 RGVPPIELVLGNKNATTSWTIWGVNSMVAMNNDVLCL-GFLD--GGVEFEPTTSIVIGAH 402

Query: 460 QLQNFYLEFDLANDRFGFA 478
           Q+++  L+FD+AN R GF 
Sbjct: 403 QIEDNLLQFDIANKRLGFT 421


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 153/401 (38%), Gaps = 71/401 (17%)

Query: 98  VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
           V + G Y + +  GTP Q     + DT +   W PC+    C  C+      +    +  
Sbjct: 91  VLNIGNYVVRVKLGTPGQFMF-MVLDTSNDAAWVPCSG---CTGCSSTTFSTNTSSTY-- 144

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYG--LGFTA 212
                   + C   +C+ +              R  +CP    S  +    YG    F+A
Sbjct: 145 ------GSLDCSMAQCTQV--------------RGFSCPATGSSSCVFNQSYGGDSSFSA 184

Query: 213 GLLLSETLRFPSKTVPNFLAGC--SILSDRQPAGIAGFGR--------SSESLPSQLGLK 262
             L+ ++LR  +  +PNF  GC  SI     P                 S SL S L   
Sbjct: 185 -TLVEDSLRLVNDVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGL--- 240

Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
            FSYCL S  F     S +L L  GP +G  K+  + YTP  +NP   S      YYV L
Sbjct: 241 -FSYCLPS--FKSYYFSGSLKL--GP-AGQPKS--IRYTPLLRNPHRPS-----LYYVNL 287

Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
             + VG   V I    L    +   G I+DSG+  T    P++ A+  EF +Q+     A
Sbjct: 288 TGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQV-----A 342

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTD 441
                      CF  +       P + L F  G  + LP EN         + CL +   
Sbjct: 343 GPFSSLGAFDTCF--AATNEAVAPAVTLHFT-GLNLVLPMENSLIHSSAGSLACLAM--- 396

Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            AA P        ++ + Q QN  L FD+ N R G A++ C
Sbjct: 397 -AAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 114/486 (23%), Positives = 192/486 (39%), Gaps = 73/486 (15%)

Query: 6   FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
           FS++ L    +  F   + A     ++ +     S K  L+H      +  +++   S++
Sbjct: 4   FSVLTLIFFYLCCFIYFSHASKKGLSIEMIHRDFS-KSPLYHPTVTKFQRAYNVVHRSIN 62

Query: 66  RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
           R  +        TK+ ++  N   S +   L     G Y IS S GTPP     F+ DTG
Sbjct: 63  RVNYF-------TKEFSLNKNQPVSTLTPEL-----GEYLISYSVGTPPFKVYGFM-DTG 109

Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
           S++VW  C     C +           P F P +SSS + I C +  C      +     
Sbjct: 110 SNIVWLQCQPCNTCFN--------QTSPIFNPSKSSSYKNIPCTSSTCK-----DTNDTH 156

Query: 186 KGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKT-----VPNFLAGC---SI 236
             CS     C      Y + Y G   + G L +++L   S +      PN + GC   ++
Sbjct: 157 ISCSNGGDVC-----EYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINV 211

Query: 237 LSDR-QPAGIAGFGRSSESLPSQLGL----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
           L D  Q +G+ G GR   SL  Q+G      KFSYCL+     D+  SS L+        
Sbjct: 212 LQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYN-SDSNSSSKLIF------- 263

Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
             +   +S       P+   +    +Y++ L    VG+   +I Y      S  N  +++
Sbjct: 264 -GEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNN--RIEYGERSNASTQN--ILI 318

Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
           DSG+  T +   LF +    ++ Q     R    +    L  C++ +GK+ + +P++   
Sbjct: 319 DSGTPLTMLPN-LFLSKLVSYVAQEVKLPRIEPPDHH--LSLCYNTTGKQ-LNVPDITAH 374

Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
           F  GA + L     F    + ++C    + N            I G+    N  +++DL 
Sbjct: 375 FN-GADVKLNSNGTFFPFEDGIMCFGFISSNGLE---------IFGNIAQNNLLIDYDLE 424

Query: 472 NDRFGF 477
            +   F
Sbjct: 425 KEIISF 430


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/395 (25%), Positives = 154/395 (38%), Gaps = 60/395 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
           G Y   +  GTPP+       DTGS L+W  C   + C+ C  P     +IP   +  K 
Sbjct: 34  GLYFTQVQLGTPPRTYN-LQVDTGSDLLWVNC---HPCIGC--PAFSDLKIPIVPYDVKA 87

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           S+SS  + C +P C+ I     +    GC+ +N+        Y  QYG G  T G L+ +
Sbjct: 88  SASSSKVPCSDPSCTLI----TQISESGCNDQNQC------GYSFQYGDGSGTLGYLVED 137

Query: 219 TLRFPSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            L +        + GC          S+R   GI GFG S  S  SQL  +  +  + + 
Sbjct: 138 VLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAH 197

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
             D       +++      G+   P + YTP           +   Y V L+ I V + +
Sbjct: 198 CLDGGERGGGILV-----LGNVIEPDIQYTPLVP--------YMYHYNVVLQSISVNNAN 244

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + I        +D   G I DSG+T  ++    ++A           +++A  +     L
Sbjct: 245 LTIDPKLF--SNDVMQGTIFDSGTTLAYLPDEAYQA-----------FTQAVSLVVAPFL 291

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNE-VLCLILFTDNAAGPA 447
                +S       P ++L F+ GA M L P  Y    A   N  + C+      + G A
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYFE-GASMTLTPAEYLIRQASAANAPIWCMGW---QSMGSA 347

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  I GD  L+N  + +DL   R G+    C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/395 (25%), Positives = 155/395 (39%), Gaps = 60/395 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
           G Y   +  GTPP+     + DTGS L+W  C   + C+ C  P     +IP   +  K 
Sbjct: 34  GLYFTQVQLGTPPRTYNLQV-DTGSDLLWVNC---HPCIGC--PAFSDLKIPIVPYDVKA 87

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
           S+SS  + C +P C+ I     +    GC+ +N+        Y  QYG G  T G L+ +
Sbjct: 88  SASSSKVPCSDPSCTLI----TQISESGCNDQNQC------GYSFQYGDGSGTLGYLVED 137

Query: 219 TLRFPSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
            L +        + GC          S+R   GI GFG S  S  SQL  +  +  + + 
Sbjct: 138 VLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAH 197

Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
             D       +++      G+   P + YTP           +   Y V L+ I V + +
Sbjct: 198 CLDGGERGGGILV-----LGNVIEPDIQYTPLVP--------YMSHYNVVLQSISVNNAN 244

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
           + I        +D   G I DSG+T  ++    ++A           +++A  +     L
Sbjct: 245 LTIDPKLF--SNDVMQGTIFDSGTTLAYLPDEAYQA-----------FTQAVSLVVAPFL 291

Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNE-VLCLILFTDNAAGPA 447
                +S       P ++L F+ GA M L P  Y    A   N  + C+      + G A
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYFE-GASMTLTPAEYLIRQASAANAPIWCMGW---QSMGSA 347

Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                  I GD  L+N  + +DL   R G+    C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 162/392 (41%), Gaps = 78/392 (19%)

Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
            I DTGS+  + PC    RC +      D  R   F        + + C     + +   
Sbjct: 53  LIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEF--------ERLDCGEASDATL--- 101

Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKTVPNFLA-GC--- 234
             E   KG    +  C     SY++ Y  G ++ G ++ + +R    T+   LA GC   
Sbjct: 102 -CEETMKGTCQSDGRC-----SYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLAFGCEEA 155

Query: 235 --SILSDRQPAGIAGFGRSSESLPSQL---GLKK--FSYCLLSRKFDDAPVSSNLVLDTG 287
             + + +++  G+ GFGR + ++ +QL   GL +  FS+C+      +   ++  VL  G
Sbjct: 156 ETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCV------EGFGANGGVLTLG 209

Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
                +  P L+ TP   +P   +     F+ V      +G   ++   SY         
Sbjct: 210 RFDFGADAPALARTPLVADPANPA-----FHNVRTSSWKLGDSLIEHLNSYTT------- 257

Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK- 401
              +DSG+TFTF+      +V   F  ++   +  A +E  +G  P     C+ +S    
Sbjct: 258 --TLDSGTTFTFVP----RSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAM 311

Query: 402 ---------SVYLPELILKFKGGAKMALPPENY-FALVGNEV-LCLILFTDNAAGPALGR 450
                    S + P L + ++GG  + L PENY FA   N    C+ +F    A P    
Sbjct: 312 NMTLSQSTVSEWFPPLTIAYEGGVSLTLGPENYLFAHETNSAAFCVGIF----ANP---- 363

Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              I+LG   +++  +EFD+AN R G A   C
Sbjct: 364 NNQILLGQITMRDTLMEFDVANSRVGMAPANC 395


>gi|108707516|gb|ABF95311.1| hypothetical protein LOC_Os03g17280 [Oryza sativa Japonica Group]
          Length = 353

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 1/97 (1%)

Query: 197 LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESL 255
           LA  +  + Y  G T  LL+S+TLR P +T+ NF+ GCS++S  +Q +G+ GF     S+
Sbjct: 3   LAADAIGVVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSV 62

Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
           PSQLGL KF Y LL+R+FDD   +S+ ++  G G  D
Sbjct: 63  PSQLGLTKFFYFLLARRFDDNATASDELILGGAGGKD 99



 Score = 44.7 bits (104), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 11/67 (16%)

Query: 386 EKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALPPENYFALVG----------NEVL 434
           EK  GL P   +S + K++ LP++ L FKGG+ M LP ENYF + G           E +
Sbjct: 124 EKGLGLSPYIAMSSRTKTMELPKISLYFKGGSVMNLPVENYFMVAGPAPSASVPAMAEAI 183

Query: 435 CLILFTD 441
           CL + +D
Sbjct: 184 CLAVVSD 190


>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 174

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 84/192 (43%), Gaps = 27/192 (14%)

Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
           L +TP  K+P+  +     FY+V L  + V    + I    L   S+GNGG I+D  + F
Sbjct: 2   LEFTPLLKHPLVET-----FYFVNLVAVAVNGAKLPISSKVLKMNSEGNGGAILDMSTRF 56

Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP----CFDISGKKSVYLPELILKFK 413
           T      F+ + K            A +   + + P    C+      ++ +P + L F+
Sbjct: 57  TRFPNSAFDHLVKAL---------KALIRLPTMVVPRFQLCYSTVNTGTLIIPTVTLIFE 107

Query: 414 GGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
            G +M LP EN F  V  +  V+CL +   N        G A ++G  Q QNF +  D  
Sbjct: 108 NGVRMRLPMENTFVSVTEQGDVMCLAMVPGNP-------GTATVIGSAQQQNFLIVIDRE 160

Query: 472 NDRFGFAKQKCA 483
             R GFA  +CA
Sbjct: 161 ASRLGFAPLQCA 172


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 109/444 (24%), Positives = 172/444 (38%), Gaps = 71/444 (15%)

Query: 60  ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY--GGYSISLSFGTPPQAS 117
           AS   SRAR  + +   K + S I    SNS  K P+S  S     Y +  + G+PP   
Sbjct: 70  ASVRTSRARGDRIR---KIRSSGI----SNSR-KYPVSRISIIDKVYVMKFNIGSPP-VE 120

Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS--- 174
           T  I DTGS++VW  C S   C +C        +IP F P +SS+  +  C + +C    
Sbjct: 121 TYAIPDTGSNIVWIQCGSPI-CTNCY-----KQKIPLFNPTKSSTYAIRLCGHRECKQAL 174

Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVP----- 228
           W  G  +     GC    + C      Y + Y    F+ G + ++ + FP          
Sbjct: 175 WGLGEYL-----GCKSSVQVC-----RYHISYEDHSFSEGTISTDIITFPEHIAEFGNYS 224

Query: 229 -NFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
                GC   +   P          G+ G G    SL  QL L +FSYC+ +        
Sbjct: 225 LRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQK--- 281

Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IPYS 337
                    P        GL+ +    +   +++  G + +  +  I V    VK  P  
Sbjct: 282 ---------PNGTIEIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEW 332

Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
                  G GG+I+DSG+T+T +     +A+  E   Q+       D    S    C++ 
Sbjct: 333 VFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQD-HSNSNYSLCYNA 391

Query: 398 SGKKSVYLPELILKFKGGAKMALP--PENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
           +     Y+P + LKF    +   P    N +   GN+  CL +F         G     I
Sbjct: 392 ANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF---------GTSGISI 442

Query: 456 LGDFQLQNFYLEFDLANDRFGFAK 479
           +G +Q ++  + +DL  +   F +
Sbjct: 443 IGIYQHRDIKIGYDLKYNLVSFTE 466


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/390 (24%), Positives = 149/390 (38%), Gaps = 57/390 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  +L+ GTPPQ ++  I   G   VW  C+   RC   + P  + S    + P+     
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGE-FVWTQCSPCRRCFKQDLPLFNRSASSTYRPEP---- 82

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
               C    C  +      S C G    +  C     SY ++   G T+G+  ++T    
Sbjct: 83  ----CGTALCESV----PASTCSG----DGVC-----SYEVETMFGDTSGIGGTDTFAIG 125

Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
           + T  +   GC++ S+ +     +G+ G GR+  SL  Q+    FSYCL       A   
Sbjct: 126 TATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG--AAGKK 182

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           S L+L         K+   + TP       SS      Y + L  I  G   +  P    
Sbjct: 183 SALLLGASAKLAGGKSA--ATTPLVNTSDDSSD-----YMIHLEGIKFGDVIIAPP---- 231

Query: 340 VPGSDGNGGVI-VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF--- 395
                 NG V+ VD+    +F+    F+A+ K     +G    A   +       CF   
Sbjct: 232 -----PNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKP---FDLCFPKA 283

Query: 396 --DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
                   S+ LP+++L F+G A + +PP  Y    GN  +CL + +             
Sbjct: 284 AAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELS-- 341

Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            ILG    +N +  FDL  +   F    C+
Sbjct: 342 -ILGRLHQENIHFLFDLDKETLSFEPADCS 370


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++S G PP  +   I DTGS+L W  C     C V C+  +      P F P RS +S+
Sbjct: 116 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 169

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
            + C + KC  +   ++  +   C  +  +C     +Y + YG G  ++ G ++++TLR 
Sbjct: 170 RVRCSSVKCGEL-RYDLRLQQANCMEKENSC-----TYSVTYGNGWAYSVGKMVTDTLRI 223

Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
              +  + + GCS  +      AGI GFG SS S   QL         K FSYCL +   
Sbjct: 224 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 279

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            D      ++L    G  D       YTP ++      S     Y + +  +I   + + 
Sbjct: 280 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 328

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
              S +          IVDSG+  T +    F  + K   + M +  Y R +   ++S +
Sbjct: 329 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378

Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
             C+    D SG             LP L + F GGA +ALPP N F    +  LC+   
Sbjct: 379 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 435

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              A  PAL    + ILG+   ++F   FD+   +FGF    C
Sbjct: 436 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 112/452 (24%), Positives = 170/452 (37%), Gaps = 111/452 (24%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  G+P +     I DTGS ++W  C +   C   +   +D   +  F    SS
Sbjct: 69  GLYFTKVKMGSPAKEFYVQI-DTGSDILWLNCNTCNNCPKSSGLGID---LNYFDTASSS 124

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           ++ L+ C +P CS+     V++    CS +   C     SY  QYG G  T+G  + + +
Sbjct: 125 TAALVSCSDPVCSYA----VQTATSQCSSQANQC-----SYTFQYGDGSGTSGYYVYDAM 175

Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
            F         S +    + GCS         +++   GI GFG  + S+ SQ+   G+ 
Sbjct: 176 YFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMA 235

Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
            K FS+CL  +          LVL      G+   P + YTP               Y +
Sbjct: 236 PKVFSHCLKGQ----GSGGGILVL------GEILEPNIVYTPLVP--------LQPHYNL 277

Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV------------ 368
            L+ I V  + + I       G+  N G IVDSG+T  ++    ++              
Sbjct: 278 NLQSIAVNGQILPIDQDVFATGN--NRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTH 335

Query: 369 ----AKEFIRQMGNYSRAADVEK----KSGLRPCFD----------------ISGKKSVY 404
                     + GN +  + V++    +  LR                    IS     Y
Sbjct: 336 FNEPTNNIKYEDGNNNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCY 395

Query: 405 L---------PELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG----RG 451
           L         P + L F GGA M L PE Y    G        F D AA   +G    + 
Sbjct: 396 LVPTSLGDIFPLVSLNFMGGASMVLKPEQYLIHYG--------FLDGAAMWCIGFQKVQK 447

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
              ILGD  L++    +DLAN R G+    C+
Sbjct: 448 GYTILGDLVLKDKIFVYDLANQRIGWTDYDCS 479


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 105/393 (26%), Positives = 163/393 (41%), Gaps = 60/393 (15%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
           + +S GTP   +   I DTGS++ W  C  +Y  V C     D    P F    SS+ + 
Sbjct: 25  MGISLGTPAVFNLVTI-DTGSTISWVQC--QYCIVHC--YTQDQRAGPTFNTSSSSTYRR 79

Query: 166 IGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
           +GC    C  +    N+ S   GC     +C      Y L+Y  G ++AG L  + L   
Sbjct: 80  VGCSAQVCHDMHVSQNIPS---GCVEEEDSCI-----YSLRYASGEYSAGYLSQDRLTLA 131

Query: 224 -SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLG----LKKFSYCLLSRKFD 274
            S ++  F+ GC   SD +     AGI GFG  S S  +Q+        FSYC  S + +
Sbjct: 132 NSYSIQKFIFGCG--SDNRYNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQEN 189

Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
           +        L  GP   DS    L+    Y        A    Y +    ++V    +++
Sbjct: 190 EG------FLSIGPYVRDSNKLILTQLFDY-------GAHLPVYALQQFDMMVNGMRLQV 236

Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLR 392
                 P        +VDSG+  TF+  P+F A+ +   + M    Y R +D ++     
Sbjct: 237 D-----PPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE----- 286

Query: 393 PCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
            CF  +G    +  LP + +KF   + + LP EN F    ++  +C     D+A  P + 
Sbjct: 287 ICFHSNGDSVDWSKLPVVEIKFS-RSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQ 345

Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
                ILG+   ++F + FD+    FGF    C
Sbjct: 346 -----ILGNRATRSFRVVFDIQQRNFGFEAGAC 373


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 155/384 (40%), Gaps = 52/384 (13%)

Query: 106 ISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           ++LS G P  PQ     + DTGS ++W  C     C +      DPS    F P   +  
Sbjct: 103 VNLSIGQPSIPQL---VVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPC 159

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
              GC   KC  I  P   S     S                    F   +L+ ET    
Sbjct: 160 GFKGC---KCDPI--PFTISYVDNSSASGT----------------FGRDILVFETTDEG 198

Query: 224 SKTVPNFLAGCS----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
           +  + + + GC       SD    GI G      SL +Q+G +KFSYC+ +      P  
Sbjct: 199 TSQISDVIIGCGHNIGFNSDPGYNGILGLNNGPNSLATQIG-RKFSYCIGNLA---DPYY 254

Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
           +   L  G G+      G S TPF          +  FYYV +  I VG K + I     
Sbjct: 255 NYNQLRLGEGA---DLEGYS-TPF--------EVYHGFYYVTMEGISVGEKRLDIALETF 302

Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC-FDIS 398
               +G GGVI+DSG+T T++     + +  E +R +  +S    + + +  + C + I 
Sbjct: 303 EMKRNGTGGVILDSGTTITYLVDSAHKLLYNE-VRNLLKWSFRQVIFENAPWKLCYYGII 361

Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
            +  V  P +   F  GA +AL   ++F+   +++ C+ +   +     +   P++I G 
Sbjct: 362 SRDLVGFPVVTFHFVDGADLALDTGSFFSQ-RDDIFCMTVSPASILNTTI--SPSVI-GL 417

Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
              Q++ + +DL N    F +  C
Sbjct: 418 LAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 153/391 (39%), Gaps = 63/391 (16%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y ++  FGTP Q  T   FDT ++       ++ +C  C     D     AF P  SSS 
Sbjct: 145 YHVTAGFGTPVQQFT-VGFDTTTT-----GATQLQCKPCA---ADEPCHHAFDPSASSSI 195

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
             + C +P C +          KGCS  + T  ++  + LL     FT  L L+     P
Sbjct: 196 AHVPCGSPDCPF---------NKGCSGHSCTLSVSINNTLLGNATFFTDKLTLT-----P 241

Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESL-----PSQLGLKKFSYCLLSRKFDD 275
              V +F   C       D    GI    R+S SL     PS      FSYCL S   D 
Sbjct: 242 WNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSDV 301

Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
                   L  G    +     +SYTP   N        G  Y V L  + +G   + +P
Sbjct: 302 G------FLSLGATKPELLGRKVSYTPLRSN-----RHNGNLYVVELVGLGLGGVDLPVP 350

Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
            + +       GG I++  +TFT+++  ++ A+  EF + M  Y  A     +  L  C+
Sbjct: 351 RAAIA-----GGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVA---PPQGSLDTCY 402

Query: 396 DISGKKSVYLPELILKFKGGAKMALPPEN--YFALVGN--EVLCLILFTDNAAGPALGRG 451
           + +   S  +P + LKF GGA+  L  +   YF   G+   V CL     +         
Sbjct: 403 NFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGGA------ 456

Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              ++G     +  + +D+   + GF   +C
Sbjct: 457 ---VIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++S G PP  +   I DTGS+L W  C     C V C+  +      P F P RS +S+
Sbjct: 116 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 169

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
            + C + KC  +   ++  +   C  +  +C     +Y + YG G  ++ G ++++TLR 
Sbjct: 170 RVRCSSVKCGEL-RYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTLRI 223

Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
              +  + + GCS  +      AGI GFG SS S   QL         K FSYCL +   
Sbjct: 224 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 279

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            D      ++L    G  D       YTP ++      S     Y + +  +I   + + 
Sbjct: 280 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 328

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
              S +          IVDSG+  T +    F  + K   + M +  Y R +   ++S +
Sbjct: 329 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378

Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
             C+    D SG             LP L + F GGA +ALPP N F    +  LC+   
Sbjct: 379 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 435

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              A  PAL    + ILG+   ++F   FD+   +FGF    C
Sbjct: 436 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 100/408 (24%), Positives = 160/408 (39%), Gaps = 66/408 (16%)

Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
           S G Y   +  GTP +       DTG+ ++W  C    +C +C   +     +  +  K 
Sbjct: 69  SVGLYYAKIGIGTPSK-DYYLQVDTGTDMMWVNCI---QCKECPTRSNLGMDLTLYNIKE 124

Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR-NKTCPLACPSYLLQYGLGF-TAGLLLS 217
           SSS +L+ C    C  I G        GC+ + N +CP     YL  YG G  TAG  + 
Sbjct: 125 SSSGKLVPCDQELCKEING----GLLTGCTSKTNDSCP-----YLEIYGDGSSTAGYFVK 175

Query: 218 ETLRFPS-----KTVP---NFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLG- 260
           + + F       KT     + + GC        S  ++    GI GFG+++ S+ SQL  
Sbjct: 176 DVVLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSS 235

Query: 261 ----LKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFG 315
                K F++CL            N V   G    G    P ++ TP   +         
Sbjct: 236 SGKVKKMFAHCL------------NGVNGGGIFAIGHVVQPTVNTTPLLPDQ-------- 275

Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
             Y V +  I VG  H  +  S        + G I+DSG+T  ++   +++ +  + + Q
Sbjct: 276 PHYSVNMTAIQVG--HTFLNLSTDASEQRDSKGTIIDSGTTLAYLPDGIYQPLVYKILSQ 333

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
             N      V+       CF  SG      P +   F+ G  + + P +Y  L  N + C
Sbjct: 334 QPNLK----VQTLHDEYTCFQYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLSEN-LWC 388

Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           +     N+   +       +LGD  L N  + +DL N   G+ +  C+
Sbjct: 389 IGW--QNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 434


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)

Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
           +++S G PP  +   I DTGS+L W  C     C V C+  +      P F P RS +S+
Sbjct: 1   MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 54

Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
            + C + KC  +   ++  +   C  +  +C     +Y + YG G  ++ G ++++TLR 
Sbjct: 55  RVRCSSVKCGEL-RYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTLRI 108

Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
              +  + + GCS  +      AGI GFG SS S   QL         K FSYCL +   
Sbjct: 109 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 164

Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
            D      ++L    G  D       YTP ++      S     Y + +  +I   + + 
Sbjct: 165 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 213

Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
              S +          IVDSG+  T +    F  + K   + M +  Y R +   ++S +
Sbjct: 214 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 263

Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
             C+    D SG             LP L + F GGA +ALPP N F    +  LC+   
Sbjct: 264 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 320

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
              A  PAL    + ILG+   ++F   FD+   +FGF    C
Sbjct: 321 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 99/404 (24%), Positives = 161/404 (39%), Gaps = 64/404 (15%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   +  GTP +     + DTGS +VW  C    +C +C   +     +  +  + S+
Sbjct: 85  GLYYAKIGIGTPSKDYYVQV-DTGSDIVWVNCI---QCRECPRTSSLGMELTPYDLEEST 140

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
           + +L+ C    C  + G  +     GC+  N +CP     YL  YG G  TAG  + + +
Sbjct: 141 TGKLVSCDEQFCLEVNGGPL----SGCTT-NMSCP-----YLQIYGDGSSTAGYFVKDYV 190

Query: 221 RFP--SKTVPNFLAGCSI---LSDRQPA-----------GIAGFGRSSESLPSQLG---- 260
           ++   S  +    A  SI      RQ             GI GFG+S+ S+ SQL     
Sbjct: 191 QYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRK 250

Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
             K F++CL      D      +        G    P ++ TP   N           Y 
Sbjct: 251 VKKMFAHCL------DGTNGGGIF-----AMGHVVQPKVNMTPLVPNQ--------PHYN 291

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
           V +  + VG  H+ +  S  V  +    G I+DSG+T  ++   ++E +  + + Q  N 
Sbjct: 292 VNMTGVQVG--HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHN- 348

Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
               +V+   G   CF  S +     P +I  F+    + + P  Y  L   E L  I +
Sbjct: 349 ---LEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKVYPHEY--LFQYENLWCIGW 403

Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             N+   +  R    + GD  L N  + +DL N   G+ +  C+
Sbjct: 404 -QNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 446


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 106/406 (26%), Positives = 167/406 (41%), Gaps = 67/406 (16%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y   L  G+PP+     + DTGS ++W  C    RC   +   +D   +  + PK S 
Sbjct: 68  GLYFTKLGLGSPPRDYYVQV-DTGSDILWVNCVECSRCPRKSDLGID---LTLYDPKGSE 123

Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +S ++ C    CS  F GP       GC        + CP Y + YG G  T G  + + 
Sbjct: 124 TSDVVSCDQDFCSATFDGP-----IPGCKSE-----IPCP-YSITYGDGSATTGYYVQDY 172

Query: 220 LRFPS-----KTVPN---FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL-- 261
           L +       +T P     + GC  +        S+    GI GFG+++ S+ SQL    
Sbjct: 173 LTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASG 232

Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
              K FS+CL     D+        +      G+   P +S TP               Y
Sbjct: 233 KVKKIFSHCL-----DNVRGGGIFAI------GEVVEPKVSTTPLVPRMA--------HY 273

Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF-EAVAKEFIRQMG 377
            V L+ I V +  +++P S +    +G G VI DSG+T  ++   ++ E + K   RQ G
Sbjct: 274 NVVLKSIEVDTDILQLP-SDIFDSVNGKGTVI-DSGTTLAYLPDIVYDELIQKVLARQPG 331

Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
              +   VE++     CF  +G      P + L FK    + + P +Y     + + C I
Sbjct: 332 --LKLYLVEQQF---RCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWC-I 385

Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
            +  + A    G+    +LGD  L N  + +DL N   G+    C+
Sbjct: 386 GWQRSVAQTKNGK-DMTLLGDLVLSNKLVIYDLENMVIGWTDYNCS 430


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 96/408 (23%), Positives = 152/408 (37%), Gaps = 94/408 (23%)

Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD-CNFPNVDPSRIPAFIPKRSSSSQL 165
           ++S GTP  +S     DTGS L W PC    +CV               +  K SS+S+ 
Sbjct: 116 NVSVGTPA-SSYLVALDTGSDLFWLPCNCT-KCVHGIQLSTGQKIAFNIYDNKESSTSKN 173

Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
           + C +  C        E + +  S    TCP     YL +     T G L+ + L   + 
Sbjct: 174 VACNSSLC--------EQKTQCSSSSGGTCPYQV-EYLSENTS--TTGFLVEDVLHLITD 222

Query: 226 T-------VPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
                    P    GC      + L    P G+ G G S  S+PS L  +      FS C
Sbjct: 223 NDDQTQHANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMC 282

Query: 268 LLSRKFDDAPV-SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
             +          +N  LD G             TPF   P  S+      Y + + QII
Sbjct: 283 FAADGLGRITFGDNNSSLDQGK------------TPFNIRPSHST------YNITVTQII 324

Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR----QMGNYSRA 382
           VG     + ++            I D+G++FT++  P ++ + + F      Q  ++S +
Sbjct: 325 VGGNSADLEFN-----------AIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNS 373

Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--------GNEVL 434
            D+        C+D+   +++ +P + L  KGG       +NYF +          N VL
Sbjct: 374 DDLP----FEYCYDLRTNQTIEVPNINLTMKGG-------DNYFVMDPIITSGGGNNGVL 422

Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           CL +   N            I+G   +  + + FD  N   G+ +  C
Sbjct: 423 CLAVLKSNNVN---------IIGQNFMTGYRIVFDRENMTLGWKESNC 461


>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
 gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
          Length = 369

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 83/185 (44%), Gaps = 17/185 (9%)

Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
           +  TP   NP  SS      YYV +  I VG K V IP   L        G ++DSG+ F
Sbjct: 199 IKTTPLLANPHRSS-----LYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMF 253

Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
           T +  P + AV  E  R++G     A V    G   CF+ +   +V  P + L F  G +
Sbjct: 254 TRLVAPAYVAVRDEVRRRVG-----APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQ 304

Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
           + LP EN   +V +     I     AA P        ++   Q QN  + FD+ N R GF
Sbjct: 305 VTLPEEN---VVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGF 361

Query: 478 AKQKC 482
           A+++C
Sbjct: 362 ARERC 366


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 139/368 (37%), Gaps = 56/368 (15%)

Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
            DT S + W PC     C+ C+           F    S++ + +GCQ  +C  +  P  
Sbjct: 1   MDTSSDVAWIPCNG---CLGCS--------STLFNSPASTTYKSLGCQAAQCKQVPKP-- 47

Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------S 235
                       TC     S+ L YG    A  L  +T+   +  VP +  GC       
Sbjct: 48  ------------TCGGGVCSFNLTYGGSSLAANLSQDTITLATDAVPGYSFGCIQKATGG 95

Query: 236 ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
            L  +   G+     S  S    L    FSYCL S  F     S +L L  GP     + 
Sbjct: 96  SLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGSLRL--GPVGQPKR- 150

Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
             + YTP  KNP   S      Y+V L  + VG + V +P            G I DSG+
Sbjct: 151 --IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGT 203

Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
            FT +  P + AV   F  ++G   R   V    G   C+ +     +  P +   F  G
Sbjct: 204 VFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV----PIAAPTITFMFT-G 255

Query: 416 AKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
             + LPP+N           CL +    AA P        ++ + Q QN  L +D+ N R
Sbjct: 256 MNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVIANLQQQNHRLLYDVPNSR 311

Query: 475 FGFAKQKC 482
            G A++ C
Sbjct: 312 LGVARELC 319


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 110/470 (23%), Positives = 184/470 (39%), Gaps = 86/470 (18%)

Query: 45  LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNI-GSNYSNSLIKTPLSVHSYGG 103
           LHH  SDP+K +  L+   L     L        +D  I G    +    TPL+  S   
Sbjct: 45  LHHRYSDPVKGM--LSVDDLPEKGSLHYYASMAHRDILIHGRKLVSDNTSTPLTFFSGNE 102

Query: 104 ----------YSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVD-CNFPNVDPSR 151
                     +  ++S GTP   S     DTGS L W PC  +   CV    FP+ +   
Sbjct: 103 TYRFSSLGFLHYANVSIGTP-SLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQID 161

Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
              + P  SS+SQ I C N  CS       +SRC        TCP     Y +QY    T
Sbjct: 162 FNIYRPNASSTSQTIPCNNTLCSR------QSRCPSA---QSTCP-----YQVQYLSNGT 207

Query: 212 A--GLLLSETLRFPSKTVPN------FLAGC------SILSDRQPAGIAGFGRSSESLPS 257
           +  G+L+ + L   +    +       + GC      S L    P G+ G G ++ S+PS
Sbjct: 208 SSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPS 267

Query: 258 QLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
            L  + ++    S  F    +      DTG       + G   TPF    +  +      
Sbjct: 268 TLAREGYTSNSFSMCFGRDGIGRISFGDTG-------SSGQGETPFNLRQLHPT------ 314

Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI--RQ 375
           Y V + +I VG +   + +S            I DSG++FT++  P +  +++ F    +
Sbjct: 315 YNVSITKINVGGRDADLEFS-----------AIFDSGTSFTYLNDPAYTLISESFNIGAK 363

Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALV--GNE 432
              YS  +D+        C+++S  ++ + +P + L  +GG++  +       ++  G  
Sbjct: 364 EKRYSSISDIP----FEYCYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGAS 419

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           + CL +         +  G   I+G   +  + + F+   +  G+    C
Sbjct: 420 IYCLAI---------VKSGDVNIIGQNFMTGYRIVFNRERNVLGWKASDC 460


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 126/317 (39%), Gaps = 60/317 (18%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I D+GS + W       +C  C  P     R P F P  S++   + C +  C+ + GP 
Sbjct: 171 IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 223

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
                +GCS  N  C         Q+G+ +  G   + T  F      P   +  F  GC
Sbjct: 224 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 269

Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
           +        D   AG    G  S+SL  Q   +    FSYCL        P +S+L   V
Sbjct: 270 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 322

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L   P              F   P+ SSS    FY V LR IIV  + + +P +     S
Sbjct: 323 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 376

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
                 ++DS +  + +    ++A+   F   M  Y  A  V   S L  C+D +G +S+
Sbjct: 377 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 427

Query: 404 YLPELILKFKGGAKMAL 420
            LP + L F GGA + L
Sbjct: 428 TLPSIALVFDGGATVNL 444



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 76/181 (41%), Gaps = 21/181 (11%)

Query: 303 FYKNPVGSSSAFG-EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           F   P+ SSS+    FY V LR IIV  + + +P +     S      ++ S +  + + 
Sbjct: 560 FVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLP 613

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
              ++A+   F R M  Y  A  V   S L  C+D +G +S+ LP + L F GGA + L 
Sbjct: 614 PTAYQALRAAFRRAMTMYRTAPPV---SILDTCYDFTGVRSITLPSIALVFDGGATVNLD 670

Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
                 L G    CL       A  A  R P  I G+ Q +   + +D+      F    
Sbjct: 671 AAGIL-LQG----CLAF-----APTATDRMPGFI-GNVQQRTLEVVYDVPGKAIRFRSAA 719

Query: 482 C 482
           C
Sbjct: 720 C 720


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 156/400 (39%), Gaps = 68/400 (17%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y+  +  GTP Q     I DTGS++ + PC+S   C  C          P F P  SS
Sbjct: 97  GYYTSRVFIGTPAQ-EFALIVDTGSTVTYVPCSS---CTHCGHHQA--CFDPRFKPDNSS 150

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           S Q + C +P C            K C  R   C      Y   Y  +  + G+L  + L
Sbjct: 151 SYQTVSCNSPDCI----------TKMCDARVHQC-----KYERVYAEMSSSKGVLGKDLL 195

Query: 221 RF--PSKTVPN-FLAGCSI-----LSDRQPAGIAGFGRSSESLPSQL---GLKKFSYCLL 269
            F   S+  P+  L GC       L  +   GI G GR   S+  QL   G  + S+ L 
Sbjct: 196 GFGNGSRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLC 255

Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
               D+     ++VL   P       P + +         S      +Y + L +I V  
Sbjct: 256 YGGMDEG--GGSMVLGAIP-----PPPAMVFAK-------SDPNRSNYYNLELSEIQVQG 301

Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
             + +P        +G  G ++DSG+T+ ++    F+A      +Q+G+  +A      S
Sbjct: 302 VSLNVPSEVF----NGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSL-QAVPGPDPS 356

Query: 390 GLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEV---LCLILFTDN 442
               CF  +G  S     + P +   F G  K+ L PENY      +V    CL  F + 
Sbjct: 357 YPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYL-FKHTKVPGAYCLGFFKNQ 415

Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            A          +LG   ++N  + +D AN + GF K  C
Sbjct: 416 DA--------TTLLGGIVVRNTLVTYDRANHQIGFFKTNC 447


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 146/397 (36%), Gaps = 59/397 (14%)

Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
           Y  + + GTPPQA +  I D    LVW       +C  C         +P F P  S++ 
Sbjct: 62  YVANFTIGTPPQAVS-GIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTY 114

Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
           +   C +P C  I   N    C G    +  C    PS       G T G+  ++ +   
Sbjct: 115 RAEQCGSPLCKSIPTRN----CSG----DGECGYEAPSMF-----GDTFGIASTDAIAI- 160

Query: 224 SKTVPNFLAGCSILSDRQ-------PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
                    GC + SD         P+G  G GR+  SL  Q  +  FSYCL        
Sbjct: 161 GNAEGRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLAPHGPGK- 219

Query: 277 PVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKH 331
              S L L       G+G S  P    TP       ++S  G   +Y V L  I  G   
Sbjct: 220 --KSALFLGASAKLAGAGKSNPP----TPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVA 273

Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTF---TFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
           V          S G G + +    TF   +++    ++A+ K     +G+ S A   E  
Sbjct: 274 VAA-------ASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEP- 325

Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGP 446
                CF  +    V  P+L+  F+GGA +  PP  Y      GN  +CL + +      
Sbjct: 326 --FDLCFQNAAVSGV--PDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDS 381

Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           A       ILG    +N +  FDL  +   F    C+
Sbjct: 382 A--DDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 105/416 (25%), Positives = 166/416 (39%), Gaps = 85/416 (20%)

Query: 101 YGGYSISLSFGTPPQASTPFIFD--TGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIP 157
           YG Y +++  G P   S P+  D  +GS L W  C +   C+ C   P+      P +  
Sbjct: 76  YGLYYVTMLVGNP---SKPYFLDVDSGSELTWIQCDAP--CISCAKGPH------PLYKL 124

Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
           K+ S   L+  ++P C+ +          G    +K     C   +     G++ G L+ 
Sbjct: 125 KKGS---LVPSKDPLCAAV------QAGSGHYHNHKEASQRCDYDVAYADHGYSEGFLVR 175

Query: 218 ETLR--FPSKTV--PNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL---GLKK 263
           +++R    +KTV   N + GC         +SD +  GI G G    SLPSQ    GL K
Sbjct: 176 DSVRALLTNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIK 235

Query: 264 --FSYCLLSRKFDDAPV--SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
               +C+     D   +    +LV           T  +++ P    P        + YY
Sbjct: 236 NVIGHCIFGAGRDGGYMFFGDDLV----------STSAMTWVPMLGRPSI------KHYY 279

Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGN--GGVIVDSGSTFTFMEGPLFEA---VAKEFIR 374
           VG  Q+  G+K        L    DG   GG+I DSGST+T+     + A   V KE + 
Sbjct: 280 VGAAQMNFGNKP-------LDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLS 332

Query: 375 QMGNYSRAAD------VEKKSGLRPCFDISGKKSVYLPELILKFKG--GAKMALPPENYF 426
                  ++D        +K G R       + + Y   L LKF+     +M + PE Y 
Sbjct: 333 GKQLEQDSSDSFLSLCWRRKEGFRSV----AEAAAYFKPLTLKFRSTKTKQMEIFPEGYL 388

Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
            +     +CL +      G A+G     +LGD   Q   + +D   ++ G+A+  C
Sbjct: 389 VVNKKGNVCLGILN----GTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSDC 440


>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 350

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 67/145 (46%), Gaps = 19/145 (13%)

Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP----CFDISG- 399
           GNGG +VDSG+T  F+  P + +V       +    R   +     L P    C ++SG 
Sbjct: 217 GNGGTVVDSGTTLAFLAEPAYRSV-------IAAVRRRVKLPIADALTPGFDLCVNVSGV 269

Query: 400 -KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
            K    LP L  +F GGA    PP NYF     ++ CL +    +  P +G     ++G+
Sbjct: 270 TKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI---QSVDPKVGFS---VIGN 323

Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
              Q F  EFD    R GF+++ CA
Sbjct: 324 LMQQGFLFEFDRDRSRLGFSRRGCA 348



 Score = 46.2 bits (108), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 54/119 (45%), Gaps = 15/119 (12%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y + L  G PPQ S   I DTGS LVW  C++   C +C+  +  P+ +  F P+ SS
Sbjct: 82  GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 133

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
           +     C +P C  +  P+    C      N T   +   Y   Y  G  T+GL   ET
Sbjct: 134 TFSPAHCYDPVCRLVPKPDRAPIC------NHTRIHSTCHYEYGYADGSLTSGLFARET 186


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 105/399 (26%), Positives = 160/399 (40%), Gaps = 58/399 (14%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS------RIPAF 155
           G Y+  +  GTPP      I DTGS++ + PC+S   C  C       S      R P F
Sbjct: 38  GYYTSRVFIGTPPN-EFALIVDTGSTVTYVPCSS---CTHCGHHQASFSTHRLFCRDPRF 93

Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLGFTAG 213
            P+ SSS Q IGC++  C      +   +CK         T        LL +G    A 
Sbjct: 94  KPENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDFG---PAS 150

Query: 214 LLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQL---GLKKFSYCLLS 270
            L S+ L F  +T     A    L  +   GI G GR   S+  QL   G  + S+ L  
Sbjct: 151 RLQSQLLSFGCET-----AESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCY 205

Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
              D+     ++VL           P  S   F K+    S+    +Y + L +I V   
Sbjct: 206 GGMDEG--GGSMVL--------GAIPAPSGMVFAKSDPRRSN----YYNLELTEIQVQGA 251

Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
            +K+  +      +G  G I+DSG+T+ ++    FEA     + Q+G+  +A D    + 
Sbjct: 252 SLKLDSNVF----NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSL-QAVDGPDPNY 306

Query: 391 LRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEV---LCLILFTDNA 443
              C+  +G  +     + P +   F    K++L PENY      +V    CL  F +  
Sbjct: 307 PDICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYL-FKHTKVPGAYCLGFFKNQD 365

Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
           A          +LG   ++N  + +D  N + GF K  C
Sbjct: 366 A--------TTLLGGIIVRNMLVTYDRYNHQIGFLKTNC 396


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 126/317 (39%), Gaps = 60/317 (18%)

Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
           I D+GS + W       +C  C  P     R P F P  S++   + C +  C+ + GP 
Sbjct: 80  IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 132

Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
                +GCS  N  C         Q+G+ +  G   + T  F      P   +  F  GC
Sbjct: 133 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 178

Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
           +        D   AG    G  S+SL  Q   +    FSYCL        P +S+L   V
Sbjct: 179 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 231

Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
           L   P              F   P+ SSS    FY V LR IIV  + + +P +     S
Sbjct: 232 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 285

Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
                 ++DS +  + +    ++A+   F   M  Y  A  V   S L  C+D +G +S+
Sbjct: 286 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 336

Query: 404 YLPELILKFKGGAKMAL 420
            LP + L F GGA + L
Sbjct: 337 TLPSIALVFDGGATVNL 353



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 76/181 (41%), Gaps = 21/181 (11%)

Query: 303 FYKNPVGSSSAFG-EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
           F   P+ SSS+    FY V LR IIV  + + +P +     S      ++ S +  + + 
Sbjct: 469 FVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLP 522

Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
              ++A+   F R M  Y  A  V   S L  C+D +G +S+ LP + L F GGA + L 
Sbjct: 523 PTAYQALRAAFRRAMTMYRTAPPV---SILDTCYDFTGVRSITLPSIALVFDGGATVNLD 579

Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
                 L G    CL       A  A  R P  I G+ Q +   + +D+      F    
Sbjct: 580 AAGIL-LQG----CLAF-----APTATDRMPGFI-GNVQQRTLEVVYDVPGKAIRFRSAA 628

Query: 482 C 482
           C
Sbjct: 629 C 629


>gi|18379072|ref|NP_563679.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12083230|gb|AAG48774.1|AF332411_1 unknown protein [Arabidopsis thaliana]
 gi|3850580|gb|AAC72120.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
           (EDGP) precursor from Daucus carota. ESTs gb|84105 and
           gb|AI100071 come from this gene [Arabidopsis thaliana]
 gi|332189426|gb|AEE27547.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 434

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/280 (28%), Positives = 128/280 (45%), Gaps = 46/280 (16%)

Query: 227 VPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK---- 272
           +PN +  C   S      +   G+AG GR +  LP Q        +KF+ CL S +    
Sbjct: 154 IPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFAVCLTSGRGVAF 213

Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIV 327
           F + P    + L   PG   S+   L  TP   NP  +   F  GE    Y++G+  I +
Sbjct: 214 FGNGPY---VFL---PGIQISR---LQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKI 264

Query: 328 GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
             K + I  + L +  S G GG  + S + +T +E  +++A   EFIRQ    + A  ++
Sbjct: 265 VEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQ----AAARSIK 320

Query: 387 KKSGLRP---CFDISGKKSVYL----PELILKFKG-GAKMALPPENYFALVGNEVLCLIL 438
           + + ++P   CF         L    PE+ L          +   N    V ++V+CL  
Sbjct: 321 RVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICL-G 379

Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
           F D    P    G ++++G FQL++  +EFDLA+++FGF+
Sbjct: 380 FVDGGVNP----GASVVIGGFQLEDNLIEFDLASNKFGFS 415


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 113/480 (23%), Positives = 180/480 (37%), Gaps = 84/480 (17%)

Query: 12  FSLLILLFTTDAGAGSSAATVTVPLT-PLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
           F LL+  F   +   +      V L  P+S++   ++     ++ + S+ + S++R R+L
Sbjct: 7   FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66

Query: 71  KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
                     +++ S   N +   PLS     GY +S S GTPP      I DTG+  +W
Sbjct: 67  ----------NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLI-DTGNDNIW 115

Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
           F C     C++           P F P +SS+ + I C +P C    G  +       + 
Sbjct: 116 FQCKPCKPCLN--------QTSPMFHPSKSSTYKTIPCTSPICKNADGHYLGVDTLTLNS 167

Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILS----DRQPAGIA 246
            N T P++                              N + GC   +    +   +G  
Sbjct: 168 NNGT-PIS----------------------------FKNIVIGCGHRNQGPLEGYVSGNI 198

Query: 247 GFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
           G  R   S  SQL      KFSYCL+   F    VSS L        GD  T  +S    
Sbjct: 199 GLARGPLSFISQLNSSIGGKFSYCLVPL-FSKENVSSKLHF------GDKST--VSGLGT 249

Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
              P+   +     Y+V L    VG   +K+        SD  G  I+DSG+T T +   
Sbjct: 250 VSTPIKEENG----YFVSLEAFSVGDHIIKLE------NSDNRGNSIIDSGTTMTILPKD 299

Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
           ++  + +  +  M    R  D  ++  L  C+  +    +    +I     G+++ L   
Sbjct: 300 VYSRL-ESVVLDMVKLKRVKDPSQQFNL--CYQTTSTTLLTKVLIITAHFSGSEVHLNAL 356

Query: 424 NYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
           N F  + +EV+C   F       +L      I G+   QNF + FDL      F    C 
Sbjct: 357 NTFYPITDEVICF-AFVSGGNFSSLA-----IFGNVVQQNFLVGFDLNKKTISFKPTDCT 410


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 103/411 (25%), Positives = 162/411 (39%), Gaps = 94/411 (22%)

Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
           G Y+  L  GTPPQ     I DTGS++ + PC++   C  C        + P F P+ SS
Sbjct: 82  GYYTTRLWIGTPPQMFA-LIVDTGSTVTYVPCST---CEQCG-----RHQDPKFQPESSS 132

Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
           + Q + C    C+             C      C      Y  QY  +  ++G+L  + +
Sbjct: 133 TYQPVKC-TIDCN-------------CDSDRMQC-----VYERQYAEMSTSSGVLGEDLI 173

Query: 221 RF--PSKTVPN-FLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKK-----FSYC 267
            F   S+  P   + GC       L  +   GI G GR   S+  QL  K      FS C
Sbjct: 174 SFGNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC 233

Query: 268 LLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
                           +D G G+    G S    +++   Y +PV S      +Y + L+
Sbjct: 234 YGG-------------MDVGGGAMVLGGISPPSDMAFA--YSDPVRSP-----YYNIDLK 273

Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
           +I V  K + +  +      DG  G ++DSG+T+ ++    F A     ++++       
Sbjct: 274 EIHVAGKRLPLNANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKEL------Q 323

Query: 384 DVEKKSGLRP-----CFDISG----KKSVYLPELILKFKGGAKMALPPENYFALVGNE-- 432
            ++K SG  P     CF  +G    + S   P + + F+ G K  L PENY         
Sbjct: 324 SLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRG 383

Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
             CL +F +       G     +LG   ++N  + +D    + GF K  CA
Sbjct: 384 AYCLGVFQN-------GNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCA 427


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,104,121,173
Number of Sequences: 23463169
Number of extensions: 361954812
Number of successful extensions: 733658
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 539
Number of HSP's successfully gapped in prelim test: 1693
Number of HSP's that attempted gapping in prelim test: 726736
Number of HSP's gapped (non-prelim): 3060
length of query: 483
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 336
effective length of database: 8,910,109,524
effective search space: 2993796800064
effective search space used: 2993796800064
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)