BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 040810
         (480 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/484 (76%), Positives = 408/484 (84%), Gaps = 19/484 (3%)

Query: 1   MEGKA-RNHLLLLFSF--FFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLP 57
           MEGKA RN  LL FSF  FF+ + SL YQT V N L +  TLSW +S S        P  
Sbjct: 1   MEGKAGRNAFLLFFSFTIFFSHSTSLNYQTLVANPLRSQPTLSWTDSES--------PTD 52

Query: 58  APDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
             ++ ++ S++LHHVD+LSFN TPE LF  R+QRD  RV++++  AE+A        +  
Sbjct: 53  TAESSATFSVQLHHVDALSFNSTPETLFTTRLQRDAARVEAISYLAETA-------GTGK 105

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
           R   GFSSSVISGLAQGSGEYFTR+GVGTPPRYVYMVLDTGSD+VWIQCAPCK+CY+Q+D
Sbjct: 106 RVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSD 165

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           PVFDP KSRSFA++ CRSPLC +LDS GCN ++ TC+YQVSYGDGS T GDFSTETLTFR
Sbjct: 166 PVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR 225

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
            TRVARVALGCGHDNEGLFV AAGLLGLGRGRLSFP+QTGRRFN KFSYCLVDRS S+KP
Sbjct: 226 RTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKP 285

Query: 297 SSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           SSMVFGDSAVSRTARFTPL++NPKLDTFYYVEL+GISVGG  V GITASLFKLD  GNGG
Sbjct: 286 SSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGG 345

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLH 416
           VIIDSGTSVTRLTRPAYIA RDAFRAGAS+LKRAP FSLFDTCFDLSGKTEVKVPTVVLH
Sbjct: 346 VIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLH 405

Query: 417 FRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           FRGADVSLPA+NYLIPVD+SG FC AFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP
Sbjct: 406 FRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAP 465

Query: 477 RGCA 480
            GCA
Sbjct: 466 HGCA 469


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  690 bits (1780), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/495 (74%), Positives = 413/495 (83%), Gaps = 20/495 (4%)

Query: 1   MEGKARNHLLLLFSFFFTAA----------ASLQYQTFVLNSLPTPSTLSW----PESVS 46
           MEGKARN   LLF F FT             S Q+QT  +N LP   TLSW    PES  
Sbjct: 1   MEGKARNAPALLF-FSFTCVFLSLSTTTLSTSPQFQTLTVNPLPNKPTLSWADTEPESEP 59

Query: 47  VSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESA 106
            +++ +          +SLS++LHH+D+LS + TP+ LFN R+ RD  RVKSLT+ A +A
Sbjct: 60  ETQTLTDSTSTEASTTTSLSVQLHHLDALSSDETPQDLFNSRLARDASRVKSLTSLA-AA 118

Query: 107 VRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           V    R R+RG    GFSSSV SGLAQGSGEYFTRLGVGTP RYV+MVLDTGSDVVWIQC
Sbjct: 119 VGSTNRTRARGP---GFSSSVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQC 175

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITV 225
           APCKKCYSQTDPVF+P KSRSFA +PC SPLCR+LDS GC+ +++ CLYQVSYGDGS T 
Sbjct: 176 APCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTY 235

Query: 226 GDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
           G+FSTETLTFRGTRV RVALGCGHDNEGLF+ AAGLLGLGRGRLSFP+Q GRRF+RKFSY
Sbjct: 236 GEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSY 295

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CLVDRS S+KPS MVFGDSA+SRTARFTPL++NPKLDTFYYVEL+G+SVGG  V GITAS
Sbjct: 296 CLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITAS 355

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
           LFKLD  GNGGVIIDSGTSVTRLTRPAY+ALRDAFR GAS+LKRAP+FSLFDTCFDLSGK
Sbjct: 356 LFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGK 415

Query: 406 TEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
           TEVKVPTVVLHFRGADVSLPA+NYLIPVD+SG+FCFAFAGTMSGLSI+GNIQQQGFRVVY
Sbjct: 416 TEVKVPTVVLHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVY 475

Query: 466 DLAASRIGFAPRGCA 480
           DLAASR+GFAPRGCA
Sbjct: 476 DLAASRVGFAPRGCA 490


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/492 (72%), Positives = 403/492 (81%), Gaps = 16/492 (3%)

Query: 1   MEGKARNHLLLLFSF---------FFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSES- 50
           MEGK RN   L FSF           T + SLQ+QT  LN LP   T+SW ++   +++ 
Sbjct: 1   MEGKTRNASTLFFSFTCIFLFLSTTTTLSTSLQFQTLTLNPLPNKPTISWADTEPGTQTF 60

Query: 51  -ESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRV 109
            + +   P+  A + LS++LHH+D+LS +++ + LFN R+ RD  RVKSL + A +   V
Sbjct: 61  TDQTTSEPSSSATTFLSVQLHHIDALSSDKSSQDLFNSRLVRDAARVKSLISLAAT---V 117

Query: 110 PPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
              N +R R  G FSSSVISGLAQGSGEYFTRLGVGTP RYVYMVLDTGSD+VWIQCAPC
Sbjct: 118 GGTNLTRARGPG-FSSSVISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC 176

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDF 228
            KCYSQTDPVFDP KSRSFA +PC SPLCR+LD  GC+ ++  CLYQVSYGDGS TVG+F
Sbjct: 177 IKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEF 236

Query: 229 STETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           STETLTFRGTRV RV LGCGHDNEGLFV AAGLLGLGRGRLSFP+Q GRRFN KFSYCL 
Sbjct: 237 STETLTFRGTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLG 296

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           DRS S++PSS+VFGDSA+SRT RFTPLL+NPKLDTFYYVEL+GISVGG  V GI+ASLFK
Sbjct: 297 DRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFK 356

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV 408
           LD  GNGGVIIDSGTSVTRLTR AY+ALRDAF  GAS+LKRAP+FSLFDTCFDLSGKTEV
Sbjct: 357 LDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEV 416

Query: 409 KVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
           KVPTVVLHFRGADV LPA+NYLIPVD+SG+FCFAFAGT SGLSIIGNIQQQGFRVVYDLA
Sbjct: 417 KVPTVVLHFRGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLA 476

Query: 469 ASRIGFAPRGCA 480
            SR+GFAPRGCA
Sbjct: 477 TSRVGFAPRGCA 488


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/466 (74%), Positives = 390/466 (83%), Gaps = 9/466 (1%)

Query: 19  AAASLQYQTFVLNSL----PTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDS 74
           A   L+YQ+ V+  L     T S LSW E+    E++ S  LP  + + ++++ L H D 
Sbjct: 29  ADKPLEYQSLVVRPLGENPTTKSQLSWTET----ETQIST-LPVSETDPTMTMHLEHRDV 83

Query: 75  LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQG 134
           L+FN TPE LFNLR+QRD  RV++L+  A +A              GGFSSSV SGLAQG
Sbjct: 84  LAFNATPEALFNLRLQRDAFRVEALSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQG 143

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGEYFTRLGVGTPP+YVYMVLDTGSDVVWIQCAPC+KCYSQTDPVFDP KS SF+++ CR
Sbjct: 144 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 203

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SPLC +LDS GCN R +CLYQV+YGDGS T G+FSTETLTFRGTRV +VALGCGHDNEGL
Sbjct: 204 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGL 263

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           FV AAGLLGLGRGRLSFPTQTG RF RKFSYCLVDRS S+KPSS+VFG SAVSRTA FTP
Sbjct: 264 FVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTP 323

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           L+ NPKLDTFYY+EL GISVGGA V GITASLFKLD AGNGGVIIDSGTSVTRLTR AY+
Sbjct: 324 LITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYV 383

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           +LRDAFRAGA+ LKRAPD+SLFDTCFDLSGKTEVKVPTVV+HFRGADVSLPATNYLIPVD
Sbjct: 384 SLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGADVSLPATNYLIPVD 443

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           ++G FCFAFAGTMSGLSIIGNIQQQGFRVV+D+AASRIGFA RGCA
Sbjct: 444 TNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGCA 489


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/473 (72%), Positives = 392/473 (82%), Gaps = 29/473 (6%)

Query: 9   LLLLFSFFFTAAASLQYQTFVLNSLPTPSTLS-WPESVSVSESESSLPLPAPDAESSLSL 67
           L  L  FFF + A+ ++QT  L SLPTPS L  +P+S S+  S        PDA   L+L
Sbjct: 7   LKYLLLFFFISTAASEFQTLTLRSLPTPSPLPLFPDSQSLQSS--------PDAP--LTL 56

Query: 68  RLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSV 127
            LHH+DSLS N+TP  LFNLR+ RD LRV +L + A                  GFSSSV
Sbjct: 57  DLHHLDSLSLNKTPTDLFNLRLHRDTLRVHALNSRA-----------------AGFSSSV 99

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRS 187
           +SGL+QGSGEYFTRLGVGTPPRY+YMVLDTGSDVVW+QC+PC+KCYSQ+DP+F+P KS+S
Sbjct: 100 VSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKS 159

Query: 188 FATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
           FA +PC SPLCR+LDSSGC+ RR+TCLYQVSYGDGS T GDF+TETLTFRG ++A+VALG
Sbjct: 160 FAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALG 219

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CGH NEGLFV AAGLLGLGRGRLSFP+QTG RFN KFSYCLVDRS S+KPSSMVFGD+A+
Sbjct: 220 CGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAI 279

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           SR ARFTPL+ NPKLDTFYYV L+GISVGG  VRG++ SLFKLD AGNGGVIIDSGTSVT
Sbjct: 280 SRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVT 339

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
           RLTRPAY ALRDAFR GA  LKR P+FSLFDTC+DLSG++ VKVPTVVLHFRGAD++LPA
Sbjct: 340 RLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMALPA 399

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           TNYLIPVD +G+FCFAFAGT+SGLSIIGNIQQQGFRVVYDLA SRIGFAPRGC
Sbjct: 400 TNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/461 (74%), Positives = 391/461 (84%), Gaps = 8/461 (1%)

Query: 25  YQTFVLNS--LPTPSTLSW-PESVSVSESESSLPLPA-PDAESSLSLRLHHVDSLSFNRT 80
           +QT + NS  LP+ S +S+ PES   SES       +  D+ESS++L L H+D+LS N+T
Sbjct: 28  FQTLIPNSHSLPSASPISFQPESEPDSESLLGSEFESGSDSESSITLNLDHIDALSSNKT 87

Query: 81  PEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
           P+ LF+ R+QRD  RVKS+   A  A ++P RN +     GGFSSSV+SGL+QGSGEYFT
Sbjct: 88  PQELFSSRLQRDSRRVKSI---ATLAAQIPGRNVTHAPRTGGFSSSVVSGLSQGSGEYFT 144

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           RLGVGTP RYVYMVLDTGSD+VW+QCAPC++CYSQ+DP+FDP KS+++AT+PC SP CR+
Sbjct: 145 RLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRR 204

Query: 201 LDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAA 259
           LDS+GCN RR TCLYQVSYGDGS TVGDFSTETLTFR  RV  VALGCGHDNEGLFV AA
Sbjct: 205 LDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAA 264

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANP 319
           GLLGLG+G+LSFP QTG RFN+KFSYCLVDRS S+KPSS+VFG++AVSR ARFTPLL+NP
Sbjct: 265 GLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNP 324

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           KLDTFYYVEL+GISVGG  V G+ ASLFKLD  GNGGVIIDSGTSVTRL RPAYIA+RDA
Sbjct: 325 KLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDA 384

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           FR GA +LKRAPDFSLFDTCFDLS   EVKVPTVVLHFRGADVSLPATNYLIPVD++G F
Sbjct: 385 FRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKF 444

Query: 440 CFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           CFAFAGTM GLSIIGNIQQQGFRVVYDLA+SR+GFAP GCA
Sbjct: 445 CFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  661 bits (1705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/473 (72%), Positives = 390/473 (82%), Gaps = 16/473 (3%)

Query: 9   LLLLFSFFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLR 68
           L L  S   T   + Q QT +L++LP P TLSWPES +V           PD E + SL 
Sbjct: 16  LFLCISATSTNPHNSQTQTLLLHTLPDPPTLSWPESATVE----------PDPEPTTSLS 65

Query: 69  LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI 128
           LHH+D+LSFN+TP  LF+LR++RD  RVK+LT  A +  +  P N   G +     SSV+
Sbjct: 66  LHHIDALSFNKTPSQLFHLRLERDAARVKTLTHLAAATNKTRPANPGSGFS-----SSVV 120

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SGL+QGSGEYFTRLGVGTPP+Y+YMVLDTGSDVVW+QC PC KCYSQTD +FDP+KS+SF
Sbjct: 121 SGLSQGSGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSF 180

Query: 189 ATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           A +PC SPLCR+LDS GC+ + N C YQVSYGDGS T GDFSTETLTFR   V RVA+GC
Sbjct: 181 AGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRAAVPRVAIGC 240

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           GHDNEGLFV AAGLLGLGRG LSFPTQTG RFN KFSYCL DR+ SAKPSS+VFGDSAVS
Sbjct: 241 GHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVS 300

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
           RTARFTPL+ NPKLDTFYYVEL+GISVGGA VRGI+AS F+LD  GNGGVIIDSGTSVTR
Sbjct: 301 RTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTR 360

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPAT 427
           LTRPAY++LRDAFR GAS LKRAP+FSLFDTC+DLSG +EVKVPTVVLHFRGADVSLPA 
Sbjct: 361 LTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRGADVSLPAA 420

Query: 428 NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           NYL+PVD+SG+FCFAFAGTMSGLSIIGNIQQQGFRVV+DLA SR+GFAPRGCA
Sbjct: 421 NYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  653 bits (1684), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/480 (73%), Positives = 393/480 (81%), Gaps = 18/480 (3%)

Query: 7   NHLLLLFSFFFTAAASL-----QYQTFVLNSLPT-PSTLSWPESVSVSESESSLPLPAPD 60
           N + L F FF     SL      +QT  L SLP+ PS L        S+S S L   A  
Sbjct: 4   NTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLP-------SDSNSFLSSEATQ 56

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           +E  L L LHH+D+LSFNRTPE LF+LR+QRD +RVK L++   ++     RN S+    
Sbjct: 57  SELGLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATS-----RNLSKPGGT 111

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
            GFSSSVISGLAQGSGEYFTR+GVGTPP+YVYMVLDTGSD+VW+QCAPCK CYSQTDPVF
Sbjct: 112 TGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVF 171

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
           +P KS SFA V CR+PLCR+L+S GCN+R TCLYQVSYGDGS T G+F TETLTFR T+V
Sbjct: 172 NPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 231

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
            +VALGCGHDNEGLFV AAGLLGLGRG LSFP+Q GR FN+KFSYCLVDRS S+KPSS+V
Sbjct: 232 EQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVV 291

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           FG+SAVSRTARFTPLL NP+LDTFYYVEL+GISVGG  V GITAS FKLD  GNGGVIID
Sbjct: 292 FGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIID 351

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
            GTSVTRL +PAYIALRDAFRAGASSLK AP+FSLFDTC+DLSGKT VKVPTVVLHFRGA
Sbjct: 352 CGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 411

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           DVSLPA+NYLIPVD SG FCFAFAGT SGLSIIGNIQQQGFRVVYDLA+SR+GF+PRGCA
Sbjct: 412 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  644 bits (1661), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/456 (76%), Positives = 385/456 (84%), Gaps = 19/456 (4%)

Query: 26  QTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLF 85
           QT  L+SLP P  +SWPES S  +            E +LSL LHH+D+LS N+TPE LF
Sbjct: 33  QTLPLHSLPHPPAISWPESESEPDP----------EEEALSLHLHHIDALSSNKTPEQLF 82

Query: 86  NLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR-ANGGFSSSVISGLAQGSGEYFTRLGV 144
            LR+QRD  RV+ + A A         N+S  R +   FSSS+ISGLAQGSGEYFTR+GV
Sbjct: 83  QLRLQRDAKRVEGVVALAA-------LNQSHARRSGSSFSSSIISGLAQGSGEYFTRIGV 135

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS 204
           GTP RYVYMVLDTGSDVVW+QCAPC+KCY+Q DPVFDP KSR++A +PC +PLCR+LDS 
Sbjct: 136 GTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPLCRRLDSP 195

Query: 205 GCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLG 263
           GCN +N  C YQVSYGDGS T GDFSTETLTFR TRV RVALGCGHDNEGLF+ AAGLLG
Sbjct: 196 GCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVTRVALGCGHDNEGLFIGAAGLLG 255

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDT 323
           LGRGRLSFP QTGRRFN+KFSYCLVDRS SAKPSS+VFGDSAVSRTARFTPL+ NPKLDT
Sbjct: 256 LGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNPKLDT 315

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           FYY+EL+GISVGG+ VRG++ASLF+LD AGNGGVIIDSGTSVTRLTRPAYIALRDAFR G
Sbjct: 316 FYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVG 375

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF 443
           AS LKRA +FSLFDTCFDLSG TEVKVPTVVLHFRGADVSLPATNYLIPVD+SG+FCFAF
Sbjct: 376 ASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPVDNSGSFCFAF 435

Query: 444 AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AGTMSGLSIIGNIQQQGFRV +DLA SR+GFAPRGC
Sbjct: 436 AGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/417 (77%), Positives = 366/417 (87%), Gaps = 4/417 (0%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           ++L L H+D+LS N+TP+ LF+ R+QRD  RVKS+   A  A ++P RN +     GGFS
Sbjct: 72  ITLNLDHIDALSSNKTPDELFSSRLQRDSRRVKSI---ATLAAQIPGRNVTHAPRPGGFS 128

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           SSV+SGL+QGSGEYFTRLGVGTP RYVYMVLDTGSD+VW+QCAPC++CYSQ+DP+FDP K
Sbjct: 129 SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRK 188

Query: 185 SRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
           S+++AT+PC SP CR+LDS+GCN RR TCLYQVSYGDGS TVGDFSTETLTFR  RV  V
Sbjct: 189 SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV 248

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           ALGCGHDNEGLFV AAGLLGLG+G+LSFP QTG RFN+KFSYCLVDRS S+KPSS+VFG+
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           +AVSR ARFTPLL+NPKLDTFYYV L+GISVGG  V G+TASLFKLD  GNGGVIIDSGT
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
           SVTRL RPAYIA+RDAFR GA +LKRAPDFSLFDTCFDLS   EVKVPTVVLHFRGADVS
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVS 428

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LPATNYLIPVD++G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA+SR+GFAP GCA
Sbjct: 429 LPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  636 bits (1641), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 321/417 (76%), Positives = 365/417 (87%), Gaps = 4/417 (0%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           ++L L H+D+LS N+TP+ LF+ R+QRD  RV+S+   A  A ++P RN +     GGFS
Sbjct: 72  ITLNLDHIDALSSNKTPQELFSSRLQRDSRRVRSI---ATLAAQIPGRNVTHAPRPGGFS 128

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           SSV+SGL+QGSGEYFTRLGVGTP RYVYMVLDTGSD+VW+QCAPC++CYSQ+DP+FDP K
Sbjct: 129 SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRK 188

Query: 185 SRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
           S+++AT+PC SP CR+LDS+GCN RR TCLYQVSYGDGS TVGDFSTETLTFR  RV  V
Sbjct: 189 SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV 248

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           ALGCGHDNEGLFV AAGLLGLG+G+LSFP QTG RFN+KFSYCLVDRS S+KPSS+VFG+
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           +AVSR ARFTPLL+NPKLDTFYYV L+GISVGG  V G+TASLFKLD  GNGGVIIDSGT
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
           SVTRL RPAYIA+RDAFR GA +LKRAP+FSLFDTCFDLS   EVKVPTVVLHFR ADVS
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRADVS 428

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LPATNYLIPVD++G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA+SR+GFAP GCA
Sbjct: 429 LPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  624 bits (1610), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 344/479 (71%), Positives = 384/479 (80%), Gaps = 22/479 (4%)

Query: 2   EGKARNHLLLLFSFFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDA 61
           EG     L L  S   + A  +Q +T  L++LP P  LS            +L  P    
Sbjct: 3   EGNWVLFLTLAISLCVSGAFQIQTETLPLHTLPEPHILS-----------ETLSEPQETL 51

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
             SL L LHH+D+LS N+TPE LF+LR+QRD  RV++L            +  +R  A  
Sbjct: 52  SLSLHLHLHHIDALSSNKTPEQLFHLRLQRDAKRVEALLN----------QIHARRSAGS 101

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
            FSSS+ISGLAQGSGEYFTR+GVGTP RYVYMVLDTGSDVVW+QCAPC+KCY+QTD VFD
Sbjct: 102 SFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFD 161

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
           P KSR++A +PC +PLCR+LDS GC+ +N  C YQVSYGDGS T GDFSTETLTFR  RV
Sbjct: 162 PTKSRTYAGIPCGAPLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRV 221

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
            RVALGCGHDNEGLF  AAGLLGLGRGRLSFP QTGRRFN KFSYCLVDRS SAKPSS++
Sbjct: 222 TRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVI 281

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           FGDSAVSRTA FTPL+ NPKLDTFYY+EL+GISVGGA VRG++ASLF+LD AGNGGVIID
Sbjct: 282 FGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIID 341

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           SGTSVTRLTRPAYIALRDAFR GAS LKRAP+FSLFDTCFDLSG TEVKVPTVVLHFRGA
Sbjct: 342 SGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGA 401

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           DVSLPATNYLIPVD+SG+FCFAFAGTMSGLSIIGNIQQQGFR+ YDL  SR+GFAPRGC
Sbjct: 402 DVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  609 bits (1570), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 310/389 (79%), Positives = 342/389 (87%), Gaps = 5/389 (1%)

Query: 92  DVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYV 151
           D +RVK L++   ++     RN S+     GFSSSVISGLAQGSGEYFTR+GVGTPP+YV
Sbjct: 1   DAIRVKKLSSLGATS-----RNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYV 55

Query: 152 YMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT 211
           YMVLDTGSD+VW+QCAPCK CYSQTDPVF+P KS SFA V CR+PLCR+L+S GCN+R T
Sbjct: 56  YMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQT 115

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           CLYQVSYGDGS T G+F TETLTFR T+V +VALGCGHDNEGLFV AAGLLGLGRG LSF
Sbjct: 116 CLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSF 175

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVG 331
           P+Q GR FN+KFSYCLVDRS S+KPSS+VFG+SAVSRTARFTPLL NP+LDTFYYVEL+G
Sbjct: 176 PSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLG 235

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP 391
           ISVGG  V GITAS FKLD  GNGGVIID GTSVTRL +PAYIALRDAFRAGASSLK AP
Sbjct: 236 ISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAP 295

Query: 392 DFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS 451
           +FSLFDTC+DLSGKT VKVPTVVLHFRGADVSLPA+NYLIPVD SG FCFAFAGT SGLS
Sbjct: 296 EFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS 355

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           IIGNIQQQGFRVVYDLA+SR+GF+PRGCA
Sbjct: 356 IIGNIQQQGFRVVYDLASSRVGFSPRGCA 384


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 324/490 (66%), Positives = 380/490 (77%), Gaps = 18/490 (3%)

Query: 1   MEGKARNHLLL-LFS-FFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPA 58
           ME K  N L   +F+  FFT++AS QYQT V+N+LP+ +TLSWPES S+++   S     
Sbjct: 1   MERKVLNTLAFSVFAVLFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSE---- 56

Query: 59  PDAESSLSLRLHHVDSLSF--NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
             + +SLS+ L HVD+LS   + +P  LFNLR+QRD LRVKS+T+ A  +       R+ 
Sbjct: 57  --STTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTP 114

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
            R  GGFS +VISGL+QGSGEYF RLGVGTP   VYMVLDTGSDVVW+QC+PCK CY+QT
Sbjct: 115 -RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQT 173

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLD-SSGC--NRRNTCLYQVSYGDGSITVGDFSTETL 233
           D +FDP KS++FATVPC S LCR+LD SS C   R  TCLYQVSYGDGS T GDFSTETL
Sbjct: 174 DAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETL 233

Query: 234 TFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
           TF G RV  V LGCGHDNEGLFV AAGLLGLGRG LSFP+QT  R+N KFSYCLVDR++S
Sbjct: 234 TFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSS 293

Query: 294 AKPS----SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
              S    ++VFG++AV +T+ FTPLL NPKLDTFYY++L+GISVGG+ V G++ S FKL
Sbjct: 294 GSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKL 353

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
           D  GNGGVIIDSGTSVTRLT+PAY+ALRDAFR GA+ LKRAP +SLFDTCFDLSG T VK
Sbjct: 354 DATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVK 413

Query: 410 VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           VPTVV HF G +VSLPA+NYLIPV++ G FCFAFAGTM  LSIIGNIQQQGFRV YDL  
Sbjct: 414 VPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 473

Query: 470 SRIGFAPRGC 479
           SR+GF  R C
Sbjct: 474 SRVGFLSRAC 483


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 311/465 (66%), Positives = 363/465 (78%), Gaps = 16/465 (3%)

Query: 24  QYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSF--NRTP 81
           QYQT V+N+LP+ +TLSWPES S S+   S      ++ +SLS+ L HVD+LS   + +P
Sbjct: 29  QYQTLVVNTLPSSATLSWPESKSFSDESVS------ESTTSLSVHLSHVDALSSFSDASP 82

Query: 82  EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTR 141
             LF LR+QRD LRVKS+T+ A  +       R+  R+ GGFS +VISGL+QGSGEYF R
Sbjct: 83  VDLFKLRLQRDSLRVKSITSLAAVSTGRNATKRTP-RSAGGFSGAVISGLSQGSGEYFMR 141

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           LGVGTP   VYMVLDTGSDVVW+QC+PCK CY+Q+D +FDP KS++FATVPC S LCR+L
Sbjct: 142 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRL 201

Query: 202 D-SSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAA 258
           D SS C   R  TCLYQVSYGDGS T GDFSTETLTF G RV  V LGCGHDNEGLFV A
Sbjct: 202 DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGA 261

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS----SMVFGDSAVSRTARFTP 314
           AGLLGLGRG LSFP+QT  R+N KFSYCLVDR++S   S    ++VFG+ AV +T+ FTP
Sbjct: 262 AGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTP 321

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NPKLDTFYY++L+GISVGG+ V G++ S FKLD  GNGGVIIDSGTSVTRLT+ AY+
Sbjct: 322 LLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYV 381

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           ALRDAFR GA+ LKRAP +SLFDTCFDLSG T VKVPTVV HF G +VSLPA+NYLIPV+
Sbjct: 382 ALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVN 441

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + G FCFAFAGTM  LSIIGNIQQQGFRV YDL  SR+GF  R C
Sbjct: 442 TEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 312/470 (66%), Positives = 362/470 (77%), Gaps = 28/470 (5%)

Query: 24  QYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRT--P 81
           QY T V+N+LP+   LS+PES S+        +   D+ +SLS+ L HVD+LS +    P
Sbjct: 29  QYNTLVVNTLPSSPILSFPESESL--------ISDSDSTTSLSVHLSHVDALSSSSDASP 80

Query: 82  EHLFNLRIQRDVLRVKSLTAFA-----ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSG 136
             LFNLR+QRD LRV+SLT+ A      +  + PPR      + GGFS  VISGL+QGSG
Sbjct: 81  AELFNLRLQRDSLRVESLTSLAAVSAGRNVTKRPPR------SAGGFSGVVISGLSQGSG 134

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EYF RLGVGTP   +YMVLDTGSDVVW+QC+PCK CY+Q+DPVF+PAKS++FATVPC S 
Sbjct: 135 EYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSR 194

Query: 197 LCRKLD-SSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           LCR+LD SS C   R   CLYQVSYGDGS TVGDFSTETLTF G RV  VALGCGHDNEG
Sbjct: 195 LCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEG 254

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS----SMVFGDSAVSRT 309
           LFV AAGLLGLGRG LSFP+QT  R+N KFSYCLVDR++S   S    ++VFG+ AV +T
Sbjct: 255 LFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKT 314

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
           A FTPLL NPKLDTFYY++L+GISVGG+ V G++ S FKLD  GNGGVIIDSGTSVTRLT
Sbjct: 315 AVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLT 374

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
           + AY+ALRDAFR GA+ LKRAP +SLFDTCFDLSG T VKVPTVV HF G +VSLPA+NY
Sbjct: 375 QSAYVALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEVSLPASNY 434

Query: 430 LIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LIPV++ G FCFAFAGTM  LSIIGNIQQQGFRV YDL  SR+GF  R C
Sbjct: 435 LIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 282/456 (61%), Positives = 332/456 (72%), Gaps = 25/456 (5%)

Query: 37  STLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTP---EHLFNLRIQRDV 93
           S   W E+V   E            ++S+ L++ H DSLS +      + +   R++RD 
Sbjct: 52  SAQEWSETVQGEE------------KNSIVLQVVHRDSLSSSSNTSLVKEILQERLKRDA 99

Query: 94  LRVKS------LTAFAESAVRVPPRNRSRGRAN---GGFSSSVISGLAQGSGEYFTRLGV 144
            RV S      L A   S   + P N S   A      FSSS+ISGLAQGSGEYFTRLGV
Sbjct: 100 ARVDSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGV 159

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS 204
           GTPPRY YMVLDTGSD++WIQC PC KCY QTDP+F+PA S ++  VPC +PLC+KLD S
Sbjct: 160 GTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLDIS 219

Query: 205 GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGL 264
           GC  +  C YQVSYGDGS TVGDFSTETLTFRG  + RVALGCGHDNEGLF+ AAGLLGL
Sbjct: 220 GCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGL 279

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTF 324
           GRG LSFP+QTG +F+++FSYCLVDRS S   SS++FG +A+ ++A FTPLL+NPKLDTF
Sbjct: 280 GRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTF 339

Query: 325 YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGA 384
           YYVELVGISVGG  +  I AS+F++D  GNGGVIIDSGTSVTRL   AY  +RDAFR G 
Sbjct: 340 YYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGT 399

Query: 385 SSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF 443
            +LK A  FSLFDTC+DLSG   VKVPT+V HF+ GA +SLPATNYLIPVDSS TFCFAF
Sbjct: 400 GNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAF 459

Query: 444 AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AG   GLSIIGNIQQQG+RVV+D  A+R+GF    C
Sbjct: 460 AGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 277/474 (58%), Positives = 338/474 (71%), Gaps = 21/474 (4%)

Query: 20  AASLQYQTFVLNSL-PTPSTLSWPESVSVSESESSLPLPAPD--AESSLSLRLHHVDSLS 76
           A  +Q Q+ ++  L PTP + S   +    +   +  L A +    S++   + H D   
Sbjct: 28  AKPVQTQSLLVTPLSPTPFSASSELARGDDKDVFAGNLAAAEDATPSTVQFSVVHRDDFV 87

Query: 77  FNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSG 136
            N T   L   R+QRD  R   ++A A +A      N +R R   G  + V+SGLAQGSG
Sbjct: 88  VNATAAELLGHRLQRDGKRAARISAAAGAA------NGTR-RTGSGVVAPVVSGLAQGSG 140

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EYFT++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+  VFDP +SRS+  V C +P
Sbjct: 141 EYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAP 200

Query: 197 LCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGL 254
           LCR+LDS GC+ RR  CLYQV+YGDGS+T GDF+TETLTF  G RVAR+ALGCGHDNEGL
Sbjct: 201 LCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGL 260

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP----SSMVFGDSAVSRT- 309
           FVAAAGLLGLGRG LSFP Q  RR+ R FSYCLVDR++SA P    S++ FG  AV  T 
Sbjct: 261 FVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTV 320

Query: 310 -ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTR 367
            A FTP++ NP+++TFYYV+LVGISVGGA V G+  S  +LDP +G GGVI+DSGTSVTR
Sbjct: 321 AASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTR 380

Query: 368 LTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
           L RPAY ALRDAFRA A+ L+ +P  FSLFDTC+DLSG+  VKVPTV +HF  GA+ +LP
Sbjct: 381 LARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALP 440

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             NYLIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    R+GF P+GC
Sbjct: 441 PENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 276/470 (58%), Positives = 335/470 (71%), Gaps = 23/470 (4%)

Query: 26  QTFVLNSLP-TPSTLSWPESVSVSESESSLPLPAPDAE----SSLSLRLHHVDSLSFNRT 80
           QT  L + P +P  +S P  ++  + +S        AE    S++  RL H D  S N T
Sbjct: 30  QTQALLATPLSPDRVSAPSELARDDDDSVFAGNLASAEDAPASTVRFRLVHRDDFSVNAT 89

Query: 81  PEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
              L   R++RD  R   L+A A  A           R  GG  + V+SGLAQGSGEYFT
Sbjct: 90  AAELLAYRLERDAKRAARLSAAAGPA-------NGTRRGGGGVVAPVVSGLAQGSGEYFT 142

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           ++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+  VFDP +SRS+  V C +PLCR+
Sbjct: 143 KIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCRR 202

Query: 201 LDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAA 258
           LDS GC+ RR+ CLYQV+YGDGS+T GDF+TETLTF  G RVARVALGCGHDNEGLFVAA
Sbjct: 203 LDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGLFVAA 262

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA----KPSSMVFGDSAVSRT--ARF 312
           AGLLGLGRG LSFPTQ  RR+ R FSYCLVDR++SA    + S++ FG  AV  T  + F
Sbjct: 263 AGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGAVGSTVASSF 322

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRP 371
           TP++ NP+++TFYYV+L+GISVGGA V G+  S  +LDP +G GGVI+DSGTSVTRL RP
Sbjct: 323 TPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARP 382

Query: 372 AYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNY 429
           AY ALRDAFR  A+ L+ +P  FSLFDTC+DLSG+  VKVPTV +HF  GA+ +LP  NY
Sbjct: 383 AYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENY 442

Query: 430 LIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    R+ F P+GC
Sbjct: 443 LIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 275/476 (57%), Positives = 334/476 (70%), Gaps = 20/476 (4%)

Query: 20  AASLQYQTFVLNSL-PTPSTLSWPESVSVSESESSLPLPAPD---AESSLSLRLHHVDSL 75
           A +++YQT V   L P P T +  E   + +      L A +   A S++ LR+ H D  
Sbjct: 29  AEAVRYQTLVATPLSPHPYTATAVEDDGLFQGS----LAADEGGAAASTVGLRVVHRDDF 84

Query: 76  SFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGS 135
           + N T   L   R++RD  R   ++A A  A          G    GF + V+SGLAQGS
Sbjct: 85  AVNATAAELLAHRLRRDKRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGS 144

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEYFT++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+  +FDP  S S+  V C +
Sbjct: 145 GEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAA 204

Query: 196 PLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEG 253
           PLCR+LDS GC+ RR  CLYQV+YGDGS+T GDF+TETLTF  G RV RVALGCGHDNEG
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARVPRVALGCGHDNEG 264

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-----RSTSAKPSSMVFGDSAV-- 306
           LFVAAAGLLGLGRG LSFP+Q  RRF R FSYCLVD      S +++ S++ FG  AV  
Sbjct: 265 LFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAVGP 324

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSV 365
           S  A FTP++ NP+++TFYYV+L+GISVGGA V G+  S  +LDP+ G GGVI+DSGTSV
Sbjct: 325 SAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGTSV 384

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           TRL RPAY ALRDAFRA A+ L+ +P  FSLFDTC+DLSG   VKVPTV +HF  GA+ +
Sbjct: 385 TRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAA 444

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LP  NYLIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    R+GF P+GC
Sbjct: 445 LPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 261/429 (60%), Positives = 317/429 (73%), Gaps = 18/429 (4%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRN-RSRGRANGGF 123
           +  R+ H D+ + N T   L   R+QRD  R   ++  A           RSRG   G  
Sbjct: 69  VHFRVVHRDAFAANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRG---GAV 125

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           ++ V+SGLAQGSGEYFT++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+ PVFDP 
Sbjct: 126 AAPVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPR 185

Query: 184 KSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVA 241
           +S S+  V C +PLCR+LDS GC+ RR  CLYQV+YGDGS+T GDF+TETLTF  G RVA
Sbjct: 186 RSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVA 245

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS--------TS 293
           RVALGCGHDNEGLFVAAAGLLGLGRG LSFPTQ  RR+ + FSYCLVDR+        + 
Sbjct: 246 RVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASR 305

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA- 352
           ++ S++ FG  + S  A FTP++ NP+++TFYYV+LVGISVGGA V G+  S  +LDP+ 
Sbjct: 306 SRSSTVTFGPPSAS-AASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPST 364

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVP 411
           G GGVI+DSGTSVTRL RP+Y ALRDAFRA A+ L+ +P  FSLFDTC+DL G+  VKVP
Sbjct: 365 GRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVP 424

Query: 412 TVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           TV +HF  GA+ +LP  NYLIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    
Sbjct: 425 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 484

Query: 471 RIGFAPRGC 479
           R+GFAP+GC
Sbjct: 485 RVGFAPKGC 493


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 263/458 (57%), Positives = 325/458 (70%), Gaps = 21/458 (4%)

Query: 35  TPSTLSWPESVSVSESESSLPLPAPDAE-SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDV 93
           T S L+ P S      E  L L AP    S+L  RL H +  + N T   L        +
Sbjct: 26  TQSLLANPLSPDPITQEQQLSLAAPRTNASTLHFRLAHREHFALNATASDL--------L 77

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
             + +  A   +A+   P N +R R  GGF++ ++SGL QGSGEYF ++GVGTP     M
Sbjct: 78  AHLLARDAARAAALLAAPNNATRPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALM 137

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTC 212
           VLDTGSDVVW+QCAPC+ CY+Q+  VFDP +SRS+A V C +P+CR+LDS+GC+ RRN+C
Sbjct: 138 VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSC 197

Query: 213 LYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           LYQV+YGDGS+T GDF++ETLTF RG RV RVA+GCGHDNEGLF+AA+GLLGLGRGRLSF
Sbjct: 198 LYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSF 257

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSS-------MVFGDSAVSRTARFTPLLANPKLDTF 324
           PTQ  R F R FSYCLVDR++S +PSS          G  A +  A FTP+  NP++ TF
Sbjct: 258 PTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATF 317

Query: 325 YYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           YYV L+G SVGGA V+G++ S  +L+P  G GGVI+DSGTSVTRL RP Y A+RDAFRA 
Sbjct: 318 YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAA 377

Query: 384 ASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF 441
           A  L+ +P  FSLFDTC++LSG+  VKVPTV +H   GA V+LP  NYLIPVD+SGTFCF
Sbjct: 378 AVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCF 437

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A AGT  G+SIIGNIQQQGFRVV+D  A R+GF P+ C
Sbjct: 438 AMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 258/428 (60%), Positives = 314/428 (73%), Gaps = 24/428 (5%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
            R+ H D+ + N T   L   R+QRD  R   ++   E+A       R       G ++ 
Sbjct: 67  FRVVHRDTFAVNATAGELLKHRLQRDKRRAARIS---EAAGAGGGNGRK------GVAAP 117

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SGLAQGSGEYFT++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+ PVFDP +S 
Sbjct: 118 VVSGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSS 177

Query: 187 SFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVA 244
           S+  V C + LCR+LDS GC+ RR  C+YQV+YGDGS+T GDF TETLTF  G RVARVA
Sbjct: 178 SYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVA 237

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA--------KP 296
           LGCGHDNEGLFVAAAGLLGLGRG LSFPTQ  RR+ R FSYCLVDR++S         + 
Sbjct: 238 LGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRS 297

Query: 297 SSMVFGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GN 354
           S++ FG  +V + +A FTP++ NP+++TFYYV+LVGISVGGA V G+  S  +LDP+ G 
Sbjct: 298 STVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 357

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAP-DFSLFDTCFDLSGKTEVKVPT 412
           GGVI+DSGTSVTRL R +Y ALRDAFRA A+  L+ +P  FSLFDTC+DL G+  VKVPT
Sbjct: 358 GGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPT 417

Query: 413 VVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           V +HF  GA+ +LP  NYLIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    R
Sbjct: 418 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 477

Query: 472 IGFAPRGC 479
           +GFAP+GC
Sbjct: 478 VGFAPKGC 485


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 262/458 (57%), Positives = 325/458 (70%), Gaps = 21/458 (4%)

Query: 35  TPSTLSWPESVSVSESESSLPLPAPDAE-SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDV 93
           T S L+ P S      E  L L AP    S+L  RL H +  + N T   L        +
Sbjct: 26  TQSLLANPLSPDPITQEQQLSLAAPRTNASTLHFRLAHREHFALNATASDL--------L 77

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
             + +  A   +A+   P N +R R  GGF++ ++SGL QGSGEYF ++GVGTP     M
Sbjct: 78  AHLLARDAARAAALLAAPNNATRPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALM 137

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTC 212
           VLDTGSDVVW+QCAPC+ CY+Q+  VFDP +SRS+A V C +P+CR+LDS+GC+ RRN+C
Sbjct: 138 VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSC 197

Query: 213 LYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           LYQV+YGDGS+T GDF++ETLTF RG RV RVA+GCGHDNEGLF+AA+GLLGLGRGRLSF
Sbjct: 198 LYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSF 257

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSS-------MVFGDSAVSRTARFTPLLANPKLDTF 324
           P+Q  R F R FSYCLVDR++S +PSS          G  A +  A FTP+  NP++ TF
Sbjct: 258 PSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATF 317

Query: 325 YYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           YYV L+G SVGGA V+G++ S  +L+P  G GGVI+DSGTSVTRL RP Y A+RDAFRA 
Sbjct: 318 YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAA 377

Query: 384 ASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF 441
           A  L+ +P  FSLFDTC++LSG+  VKVPTV +H   GA V+LP  NYLIPVD+SGTFCF
Sbjct: 378 AVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCF 437

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A AGT  G+SIIGNIQQQGFRVV+D  A R+GF P+ C
Sbjct: 438 AMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 262/458 (57%), Positives = 325/458 (70%), Gaps = 21/458 (4%)

Query: 35  TPSTLSWPESVSVSESESSLPLPAPDAE-SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDV 93
           T S L+ P S      E  L L AP    S+L  RL H +  + N T   L        +
Sbjct: 32  TQSLLANPLSPDPITQEQQLSLAAPRTNASTLHFRLAHREHFALNATASDL--------L 83

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
             + +  A   +A+   P N +R R  GGF++ ++SGL QGSGEYF ++GVGTP     M
Sbjct: 84  AHLLARDAARAAALLAAPNNATRPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALM 143

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTC 212
           VLDTGSDVVW+QCAPC+ CY+Q+  VFDP +SRS+A V C +P+CR+LDS+GC+ RRN+C
Sbjct: 144 VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSC 203

Query: 213 LYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           LYQV+YGDGS+T GDF++ETLTF RG RV RVA+GCGHDNEGLF+AA+GLLGLGRGRLSF
Sbjct: 204 LYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSF 263

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSS-------MVFGDSAVSRTARFTPLLANPKLDTF 324
           P+Q  R F R FSYCLVDR++S +PSS          G  A +  A FTP+  NP++ TF
Sbjct: 264 PSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATF 323

Query: 325 YYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           YYV L+G SVGGA V+G++ S  +L+P  G GGVI+DSGTSVTRL RP Y A+RDAFRA 
Sbjct: 324 YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAA 383

Query: 384 ASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF 441
           A  L+ +P  FSLFDTC++LSG+  VKVPTV +H   GA V+LP  NYLIPVD+SGTFCF
Sbjct: 384 AVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCF 443

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A AGT  G+SIIGNIQQQGFRVV+D  A R+GF P+ C
Sbjct: 444 AMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  424 bits (1089), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 240/458 (52%), Positives = 298/458 (65%), Gaps = 17/458 (3%)

Query: 37  STLSWPESVSVSESESSLPLPAPDAE-SSLSLRLHHVDSLSFNRTPEHLFNL------RI 89
           STL    ++ V+  E   P      E    S+ L H D++  N    +  +       R+
Sbjct: 30  STLDVQATLRVARGEVVQPAKEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRL 89

Query: 90  QRDVLRVKSLTAFAESAVRVPPRNRSRGR-------ANGGFSSSVISGLAQGSGEYFTRL 142
           +RD  RV ++ +  E AV    R+  +         A   F S V+SG+ QGSGEYF+R+
Sbjct: 90  KRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRI 149

Query: 143 GVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD 202
           GVG P R   MVLDTGSDV WIQC PC  CY Q+DP+++PA S S+  V C++ LC++LD
Sbjct: 150 GVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANLCQQLD 209

Query: 203 SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLL 262
            SGC+R  +CLYQVSYGDGS T G+F+TETLT  G  +  VA+GCGHDNEGLFV AAGLL
Sbjct: 210 VSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLL 269

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLD 322
           GLG G LSFP+Q      + FSYCLVDR  S   S++ FG +AV   A   P+L N +LD
Sbjct: 270 GLGGGSLSFPSQLTDENGKIFSYCLVDRD-SESSSTLQFGRAAVPNGAVLAPMLKNSRLD 328

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
           TFYYV L GISVGG  +  I+ S+F +D +GNGGVI+DSGT+VTRL   AY +LRDAFRA
Sbjct: 329 TFYYVSLSGISVGGKML-SISDSVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRA 387

Query: 383 GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF 441
           G  +L      SLFDTC+DLS K  V VPTVV HF  G  +SLPA NYL+PVDS GTFCF
Sbjct: 388 GTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCF 447

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AFA T S LSI+GNIQQQG RV +D A +++GFA   C
Sbjct: 448 AFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 225/342 (65%), Positives = 269/342 (78%), Gaps = 15/342 (4%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNT 211
           MVLDTGSDVVW+QCAPC++CY Q+ PVFDP +S S+  V C + LCR+LDS GC+ RR  
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRG-TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
           C+YQV+YGDGS+T GDF TETLTF G  RVARVALGCGHDNEGLFVAAAGLLGLGRG LS
Sbjct: 61  CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSA--------KPSSMVFGDSAV-SRTARFTPLLANPKL 321
           FPTQ  RR+ R FSYCLVDR++S         + S++ FG  +V + +A FTP++ NP++
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRM 180

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSVTRLTRPAYIALRDAF 380
           +TFYYV+LVGISVGGA V G+  S  +LDP+ G GGVI+DSGTSVTRL R +Y ALRDAF
Sbjct: 181 ETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAF 240

Query: 381 RAGAS-SLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSG 437
           RA A+  L+ +P  FSLFDTC+DL G+  VKVPTV +HF  GA+ +LP  NYLIPVDS G
Sbjct: 241 RAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRG 300

Query: 438 TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           TFCFAFAGT  G+SIIGNIQQQGFRVV+D    R+GFAP+GC
Sbjct: 301 TFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 233/429 (54%), Positives = 288/429 (67%), Gaps = 18/429 (4%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEH--LFNLRIQRDVLRVKSLTAFAESAVR-------VPP 111
           + S L++ LH   S+   + P++  L   R++RD  RVKS+    + A+         P 
Sbjct: 59  SSSQLTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPL 118

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
              S+ RA       +ISG +QGSGEYF+R+G+G P   VYMVLDTGSDV WIQCAPC  
Sbjct: 119 DTDSQFRAED-LQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCAD 177

Query: 172 CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTE 231
           CY Q DP+F+PA S S++ + C +  C+ LD S C R NTCLY+VSYGDGS TVGDF TE
Sbjct: 178 CYHQADPIFEPASSTSYSPLSCDTKQCQSLDVSEC-RNNTCLYEVSYGDGSYTVGDFVTE 236

Query: 232 TLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           T+T     V  VA+GCGH+NEGLF+ AAGLLGLG G+LSFP+Q        FSYCLVDR 
Sbjct: 237 TITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINA---SSFSYCLVDRD 293

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           + +  S++ F +SA+   A   PLL N +LDTFYYV + G+SVGG  +  I  S+F++D 
Sbjct: 294 SDSA-STLEF-NSALLPHAITAPLLRNRELDTFYYVGMTGLSVGG-ELLSIPESMFEMDE 350

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
           +GNGG+IIDSGT+VTRL   AY ALRDAF  G   L    + +LFDTC+DLS KT V+VP
Sbjct: 351 SGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVP 410

Query: 412 TVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           TV  H  G  V  LPATNYLIPVDS GTFCFAFA T S LSIIGN+QQQG RV +DLA S
Sbjct: 411 TVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANS 470

Query: 471 RIGFAPRGC 479
            +GF PR C
Sbjct: 471 LVGFEPRQC 479


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 233/444 (52%), Positives = 285/444 (64%), Gaps = 24/444 (5%)

Query: 42  PESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTA 101
           P SV V   +S L   A +A +S   RL                   ++RD  RV+ L  
Sbjct: 113 PWSVQVVHRDSLLVKDAANATASYERRLEET----------------LRRDARRVRGLEQ 156

Query: 102 FAESAVRVPP----RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDT 157
             E  +R+       + +       F   V+SG+AQGSGEYFTR+GVGTP R  YMVLDT
Sbjct: 157 RIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDT 216

Query: 158 GSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVS 217
           GSDVVWIQC PC KCYSQ DP+F+P+ S SF+T+ C S +C  LD+  C+    CLY+VS
Sbjct: 217 GSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCH-GGGCLYKVS 275

Query: 218 YGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR 277
           YGDGS T+G F+TE LTF  T V  VA+GCGHDN GLFV AAGLLGLG G LSFP+Q G 
Sbjct: 276 YGDGSYTIGSFATEMLTFGTTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGT 335

Query: 278 RFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGA 337
           +  R FSYCLVDR + +   ++ FG  +V   +  TPLL NP L TFYYV L+ ISVGGA
Sbjct: 336 QTGRAFSYCLVDRFSESS-GTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGA 394

Query: 338 HVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF 396
            +  +   +F++D  +G GG I+DSGT+VTRL  P Y A+RDAF AG   L +A   S+F
Sbjct: 395 LLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIF 454

Query: 397 DTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGN 455
           DTC+DLSG   V VPTVV HF  GA + LPA NY+IP+D  GTFCFAFA   S LSI+GN
Sbjct: 455 DTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGN 514

Query: 456 IQQQGFRVVYDLAASRIGFAPRGC 479
           IQQQG RV +D A S +GFA R C
Sbjct: 515 IQQQGIRVSFDTANSLVGFALRQC 538


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 229/429 (53%), Positives = 286/429 (66%), Gaps = 14/429 (3%)

Query: 63  SSLSLRLHHVDSLSF----NRTP--EHLFNLRIQRDVLRVKSLTAFAESAVRVP--PRNR 114
           ++ S++L H DSL F    N T   E     +++R+  RV++L    E  +++   P   
Sbjct: 69  TAWSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGS 128

Query: 115 SRGRA--NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
               A     F S V+SG+ QGSGEYFTR+G+GTP R  YMVLDTGSDVVWIQC PC++C
Sbjct: 129 YENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCREC 188

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           YSQ DP+F+P+ S SF+TV C S +C +LD++ C+    CLY+VSYGDGS TVG ++TET
Sbjct: 189 YSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCH-GGGCLYEVSYGDGSYTVGSYATET 247

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
           LTF  T +  VA+GCGHDN GLFV AAGLLGLG G LSFP Q G +  R FSYCLVDR +
Sbjct: 248 LTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDS 307

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP- 351
            +   ++ FG  +V   + FTPL+ANP L TFYY+ +V ISVGG  +  + +  F++D  
Sbjct: 308 ESS-GTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDET 366

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
            G GG+IIDSGT+VTRL   AY ALRDAF AG   L RA   S+FDTC+DLS    V +P
Sbjct: 367 TGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIP 426

Query: 412 TVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
            V  HF  GA   LPA N LIP+DS GTFCFAFA   S LSI+GNIQQQG RV +D A S
Sbjct: 427 AVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANS 486

Query: 471 RIGFAPRGC 479
            +GFA   C
Sbjct: 487 LVGFAIDQC 495


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 225/408 (55%), Positives = 281/408 (68%), Gaps = 16/408 (3%)

Query: 85  FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA----NGGFSSSVISGLAQGSGEYFT 140
            ++ I RD LRV S+       V    R+RSR R     +  F + V+SGL+ GSGEYF 
Sbjct: 1   MHVTISRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFI 60

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           R+ VGTPPR +Y+V+DTGSD++W+QCAPC  CY Q+D +FDP KS +++T+ C +  C  
Sbjct: 61  RISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLN 120

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR------VARVALGCGHDNEGL 254
           LD   C + N CLYQV YGDGS T G+F T+ ++   T       + ++ LGCGHDNEG 
Sbjct: 121 LDIGTC-QANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGY 179

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST-SAKPSSMVFGDSAVSRT-ARF 312
           FV AAGLLGLG+G LSFP Q   +   +FSYCL DR T S + SS+VFG++AV    ARF
Sbjct: 180 FVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARF 239

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TP  +N ++ TFYY+++ GISVGG  +  I  S F+LD  GNGGVIIDSGTSVTRL   A
Sbjct: 240 TPQDSNMRVPTFYYLKMTGISVGGT-ILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAA 298

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLI 431
           Y +LRDAFRAG S L     FSLFDTC+DLSG   V VPTV LHF+G  D+ LPA+NYLI
Sbjct: 299 YASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLI 358

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           PVD+S TFC AFAGT +G SIIGNIQQQGFRV+YD   +++GF P  C
Sbjct: 359 PVDNSNTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 208/362 (57%), Positives = 259/362 (71%), Gaps = 15/362 (4%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+ QGSGEYF+R+G+G+P R +YMVLDTGSDV W+QCAPC  CY+Q+DP+FDPA S 
Sbjct: 185 VVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSS 244

Query: 187 SFATVPCRSPLCRKLDSSGC-----NRRNTCLYQVSYGDGSITVGDFSTETLTFRG---T 238
           S+ATVPC SP CR LD+S C     N  ++C+Y+V+YGDGS TVGDF+TETLT  G    
Sbjct: 245 SYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSA 304

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
            V  VA+GCGHDNEGLFV AAGLL LG G LSFP+Q       +FSYCLVDR  S   S+
Sbjct: 305 AVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISA---TEFSYCLVDRD-SPSAST 360

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           + FG S  S      PL+ +P+ +TFYYV L GISVGG  +  I  + F +D  G+GGVI
Sbjct: 361 LQFGASDSSTVT--APLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVI 418

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +DSGT+VTRL   AY ALRDAF  G  +L RA   SLFDTC+DL+G++ V+VP V L F 
Sbjct: 419 VDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFE 478

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
            G ++ LPA NYLIPVD +GT+C AFA T   +SI+GN+QQQG RV +D A + +GF+P 
Sbjct: 479 GGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPN 538

Query: 478 GC 479
            C
Sbjct: 539 KC 540


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 219/399 (54%), Positives = 263/399 (65%), Gaps = 14/399 (3%)

Query: 88  RIQRDVLRVKSLTAFAESAVR------VPPRNRSRGRANGGFSSSVISGLAQGSGEYFTR 141
           R+QRD  RVKSL    + A+       + P             S +ISG +QGSGEYF+R
Sbjct: 93  RLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSGEYFSR 152

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           +G+G PP   Y++LDTGSDV W+QCAPC  CY Q DP+F+PA S SF+T+ C +  CR L
Sbjct: 153 VGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSL 212

Query: 202 DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGL 261
           D S C R +TCLY+VSYGDGS TVGDF TET+T     V  VA+GCGH+NEGLFV AAGL
Sbjct: 213 DVSEC-RNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGL 271

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL 321
           LGLG G LSFP+Q        FSYCLVDR  S   S++ F +S +   A   PLL N  L
Sbjct: 272 LGLGGGSLSFPSQINA---TSFSYCLVDRD-SESASTLEF-NSTLPPNAVSAPLLRNHHL 326

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
           DTFYYV L G+SVGG  V  I  S F++D +GNGGVI+DSGT++TRL    Y +LRDAF 
Sbjct: 327 DTFYYVGLTGLSVGGELV-SIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFV 385

Query: 382 AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFC 440
                L      +LFDTC+DLS K  V+VPTV  HF  G ++ LPA NYL+P+DS GTFC
Sbjct: 386 KRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFC 445

Query: 441 FAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FAFA T S LSIIGN+QQQG RVVYDL    +GF P  C
Sbjct: 446 FAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/361 (59%), Positives = 255/361 (70%), Gaps = 10/361 (2%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S V SGLA GSGEYF R+G+G+P +  Y+V+DTGSDV WIQC+PCK CY Q D VFDP  
Sbjct: 1   SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60

Query: 185 SRSFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
           S SF  + C +P C+ LD   C +  N CLYQVSYGDGS TVGD ++++ +    R + V
Sbjct: 61  SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTSPV 120

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG 302
             GCGHDNEGLFV AAGLLGLG G+LSFP+Q     +RKFSYCLV R    + SS ++FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLS---SRKFSYCLVSRDNGVRASSALLFG 177

Query: 303 DSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVII 359
           DSA+  +A F  T LL NPKLDTFYY  L GIS+GG  +  I ++ FKL  + G GGVII
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGT-LLSIPSTAFKLSSSTGRGGVII 236

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR- 418
           DSGTSVTRL   AY  +RDAFR+    L RA DFSLFDTC+D S  T V +PTV  HF  
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEG 296

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GA V LP +NYL+PVD+SGTFCFAF+ T   LSIIGNIQQQ  RV  DL +SR+GFAPR 
Sbjct: 297 GASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356

Query: 479 C 479
           C
Sbjct: 357 C 357


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 238/473 (50%), Positives = 303/473 (64%), Gaps = 35/473 (7%)

Query: 20  AASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLH-HVDSLSFN 78
           AAS+Q    V    P  ST   P+  +VS+             SSLSL+L+  +  +  +
Sbjct: 36  AASIQRTQQVFAVEPKSST---PDETTVSD------------PSSLSLQLNSRISVMKAS 80

Query: 79  RTPEHLFNL-RIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGG---FSSSV 127
            +      L R++RD  RV+SLTA  + A+R        P  N   G +  G   F S +
Sbjct: 81  HSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPI 140

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRS 187
           +SG +QGSGEYF+R+G+G PP  VYMVLDTGSDV W+QCAPC +CY QTDP+F+P  S S
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSAS 200

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           F ++ C +  C+ LD S C R  TCLY+VSYGDGS TVGDF TET+T   T +  +A+GC
Sbjct: 201 FTSLSCETEQCKSLDVSEC-RNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGC 259

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           GH+NEGLF+ AAGLLGLG G LSFP+Q        FSYCLVDR + +  S++ F +S ++
Sbjct: 260 GHNNEGLFIGAAGLLGLGGGSLSFPSQLNA---SSFSYCLVDRDSDST-STLDF-NSPIT 314

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
             A   PL  NP LDTF+Y+ L G+SVGGA V  I  + F++   GNGG+I+DSGT+VTR
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGA-VLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPA 426
           L    Y  LRDAF      L+ A   +LFDTC+DLS K+ V+VPTV  HF  G ++ LPA
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            NYLIPVDS GTFCFAFA T S LSI+GN QQQG RV +DLA S +GF+P  C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 221/414 (53%), Positives = 280/414 (67%), Gaps = 17/414 (4%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTA--------FAESAVRVPPRNRSRGRANGGFSSSVIS 129
           N T   L   R+ RD LR+ S+++          +S++  P +N +       F + + S
Sbjct: 14  NATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKN-TNPFLQQDFETPLRS 72

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           GL+ GSGEYF  LGVGTPPR V MV DTGSDV+W+QC PC+ CY QTDP+F+P+ S +F 
Sbjct: 73  GLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQ 132

Query: 190 TVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           ++ C S LC++L   GC RRN CLYQVSYGDGS TVG+FSTETL+F    V  VA+GCGH
Sbjct: 133 SITCGSSLCQQLLIRGC-RRNQCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGH 191

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSR 308
           +N+GLF  AAGLLGLG+G LSFP+Q G+ +   FSYCL  R ST + P  ++FG+ AV+ 
Sbjct: 192 NNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVP--LIFGNQAVAS 249

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSVTR 367
            A+FT LL NPKLDTFYYVE+VGI VGG  V  I A    LD + GNGGVI+DSGT+VTR
Sbjct: 250 NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVN-IPAGSLSLDSSTGNGGVILDSGTAVTR 308

Query: 368 LTRPAYIALRDAFRAGA-SSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
           L   AY  +RDAFRAG  S  K    FSLFDTC+DLSG++ + +P V   F  GA ++LP
Sbjct: 309 LVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALP 368

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A N ++PVD+SGT+C AFA      SIIGNIQQQ FR+ +D   +R+G     C
Sbjct: 369 AQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 221/414 (53%), Positives = 280/414 (67%), Gaps = 17/414 (4%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTA--------FAESAVRVPPRNRSRGRANGGFSSSVIS 129
           N T   L   R+ RD LR+ S+++          +S++  P +N +       F + + S
Sbjct: 14  NATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKN-TNPFLQQDFETPLRS 72

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           GL+ GSGEYF  LGVGTPPR V MV DTGSDV+W+QC PC+ CY QTDP+F+P+ S +F 
Sbjct: 73  GLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQ 132

Query: 190 TVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           ++ C S LC++L   GC RRN CLYQVSYGDGS TVG+FSTETL+F    V  VA+GCGH
Sbjct: 133 SITCGSSLCQQLLIRGC-RRNQCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGH 191

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSR 308
           +N+GLF  AAGLLGLG+G LSFP+Q G+ +   FSYCL  R ST + P  ++FG+ AV+ 
Sbjct: 192 NNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVP--LIFGNQAVAS 249

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSVTR 367
            A+FT LL NPKLDTFYYVE+VGI VGG  V  I A    LD + GNGGVI+DSGT+VTR
Sbjct: 250 NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVS-IPAGSLSLDSSTGNGGVILDSGTAVTR 308

Query: 368 LTRPAYIALRDAFRAGA-SSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
           L   AY  +RDAFRAG  S  K    FSLFDTC+DLSG++ + +P V   F  GA ++LP
Sbjct: 309 LVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALP 368

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A N ++PVD+SGT+C AFA      SIIGNIQQQ FR+ +D   +R+G     C
Sbjct: 369 AQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 236/465 (50%), Positives = 296/465 (63%), Gaps = 27/465 (5%)

Query: 32  SLPTPSTLSWPESVSVSESESSLPLPAPD-----AESSLSLRLHHVDSLSFNRTPEH--- 83
           S  T S L+  +S+  ++  SS  L   +     A SS SL+LH   S+   R  EH   
Sbjct: 29  STTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSASSSFSLQLHSRVSV---RGTEHSDY 85

Query: 84  --LFNLRIQRDVLRVKSLTAFAESAVR------VPPRNRSRGRANGGFSSSVISGLAQGS 135
             L   R+ RD  RVKSL    + A+       + P +           + +ISG  QGS
Sbjct: 86  KSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGS 145

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEYFTR+G+G P R VYMVLDTGSDV W+QC PC  CY QT+P+F+P+ S S+  + C +
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDT 205

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLF 255
           P C  L+ S C R  TCLY+VSYGDGS TVGDF+TETLT   T V  VA+GCGH NEGLF
Sbjct: 206 PQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGHSNEGLF 264

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPL 315
           V AAGLLGLG G L+ P+Q        FSYCLVDR  S   S++ FG S +S  A   PL
Sbjct: 265 VGAAGLLGLGGGLLALPSQLN---TTSFSYCLVDRD-SDSASTVDFGTS-LSPDAVVAPL 319

Query: 316 LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
           L N +LDTFYY+ L GISVGG  ++ I  S F++D +G+GG+IIDSGT+VTRL    Y +
Sbjct: 320 LRNHQLDTFYYLGLTGISVGGELLQ-IPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNS 378

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVD 434
           LRD+F  G   L++A   ++FDTC++LS KT V+VPTV  HF G  + +LPA NY+IPVD
Sbjct: 379 LRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVD 438

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           S GTFC AFA T S L+IIGN+QQQG RV +DLA S IGF+   C
Sbjct: 439 SVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 215/361 (59%), Positives = 254/361 (70%), Gaps = 10/361 (2%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S V SGLA GSGEYF R+G+G+P +  Y+V+DTGSDV WIQC+PCK CY Q D VFDP  
Sbjct: 1   SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60

Query: 185 SRSFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
           S SF  + C +P C+ LD   C +  N CLYQVSYGDGS TVGD ++++      R + V
Sbjct: 61  SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTSPV 120

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG 302
             GCGHDNEGLFV AAGLLGLG G+LSFP+Q     +RKFSYCLV R    + SS ++FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLS---SRKFSYCLVSRDNGVRASSALLFG 177

Query: 303 DSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVII 359
           DSA+  +A F  T LL NPKLDTFYY  L GIS+GG  +  I ++ FKL  + G GGVII
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGT-LLSIPSTAFKLSSSTGRGGVII 236

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR- 418
           DSGTSVTRL   AY  +RDAFR+    L RA DFSLFDTC+D S  T V +PTV  HF  
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEG 296

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GA V LP +NYL+PVD+SGTFCFAF+ T   LSIIGNIQQQ  RV  DL +SR+GFAPR 
Sbjct: 297 GASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQ 356

Query: 479 C 479
           C
Sbjct: 357 C 357


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 240/456 (52%), Positives = 297/456 (65%), Gaps = 36/456 (7%)

Query: 35  TPSTLSWPESVSVSESESSLPLPAPDAE-SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDV 93
           T S L+ P S      E  L L AP    S+L  RL H +  + N T   L        +
Sbjct: 26  TQSLLANPLSPDPITQEQQLSLAAPRTNASTLHFRLAHREHFALNATASDL--------L 77

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
             + +  A   +A+   P N +R R  GGF++ ++SGL QG+GEYF ++GVGTP     M
Sbjct: 78  AHLLARDAARAAALLAAPNNATRPRRRGGFAAPLLSGLPQGTGEYFAQVGVGTPATTALM 137

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP-----CRSPLCRKLDSSGCNR 208
           VLDTGSDVVW   AP +        V     S   A  P     C +P+CR+LDS+GC+R
Sbjct: 138 VLDTGSDVVW---APVRALPPLLRAVRQ-GSSTGAAPAPTPRWNCVAPICRRLDSAGCDR 193

Query: 209 R-NTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGR 266
           R N+CLYQV+YGDGS+T GDF++ETLTF RG RV RVA+GCGHDNEGLF+AA+GLLGLGR
Sbjct: 194 RRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGR 253

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYY 326
           GRLSFP+Q  R F R FSYCLVDR++S +                       P++ TFYY
Sbjct: 254 GRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWG-------------GTPRMATFYY 300

Query: 327 VELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           V L+G SVGGA V+G++ S  +L+P  G GGVI+DSGTSVTRL RP Y A+RDAFRA A 
Sbjct: 301 VHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAV 360

Query: 386 SLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF 443
            L+ +P  FSLFDTC++LSG+  VKVPTV +H   GA V+LP  NYLIPVD+SGTFCFA 
Sbjct: 361 GLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM 420

Query: 444 AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AGT  G+SIIGNIQQQGFRVV+D  A R+GF P+ C
Sbjct: 421 AGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 238/477 (49%), Positives = 297/477 (62%), Gaps = 34/477 (7%)

Query: 21  ASLQYQTFVLNSLPTPSTLSW------PESVSVSESESSLPLPAPDAESSLSLRLHHVDS 74
           +SLQ    +L+  PT S+L+       PES  V  + SS           LSL LH  D+
Sbjct: 42  SSLQQTQHILSVDPTRSSLTARIPEFKPESDPVFLNSSS----------PLSLELHSRDT 91

Query: 75  L--SFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVR------VPPRNRSRGRAN-GGFSS 125
           L  S ++  + L   R++RD  RV  + A    AV       + P +    R      ++
Sbjct: 92  LVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFAVEGIDRSDLKPVDIDETRFQPEDLTT 151

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKS 185
            V+SG +QGSGEYF+R+GVGTP + +Y+VLDTGSDV WIQC PC +CY Q+DP+FDP  S
Sbjct: 152 PVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSS 211

Query: 186 RSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVA 244
            +F ++ C  P C  LD S C R N CLYQVSYGDGS TVG+++T+T+TF    +V  VA
Sbjct: 212 STFKSLTCSDPKCASLDVSAC-RSNKCLYQVSYGDGSFTVGNYATDTVTFGESGKVNDVA 270

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           LGCGHDNEGLF  AAGLLGLG G LS   Q      + FSYCLVDR  SAK SS+ F   
Sbjct: 271 LGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKA---KSFSYCLVDRD-SAKSSSLDFNSV 326

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            +       PLL N K+DTFYYV L G SVGG  V  I +SLF++D +G GGVI+D GT+
Sbjct: 327 QIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQV-SIPSSLFEVDASGAGGVILDCGTA 385

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKTEVKVPTVVLHFRGAD-V 422
           VTRL   AY +LRDAF    +  K+     SLFDTC+D S  + VKVPTV  HF G   +
Sbjct: 386 VTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSL 445

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +LPA NYLIP+D +GTFCFAFA T S LSIIGN+QQQG R+ YDLA + IG +   C
Sbjct: 446 NLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 238/473 (50%), Positives = 302/473 (63%), Gaps = 35/473 (7%)

Query: 20  AASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLH-HVDSLSFN 78
           AAS+Q    V    P  ST   P+  +VS+             SSLSL+L+  +  +  +
Sbjct: 36  AASIQRTQQVFAVEPKSST---PDETTVSD------------PSSLSLQLNSRISVMKAS 80

Query: 79  RTPEHLFNL-RIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGG---FSSSV 127
            +      L R++RD  RV+SLTA  + A+R        P  N   G +  G   F S +
Sbjct: 81  HSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPI 140

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRS 187
           +SG +QGSGEYF+R+G+G PP  VYMVLDTGSDV W+QCAPC +CY QTDP F+P  S S
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSAS 200

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           F ++ C +  C+ LD S C R  TCLY+VSYGDGS TVGDF TET+T   T +  +A+GC
Sbjct: 201 FTSLSCETEQCKSLDVSEC-RNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGC 259

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           GH+NEGLF+ AAGLLGLG G LSFP+Q        FSYCLVDR + +  S++ F +S ++
Sbjct: 260 GHNNEGLFIGAAGLLGLGGGSLSFPSQLNA---SSFSYCLVDRDSDST-STLDF-NSPIT 314

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
             A   PL  NP LDTF+Y+ L G+SVGGA V  I  + F++   GNGG+I+DSGT+VTR
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGA-VLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPA 426
           L    Y  LRDAF      L+ A   +LFDTC+DLS K+ V+VPTV  HF  G ++ LPA
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            NYLIPVDS GTFCFAFA T S LSI+GN QQQG RV +DLA S +GF+P  C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 239/468 (51%), Positives = 302/468 (64%), Gaps = 25/468 (5%)

Query: 20  AASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSL--SF 77
           +ASLQ    VL   PT S +S+ + V +  S SS          S SL+LH  DSL  + 
Sbjct: 41  SASLQQANQVLKFDPTAS-ISFQQQVHLVPSNSSF---------SFSLQLHPRDSLHNAG 90

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG----GFSSSVISGLAQ 133
           ++  + L   R+ RD  RVKS+    E A+    R+              S+ +ISG +Q
Sbjct: 91  HKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQ 150

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           GSGEYF+R+GVG P +  YMVLDTGSD+ W+QC PC  CY QTDP+FDP  S SFA++PC
Sbjct: 151 GSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPC 210

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNE 252
            S  C+ L++SGC R + CLYQVSYGDGS TVG+F TETLTF  +  +  VA+GCGHDNE
Sbjct: 211 ESQQCQALETSGC-RASKCLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGCGHDNE 269

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           GLFV +AGLLGLG G LS  +Q        FSYCLVDR +S+  S + F  +A S +   
Sbjct: 270 GLFVGSAGLLGLGGGPLSLTSQMKA---SSFSYCLVDRDSSSS-SDLEFNSAAPSDSVN- 324

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
            PLL + K+DTFYYV L G+SVGG  +  I  +LF++D +G GG+I+DSGT++TRL   A
Sbjct: 325 APLLKSGKVDTFYYVGLTGMSVGG-QLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQA 383

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATNYLI 431
           Y  LRDAF +    LK+   F+LFDTC+DLS ++ V +PTV   F G   + LP  NYLI
Sbjct: 384 YNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLI 443

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           PVDS GTFCFAFA T S LSIIGN+QQQG RV YDLA S +GF+P  C
Sbjct: 444 PVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 209/356 (58%), Positives = 259/356 (72%), Gaps = 10/356 (2%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+ QGSGEYF+R+GVG P R +YMVLDTGSDV W+QC PC  CY+Q+DPV+DP+ S 
Sbjct: 152 VVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVST 211

Query: 187 SFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVA 244
           S+ATV C SP CR LD++ C N   +CLY+V+YGDGS TVGDF+TETLT   +  V+ VA
Sbjct: 212 SYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVSNVA 271

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +GCGHDNEGLFV AAGLL LG G LSFP+Q        FSYCLVDR  S   S++ FGDS
Sbjct: 272 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS---ATTFSYCLVDRD-SPSSSTLQFGDS 327

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
              + A   PL+ +P+ +TFYYV L GISVGG  +  I +S F +D AG+GGVI+DSGT+
Sbjct: 328 --EQPAVTAPLIRSPRTNTFYYVALSGISVGGEALS-IPSSAFAMDDAGSGGVIVDSGTA 384

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VTRL   AY ALR+AF  G  SL RA   SLFDTC+DL+G++ V+VP V L F  G ++ 
Sbjct: 385 VTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGELK 444

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LPA NYLIPVD++GT+C AFAGT   +SIIGN+QQQG RV +D A + +GF    C
Sbjct: 445 LPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 238/468 (50%), Positives = 301/468 (64%), Gaps = 25/468 (5%)

Query: 20  AASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSL--SF 77
           +ASLQ    VL   PT S +S+ + V +  S SS          S SL+LH  DSL  + 
Sbjct: 41  SASLQQANQVLKFDPTAS-ISFQQQVHLVPSNSSF---------SFSLQLHPRDSLHNAG 90

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG----GFSSSVISGLAQ 133
           ++  + L   R+ RD  RVKS+    E A+    R+              S+ +ISG +Q
Sbjct: 91  HKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQ 150

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           GSGEYF+R+GVG P +  YMVLDTGSD+ W+QC PC  CY QTDP+FDP  S SFA++PC
Sbjct: 151 GSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPC 210

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNE 252
            S  C+ L++SGC R + CLYQVSYGDGS TVG+F  ETLTF  +  +  VA+GCGHDNE
Sbjct: 211 ESQQCQALETSGC-RASKCLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGCGHDNE 269

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           GLFV +AGLLGLG G LS  +Q        FSYCLVDR +S+  S + F  +A S +   
Sbjct: 270 GLFVGSAGLLGLGGGSLSLTSQMKA---SSFSYCLVDRDSSSS-SDLEFNSAAPSDSVN- 324

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
            PLL + K+DTFYYV L G+SVGG  +  I  +LF++D +G GG+I+DSGT++TRL   A
Sbjct: 325 APLLKSGKVDTFYYVGLTGMSVGG-QLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQA 383

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATNYLI 431
           Y  LRDAF +    LK+   F+LFDTC+DLS ++ V +PTV   F G   + LP  NYLI
Sbjct: 384 YNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLI 443

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           PVDS GTFCFAFA T S LSIIGN+QQQG RV YDLA S +GF+P  C
Sbjct: 444 PVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 228/433 (52%), Positives = 279/433 (64%), Gaps = 20/433 (4%)

Query: 62  ESSLSLRLHHVDSL----SFNRTPEHLFNLRIQRDVLRVKSL----TAFAESAVRV---P 110
           E  L+LRLH  D L      + T   L   R++RD  R  ++    T  A+   R+   P
Sbjct: 79  EGGLTLRLHSRDFLPEEQGRHETYRSLVLSRLRRDSARAAAVSARATLAADGVTRLDLRP 138

Query: 111 PRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
               +   A+      V+SG+ QGSGEYF+R+G+G+P R +YMVLDTGSDV W+QC PC 
Sbjct: 139 ANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCA 198

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFS 229
            CY Q+DPVFDP+ S S+A V C S  CR LD++ C N    CLY+V+YGDGS TVGDF+
Sbjct: 199 DCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFA 258

Query: 230 TETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           TETLT    T V  VA+GCGHDNEGLFV AAGLL LG G LSFP+Q        FSYCLV
Sbjct: 259 TETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS---ASTFSYCLV 315

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           DR + A  S++ FGD A        PL+ +P+  TFYYV L GISVGG  +  I AS F 
Sbjct: 316 DRDSPAA-STLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLS-IPASAFA 373

Query: 349 LDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTE 407
           +D  +G+GGVI+DSGT+VTRL   AY ALRDAF  GA SL R    SLFDTC+DLS +T 
Sbjct: 374 MDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTS 433

Query: 408 VKVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
           V+VP V L F G   + LPA NYLIPVD +GT+C AFA T + +SIIGN+QQQG RV +D
Sbjct: 434 VEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFD 493

Query: 467 LAASRIGFAPRGC 479
            A   +GF P  C
Sbjct: 494 TARGAVGFTPNKC 506


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 210/366 (57%), Positives = 263/366 (71%), Gaps = 12/366 (3%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
           F + VISGL+ GSGEYF R+ VGTPPR +Y+V+DTGSD++W+QCAPC  CY Q D VFDP
Sbjct: 22  FQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDP 81

Query: 183 AKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR--- 239
            KS +++T+ C S  C  LD  GC   N CLYQV YGDGS + G+F+T+ ++   T    
Sbjct: 82  YKSSTYSTLGCNSRQCLNLDVGGC-VGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGG 140

Query: 240 ---VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST-SAK 295
              + ++ LGCGHDNEG FV AAGLLGLG+G LSFP Q       +FSYCL  R T S +
Sbjct: 141 QVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTE 200

Query: 296 PSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
            SS++FGD+AV     RFTP  +N ++ TFYY+++ GISVGG+ +  I  S F+LD  GN
Sbjct: 201 RSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGS-ILTIPTSAFQLDSLGN 259

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           GGVIIDSGTSVTRL   AY +LR+AFRAG S L    +FSLFDTC++LS  + V VPTV 
Sbjct: 260 GGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVT 319

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
           LHF+ GAD+ LPA+NYL+PVD+S TFC AFAGT +G SIIGNIQQQGFRV+YD   +++G
Sbjct: 320 LHFQGGADLKLPASNYLVPVDNSSTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLHNQVG 378

Query: 474 FAPRGC 479
           F P  C
Sbjct: 379 FVPSQC 384


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 219/418 (52%), Positives = 272/418 (65%), Gaps = 19/418 (4%)

Query: 68  RLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG----GF 123
           ++HH D  S       L   R+ RD +R  SLTA  + A+    ++  +           
Sbjct: 94  KIHHKDYKS-------LVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDL 146

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ V SG +QGSGEYFTR+GVG P R  YMVLDTGSD+ W+QC PC  CY QTDP+FDP 
Sbjct: 147 STPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPT 206

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVAR 242
            S ++A V C+S  C  L+ S C R   CLYQV+YGDGS T GDF+TE+++F  +  V  
Sbjct: 207 ASSTYAPVTCQSQQCSSLEMSSC-RSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKN 265

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
           VALGCGHDNEGLFV AAGLLGLG G LS   Q        FSYCLV+R  SA  S++ F 
Sbjct: 266 VALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKA---TSFSYCLVNRD-SAGSSTLDFN 321

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            + +   +   PL+ N K+DTFYYV L G+SVGG  V  I  S F+LD +GNGG+I+D G
Sbjct: 322 SAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMV-SIPESTFRLDESGNGGIIVDCG 380

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGAD 421
           T++TRL   AY  LRDAF     +LK     +LFDTC+DLSG+  V+VPTV  HF  G  
Sbjct: 381 TAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKS 440

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +LPA NYLIPVDS+GT+CFAFA T S LSIIGN+QQQG RV +DLA +R+GF+P  C
Sbjct: 441 WNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 221/440 (50%), Positives = 283/440 (64%), Gaps = 24/440 (5%)

Query: 58  APDAESSLSLRLHHVDSLSFN-----RTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPR 112
           +P    +LSL L H +SL         T E L    +QRD  RV+    + ES  ++  +
Sbjct: 49  SPRDGGTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVR----WIESKAQLAGK 104

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
            +    +    +  V SGL  GSGEYF RLGVGTP R ++MV+DTGSD+ W+QC PCK C
Sbjct: 105 KKDEASSTD-LNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSC 163

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDF 228
           Y Q DP+FDP  S SF  +PC SPLC+ L+   C+      + C YQV+YGDGS +VGDF
Sbjct: 164 YKQADPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDF 223

Query: 229 STETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ-----TGRRFNRK 282
           S++  T   G++   VA GCG DNEGLF  AAGLLGLG G+LSFP+Q     T       
Sbjct: 224 SSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANS 283

Query: 283 FSYCLVDRST--SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
           FSYCLVDRS   +   SS++FG +A+  TA  +PLL NPKLDTFYY  ++G+SVGGA + 
Sbjct: 284 FSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLP 343

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
            I+    +L  +G+GGVIIDSGTSVTR     Y  +RDAFR   ++L  AP +SLFDTC+
Sbjct: 344 -ISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCY 402

Query: 401 DLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
           + SGK  V VP +VLHF  GAD+ LP TNYLIP++++G+FC AFA T   L IIGNIQQQ
Sbjct: 403 NFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQ 462

Query: 460 GFRVVYDLAASRIGFAPRGC 479
            FR+ +DL  S + FAP+ C
Sbjct: 463 SFRIGFDLQKSHLAFAPQQC 482


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 234/466 (50%), Positives = 295/466 (63%), Gaps = 28/466 (6%)

Query: 32  SLPTPSTLSWPESVSVSESESSLPLPAPDAE-----SSLSLRLHHVDSLSFNRTPEH--- 83
           S+ T S L+  +S+  ++  SS  L   + +     SS SL+LH   S+   R  EH   
Sbjct: 31  SVTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSRSSSFSLQLHSRVSV---RGTEHSDY 87

Query: 84  --LFNLRIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGGFSSSVISGLAQG 134
             L   R+ RD  RVKSL    + A+         P              + +ISG  QG
Sbjct: 88  KSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEEEDIEAPLISGTTQG 147

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGEYFTR+G+G P R VYMVLDTGSDV W+QC PC  CY QT+P+F+P+ S S+  + C 
Sbjct: 148 SGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCD 207

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C  L+ S C R  TCLY+VSYGDGS TVGDF+TETLT   T V  VA+GCGH NEGL
Sbjct: 208 TPQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGHSNEGL 266

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           FV AAGLLGLG G L+ P+Q        FSYCLVDR  S   S++ FG S +   A   P
Sbjct: 267 FVGAAGLLGLGGGLLALPSQLN---TTSFSYCLVDRD-SDSASTVEFGTS-LPPDAVVAP 321

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL N +LDTFYY+ L GISVGG  ++ I  S F++D +G+GG+IIDSGT+VTRL    Y 
Sbjct: 322 LLRNHQLDTFYYLGLTGISVGGELLQ-IPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYN 380

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPV 433
           +LRD+F  G S L++A   ++FDTC++LS KT ++VPTV  HF G  + +LPA NY+IPV
Sbjct: 381 SLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMIPV 440

Query: 434 DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           DS GTFC AFA T S L+IIGN+QQQG RV +DLA S IGF+   C
Sbjct: 441 DSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 232/471 (49%), Positives = 299/471 (63%), Gaps = 24/471 (5%)

Query: 21  ASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSL--SFN 78
           +SLQ    +L+  PT S+L+  +  S+S+     P+   ++ S LSL LH  D+L  S +
Sbjct: 42  SSLQQTQTILSLDPTRSSLTATKPESISD-----PVFF-NSSSPLSLELHSRDTLVASQH 95

Query: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVR------VPPRNRSRGRAN-GGFSSSVISGL 131
           +  + L   R++RD  RV  + A    AV       + P N    R      ++ V+SG+
Sbjct: 96  KDYKSLVLSRLERDSSRVAGIAAKIRFAVEGIDRSDLKPVNNEDTRYQPEALTTPVVSGV 155

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           +QGSGEYF+R+GVGTP + +Y+VLDTGSDV WIQC PC  CY Q+DPVF+P  S ++ ++
Sbjct: 156 SQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSL 215

Query: 192 PCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHD 250
            C +P C  L++S C R N CLYQVSYGDGS TVG+ +T+T+TF  + ++  VALGCGHD
Sbjct: 216 TCSAPQCSLLETSAC-RSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINDVALGCGHD 274

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           NEGLF  AAGLLGLG G LS   Q        FSYCLVDR  S K SS+ F    +    
Sbjct: 275 NEGLFTGAAGLLGLGGGALSITNQMKA---TSFSYCLVDRD-SGKSSSLDFNSVQLGSGD 330

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
              PLL N K+DTFYYV L G SVGG  V  +  ++F +D +G+GGVI+D GT+VTRL  
Sbjct: 331 ATAPLLRNQKIDTFYYVGLSGFSVGGQKVM-MPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 371 PAYIALRDAFRAGASSLKRA-PDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATN 428
            AY +LRDAF    ++LK+     SLFDTC+D S  + VKVPTV  HF G   + LPA N
Sbjct: 390 QAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKN 449

Query: 429 YLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           YLIPVD +GTFCFAFA T S LSIIGN+QQQG R+ YDLA   IG +   C
Sbjct: 450 YLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 224/430 (52%), Positives = 277/430 (64%), Gaps = 16/430 (3%)

Query: 63  SSLSLRLHHVDSLSFNRTP------EHLFNLRIQRDVLRVKSLTAFAESAVRV--PPRNR 114
           S  S+ + H D+L            E     +++R+ +RV+ L    E  + +   P NR
Sbjct: 72  SPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNR 131

Query: 115 SRGRA--NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
               A  +  F   V+SG+ QGSGEYFTR+GVGTP R  YMVLDTGSDV WIQC PC++C
Sbjct: 132 YENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCREC 191

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           YSQ DP+F+P+ S SF+TV C S +C +LD+  C+    CLY+ SYGDGS + G F+TET
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCH-SGGCLYEASYGDGSYSTGSFATET 250

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-S 291
           LTF  T VA VA+GCGH N GLF+ AAGLLGLG G LSFP Q G +    FSYCLVDR S
Sbjct: 251 LTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRES 310

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            S+ P  + FG  +V   + FTPL  NP L TFYY+ +  ISVGGA +  I   +F++D 
Sbjct: 311 DSSGP--LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDE 368

Query: 352 -AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
            +G+GG IIDSGT VTRL   AY A+RDAF AG   L R    S+FDTC+DLSG   V V
Sbjct: 369 TSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSV 428

Query: 411 PTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           PTV  HF  GA + LPA NYLIP+D+ GTFCFAFA   S +SI+GN QQQ  RV +D A 
Sbjct: 429 PTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSAN 488

Query: 470 SRIGFAPRGC 479
           S +GFA   C
Sbjct: 489 SLVGFAFDQC 498


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 208/351 (59%), Positives = 249/351 (70%), Gaps = 4/351 (1%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           + QGSGEYFTR+G+GTP R  YMVLDTGSDVVWIQC PC++CYSQ DP+F+P+ S SF+T
Sbjct: 1   MEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFST 60

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           V C S +C +LD++ C+    CLY+VSYGDGS TVG ++TETLTF  T +  VA+GCGHD
Sbjct: 61  VGCDSAVCSQLDANDCHGGG-CLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHD 119

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           N GLFV AAGLLGLG G LSFP Q G +  R FSYCLVDR  S    ++ FG  +V   +
Sbjct: 120 NVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRD-SESSGTLEFGPESVPIGS 178

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLT 369
            FTPL+ANP L TFYY+ +V ISVGG  +  + +  F++D   G GG+IIDSGT+VTRL 
Sbjct: 179 IFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQ 238

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATN 428
             AY ALRDAF AG   L RA   S+FDTC+DLS    V +P V  HF  GA   LPA N
Sbjct: 239 TSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKN 298

Query: 429 YLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LIP+DS GTFCFAFA   S LSI+GNIQQQG RV +D A S +GFA   C
Sbjct: 299 CLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 209/423 (49%), Positives = 279/423 (65%), Gaps = 14/423 (3%)

Query: 62  ESSLSLRLHHVDSLS-FNRTP---EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           E    L+L H D ++ FN++     H F+ RIQRD  RV +L        R+ PR+ +  
Sbjct: 68  EGKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIR------RLSPRDATSS 121

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
            +   F + V+SG+ QGSGEYF R+GVG+PPR  Y+V+D+GSD+VW+QC PC +CY QTD
Sbjct: 122 YSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTD 181

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           PVFDPA S SF  VPC S +C +++++GC+    C Y+V YGDGS T G  + ETLTF  
Sbjct: 182 PVFDPADSASFMGVPCSSSVCERIENAGCH-AGGCRYEVMYGDGSYTKGTLALETLTFGR 240

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
           T V  VA+GCGH N G+FV AAGLLGLG G +S   Q G +    FSYCLV R T +   
Sbjct: 241 TVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSA-G 299

Query: 298 SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
           S+ FG  A+   A + PL+ NP+  +FYY+ L G+ VGG  V  I+  +F+L+  GNGGV
Sbjct: 300 SLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVP-ISEDVFQLNEMGNGGV 358

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           ++D+GT+VTR+   AY+A RDAF     +L RA   S+FDTC++L+G   V+VPTV  +F
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYF 418

Query: 418 RGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
            G  + +LPA N+LIPVD  GTFCFAFA + SGLSIIGNIQQ+G ++ +D A   +GF P
Sbjct: 419 AGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGP 478

Query: 477 RGC 479
             C
Sbjct: 479 NVC 481


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 210/398 (52%), Positives = 277/398 (69%), Gaps = 13/398 (3%)

Query: 88  RIQRDVLRVKSLTAFAESAVRVPPRNRSR----GRANGGFSSSVISGLAQGSGEYFTRLG 143
           R++RD  RV+SL    + A+    ++  +            + ++SG +QGSGEYF+R+G
Sbjct: 101 RLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGASQGSGEYFSRVG 160

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +G+PP++VYMV+DTGSDV W+QCAPC  CY Q DP+F+P+ S S+A + C +  C+ LD 
Sbjct: 161 IGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDV 220

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNEGLFVAAAGLL 262
           S C R ++CLY+VSYGDGS TVGDF+TET+T  G+  +  VA+GCGHDNEGLFV AAGLL
Sbjct: 221 SEC-RNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLL 279

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLD 322
           GLG G LSFP+Q        FSYCLV+R T +  S++ F +S +   +   PLL N +LD
Sbjct: 280 GLGGGSLSFPSQINA---SSFSYCLVNRDTDSA-STLEF-NSPIPSHSVTAPLLRNNQLD 334

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
           TFYY+ + GI VGG  +  I  S F++D +GNGG+I+DSGT+VTRL    Y +LRD+F  
Sbjct: 335 TFYYLGMTGIGVGG-QMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVR 393

Query: 383 GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCF 441
           G   L      +LFDTC+DLS ++ V+VPTV  HF  G  ++LPA NYLIPVDS+GTFCF
Sbjct: 394 GTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCF 453

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AFA T S LSIIGN+QQQG RV YDL+ S +GF+P GC
Sbjct: 454 AFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 230/471 (48%), Positives = 296/471 (62%), Gaps = 24/471 (5%)

Query: 21  ASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSL--SFN 78
           +SLQ    +L+  PT S+L+  +  S+S+     P+   ++ S LSL LH  D+   S +
Sbjct: 42  SSLQQTQTILSLDPTRSSLTTTKPESLSD-----PVFF-NSSSPLSLELHSRDTFVASQH 95

Query: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGGFSSSVISGL 131
           +  + L   R++RD  RV  + A    AV         P  N          ++ V+SG 
Sbjct: 96  KDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGA 155

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           +QGSGEYF+R+GVGTP + +Y+VLDTGSDV WIQC PC  CY Q+DPVF+P  S ++ ++
Sbjct: 156 SQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSL 215

Query: 192 PCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHD 250
            C +P C  L++S C R N CLYQVSYGDGS TVG+ +T+T+TF  + ++  VALGCGHD
Sbjct: 216 TCSAPQCSLLETSAC-RSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHD 274

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           NEGLF  AAGLLGLG G LS   Q        FSYCLVDR  S K SS+ F    +    
Sbjct: 275 NEGLFTGAAGLLGLGGGVLSITNQMKA---TSFSYCLVDRD-SGKSSSLDFNSVQLGGGD 330

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
              PLL N K+DTFYYV L G SVGG  V  +  ++F +D +G+GGVI+D GT+VTRL  
Sbjct: 331 ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 371 PAYIALRDAFRAGASSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATN 428
            AY +LRDAF     +LK+ +   SLFDTC+D S  + VKVPTV  HF G   + LPA N
Sbjct: 390 QAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKN 449

Query: 429 YLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           YLIPVD SGTFCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Sbjct: 450 YLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 230/471 (48%), Positives = 296/471 (62%), Gaps = 24/471 (5%)

Query: 21  ASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSL--SFN 78
           +SLQ    +L+  PT S+L+  +  S+S+     P+   ++ S LSL LH  D+   S +
Sbjct: 42  SSLQQTQTILSLDPTRSSLTTTKPESLSD-----PVFF-NSSSPLSLELHSRDTFVASQH 95

Query: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGGFSSSVISGL 131
           +  + L   R++RD  RV  + A    AV         P  N          ++ V+SG 
Sbjct: 96  KDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGA 155

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           +QGSGEYF+R+GVGTP + +Y+VLDTGSDV WIQC PC  CY Q+DPVF+P  S ++ ++
Sbjct: 156 SQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSL 215

Query: 192 PCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHD 250
            C +P C  L++S C R N CLYQVSYGDGS TVG+ +T+T+TF  + ++  VALGCGHD
Sbjct: 216 TCSAPQCSLLETSAC-RSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHD 274

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           NEGLF  AAGLLGLG G LS   Q        FSYCLVDR  S K SS+ F    +    
Sbjct: 275 NEGLFTGAAGLLGLGGGVLSITNQMKA---TSFSYCLVDRD-SGKSSSLDFNSVQLGGGD 330

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
              PLL N K+DTFYYV L G SVGG  V  +  ++F +D +G+GGVI+D GT+VTRL  
Sbjct: 331 ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 371 PAYIALRDAFRAGASSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATN 428
            AY +LRDAF     +LK+ +   SLFDTC+D S  + VKVPTV  HF G   + LPA N
Sbjct: 390 QAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKN 449

Query: 429 YLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           YLIPVD SGTFCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Sbjct: 450 YLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 218/399 (54%), Positives = 272/399 (68%), Gaps = 14/399 (3%)

Query: 88  RIQRDVLRVKSLTAFAESAV-RVPPRNRSRGRANGGFSSS-----VISGLAQGSGEYFTR 141
           R+ RD  RVKSL    +  + RV   +     +N  F ++     V+SG +QGSGEYF R
Sbjct: 93  RLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGTSQGSGEYFLR 152

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           +G+G PP   Y+VLDTGSDV WIQCAPC +CY Q+DP+FDP  S S++ + C +P C+ L
Sbjct: 153 VGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQCKSL 212

Query: 202 DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGL 261
           D S C R  TCLY+VSYGDGS TVG+F+TET+T     V  VA+GCGH+NEGLFV AAGL
Sbjct: 213 DLSEC-RNGTCLYEVSYGDGSYTVGEFATETVTLGTAAVENVAIGCGHNNEGLFVGAAGL 271

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL 321
           LGLG G+LSFP Q        FSYCLV+R + A  S++ F +S + R     PL  NP+L
Sbjct: 272 LGLGGGKLSFPAQVNA---TSFSYCLVNRDSDAV-STLEF-NSPLPRNVVTAPLRRNPEL 326

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
           DTFYY+ L GISVGG  +  I  S+F++D  G GG+IIDSGT+VTRL    Y ALRDAF 
Sbjct: 327 DTFYYLGLKGISVGGEAL-PIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFV 385

Query: 382 AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFC 440
            GA  + +A   SLFDTC+DLS +  V+VPTV  HF  G ++ LPA NYLIPVDS GTFC
Sbjct: 386 KGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFC 445

Query: 441 FAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FAFA T S LSI+GN+QQQG RV +D+A S +GF+   C
Sbjct: 446 FAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 209/420 (49%), Positives = 274/420 (65%), Gaps = 21/420 (5%)

Query: 67  LRLHHVDSL-SFNRTPEHL--FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR---AN 120
           L+L H D + +FN + +H   FN R+QRD  RV +L            R+ + G+   A 
Sbjct: 68  LKLVHRDKVPTFNTSHDHRTRFNARMQRDTKRVAALR-----------RHLAAGKPTYAE 116

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
             F S V+SG+ QGSGEYF R+GVG+PPR  Y+V+D+GSD++W+QC PC +CY Q+DPVF
Sbjct: 117 EAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVF 176

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
           +PA S S+A V C S +C  +D++GC+    C Y+VSYGDGS T G  + ETLTF  T +
Sbjct: 177 NPADSSSYAGVSCASTVCSHVDNAGCH-EGRCRYEVSYGDGSYTKGTLALETLTFGRTLI 235

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
             VA+GCGH N+G+FV AAGLLGLG G +SF  Q G +    FSYCLV R   +    + 
Sbjct: 236 RNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSS-GLLQ 294

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           FG  AV   A + PL+ NP+  +FYYV L G+ VGG  V  I+  +FKL   G+GGV++D
Sbjct: 295 FGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVP-ISEDVFKLSELGDGGVVMD 353

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           +GT+VTRL   AY A RDAF A  ++L RA   S+FDTC+DL G   V+VPTV  +F G 
Sbjct: 354 TGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGG 413

Query: 421 DV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            + +LPA N+LIPVD  G+FCFAFA + SGLSIIGNIQQ+G  +  D A   +GF P  C
Sbjct: 414 PILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 212/413 (51%), Positives = 271/413 (65%), Gaps = 19/413 (4%)

Query: 80  TPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYF 139
           T E L    +QRD  RV+    + ES  ++  + +    +    +  V SGL  GSGEYF
Sbjct: 1   THEQLLLETLQRDERRVR----WIESKAKLAGKKKDEASSTD-LNGPVTSGLLYGSGEYF 55

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
            RLG+GTP R ++MV+DTGSD+ W+QC PCK CY Q DP+FDP  S SF  +PC SPLC+
Sbjct: 56  VRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCK 115

Query: 200 KLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGL 254
            L+   C+      + C YQV+YGDGS +VGDFS++  T   G++   VA GCG DNEGL
Sbjct: 116 ALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGL 175

Query: 255 FVAAAGLLGLGRGRLSFPTQ-----TGRRFNRKFSYCLVDRST--SAKPSSMVFGDSAVS 307
           F  AAGLLGLG G+LSFP+Q     T       FSYCLVDRS   +   SS++FG +A+ 
Sbjct: 176 FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIP 235

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
            TA  +PLL NPKLDTFYY  ++G+SVGGA +  I+    +L  +G+GGVIIDSGTSVTR
Sbjct: 236 STAALSPLLKNPKLDTFYYAAMIGVSVGGAQLP-ISLKSLQLSQSGSGGVIIDSGTSVTR 294

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPA 426
                Y  +RDAFR    +L  AP +SLFDTC++ SGK  V VP +VLHF  GAD+ LP 
Sbjct: 295 FPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPP 354

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           TNYLIP++++G+FC AFA T   L IIGNIQQQ FR+ +DL  S + FAP+ C
Sbjct: 355 TNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 206/357 (57%), Positives = 247/357 (69%), Gaps = 9/357 (2%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+ QGSGEYF+R+G+G+P R +YMVLDTGSDV W+QC PC  CY Q+DPVFDP+ S 
Sbjct: 158 VVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSA 217

Query: 187 SFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVA 244
           S+A V C SP CR LD++ C N    CLY+V+YGDGS TVGDF+TETLT    T V  VA
Sbjct: 218 SYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVTNVA 277

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +GCGHDNEGLFV AAGLL LG G LSFP+Q        FSYCLVDR + A  S++ FG  
Sbjct: 278 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS---ASTFSYCLVDRDSPAA-STLQFGAD 333

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGT 363
                    PL+ +P+  TFYYV L GISVGG     I +S F +D  +G+GGVI+DSGT
Sbjct: 334 GAEADTVTAPLVRSPRTGTFYYVALSGISVGG-QALSIPSSAFAMDATSGSGGVIVDSGT 392

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-V 422
           +VTRL   AY ALRDAF  G  SL R    SLFDTC+DLS +T V+VP V L F G   +
Sbjct: 393 AVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGAL 452

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LPA NYLIPVD +GT+C AFA T + +SIIGN+QQQG RV +D A   +GF P  C
Sbjct: 453 RLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 205/359 (57%), Positives = 249/359 (69%), Gaps = 8/359 (2%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
            S+ V SG +QGSGEYFTR+GVG P R  YMVLDTGSD+ W+QC PC  CY QTDP+FDP
Sbjct: 5   LSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDP 64

Query: 183 AKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVA 241
             S ++A V C+S  C  L+ S C R   CLYQV+YGDGS T GDF+TE+++F  +  V 
Sbjct: 65  TASSTYAPVTCQSQQCSSLEMSSC-RSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK 123

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
            VALGCGHDNEGLFV AAGLLGLG G LS   Q        FSYCLV+R  SA  S++ F
Sbjct: 124 NVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKA---TSFSYCLVNRD-SAGSSTLDF 179

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
             + +   +   PL+ N K+DTFYYV L G+SVGG  V  I  S F+LD +GNGG+I+D 
Sbjct: 180 NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVS-IPESTFRLDESGNGGIIVDC 238

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGA 420
           GT++TRL   AY  LRDAF     +LK     +LFDTC+DLSG+  V+VPTV  HF  G 
Sbjct: 239 GTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGK 298

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +LPA NYLIPVDS+GT+CFAFA T S LSIIGN+QQQG RV +DLA +R+GF+P  C
Sbjct: 299 SWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 187/369 (50%), Positives = 240/369 (65%), Gaps = 14/369 (3%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
           F + + SGLA G+GEYF  +GVGTP R +Y+V+DTGSD+ W+QCAPC  CY Q D +F+P
Sbjct: 1   FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60

Query: 183 AKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG----- 237
           + S SF  + C S LC  LD  GC   N CLYQ  YGDGS T+G+  T+ +         
Sbjct: 61  SSSSSFKVLDCSSSLCLNLDVMGC-LSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPG 119

Query: 238 -TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK- 295
              +  + LGCGHDNEG F  AAG+LGLGRG LSFP          FSYCL DR +    
Sbjct: 120 QVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNH 179

Query: 296 PSSMVFGDSAVSRTA----RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            S++VFGD+A+  TA    +F P L NP++ T+YYV++ GISVGG  +  I AS+F+LD 
Sbjct: 180 KSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDS 239

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
            GNGG I DSGT++TRL   AY A+RDAFRA    L  A DF +FDTC+D +G   + VP
Sbjct: 240 HGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVP 299

Query: 412 TVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           TV  HF+G  D+ LP +NY++PV ++  FCFAFA +M G S+IGN+QQQ FRV+YD    
Sbjct: 300 TVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASM-GPSVIGNVQQQSFRVIYDNVHK 358

Query: 471 RIGFAPRGC 479
           +IG  P  C
Sbjct: 359 QIGLLPDQC 367


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 243/479 (50%), Positives = 295/479 (61%), Gaps = 25/479 (5%)

Query: 20  AASLQYQTFVLNSL-PTPSTLSWPESVSVSESESSLPLPAPD---AESSLSLRLHHVDSL 75
           A +++YQT V   L P P T +  E   + +      L A +   A S++ LR+ H D  
Sbjct: 29  AEAVRYQTLVATPLSPHPYTATAVEDDGLFQGS----LAADEGGAAASTVGLRVVHRDDF 84

Query: 76  SFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGS 135
           + N T   L   R++RD  R   ++A A  A          G    GF + V+SGLAQGS
Sbjct: 85  AVNATAAELLAHRLRRDKRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGS 144

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEYFT++GVGTP     MVLDTGSDVVW+QCAPC++CY Q+  +FDP  S S+  V C +
Sbjct: 145 GEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAA 204

Query: 196 PLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEG 253
           PLCR+LDS GC+ RR  CLYQV+YGDGS+T GDF+TETLTF  G RV RVALGCGHDNEG
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARVPRVALGCGHDNEG 264

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-----RSTSAKPSSMVFGDSAVSR 308
           LFVAAAGLLGLGRG LSFP+Q  RRF R FSYCLVD      S +++ S++ FG  A   
Sbjct: 265 LFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGARGA 324

Query: 309 TAR--FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSG--- 362
             R    P    P+          G         G        DP+ G GGVI+DSG   
Sbjct: 325 LGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPSTGRGGVIVDSGRPS 384

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GA 420
            +  R  R    A R   RA A+ L+ +P  FSLFDTC+DLSG   VKVPTV +HF  GA
Sbjct: 385 PAWARAGRTPPCATRS--RAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGA 442

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + +LP  NYLIPVDS GTFCFAFAGT  G+SIIGNIQQQGFRVV+D    R+GF P+GC
Sbjct: 443 EAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/405 (54%), Positives = 264/405 (65%), Gaps = 47/405 (11%)

Query: 83  HLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRL 142
            L   R+ RD  R ++++  A          R+  RA GGFS+ V+SGLAQGSGEYF  +
Sbjct: 97  QLLAHRLARDAARAEAISVSA----------RNVTRAGGGFSAPVVSGLAQGSGEYFASV 146

Query: 143 GVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC---- 198
           GVGTPP    +VLDTGSDVVW+QCAPC++CY+Q+  VFDP +SRS+A V C +P C    
Sbjct: 147 GVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCRGLD 206

Query: 199 RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVA 257
                    RR TCLYQV+YGDGS+T GD +TETL F RG RV RVA+GCGHDNEGLFVA
Sbjct: 207 AGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGARVPRVAVGCGHDNEGLFVA 266

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLA 317
           AAGLLGLGRGRLS PTQT RR+ R+FSYC                               
Sbjct: 267 AAGLLGLGRGRLSLPTQTARRYGRRFSYC-----------------------------FQ 297

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSVTRLTRPAYIAL 376
              LD    +  V   VGGA VRG+     +LDP+ G GGVI+DSGTSVTRL RP Y+A+
Sbjct: 298 GSDLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAV 357

Query: 377 RDAFRAGASSLKRAP-DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD 434
           R+AFRA A  L+ AP  FSLFDTC+DL G+  VKVPTV +H   GA+V+LP  NYLIPVD
Sbjct: 358 REAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVD 417

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + GTFC A AGT  G+SI+GNIQQQGFRVV+D    R+   P+ C
Sbjct: 418 TRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 205/356 (57%), Positives = 249/356 (69%), Gaps = 10/356 (2%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+  GSGEYF+R+GVG+P R +YMVLDTGSDV W+QC PC  CY Q+DPVFDP+ S 
Sbjct: 152 VVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLST 211

Query: 187 SFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVA 244
           S+A+V C +P C  LD++ C N    CLY+V+YGDGS TVGDF+TETLT   +  V+ VA
Sbjct: 212 SYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSVA 271

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +GCGHDNEGLFV AAGLL LG G LSFP+Q        FSYCLVDR  S   S++ FGD+
Sbjct: 272 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS---ATTFSYCLVDRD-SPSSSTLQFGDA 327

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A +      PL+ +P+  TFYYV L GISVGG  +  I  S F +D  G GGVI+DSGT+
Sbjct: 328 ADAEVT--APLIRSPRTSTFYYVGLSGISVGG-QILSIPPSAFAMDGTGAGGVIVDSGTA 384

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VTRL   AY ALRDAF  G  SL R    SLFDTC+DLS +T V+VP V L F  G ++ 
Sbjct: 385 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELR 444

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LPA NYLIPVD +GT+C AFA T + +SIIGN+QQQG RV +D A S +GF    C
Sbjct: 445 LPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 224/426 (52%), Positives = 281/426 (65%), Gaps = 16/426 (3%)

Query: 63  SSLSLRLHHVDSLSFNRTPEH--LFNLRIQRDVLRVKSL-TAFAESAVRVPPRNRSRGRA 119
           SS  ++LH   S+  +   ++  L   R+ RD  RVK+L T       RV   +     +
Sbjct: 66  SSFGIQLHSRASIQKSSHSDYKSLTLSRLARDSARVKALQTRLDLFLKRVSNSDLHPAES 125

Query: 120 NGGFSSS-----VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
              F S+     V+SG +QGSGEYF R+G+G PP   Y+VLDTGSDV WIQCAPC +CY 
Sbjct: 126 KAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQ 185

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           Q+DP+FDP  S S++ + C  P C+ LD S C R  TCLY+VSYGDGS TVG+F+TET+T
Sbjct: 186 QSDPIFDPISSNSYSPIRCDEPQCKSLDLSEC-RNGTCLYEVSYGDGSYTVGEFATETVT 244

Query: 235 FRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
                V  VA+GCGH+NEGLFV AAGLLGLG G+LSFP Q        FSYCLV+R + A
Sbjct: 245 LGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNA---TSFSYCLVNRDSDA 301

Query: 295 KPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
             S++ F +S + R A   PL+ NP+LDTFYY+ L GISVGG  +  I  S F++D  G 
Sbjct: 302 V-STLEF-NSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEAL-PIPESSFEVDAIGG 358

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           GG+IIDSGT+VTRL    Y ALRDAF  GA  + +A   SLFDTC+DLS +  V++PTV 
Sbjct: 359 GGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVS 418

Query: 415 LHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
             F  G ++ LPA NYLIPVDS GTFCFAFA T S LSIIGN+QQQG RV +D+A S +G
Sbjct: 419 FRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVG 478

Query: 474 FAPRGC 479
           F+   C
Sbjct: 479 FSVDSC 484


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 204/356 (57%), Positives = 249/356 (69%), Gaps = 10/356 (2%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+  GSGEYF+R+GVG+P R +YMVLDTGSDV W+QC PC  CY Q+DPVFDP+ S 
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLST 215

Query: 187 SFATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVA 244
           S+A+V C +P C  LD++ C N    CLY+V+YGDGS TVGDF+TETLT   +  V+ VA
Sbjct: 216 SYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSVA 275

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +GCGHDNEGLFV AAGLL LG G LSFP+Q        FSYCLVDR  S   S++ FGD+
Sbjct: 276 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS---ATTFSYCLVDRD-SPSSSTLQFGDA 331

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A +      PL+ +P+  TFYYV L G+SVGG  +  I  S F +D  G GGVI+DSGT+
Sbjct: 332 ADAEVT--APLIRSPRTSTFYYVGLSGLSVGG-QILSIPPSAFAMDSTGAGGVIVDSGTA 388

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VTRL   AY ALRDAF  G  SL R    SLFDTC+DLS +T V+VP V L F  G ++ 
Sbjct: 389 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELR 448

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LPA NYLIPVD +GT+C AFA T + +SIIGN+QQQG RV +D A S +GF    C
Sbjct: 449 LPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 216/462 (46%), Positives = 289/462 (62%), Gaps = 32/462 (6%)

Query: 31  NSLPTPSTLSWPESVSVSESESSLPLPAPDAESS----LSLRLHHVDSLSFNRTPEHLFN 86
           +S PT   L+  E+++ +     +PL   +          +++ H D LSF  + +H   
Sbjct: 37  SSYPTFQHLNVKETIAGTRI---IPLEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHR 93

Query: 87  L--RIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG------FSSSVISGLAQGSGEY 138
           L  R++RD  RV SL              R      GG      F + VISG+ QGSGEY
Sbjct: 94  LDGRLKRDAKRVASLI-------------RRLSSGGGGSYRVDDFGTDVISGMEQGSGEY 140

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
           F R+GVG+PPR  YMV+D+GSD+VW+QC PC +CY Q+DPVFDPA S SF  V C S +C
Sbjct: 141 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVC 200

Query: 199 RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAA 258
            +L+++GC+    C Y+VSYGDGS T G  + ETLTF  T V  VA+GCGH N G+FV A
Sbjct: 201 DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVGA 259

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN 318
           AGLLGLG G +SF  Q G +    FSYCLV R T +   S+VFG  A+   A + PL+ N
Sbjct: 260 AGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSS-GSLVFGREALPAGAAWVPLVRN 318

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
           P+  +FYY+ L G+ VGG  V  I+  +F+L   G+GGV++D+GT+VTRL   AY A RD
Sbjct: 319 PRAPSFYYIGLAGLGVGGIRVP-ISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRD 377

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSG 437
           AF A  ++L RA   ++FDTC+DL G   V+VPTV  +F G  + +LPA N+LIP+D +G
Sbjct: 378 AFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAG 437

Query: 438 TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           TFCFAFA + SGLSI+GNIQQ+G ++ +D A   +GF P  C
Sbjct: 438 TFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 219/428 (51%), Positives = 276/428 (64%), Gaps = 25/428 (5%)

Query: 65  LSLRLHHVDSLSFNRTPEH--LFNLRIQRDVLRVKSLTAFAESAVRVPPRN-----RSRG 117
            SL+LH  ++L   + P +  L   R+ RD  RV SL    + A+    R+      +  
Sbjct: 77  FSLQLHPRETLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETEL 136

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
                 S+ V SG AQGSGEYF+R+GVG P +  YMVLDTGSDV W+QC PC  CY Q+D
Sbjct: 137 LRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSD 196

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           P+FDP  S S+  + C +  C+ L+ S C R   CLYQVSYGDGS TVG++ TET++F  
Sbjct: 197 PIFDPTASSSYNPLTCDAQQCQDLEMSAC-RNGKCLYQVSYGDGSFTVGEYVTETVSFGA 255

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
             V RVA+GCGHDNEGLFV +AGLLGLG G LS  +Q        FSYCLVDR  S K S
Sbjct: 256 GSVNRVAIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIKA---TSFSYCLVDRD-SGKSS 311

Query: 298 SMVF-----GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           ++ F     GDS V+      PLL N K++TFYYVEL G+SVGG  V  +    F +D +
Sbjct: 312 TLEFNSPRPGDSVVA------PLLKNQKVNTFYYVELTGVSVGGEIVT-VPPETFAVDQS 364

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
           G GGVI+DSGT++TRL   AY ++RDAF+   S+L+ A   +LFDTC+DLS    V+VPT
Sbjct: 365 GAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPT 424

Query: 413 VVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           V  HF G    +LPA NYLIPVD +GT+CFAFA T S +SIIGN+QQQG RV +DLA S 
Sbjct: 425 VSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSL 484

Query: 472 IGFAPRGC 479
           +GF+P  C
Sbjct: 485 VGFSPNKC 492


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 219/434 (50%), Positives = 270/434 (62%), Gaps = 22/434 (5%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPP-RNRSRGRA 119
           + S+L +RL H D  + N TP  L   R+QRDVLR   + + A +    PP    S  R 
Sbjct: 64  SSSTLHIRLLHRDRFAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSAR- 122

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV 179
             GF + V+S  A  SGEY  ++ VGTP     + LDT SD+ W+QC PC++CY Q+ PV
Sbjct: 123 --GFVAPVVS-RAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPV 179

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSGCN--RRNTCLYQVSYGDGSITVGDFSTETLTFR- 236
           FDP  S S+  +   +  C+ L  SG    +R TC+Y V YGDGS TVGDF  ETLTF  
Sbjct: 180 FDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG 239

Query: 237 GTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD--RSTS 293
           G R+ R+++GCGHDN+GLF A AAG+LGLGRG +SFP Q     N  FSYCLVD      
Sbjct: 240 GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPG 297

Query: 294 AKPSSMVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           +  S++ FG  AV  S    FTP + N  + TFYYV L GISVGG  V G+T    +LDP
Sbjct: 298 SLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDP 357

Query: 352 -AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP---DFSLFDTCFDLSGKTE 407
             G GGVI+DSGT+VTRL RPAY A RDAFRA A  L +         FDTC+ + G+  
Sbjct: 358 YTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGM 417

Query: 408 VKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVY 465
            KVPTV +HF G+ +V L   NYLIPVDS GT CFAFA T    +SIIGNIQQQGFR+VY
Sbjct: 418 KKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVY 477

Query: 466 DLAASRIGFAPRGC 479
           D+   R+GFAP  C
Sbjct: 478 DIGG-RVGFAPNSC 490


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 223/461 (48%), Positives = 287/461 (62%), Gaps = 23/461 (4%)

Query: 30  LNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEH--LFNL 87
           L+ L   S++S P   S    E +         SS SL LH  + L      ++  L   
Sbjct: 48  LDVLSHKSSVSKP---SDQRDEKTTSFSPTSLASSFSLELHPRELLHGGSHKDYRALMLS 104

Query: 88  RIQRDVLRVKSLTAFAESAVR-------VPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
           R+ RD  RVK++    + AV        VP         +  FS+ V SG +QGSGEYF 
Sbjct: 105 RLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQD--FSTPVTSGTSQGSGEYFL 162

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           R+G+G P +  YMV+DTGSDV W+QC PC  CY Q DP+FDPA S SF+ + C++P CR 
Sbjct: 163 RVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCRN 222

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNEGLFVAAA 259
           LD   C R ++CLYQVSYGDGS TVGDF+TET++F  +  V +VA+GCGHDNEGLFV AA
Sbjct: 223 LDVFAC-RNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAA 281

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANP 319
           GL+GLG G LS  +Q        FSYCLV+R  S   S++ F +SA    +   P+  N 
Sbjct: 282 GLIGLGGGPLSLTSQIKA---SSFSYCLVNRD-SVDSSTLEF-NSAKPSDSVTAPIFKNS 336

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           K+DTFYYV + G+SVGG  +  I  S+F++D +G GG+I+D GT+VTRL   AY ALRD 
Sbjct: 337 KVDTFYYVGITGMSVGGEKL-AIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDT 395

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATNYLIPVDSSGT 438
           F      L     F+LFDTC++LS +T V+VPTV   F G   + LP +NYLIPVDS+GT
Sbjct: 396 FVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGT 455

Query: 439 FCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FC AFA T + LSIIGN+QQQG RV YDLA S++ F+ R C
Sbjct: 456 FCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 203/419 (48%), Positives = 268/419 (63%), Gaps = 6/419 (1%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
            ++  L L H D LS        FN R++RD +RV +L            ++     AN 
Sbjct: 69  NNTFKLNLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVAN- 127

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
            F++ VISG+  GSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC +CY Q+DPVFD
Sbjct: 128 -FATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFD 186

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           PA S SFA V C S +C +L+++GCN    C Y+VSYGDGS T G  + ETLT     + 
Sbjct: 187 PADSSSFAGVSCGSDVCDRLENTGCN-AGRCRYEVSYGDGSYTKGTLALETLTVGQVMIR 245

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
            VA+GCGH N+G+F+ AAGLLGLG G +SF  Q G +    FSYCLV R T +   ++ F
Sbjct: 246 DVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGST-GALEF 304

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           G  A+   A +  L+ NP+  +FYY+ L GI VGG  V  +    F+L   G  GV++D+
Sbjct: 305 GRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVS-VPEETFQLTEYGTNGVVMDT 363

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GA 420
           GT+VTR    AY+A RD+F A  S+L RAP  S+FDTC+DL+G   V+VPTV  +F  G 
Sbjct: 364 GTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGP 423

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++LPA N+LIPVD  GTFC AFA + SGLSIIGNIQQ+G ++ +D A   +GF P  C
Sbjct: 424 VLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  368 bits (945), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 215/423 (50%), Positives = 268/423 (63%), Gaps = 28/423 (6%)

Query: 82  EHLFNLRIQRDVLRVKSL--------TAFAESAVRVPPRNRSRGRANGG----------- 122
           + L   R+++D LR K++          + +S +R P   +S   A  G           
Sbjct: 1   KQLLLARLRKDELRSKAIAATIALATNGWRKSDLRHPLPGQSESLAVAGLASGRGGRGHG 60

Query: 123 -----FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
                F+S +ISG+A GSG+YF R+GVGTP R VYMV DTGSDV W+QC+PC+KCY Q D
Sbjct: 61  GARRGFASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQD 120

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           P+F+P+ S SF  + C S +C KL   GC+R+N C+YQVSYGDGS TVGDFSTETL+F  
Sbjct: 121 PIFNPSLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGE 180

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
             V  VA+GCG +N+GLF  AAGLLGLGRG LSFP+QTG  +   FSYCL  R  SA  +
Sbjct: 181 HAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAIAA 239

Query: 298 SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
           S+VFG SAV   ARFT LL N +LDT+YYV L  I V G+ V  I    F +   G GGV
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVN-IPPDAFAMGSRGTGGV 298

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           I+DSGT+++RLT PAY ALRDAFR+   +   AP  SLFDTC+DLS      +P VVL F
Sbjct: 299 IVDSGTAISRLTTPAYTALRDAFRS-LVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDF 357

Query: 418 R-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
             GA + LPA   L+ VD  GT+C AFA      SIIGN+QQQ FR+  D    ++G AP
Sbjct: 358 DGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAP 417

Query: 477 RGC 479
             C
Sbjct: 418 DQC 420


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  368 bits (944), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 212/416 (50%), Positives = 270/416 (64%), Gaps = 21/416 (5%)

Query: 76  SFNRTPEHLFNL----RIQRDVLRVKSLTAFAE------SAVRVPPRNRSRGRANGGFSS 125
           + ++TP   +      R+ RD  RV+++T   +      S   + P        +   S+
Sbjct: 89  TIHKTPHKDYKALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQD--LST 146

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKS 185
            V SG +QGSGEYFTR+GVG P +  YMVLDTGSD+ WIQC PC  CY Q+DP+F PA S
Sbjct: 147 PVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAAS 206

Query: 186 RSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVA 244
            S++ + C S  C  L  S C R   C YQV+YGDGS T GDF TET++F G+  V  +A
Sbjct: 207 SSYSPLTCDSQQCNSLQMSSC-RNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIA 265

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           LGCGHDNEGLFV AAGLLGLG G LS  +Q        FSYCLV+R  SA  S++ F  +
Sbjct: 266 LGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKA---TSFSYCLVNRD-SAASSTLDFNSA 321

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            V  +    PLL + K+DTFYYV L G+SVGG  +R I   +FKLD +G+GGVI+D GT+
Sbjct: 322 PVGDSV-IAPLLKSSKIDTFYYVGLSGMSVGGELLR-IPQEVFKLDDSGDGGVIVDCGTA 379

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VS 423
           +TRL   AY +LRD+F + +  L+     +LFDTC+DLSG++ VKVPTV  HF G     
Sbjct: 380 ITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWD 439

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LPA NYLIPVDS+GT+CFAFA T S LSIIGN+QQQG RV +DLA +R+GF+   C
Sbjct: 440 LPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  364 bits (934), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 232/476 (48%), Positives = 291/476 (61%), Gaps = 33/476 (6%)

Query: 24  QYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEH 83
           +Y ++V+  L +P   S P +   + S SS         S+L + L H DS + N T   
Sbjct: 32  EYHSYVVTPL-SPHPYSAPAAADDNFSVSS--------SSALHIHLLHRDSFAVNATAAE 82

Query: 84  LFNLRIQRDVLRVKSLTAFAESAVRVPPR-NRSRGRANGGFSSSVISGLAQGSGEYFTRL 142
           L   R+QRD LR   + + A +    PP    S GR   G  + V+S  A  SGEY  ++
Sbjct: 83  LLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGR---GLVAPVVS-RAPTSGEYMAKI 138

Query: 143 GVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD 202
            VGTP     + LDT SD+ W+QC PC++CY Q+ PVFDP  S S+  +   +P C+ L 
Sbjct: 139 AVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALG 198

Query: 203 SSGCN--RRNTCLYQVSYGDG----SITVGDFSTETLTFR-GTRVARVALGCGHDNEGLF 255
            SG    +R TC+Y V YGDG    S +VGD   ETLTF  G R A +++GCGHDN+GLF
Sbjct: 199 RSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLF 258

Query: 256 VA-AAGLLGLGRGRLSFPTQTG-RRFNRKFSYCLVD-RSTSAKPSS-MVFGDSAV--SRT 309
            A AAG+LGLGRG++S P Q     +N  FSYCLVD  S    PSS + FG  AV  S  
Sbjct: 259 GAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPP 318

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRL 368
           A FTP + N  + TFYYV L+G+SVGG  V G+T    +LDP  G GGVI+DSGT+VTRL
Sbjct: 319 ASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVTRL 378

Query: 369 TRPAYIALRDAFRAGASSLKRAPD---FSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSL 424
            RPAY+A RDAFRA A+SL +        LFDTC+ + G+  VKVP V +HF G  +VSL
Sbjct: 379 ARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSL 438

Query: 425 PATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              NYLIPVDS GT CFAFAGT    +S+IGNI QQGFRVVYDLA  R+GFAP  C
Sbjct: 439 QPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  364 bits (934), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 202/356 (56%), Positives = 244/356 (68%), Gaps = 4/356 (1%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S +ISG+A GSG+YF R+GVGTP R VYMV DTGSDV W+QC+PC+KCY Q DP+F+P+ 
Sbjct: 1   SPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSL 60

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S SF  + C S +C KL   GC+R+N C+YQVSYGDGS TVGDFSTETL+F    V  VA
Sbjct: 61  SSSFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVA 120

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +GCG +N+GLF  AAGLLGLGRG LSFP+QTG  +   FSYCL  R  SA  +S+VFG S
Sbjct: 121 MGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAIAASLVFGPS 179

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           AV   ARFT LL N +LDT+YYV L  I V G+ V  I    F +   G GGVI+DSGT+
Sbjct: 180 AVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVN-IPPDAFAMGSRGTGGVIVDSGTA 238

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           ++RLT PAY ALRDAFR+   +   AP  SLFDTC+DLS      +P VVL F  GA + 
Sbjct: 239 ISRLTTPAYTALRDAFRS-LVTFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMP 297

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LPA   L+ VD  GT+C AFA      SIIGN+QQQ FR+  D    ++G AP  C
Sbjct: 298 LPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  360 bits (925), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 191/360 (53%), Positives = 238/360 (66%), Gaps = 8/360 (2%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           G  S V+SGL +GSGEYF R+G+G+PP   Y+V+D+GSDV+W+QC PC +CY+Q DP+FD
Sbjct: 111 GSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFD 170

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           PA S +F+ VPC S +CR L +SGC     C Y+VSYGDGS T G  + ETLT  GT V 
Sbjct: 171 PATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVE 230

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
            VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R       S+V 
Sbjct: 231 GVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGA----GSLVL 286

Query: 302 GDS-AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           G S AV   A + PL+ NP+  +FYYV L GI VG   +  +   LF+L   G GGV++D
Sbjct: 287 GRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLP-LQEDLFQLTEDGAGGVVMD 345

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG- 419
           +GT+VTRL + AY ALRDAF A   +L RAP  SL DTC+DLSG T V+VPTV  +F G 
Sbjct: 346 TGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGA 405

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A ++LPA N L+ VD  G +C AFA + SG SI+GNIQQ+G ++  D A   IGF P  C
Sbjct: 406 ATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 190/365 (52%), Positives = 239/365 (65%), Gaps = 9/365 (2%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           G  S V+SGL +GSGEYF R+G+G+PP   Y+V+D+GSDV+W+QC PC +CY+Q DP+FD
Sbjct: 109 GSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFD 168

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           PA S +F+ V C S +CR L +SGC     C Y+VSYGDGS T G  + ETLT  GT V 
Sbjct: 169 PASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTLGGTAVE 228

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS---- 297
            VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R  S   +    
Sbjct: 229 GVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAA 288

Query: 298 -SMVFGDS-AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
            S+V G S AV   A + PL+ NP+  +FYYV + GI VG   +  +   LF+L   G G
Sbjct: 289 GSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLP-LQDGLFQLTEDGGG 347

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           GV++D+GT+VTRL + AY ALRDAF     +L RAP  SL DTC+DLSG T V+VPTV  
Sbjct: 348 GVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSF 407

Query: 416 HFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
           +F G A ++LPA N L+ VD  G +C AFA + SGLSI+GNIQQ+G ++  D A   IGF
Sbjct: 408 YFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGF 466

Query: 475 APRGC 479
            P  C
Sbjct: 467 GPATC 471


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 208/453 (45%), Positives = 273/453 (60%), Gaps = 54/453 (11%)

Query: 39  LSWPESV---SVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNL--RIQRDV 93
           L WP  +    VSE          +      +++ H D LSF  + +H   L  R++RD 
Sbjct: 111 LWWPCQIIPLEVSEDHE-------EGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDA 163

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGG------FSSSVISGLAQGSGEYFTRLGVGTP 147
            RV SL              R      GG      F + VISG+ QGSGEYF R+GVG+P
Sbjct: 164 KRVASLI-------------RRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSP 210

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN 207
           PR  YMV+D+GSD+VW+QC PC +CY Q+DPVFDPA S SF  V C S +C +L+++GC+
Sbjct: 211 PRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLENAGCH 270

Query: 208 RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
               C Y+VSYGDGS T G  + ETLTF  T V  VA+GCGH N G+FV AAGLLGLG G
Sbjct: 271 -AGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGG 329

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
            +SF  Q G +    FSYCLV                    +A + PL+ NP+  +FYY+
Sbjct: 330 SMSFVGQLGGQTGGAFSYCLV--------------------SAAWVPLVRNPRAPSFYYI 369

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            L G+ VGG  V  I+  +F+L   G+GGV++D+GT+VTRL   AY A RDAF A  ++L
Sbjct: 370 GLAGLGVGGIRVP-ISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANL 428

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGT 446
            RA   ++FDTC+DL G   V+VPTV  +F G  + +LPA N+LIP+D +GTFCFAFA +
Sbjct: 429 PRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPS 488

Query: 447 MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            SGLSI+GNIQQ+G ++ +D A   +GF P  C
Sbjct: 489 TSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  357 bits (917), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 211/341 (61%), Positives = 253/341 (74%), Gaps = 18/341 (5%)

Query: 1   MEGKARNHLLL-LFS-FFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPA 58
           ME K  N L   +F+  FFT++AS QYQT V+N+LP+ +TLSWPES S+++   S     
Sbjct: 1   MERKVLNTLAFSVFAVLFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSE---- 56

Query: 59  PDAESSLSLRLHHVDSLSF--NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
             + +SLS+ L HVD+LS   + +P  LFNLR+QRD LRVKS+T+ A  +       R+ 
Sbjct: 57  --STTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRT- 113

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
            R  GGFS +VISGL+QGSGEYF RLGVGTP   VYMVLDTGSDVVW+QC+PCK CY+QT
Sbjct: 114 PRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQT 173

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKL-DSSGC--NRRNTCLYQVSYGDGSITVGDFSTETL 233
           D +FDP KS++FATVPC S LCR+L DSS C   R  TCLYQVSYGDGS T GDFSTETL
Sbjct: 174 DAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETL 233

Query: 234 TFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS-- 291
           TF G RV  V LGCGHDNEGLFV AAGLLGLGRG LSFP+QT  R+N KFSYCLVDR+  
Sbjct: 234 TFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSS 293

Query: 292 --TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV 330
             +S  PS++VFG++AV +T+ FTPLL NPKLDTFYY   +
Sbjct: 294 GSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYCSFL 334


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  355 bits (911), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 206/424 (48%), Positives = 281/424 (66%), Gaps = 11/424 (2%)

Query: 60  DAESSLSLRLHHVD---SLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
           ++ S  +LRL H D   S+++ R   H  + R++RD  RV ++      + +V P + SR
Sbjct: 54  ESSSKYTLRLLHRDRFPSVTY-RNHHHRLHARMRRDTDRVSAI--LRRISGKVIPSSDSR 110

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
              N  F S ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PCK CY Q+
Sbjct: 111 YEVND-FGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQS 169

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           DPVFDPAKS S+  V C S +C ++++SGC+    C Y+V YGDGS T G  + ETLTF 
Sbjct: 170 DPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGTLALETLTFA 228

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
            T V  VA+GCGH N G+F+ AAGLLG+G G +SF  Q   +    F YCLV R T +  
Sbjct: 229 KTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST- 287

Query: 297 SSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
            S+VFG  A+   A + PL+ NP+  +FYYV L G+ VGG  +  +   +F L   G+GG
Sbjct: 288 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP-LPDGVFDLTETGDGG 346

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLH 416
           V++D+GT+VTRL   AY+A RD F++  ++L RA   S+FDTC+DLSG   V+VPTV  +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406

Query: 417 F-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           F  G  ++LPA N+L+PVD SGT+CFAFA + +GLSIIGNIQQ+G +V +D A   +GF 
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466

Query: 476 PRGC 479
           P  C
Sbjct: 467 PNVC 470


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  355 bits (910), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 214/466 (45%), Positives = 286/466 (61%), Gaps = 24/466 (5%)

Query: 24  QYQTFVLNSLPTPSTLSWPESVSVSESESSLPL-PAPDAESS--LSLRLHHVDSL-SFNR 79
            +Q   +  +    T  +P     S+   +  L  A +A SS    L+L H D + +FN 
Sbjct: 24  HFQQLNVKQIILTETKLYPNPTQPSKHPHNKKLNSATEASSSAKYKLKLVHRDKVPTFNT 83

Query: 80  TPEHL--FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR---ANGGFSSSVISGLAQG 134
             +H   FN R+QRD  R  SL            R  + G+   A   F S V+SG+ QG
Sbjct: 84  YHDHRTRFNARMQRDTKRAASLL-----------RRLAAGKPTYAAEAFGSDVVSGMEQG 132

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGEYF R+GVG+PPR  Y+V+D+GSD++W+QC PC +CY Q+DPVF+PA S SF+ V C 
Sbjct: 133 SGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCA 192

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           S +C  +D++ C+    C Y+VSYGDGS T G  + ET+TF  T +  VA+GCGH N+G+
Sbjct: 193 STVCSHVDNAACH-EGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAIGCGHHNQGM 251

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           FV AAGLLGLG G +SF  Q G +    FSYCLV R   +    + FG  A+   A + P
Sbjct: 252 FVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESS-GLLEFGREAMPVGAAWVP 310

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           L+ NP+  +FYY+ L G+ VGG  V  I+  +FKL   G+GGV++D+GT+VTRL   AY 
Sbjct: 311 LIHNPRAQSFYYIGLSGLGVGGLRVS-ISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYE 369

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPV 433
           A RD F A  ++L RA   S+FDTC+DL G   V+VPTV  +F G  + +LPA N+LIPV
Sbjct: 370 AFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPV 429

Query: 434 DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D  GTFCFAFA + SGLSIIGNIQQ+G ++  D A   +GF P  C
Sbjct: 430 DDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  354 bits (908), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 199/392 (50%), Positives = 255/392 (65%), Gaps = 11/392 (2%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +QRDV RV SL     S         S G  +  F S V+SG+ QGSGEYF R+GVG+PP
Sbjct: 1   MQRDVKRVVSLIRRVSSG-----STASYGVED--FGSEVVSGMDQGSGEYFVRIGVGSPP 53

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R  YMV+D+GSD+VW+QC PC +CY QTDP+FDPA S SF  V C S +C ++D++GCN 
Sbjct: 54  RSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCN- 112

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGR 268
              C Y+VSYGDGS T G  + ETLT   T V  VA+GCGH N+G+FV AAGLLGLG G 
Sbjct: 113 SGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGS 172

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVE 328
           +SF  Q  R     FSYCLV R T++    + FG  A+   A + PL+ NP   ++YY+ 
Sbjct: 173 MSFVGQLSRERGNAFSYCLVSRVTNSN-GFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIG 231

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           L G+ VG   V  I+  +F+L   GNGGV++D+GT+VTR    AY A RDAF     +L 
Sbjct: 232 LSGLGVGDMKVP-ISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLP 290

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTM 447
           RA   S+FDTC++L G   V+VPTV  +F G  + +LPA N+LIPVD +GTFCFAFA + 
Sbjct: 291 RASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSP 350

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           SGLSI+GNIQQ+G ++  D A   +GF P  C
Sbjct: 351 SGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 212/466 (45%), Positives = 284/466 (60%), Gaps = 17/466 (3%)

Query: 17  FTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLS 76
           + A   L  +  +  +   PS L  P+ + + E+     L    ++S   L+L H D L 
Sbjct: 25  YPATQLLNVKDTIKEAETAPSRL--PQDLELHENYPIFELDNNSSQSQWKLKLFHRDKLP 82

Query: 77  FNRTPEH--LFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQG 134
            N  P+H   F  RI RD  RV SL     S       +         F S V+SG  QG
Sbjct: 83  LNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQVTD---------FGSDVVSGTEQG 133

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGEYF R+GVG+PPR  Y+V+D+GSD+VW+QC PC +CY Q+DPVFDPA S ++A + C 
Sbjct: 134 SGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCD 193

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           S +C +LD++GCN    C Y+VSYGDGS T G  + ETLTF    +  +A+GCGH N G+
Sbjct: 194 SSVCDRLDNAGCN-DGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMNRGM 252

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           F+ AAGLLGLG G +SF  Q G +    FSYCLV R T +   ++ FG  A+   A + P
Sbjct: 253 FIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTEST-GTLEFGRGAMPVGAAWVP 311

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           L+ NP+  +FYYV L G+ VGG  V  I   +F+L   G GGV++D+GT+VTRL  PAY 
Sbjct: 312 LIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYE 370

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPV 433
           A RD F    ++L R+   S+FDTC++L+G   V+VPTV  +F G  + +LPA N+LIPV
Sbjct: 371 AFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPV 430

Query: 434 DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D  GTFCFAFA + SGLSIIGNIQQ+G ++  D +   +GF P  C
Sbjct: 431 DGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 208/424 (49%), Positives = 278/424 (65%), Gaps = 10/424 (2%)

Query: 60  DAESSLSLRLHHVD---SLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
           D+ S  +LRL H D   S+++ R   H  + R++RD  RV ++       V V   + SR
Sbjct: 54  DSNSKYTLRLLHRDRFPSVTY-RNHHHRLHARMRRDTDRVSAILRRISGKVVVASSD-SR 111

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
              N  F S V+SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PCK CY Q+
Sbjct: 112 YEVND-FGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQS 170

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           DPVFDPAKS S+  V C S +C ++++SGC+    C Y+V YGDGS T G  + ETLTF 
Sbjct: 171 DPVFDPAKSGSYTGVSCGSSVCDRIENSGCH-SGGCRYEVMYGDGSYTKGTLALETLTFA 229

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
            T V  VA+GCGH N G+F+ AAGLLG+G G +SF  Q   +    F YCLV R T +  
Sbjct: 230 KTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST- 288

Query: 297 SSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
            S+VFG  A+   A + PL+ NP+  +FYYV L G+ VGG  +  +   +F L   G+GG
Sbjct: 289 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP-LPDGVFDLTETGDGG 347

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLH 416
           V++D+GT+VTRL   AY A RD F++  ++L RA   S+FDTC+DLSG   V+VPTV  +
Sbjct: 348 VVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 407

Query: 417 F-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           F  G  ++LPA N+L+PVD SGT+CFAFA + +GLSIIGNIQQ+G +V +D A   +GF 
Sbjct: 408 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 467

Query: 476 PRGC 479
           P  C
Sbjct: 468 PNVC 471


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  353 bits (905), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 186/372 (50%), Positives = 247/372 (66%), Gaps = 16/372 (4%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S V+SG+   SGEYF  +GVG PP +  +V+DTGSD++W+QC PC++CY Q  P++DP  
Sbjct: 79  SPVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRN 138

Query: 185 SRSFATVPCRSPLCRK-LDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA 241
           S++   +PC SP CR  L   GC+ R   C+Y V YGDGS + GD +T+TL     TRV 
Sbjct: 139 SKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVH 198

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS--M 299
            V LGCGHDNEGL  +AAGLLG GRG+LSFPTQ    +   FSYCL DR + A+ SS  +
Sbjct: 199 NVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYL 258

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVI 358
           VFG +    +  FTPL  NP+  + YYV++VG SVGG  V G + +   L+PA G GGV+
Sbjct: 259 VFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVV 318

Query: 359 IDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAPD-FSLFDTCFDLSGK---TEVKVPT 412
           +DSGT+++R TR AY A+RDAF   A A+ ++R  + FS+FDTC+D+ G    T V+VP+
Sbjct: 319 VDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPS 378

Query: 413 VVLHF-RGADVSLPATNYLIPV---DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
           +VLHF   AD++LP  NYLIPV   D    FC        GL+++GN+QQQGF VV+D+ 
Sbjct: 379 IVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVE 438

Query: 469 ASRIGFAPRGCA 480
             RIGF P GC+
Sbjct: 439 RGRIGFTPNGCS 450


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  348 bits (894), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 191/392 (48%), Positives = 253/392 (64%), Gaps = 11/392 (2%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           + RDV RV SL     S                 F S V+SG+ QGSGEYF R+G+G+PP
Sbjct: 1   MHRDVKRVASLIHRLSSGSAAKYEVED-------FGSDVVSGMNQGSGEYFVRIGLGSPP 53

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R  YMV+D+GSD+VW+QC PC +CY QTDP+FDPA S SF  V C S +C +++++GCN 
Sbjct: 54  RSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCN- 112

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGR 268
              C Y+VSYGDGS T G  + ETLTF  T V  VA+GCGH N G+FV AAGLLGLG G 
Sbjct: 113 SGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGS 172

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVE 328
           +SF  Q   +    FSYCLV R T+     + FG  A+   A + PL+ NP+  +FYY+ 
Sbjct: 173 MSFMGQLSGQTGNAFSYCLVSRGTNTN-GFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIR 231

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           L+G+ VG   V  ++  +F+L+  G+GGV++D+GT+VTR    AY A R+AF     +L 
Sbjct: 232 LLGLGVGDTRVP-VSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLP 290

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTM 447
           RA   S+FDTC++L G   V+VPTV  +F G  + ++PA N+LIPVD +GTFCFAFA + 
Sbjct: 291 RASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSP 350

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           SGLSI+GNIQQ+G ++  D A   +GF P  C
Sbjct: 351 SGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  348 bits (892), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 211/431 (48%), Positives = 265/431 (61%), Gaps = 37/431 (8%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           L +RL H DS + N +   L   R+QRD+ R   +    ++A    P N           
Sbjct: 66  LQVRLVHRDSFAVNASAADLLARRLQRDMRRAAWI--ITKAATPADPEN----------- 112

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPR-----YVYMVLDTGSDVVWIQCAPCKKCYSQTDPV 179
            +V++G A  SGEY  ++ VGTP          +  D GSDV W+QC PC +CY Q  PV
Sbjct: 113 GTVVTG-APTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPV 171

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSG-CNR-RNTCLYQVSYGDGSITVGDFSTETLTFR- 236
           ++  KS S + V C +P CR L SSG C +  N C Y+V YGDGS + GDF  ETLTF  
Sbjct: 172 YNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPP 231

Query: 237 GTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK 295
           G RV  VA+GCG DN+GLF A AAG+LGLGRG LSFP+Q   R+ R FSYCL  + T  +
Sbjct: 232 GVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGR 291

Query: 296 PSSMVFGDSAVS-----RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
            S++ FG  A +         FTP+L N ++ TFYYV LVGISVGG  VRG+T S  +LD
Sbjct: 292 SSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLD 351

Query: 351 PA-GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD----FSLFDTCF-DLSG 404
           P+ G+GGVI+DSGT+VTRL+ PAY A RDAFR  A      P     F+ FDTC+  + G
Sbjct: 352 PSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRG 411

Query: 405 KTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSS-GTFCFAFAGTMS-GLSIIGNIQQQGF 461
           +   KVP V +HF G  +V LP  NYLIPVDS+ GT CFAFAG+   G+SIIGNIQ QGF
Sbjct: 412 RVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGF 471

Query: 462 RVVYDLAASRI 472
           RVVYD+   R+
Sbjct: 472 RVVYDVDGQRV 482


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 224/483 (46%), Positives = 287/483 (59%), Gaps = 33/483 (6%)

Query: 24  QYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEH 83
           QY ++ +  L +P   S PE+           + A  + S++ +RL H DS + N T   
Sbjct: 31  QYHSYAVTPL-SPHAHSSPEAAEDGAHAHQEDMAA-SSSSAMHVRLLHRDSFAVNATGAE 88

Query: 84  LFNLRIQRDVLRVKSLTAFAESAVRVPPR--NRSRGRANGGFSSSVISGLAQGSGEYFTR 141
           L   R+QRD LR   + + A +    PP     S GR   G  + V+S  A  SG+Y  +
Sbjct: 89  LLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGR---GLVAPVVS-RAPTSGDYIAK 144

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           + VGTP     + LDT SD+ W+QC PC++CY Q+ PVFDP  S S+  +   +P C+ L
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQAL 204

Query: 202 DSSGCN--RRNTCLYQVSYGDG------SITVGDFSTETLTFR-GTRVARVALGCGHDNE 252
             SG    +R TC+Y V YGDG      S +VGD   ETLTF  G R A +++GCGHDN+
Sbjct: 205 GRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNK 264

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTG-RRFNRKFSYCLVD-RSTSAKPSS-MVFGDSAV-- 306
           GLF A AAG+LGL RG++S P Q     +N  FSYCLVD  S    PSS + FG  AV  
Sbjct: 265 GLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDT 324

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSV 365
           S  A FTP + N  + TFYYV L+G+SVGG  V G+T    +LDP  G+GGVI+DSGT+V
Sbjct: 325 SPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTTV 384

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPD---FSLFDTCFDLSGKTE----VKVPTVVLHFR 418
           TRL RPAY A RDAFRA A+ L +        LFDTC+ + G+      VKVP V +HF 
Sbjct: 385 TRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFA 444

Query: 419 GA-DVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           G  ++SL   NYLI VDS GT CFAFAGT    +S+IGNI QQGFRVVYD+   R+GFAP
Sbjct: 445 GGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAP 504

Query: 477 RGC 479
             C
Sbjct: 505 NSC 507


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  344 bits (883), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 202/449 (44%), Positives = 269/449 (59%), Gaps = 26/449 (5%)

Query: 39  LSWPESVSVSE--SESSL-PLPAPD---AESSLSLRLHHVDSLSFNRTPEHL-FNLRIQR 91
           LS+ + ++V    SE+ L PL   +    +     +L H D+++  +T     F  RI R
Sbjct: 26  LSYFQHLNVENAISETKLKPLKQQNHNTQQPQWKTKLFHRDNINLKKTTHKTRFISRINR 85

Query: 92  DVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYV 151
           D+ RV   T       +     ++       F S V+SG  +GSGEYF R+G+G+P  Y 
Sbjct: 86  DIKRV---TFLLNRLNKNTQEQQTTTATEASFGSDVVSGTEEGSGEYFVRIGIGSPAIYQ 142

Query: 152 YMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT 211
           YMV+D+GSD+VWIQC PC +CY+QTDP+F+PA S SF  V C S +C +LD     R+  
Sbjct: 143 YMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNVCNQLDDDVACRKGR 202

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           C YQV+YGDGS T G  + ET+T   T +   A+GCGH NEG+FV AAGLLGLG G +SF
Sbjct: 203 CGYQVAYGDGSYTKGTLALETITIGRTVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSF 262

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVG 331
             Q G +    F YCLV R              A+   A + PL+ NP   +FYYV L G
Sbjct: 263 VGQLGAQTGGAFGYCLVSR--------------AMPVGAMWVPLIHNPFYPSFYYVSLSG 308

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP 391
           ++VGG  V  I+  +F+L   G GGV++D+GT++TRL   AY A RDAF A  ++L RAP
Sbjct: 309 LAVGGIRVP-ISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAP 367

Query: 392 DFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGL 450
             S+FDTC+DL+G   V+VPTV  +F G  + + PA N+LIP D  GTFCFAFA + SGL
Sbjct: 368 GVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGL 427

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           SIIGNIQQ+G +V  D     +GF P  C
Sbjct: 428 SIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 193/331 (58%), Positives = 227/331 (68%), Gaps = 9/331 (2%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC-NRRNT 211
           MVLDTGSDV W+QC PC  CY Q+DPVFDP+ S S+A V C S  CR LD++ C N    
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 212 CLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
           CLY+V+YGDGS TVGDF+TETLT    T V  VA+GCGHDNEGLFV AAGLL LG G LS
Sbjct: 61  CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV 330
           FP+Q        FSYCLVDR + A  S++ FGD A        PL+ +P+  TFYYV L 
Sbjct: 121 FPSQISAS---TFSYCLVDRDSPAA-STLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALS 176

Query: 331 GISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR 389
           GISVGG  +  I AS F +D  +G+GGVI+DSGT+VTRL   AY ALRDAF  GA SL R
Sbjct: 177 GISVGGQPLS-IPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPR 235

Query: 390 APDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMS 448
               SLFDTC+DLS +T V+VP V L F G   + LPA NYLIPVD +GT+C AFA T +
Sbjct: 236 TSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNA 295

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +SIIGN+QQQG RV +D A   +GF P  C
Sbjct: 296 AVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 196/422 (46%), Positives = 254/422 (60%), Gaps = 20/422 (4%)

Query: 66  SLRLHHVDSLSFNRTPE--HLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRANG 121
           SL L H D++S    P   H     + RD  RV+ L     A ++  +P           
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPED--------- 114

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              S V+ G+  GSGEYF R+GVG+PP   Y+V+D+GSDV+W+QC PC++CY+QTDP+FD
Sbjct: 115 -LVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFD 173

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT 238
           PA S SF+ V C S +CR L  +GC        C Y V+YGDGS T G+ + ETLT  GT
Sbjct: 174 PAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT 233

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
            V  VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R      S 
Sbjct: 234 AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSL 293

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           ++    AV   A + PL+ N +  +FYYV L GI VGG  +  +  SLF+L   G GGV+
Sbjct: 294 VLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLP-LQDSLFQLTEDGAGGVV 352

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF- 417
           +D+GT+VTRL R AY ALR AF     +L R+P  SL DTC+DLSG   V+VPTV  +F 
Sbjct: 353 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 412

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +GA ++LPA N L+ V  +  FC AFA + SG+SI+GNIQQ+G ++  D A   +GF P 
Sbjct: 413 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471

Query: 478 GC 479
            C
Sbjct: 472 TC 473


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 210/430 (48%), Positives = 256/430 (59%), Gaps = 24/430 (5%)

Query: 59  PDAESSLSLRLHHVDSLSFNRTP--EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
           PD   SL+L   H D++S    P   H       RD  RV+ L            R  S 
Sbjct: 65  PDGRPSLALL--HRDAVSGRTYPSTRHAMLGLAARDGARVEYLQ-----------RRLSP 111

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
                   S V+SG+++GSGEYF R+GVG+PP   Y+V+D+GSDV+WIQC PC +CY Q 
Sbjct: 112 TTMTTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQA 171

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           DP+FDPA S SF  VPC S +CR L   SSGC     C YQVSYGDGS T G  + ETLT
Sbjct: 172 DPLFDPAASASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLT 231

Query: 235 F-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
           F   T V  VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R   
Sbjct: 232 FGDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAD 291

Query: 294 AKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           A   S+VFG D A+   A + PLL N +  +FYYV L G+ VGG  +  +   LF L   
Sbjct: 292 AGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLP-LQDGLFDLTED 350

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAG-ASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
           G GGV++D+GT+VTRL   AY ALRDAF +     L RAP  SL DTC+DLSG   V+VP
Sbjct: 351 GGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVP 410

Query: 412 TVVLHF--RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           TV L+F   GA ++LPA N L+ +   G +C AFA + SGLSI+GNIQQQG ++  D A 
Sbjct: 411 TVALYFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQGIQITVDSAN 469

Query: 470 SRIGFAPRGC 479
             +GF P  C
Sbjct: 470 GYVGFGPSTC 479


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  340 bits (872), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 195/422 (46%), Positives = 253/422 (59%), Gaps = 20/422 (4%)

Query: 66  SLRLHHVDSLSFNRTPE--HLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRANG 121
           SL L H D++S    P   H     + RD  RV+ L     A ++  +P           
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPED--------- 114

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              S V+ G+  GSGEYF R+GVG+PP   Y+V+D+GSDV+W+QC PC++CY+QTDP+FD
Sbjct: 115 -LVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFD 173

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT 238
           PA S SF+ V C S +CR L  +GC        C Y V+YGDGS T G+ + ETLT  GT
Sbjct: 174 PAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT 233

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
            V  VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R      S 
Sbjct: 234 AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSL 293

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           ++    AV   A + PL+ N +  +FYYV L GI VGG  +  +   LF+L   G GGV+
Sbjct: 294 VLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLP-LQDGLFQLTEDGAGGVV 352

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF- 417
           +D+GT+VTRL R AY ALR AF     +L R+P  SL DTC+DLSG   V+VPTV  +F 
Sbjct: 353 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 412

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +GA ++LPA N L+ V  +  FC AFA + SG+SI+GNIQQ+G ++  D A   +GF P 
Sbjct: 413 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471

Query: 478 GC 479
            C
Sbjct: 472 TC 473


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  337 bits (864), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 179/374 (47%), Positives = 238/374 (63%), Gaps = 18/374 (4%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S V+SG+   SGEYF  + VG PP    +V+DTGSD++W+QC PC+ CY Q  P++DP  
Sbjct: 75  SPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRS 134

Query: 185 SRSFATVPCRSPLCRK-LDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA 241
           S +   +PC SP CR  L   GC+ R   C+Y V YGDGS + GD +T+ L F   T V 
Sbjct: 135 SSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH 194

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS--M 299
            V LGCGHDN GL  +AAGLLG+GRG+LSFPTQ    +   FSYCL DR + A+  S  +
Sbjct: 195 NVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYL 254

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVI 358
           VFG +    +  FTPL  NP+  + YYV++VG SVGG  V G + +   L+PA G GG++
Sbjct: 255 VFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIV 314

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSL----KRAPDFSLFDTCFDLSGK----TEVKV 410
           +DSGT+++R  R AY A+RDAF + A++     K A  FS+FD C+DL G       V+V
Sbjct: 315 VDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRV 374

Query: 411 PTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
           P++VLHF  GAD++LP  NYLIPV   D    FC        GL+++GN+QQQGF +V+D
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFD 434

Query: 467 LAASRIGFAPRGCA 480
           +   RIGF P GC+
Sbjct: 435 VERGRIGFTPNGCS 448


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 186/367 (50%), Positives = 232/367 (63%), Gaps = 12/367 (3%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
             S VISGL   SGEYF  +GVGTPP    +V+DTGSDVVW+QC PC  CY Q  P++DP
Sbjct: 84  LHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDP 143

Query: 183 AKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRV 240
             S ++A  PC  P CR   +  C+     C Y++ YGD S T G+ +T+ L F   T V
Sbjct: 144 RGSSTYAQTPCSPPQCRNPQT--CDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDTSV 201

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-M 299
             V LGCGHDNEGLF +AAGLLG+ RG  SF TQ    + R F+YCL DR+ S   SS +
Sbjct: 202 GNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYL 261

Query: 300 VFGDSAVS-RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGV 357
           VFG +A    ++ FTPL +NP+  + YYV++VG SVGG  V G + +   LDPA G GGV
Sbjct: 262 VFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGV 321

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSL---KRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           ++DSGTS+TR  R AY ALRDAF A A+ +   K     S+FD C+DL G      P VV
Sbjct: 322 VVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVV 381

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRI 472
           LHF  GADV+LP  NYL+P +S    CFA  A    GLS+IGN+ QQ FRVV+D+   R+
Sbjct: 382 LHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERV 441

Query: 473 GFAPRGC 479
           GF P GC
Sbjct: 442 GFEPNGC 448


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 185/359 (51%), Positives = 231/359 (64%), Gaps = 8/359 (2%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + SGL+ GSGEYF R+G+G P R  Y+ LDTGSDV WIQCAPC  CYSQ DP++DP+ S 
Sbjct: 1   ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARV 243
           S+  V C S LC+ LD S C     C Y+V YGD S + GD   E+        T +  +
Sbjct: 61  SYRRVYCGSALCQALDYSACQGMG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNI 119

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST--SAKPSSMVF 301
           A GCGH N GLF   AGLLG+G G LSF +Q        FSYCLVDR +   ++ S ++F
Sbjct: 120 AFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIF 179

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           G +A+   ARFTPLL NP+++TFYY  L GISVGG  +  I  + F L   G GG I+DS
Sbjct: 180 GRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLP-IPPAQFALTGNGTGGAILDS 238

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGA 420
           GTSVTR+  PAY  LRDA+RA + +L  AP   L DTCF+  G   V++P++VLHF  G 
Sbjct: 239 GTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGV 298

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D+ LP  N LIPVD SGTFC AFA +   +S+IGN+QQQ FR+ +DL  S I  APR C
Sbjct: 299 DMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  335 bits (859), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 192/397 (48%), Positives = 247/397 (62%), Gaps = 13/397 (3%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           ++RD  R++ +    +S+     R RS  +     ++ V SGL+ GSGEYF R+G+G+P 
Sbjct: 1   MERDEARLRWIHHRIQSSDHRHRRGRSLLQ-----TAQVSSGLSLGSGEYFARMGIGSPQ 55

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R  Y+ LDTGSDV WIQCAPC  CYSQ DP++DP+ S S+  V C S LC+ LD S C  
Sbjct: 56  RSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQG 115

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNEGLFVAAAGLLGLG 265
              C Y+V YGD S + GD   E+        T +  +A GCGH N GLF   AGLLG+G
Sbjct: 116 MG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMG 174

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRST--SAKPSSMVFGDSAVSRTARFTPLLANPKLDT 323
            G LSF +Q        FSYCLVDR +   ++ S ++FG +A+   ARFTPLL NP++DT
Sbjct: 175 GGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDT 234

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           FYY  L GISVGG  +  I  + F L   G GG I+DSGTSVTR+   AY  LRDA+RA 
Sbjct: 235 FYYAILTGISVGGTALP-IPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAA 293

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFA 442
           + +L  AP   L DTCF+  G   V++P++VLHF    D+ LP  N LIPVD SGTFC A
Sbjct: 294 SRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLA 353

Query: 443 FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FA +   +S+IGN+QQQ FR+ +DL  S I  APR C
Sbjct: 354 FAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  330 bits (847), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 194/422 (45%), Positives = 250/422 (59%), Gaps = 29/422 (6%)

Query: 66  SLRLHHVDSLSFNRTPE--HLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRANG 121
           SL L H D++S    P   H     + RD  RV+ L     A ++  +P           
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPED--------- 114

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              S V+ G+  GSGEYF R+GVG+PP   Y+V+D+GSDV+W+QC PC++CY+QTDP+FD
Sbjct: 115 -LVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFD 173

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT 238
           PA S SF+ V C S +CR L  +GC        C Y V+YGDGS T G+ + ETLT  GT
Sbjct: 174 PAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT 233

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
            V  VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R      S 
Sbjct: 234 AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSL 293

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           ++    AV R  R +         +FYYV L GI VGG  +  +  SLF+L   G GGV+
Sbjct: 294 VLGRTEAVPRGRRAS---------SFYYVGLTGIGVGGERLP-LQDSLFQLTEDGAGGVV 343

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF- 417
           +D+GT+VTRL R AY ALR AF     +L R+P  SL DTC+DLSG   V+VPTV  +F 
Sbjct: 344 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 403

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +GA ++LPA N L+ V  +  FC AFA + SG+SI+GNIQQ+G ++  D A   +GF P 
Sbjct: 404 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 462

Query: 478 GC 479
            C
Sbjct: 463 TC 464


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  328 bits (841), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 183/378 (48%), Positives = 237/378 (62%), Gaps = 17/378 (4%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
           A G   S V SG+   SGEYF  +GVGTP     +V+DTGSD+VW+QC+PC++CY+Q   
Sbjct: 67  ATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQ 126

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT----CLYQVSYGDGSITVGDFSTETLT 234
           VFDP +S ++  VPC SP CR L   GC+        C Y V+YGDGS + GD +T+ L 
Sbjct: 127 VFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLA 186

Query: 235 F-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-ST 292
           F   T V  V LGCG DNEGLF +AAGLLG+GRG++S  TQ    +   F YCL DR S 
Sbjct: 187 FANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSR 246

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           S + S +VFG +    +  FT LL+NP+  + YYV++ G SVGG  V G + +   LD A
Sbjct: 247 STRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTA 306

Query: 353 -GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP---DFSLFDTCFDLSGKTEV 408
            G GGV++DSGT+++R  R AY ALRDAF A A +        + S+FD C+DL G+   
Sbjct: 307 TGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAA 366

Query: 409 KVPTVVLHFR-GADVSLPATNYLIPVD------SSGTFCFAFAGTMSGLSIIGNIQQQGF 461
             P +VLHF  GAD++LP  NY +PVD      +S   C  F     GLS+IGN+QQQGF
Sbjct: 367 SAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGF 426

Query: 462 RVVYDLAASRIGFAPRGC 479
           RVV+D+   RIGFAP+GC
Sbjct: 427 RVVFDVEKERIGFAPKGC 444


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  325 bits (833), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 181/378 (47%), Positives = 236/378 (62%), Gaps = 17/378 (4%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
           A G   S V SG+   SGEYF  +GVGTP     +V+DTGSD+VW+QC+PC++CY+Q   
Sbjct: 67  ATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQ 126

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT----CLYQVSYGDGSITVGDFSTETLT 234
           VFDP +S ++  VPC SP CR L   GC+        C Y V+YGDGS + G+ +T+ L 
Sbjct: 127 VFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLA 186

Query: 235 F-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-ST 292
           F   T V  V LGCG DNEGLF +AAGLLG+ RG++S  TQ    +   F YCL DR S 
Sbjct: 187 FANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSR 246

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           S + S +VFG +    +  FT LL+NP+  + YYV++ G SVGG  V G + +   LD A
Sbjct: 247 STRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTA 306

Query: 353 -GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP---DFSLFDTCFDLSGKTEV 408
            G GGV++DSGT+++R  R AY ALRDAF A A +        + S+FD C+DL G+   
Sbjct: 307 TGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAA 366

Query: 409 KVPTVVLHFR-GADVSLPATNYLIPVD------SSGTFCFAFAGTMSGLSIIGNIQQQGF 461
             P +VLHF  GAD++LP  NY +PVD      +S   C  F     GLS+IGN+QQQGF
Sbjct: 367 SAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGF 426

Query: 462 RVVYDLAASRIGFAPRGC 479
           RVV+D+   RIGFAP+GC
Sbjct: 427 RVVFDVEKERIGFAPKGC 444


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  324 bits (831), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 167/265 (63%), Positives = 196/265 (73%), Gaps = 10/265 (3%)

Query: 19  AAASLQYQTFVLNSL----PTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDS 74
           A   L+YQ+ V+  L     T S LSW E+    E++ S  LP  + + ++++ L H D 
Sbjct: 56  ADKPLEYQSLVVRPLGENPTTKSQLSWTET----ETQIST-LPVSETDPTMTMHLEHRDV 110

Query: 75  LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQG 134
           L+FN TPE LFNLR+QRD  RV++L+  A +A              GGFSSSV SGLAQG
Sbjct: 111 LAFNATPEALFNLRLQRDAFRVEALSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQG 170

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGEYFTRLGVGTPP+YVYMVLDTGSDVVWIQCAPC+KCYSQTDPVFDP KS SF+++ CR
Sbjct: 171 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 230

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SPLC +LDS GCN R +CLYQV+YGDGS T G+FSTETLTFRGTRV +VALGCGHDNEGL
Sbjct: 231 SPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGL 290

Query: 255 FVAAAGLLGLGRG-RLSFPTQTGRR 278
           FV AAGLLGLGR  RL+ P   G R
Sbjct: 291 FVGAAGLLGLGRQPRLNRPPVGGAR 315



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/40 (90%), Positives = 36/40 (90%)

Query: 334 VGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           VGGA V GITASLFKLD AGNGGVIIDSGTSVTRLTR AY
Sbjct: 311 VGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAY 350


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  324 bits (830), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 228/489 (46%), Positives = 287/489 (58%), Gaps = 51/489 (10%)

Query: 24  QYQTFVLNSLPTPSTLSWP-----------ESVSVSESESSLPLPAPDAESSLSLRLHHV 72
           QY ++ +  L +P T S P           E V+VS S             +L +RL H 
Sbjct: 23  QYHSYAVTPL-SPHTYSVPAADDDGARARQEDVAVSPS-------------ALHVRLLHR 68

Query: 73  DSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLA 132
           DS + N TP  L   R+QRD LR   +   A  A            + G F + V+S   
Sbjct: 69  DSFAVNATPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSSGGAFVAPVVSRAP 128

Query: 133 QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP 192
             SGEY  ++ VGTP     + +DTGSD+ W+QC PC++CY Q+ PVFDP  S S+  + 
Sbjct: 129 TTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMG 188

Query: 193 CRSPLCRKLDSSGCN--RRNTCLYQVSYG-DGSITVGDFSTETLTFR-GTRVARVALGCG 248
             +P C+ L  SG    +R TC+Y V YG DGS TVGDF  ETLTF  G +V  +++GCG
Sbjct: 189 YDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQVPHMSIGCG 248

Query: 249 HDNEGLFVA-AAGLLGLGRGRLSFPTQTGR-RFN-RKFSYCLVD-------RSTSAKPSS 298
           HDN+GLF A AAG+LGLGRG++S P+Q     +N   FSYCL D       RS S   S+
Sbjct: 249 HDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVS---ST 305

Query: 299 MVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNG 355
           +  GD A   S    FTP + N  + TFYYV LVG+SVGG  V G+T    KLDP  G G
Sbjct: 306 LTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYTGRG 365

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP---DFSLFDTCFDLSGKTEVKVPT 412
           GVI+DSGT+VTRL R AYIA RDAFRA A  L +         FDTC+ + G+  +KVPT
Sbjct: 366 GVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRA-MKVPT 424

Query: 413 VVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAAS 470
           V +HF G  +++LP  NYLIPVDS GT CFAFAGT    +SIIGNIQQQGFRVVY++   
Sbjct: 425 VSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGGG 484

Query: 471 RIGFAPRGC 479
           R+GFAP  C
Sbjct: 485 RVGFAPNSC 493


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  323 bits (828), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 190/422 (45%), Positives = 243/422 (57%), Gaps = 42/422 (9%)

Query: 66  SLRLHHVDSLSFNRTPE--HLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRANG 121
           SL L H D++S    P   H     + RD  RV+ L     A ++  +P           
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPED--------- 114

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              S V+ G+  GSGEYF R+GVG+PP   Y+V+D+GSDV+W+QC PC++CY+QTDP+FD
Sbjct: 115 -LVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFD 173

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT 238
           PA S SF+ V C S +CR L  +GC        C Y V+YGDGS T G+ + ETLT  GT
Sbjct: 174 PAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT 233

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
            V  VA+GCGH N GLFV AAGLLGLG G +S   Q G      FSYCL  R        
Sbjct: 234 AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGG---- 289

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                             A     +FYYV L GI VGG  +  +  SLF+L   G GGV+
Sbjct: 290 ------------------AGSLASSFYYVGLTGIGVGGERLP-LQDSLFQLTEDGAGGVV 330

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF- 417
           +D+GT+VTRL R AY ALR AF     +L R+P  SL DTC+DLSG   V+VPTV  +F 
Sbjct: 331 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 390

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +GA ++LPA N L+ V  +  FC AFA + SG+SI+GNIQQ+G ++  D A   +GF P 
Sbjct: 391 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 449

Query: 478 GC 479
            C
Sbjct: 450 TC 451


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  318 bits (816), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 194/435 (44%), Positives = 251/435 (57%), Gaps = 25/435 (5%)

Query: 60  DAESSLSL-RLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR 118
           D+  SL+L R   V   ++      + +L + RD  R + L      A R+ P  +  G 
Sbjct: 101 DSRPSLALVRRDEVTGSTYPSLRHAVLDL-VARDNARAEYL------ATRLSPAYQPPGF 153

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
           +  G  S V+SGL +GSGEY  R+ VG+PP   Y+V+D+GSDV+W+QC PC +CY Q DP
Sbjct: 154 S--GSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADP 211

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           +FDPA S +F+ V C S +CR L +S C       C Y+VSY DGS T G  + ETLT  
Sbjct: 212 LFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLG 271

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
           GT V  V +GCGH N GLFV AAGL+GLG G +S   Q G      FSYCL  R      
Sbjct: 272 GTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSG 331

Query: 297 SS------MVFGDS-AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           ++      +V G S AV   A + PL+ NP+  +FYYV L GI VG   +  + A LF+L
Sbjct: 332 AADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLP-LQAGLFQL 390

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RAGASSLKRAPDFSLFDTCFDLSGKT 406
              G G V++D+GT+VTRL + AY ALRDAF    AGA    +    S+ DTC+DLSG  
Sbjct: 391 TEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYA 450

Query: 407 EVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
            V+VPTV   F G A + L A N L+ VD  G +C AFA + SGLSI+GN QQ G ++  
Sbjct: 451 SVRVPTVSFCFDGDARLILAARNVLLEVD-MGIYCLAFAPSSSGLSIMGNTQQAGIQITV 509

Query: 466 DLAASRIGFAPRGCA 480
           D A   IGF P  C 
Sbjct: 510 DSANGYIGFGPANCG 524


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  310 bits (795), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 162/351 (46%), Positives = 212/351 (60%), Gaps = 10/351 (2%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+G Y    G GTP +   +++DTGSDV WIQC PC  CYSQ DP+F+P +S S+  + C
Sbjct: 134 GTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSC 193

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
            S  C +L +    R   C+Y+++YGDGS + GDFS ETLT         A GCGH N G
Sbjct: 194 LSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDSFPSFAFGCGHTNTG 253

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
           LF  +AGLLGLGR  LSFP+QT  ++  +FSYCL D  +S    S   G  ++  TA F 
Sbjct: 254 LFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFV 313

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           PL++N    +FY+V L GISVGG  +    A L      G GG I+DSGT +TRL   AY
Sbjct: 314 PLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL------GRGGTIVDSGTVITRLVPQAY 367

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIP 432
            AL+ +FR+   +L  A  FS+ DTC+DLS  ++V++PT+  HF+  ADV++ A   L  
Sbjct: 368 DALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVAVSAVGILFT 427

Query: 433 VDSSGT-FCFAFAGTMSGLS--IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           + S G+  C AFA     +S  IIGN QQQ  RV +D  A RIGFAP  CA
Sbjct: 428 IQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSCA 478


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  301 bits (772), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 161/360 (44%), Positives = 216/360 (60%), Gaps = 14/360 (3%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG   G+G Y    G GTP +   +++DTGSD+ WIQC PC  CYSQ D +F+P +S S+
Sbjct: 128 SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSY 187

Query: 189 ATVPCRSPLCRKLDSSGCNRR----NTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
            T+PC S  C +L +S  N        C+Y+++YGDGS + GDFS ETLT         A
Sbjct: 188 KTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDSFQNFA 247

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCGH N GLF  ++GLLGLG+  LSFP+Q+  ++  +F+YCL D  +S    S   G  
Sbjct: 248 FGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGKG 307

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           ++  +A FTPL++N    TFY+V L GISVGG  +    A L      G G  I+DSGT 
Sbjct: 308 SIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL------GRGSTIVDSGTV 361

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           +TRL   AY AL+ +FR+    L  A  FS+ DTC+DLS  ++V++PT+  HF+  ADV+
Sbjct: 362 ITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNADVA 421

Query: 424 LPATNYLIPVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +     L+PV + G+  C AFA    M G +IIGN QQQ  RV +D  A RIGFA   CA
Sbjct: 422 VSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSCA 481


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  301 bits (770), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 174/372 (46%), Positives = 238/372 (63%), Gaps = 17/372 (4%)

Query: 118 RANGGFSSS-----VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--- 169
           R NG  S++     V SG +QG+GEYF R+GVG P +  + V DTGSDV W+QC PC   
Sbjct: 159 RINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGE 218

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
             CY Q  P+FDP  S S++ + C S  C  LD + C+  N+C+Y+V YGDGS TVG+ +
Sbjct: 219 NGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA-NSCIYEVEYGDGSFTVGELA 277

Query: 230 TETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           TET +FR +  +  + +GCGHDNEGLFV A GL+GLG G +S  +Q        FSYCLV
Sbjct: 278 TETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEAT---SFSYCLV 334

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           D  + +  S++ F     S +   +PL+ N +  TF YV+++G+SVGG  +  I++S F+
Sbjct: 335 DLDSESS-STLDFNADQPSDSLT-SPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 391

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV 408
           +D +G+GG+I+DSGT++T +    Y  LRDAF     +L  AP  S FDTC+DLS ++ V
Sbjct: 392 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 451

Query: 409 KVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           +VPT+     G + + LPA N LI VDS+GTFC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 452 EVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 511

Query: 468 AASRIGFAPRGC 479
           A S +GF+   C
Sbjct: 512 ANSLVGFSTDKC 523


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  301 bits (770), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 197/490 (40%), Positives = 284/490 (57%), Gaps = 37/490 (7%)

Query: 15  FFFTAAASLQYQTFVLNSLPTPSTLS---WPESVSVSESESSL---PLPA------PDAE 62
            F T   SLQ+ + +   L TPS+ S   +  S S +++  +L   P P       P++ 
Sbjct: 10  LFLTIFTSLQFPSILSRKL-TPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLPNSP 68

Query: 63  SSLSLR----LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPP---RNRS 115
            SL L     LH+     +N     L   R+ RD  RV+ L    E ++        + +
Sbjct: 69  FSLPLYPRLALHNPSYKDYNT----LVRARLTRDAARVQFLNRNLERSLNGGTHFGESIN 124

Query: 116 RGRANGGFSSSVISGLAQGSG-EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KK 171
                   ++ V+SG ++GSG EY  ++GVG P +  Y+V DTGSDV W+QC PC     
Sbjct: 125 ESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENT 184

Query: 172 CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTE 231
           CY Q DP+FDP  S S++ + C S  C+ LD + CN  +TC+YQV YGDGS T G+ +TE
Sbjct: 185 CYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNS-DTCIYQVHYGDGSFTTGELATE 243

Query: 232 TLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
           TL+F     +  + +GCGHDNEGLF   AGL+GLG G +S  +Q        FSYCLV+ 
Sbjct: 244 TLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS---SFSYCLVNL 300

Query: 291 STSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
            + +  SS +  +S +   +  +PL+ N +  ++ YV++VGISVGG  +  I+ + F++D
Sbjct: 301 DSDS--SSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-ISPTRFEID 357

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
            +G GG+I+DSGT ++RL    Y +LR+AF    SSL  AP  S+FDTC++ SG++ V+V
Sbjct: 358 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 417

Query: 411 PTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           PT+      G  + LPA NYLI +D++GT+C AF  T S LSIIG+ QQQG RV YDL  
Sbjct: 418 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 477

Query: 470 SRIGFAPRGC 479
           S +GF+   C
Sbjct: 478 SLVGFSTNKC 487


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  300 bits (769), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 197/436 (45%), Positives = 249/436 (57%), Gaps = 36/436 (8%)

Query: 66  SLRLHHVDSLSFNRTPE--HLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGF 123
           SL+L H D++S  + P   H       RD  RV  L            + R     +   
Sbjct: 58  SLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYL------------QRRLSPSPSPSS 105

Query: 124 SSSVISG---LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           +SSV SG   ++ GSGEY  R+G+G+PP   ++V DTGSDV+W+QC+PC  CY+Q DP+F
Sbjct: 106 TSSVESGGTIVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLF 165

Query: 181 DPAKSRSFATVPCRSPLCRK----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           DPA S SF+ VPC S +CR       SS       C Y+VSYGD S T G  + ETLT  
Sbjct: 166 DPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLD 225

Query: 237 -GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV--DRSTS 293
            GT V  VA+GCGH+N GLF  AAGLLGLG G +S   Q G      FSYCL        
Sbjct: 226 GGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEG 285

Query: 294 AKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           +   S+V G + A    A + PL+ NP   +FYYV + G+ V G  ++ +   LF L   
Sbjct: 286 SGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQ-LQDGLFDLGDD 344

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGA--SSLKRAPDFSLFDTCFDLSGKTEVKV 410
           G GGV++D+GT+VTRL   AY ALR AF AGA      RAP  SLFDTC+DLSG   V+V
Sbjct: 345 GGGGVVMDTGTAVTRLPAEAYAALRGAF-AGAFEEGAPRAPGVSLFDTCYDLSGYASVRV 403

Query: 411 PTVVLHF-------RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
           PTV L+F         A ++LPA N L+PVD  GT+C AFA   SG SI+GNIQQQG  +
Sbjct: 404 PTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEI 463

Query: 464 VYDLAASRIGFAPRGC 479
             D A+  +GF P  C
Sbjct: 464 TVDSASGYVGFGPATC 479


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 200/498 (40%), Positives = 284/498 (57%), Gaps = 53/498 (10%)

Query: 15  FFFTAAASLQYQTFVLNSLPTPSTLS---WPESVSVSESESSL---PLPA------PDAE 62
            F T   SLQ+ + +   L TPS+ S   +  S S +++  +L   P P       P++ 
Sbjct: 10  LFLTIFTSLQFPSILSRKL-TPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSHLPNSP 68

Query: 63  SSLSLR----LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSL-----------TAFAESAV 107
            SL L     LH+     +N     L   R+ RD  RV+ L           T F ES  
Sbjct: 69  FSLPLYPRLALHNPSYKDYNT----LVRARLTRDAARVQFLNRNLERSLNGGTHFGESI- 123

Query: 108 RVPPRNRSRGRANGGFSSSVISGLAQGSG-EYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
                  +        ++ V+SG ++GSG EY  ++GVG P +  Y+V DTGSDV W+QC
Sbjct: 124 -------NESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQC 176

Query: 167 APC---KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSI 223
            PC     CY Q DP+FDP  S S++ + C S  C+ LD + CN  +TC+YQV YGDGS 
Sbjct: 177 QPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNS-DTCIYQVHYGDGSF 235

Query: 224 TVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRK 282
           T G+ +TETL+F     +  + +GCGHDNEGLF   AGL+GLG G +S  +Q        
Sbjct: 236 TTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKAS---S 292

Query: 283 FSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           FSYCLV+  + +  SS +  +S +   +  +PL+ N +  ++ YV++VGISVGG  +  I
Sbjct: 293 FSYCLVNLDSDS--SSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLP-I 349

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL 402
           + + F++D +G GG+I+DSGT ++RL    Y +LR+AF    SSL  AP  S+FDTC++ 
Sbjct: 350 SPTRFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNF 409

Query: 403 SGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGF 461
           SG++ V+VPT+      G  + LPA NYLI +D++GT+C AF  T S LSIIG+ QQQG 
Sbjct: 410 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGI 469

Query: 462 RVVYDLAASRIGFAPRGC 479
           RV YDL  S +GF+   C
Sbjct: 470 RVSYDLTNSIVGFSTNKC 487


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 174/372 (46%), Positives = 238/372 (63%), Gaps = 17/372 (4%)

Query: 118 RANGGFSSS-----VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--- 169
           R NG  S++     V SG +QG+GEYF R+GVG P +  + V DTGSDV W+QC PC   
Sbjct: 159 RINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGE 218

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
             CY Q  P+FDP  S S++ + C S  C  LD + C+  N+C+Y+V YGDGS TVG+ +
Sbjct: 219 NGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA-NSCIYEVEYGDGSFTVGELA 277

Query: 230 TETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           TET +FR +  +  + +GCGHDNEGLFV AAGL+GLG G +S  +Q        FSYCLV
Sbjct: 278 TETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 334

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           D  + +  S++ F     S +   +PL+ N +  TF YV+++G+SVGG  +  I++S F+
Sbjct: 335 DLDSESS-STLDFNADQPSDSLT-SPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISSSSFE 391

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV 408
           +D +G+GG+I+DSGT++T +    Y  LRDAF     +L  AP  S FDTC+DLS ++ V
Sbjct: 392 IDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNV 451

Query: 409 KVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           +VPT+     G + + LPA N L  VDS+GTFC AF  +   LSIIGN+QQQG RV YDL
Sbjct: 452 EVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 511

Query: 468 AASRIGFAPRGC 479
           A S +GF+   C
Sbjct: 512 ANSLVGFSTDKC 523


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 165/370 (44%), Positives = 227/370 (61%), Gaps = 21/370 (5%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           GF++ V    A   GEY   + +GTP R   +++DTGSD+ W+QC+PC KCYSQ D +F 
Sbjct: 1   GFTAPV----AAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFL 56

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT--- 238
           P  S SF  + C S LC  L    CN + TC+Y  SYGDGS+T GDF  +T+T  G    
Sbjct: 57  PNTSTSFTKLACGSALCNGLPFPMCN-QTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQ 115

Query: 239 --RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAK 295
             +V   A GCGHDNEG F  A G+LGLG+G LSF +Q    +N KFSYCLVD  +   +
Sbjct: 116 KQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQ 175

Query: 296 PSSMVFGDSAVS--RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
            S ++FGD+AV      ++ P+LANPK+ T+YYV+L GISVG  ++  I++++F +D  G
Sbjct: 176 TSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGD-NLLNISSTVFDIDSVG 234

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDTCFDLSGKTEVKVPT 412
             G I DSGT+VT+L   AY  +  A  A   +  R   D S  D C  LSG  + ++PT
Sbjct: 235 GAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQLPT 292

Query: 413 V---VLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           V     HF G D+ LP +NY I ++SS ++CFA   +   ++IIG++QQQ F+V YD A 
Sbjct: 293 VPAMTFHFEGGDMVLPPSNYFIYLESSQSYCFAMTSS-PDVNIIGSVQQQNFQVYYDTAG 351

Query: 470 SRIGFAPRGC 479
            ++GF P+ C
Sbjct: 352 RKLGFVPKDC 361


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 159/372 (42%), Positives = 221/372 (59%), Gaps = 19/372 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+   S +A G G+Y T + +GTP +   ++ DTGSD++WIQC PC+ C++Q DP+FDP 
Sbjct: 26  STDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPE 85

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRV 240
            S S+ T+ C   LC  L    C+    C Y   YGDGS T G  S+ET+T    +G ++
Sbjct: 86  GSSSYTTMSCGDTLCDSLPRKSCSPD--CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKL 143

Query: 241 A--RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPS 297
           A   +A GCGH N G F  A+GL+GLGRG LSF +Q G  F  KFSYCLV  R   +K S
Sbjct: 144 AAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTS 203

Query: 298 SMVFGDSAVSRTA------RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            M FGD + S ++       FTP++ NP +++FYYV+L  IS+ G  +R I A  F + P
Sbjct: 204 PMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALR-IPAGSFDIKP 262

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT---EV 408
            G+GG+I DSGT++T L    Y  +  A R+  S  K     +  D C+D+SG     ++
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKM 322

Query: 409 KVPTVVLHFRGADVSLPATNYLIPVDSSGTF-CFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           K+P +V HF GAD  LP  NY I  + +GT  C A   +   + I GN+ QQ FRV+YD+
Sbjct: 323 KIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDI 382

Query: 468 AASRIGFAPRGC 479
            +S+IG+AP  C
Sbjct: 383 GSSKIGWAPSQC 394


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  291 bits (745), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 158/372 (42%), Positives = 220/372 (59%), Gaps = 19/372 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+   S +A G G+Y T + +GTP +   ++ DTGSD++WIQC PC+ C++Q DP+FDP 
Sbjct: 26  STDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPE 85

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRV 240
            S S+ T+ C   LC  L    C+    C Y   YGDGS T G  S+ET+T    +G ++
Sbjct: 86  GSSSYTTMSCGDTLCDSLPRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKL 143

Query: 241 A--RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPS 297
           A   +A GCGH N G F  A+GL+GLGRG LSF +Q G  F  KFSYCLV  R   +K S
Sbjct: 144 AAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTS 203

Query: 298 SMVFGDSAVSRTA------RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            M FGD + S ++       FTP++ NP +++FYYV+L  IS+ G  +R I A  F + P
Sbjct: 204 PMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALR-IPAGSFDIKP 262

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT---EV 408
            G+GG+I DSGT++T L    Y  +  A R+  S  +     +  D C+D+SG     + 
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKK 322

Query: 409 KVPTVVLHFRGADVSLPATNYLIPVDSSGTF-CFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           K+P +V HF GAD  LP  NY I  + +GT  C A   +   + I GN+ QQ FRV+YD+
Sbjct: 323 KIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDI 382

Query: 468 AASRIGFAPRGC 479
            +S+IG+AP  C
Sbjct: 383 GSSKIGWAPSQC 394


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 211/353 (59%), Gaps = 5/353 (1%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           ++ GSGEY  ++ +GTPP+    ++DTGSD+ W+QCAPC +C+ Q DP+F P  S S++ 
Sbjct: 1   VSAGSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSN 60

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
             C   LC  L    C+ RNTC Y  SYGDGS T GDF+ ET+T  G+ +AR+  GCGH+
Sbjct: 61  ASCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHN 120

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
            EG F  A GL+GLG+G LS P+Q    F   FSYCLVD+ST+   S + FG++A +  A
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRA 180

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
            FTPLL N    ++YYV +  ISVG   V     S F++D  G GGVI+DSGT++T    
Sbjct: 181 SFTPLLQNEDNPSYYYVGVESISVGNRRVP-TPPSAFRIDANGVGGVILDSGTTITYWRL 239

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS--GKTEVKVPTVVLHFRGADVSLPATN 428
            A+I +    R   S  +  P     + C+D+S    + + +P++ +H    D  +P +N
Sbjct: 240 AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIPVSN 299

Query: 429 YLIPVDSSG-TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             + VD+ G T C A + T    SIIGN+QQQ   +V D+A SR+GF    C+
Sbjct: 300 LWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 157/357 (43%), Positives = 216/357 (60%), Gaps = 17/357 (4%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   + +GTP R   +++DTGSD+ W+QC+PC  CYSQ D +F P  S SF  + C +
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGT 60

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHD 250
            LC  L    CN + TC+Y  SYGDGS++ GDF  +T+T  G      +V   A GCGHD
Sbjct: 61  ELCNGLPYPMCN-QTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHD 119

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSR- 308
           NEG F  A G+LGLG+G LSFP+Q    FN KFSYCLVD  +   + S ++FGD+AV   
Sbjct: 120 NEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTF 179

Query: 309 -TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
              ++  LL NPK+ T+YYV+L GISVGG  +  I+++ F +D  G  G I DSGT+VT+
Sbjct: 180 PGVKYISLLTNPKVPTYYYVKLNGISVGGKLLN-ISSTAFDIDSVGRAGTIFDSGTTVTQ 238

Query: 368 LTRPAYIALRDAFRAGASSL-KRAPDFSLFDTCFDLSGKTEVKVPTV---VLHFRGADVS 423
           L    +  +  A  A      +++ D S  D C  L G  E ++PTV     HF G D+ 
Sbjct: 239 LAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLC--LGGFAEGQLPTVPSMTFHFEGGDME 296

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LP +NY I ++SS ++CF+   +   ++IIG+IQQQ F+V YD    +IGF P+ C 
Sbjct: 297 LPPSNYFIFLESSQSYCFSMVSSPD-VTIIGSIQQQNFQVYYDTVGRKIGFVPKSCV 352


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 174/418 (41%), Positives = 225/418 (53%), Gaps = 33/418 (7%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T        ++R  LR++ L+A   S                 F SS
Sbjct: 44  VSLRHVDS-GGNYTKFERLQRAMKRGKLRLQRLSAKTAS-----------------FESS 85

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V + +  G+GE+  +L +GTP      ++DTGSD++W QC PCK C+ Q  P+FDP KS 
Sbjct: 86  VEAPVHAGNGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSS 145

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
           SF+ +PC S LC  L  S C+  + C Y  SYGD S T G  +TET  F    V+++  G
Sbjct: 146 SFSKLPCSSDLCAALPISSCS--DGCEYLYSYGDYSSTQGVLATETFAFGDASVSKIGFG 203

Query: 247 CGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG DN+G  F   AGL+GLGRG LS  +Q G     KFSYCL     S   SS++ G  A
Sbjct: 204 CGEDNDGSGFSQGAGLVGLGRGPLSLISQLGE---PKFSYCLTSMDDSKGISSLLVGSEA 260

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             + A  TPL+ NP   +FYY+ L GISVG   +  I  S F +   G+GG+IIDSGT++
Sbjct: 261 TMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLP-IEKSTFSIQNDGSGGLIIDSGTTI 319

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFS---LFDTCFDL-SGKTEVKVPTVVLHFRGAD 421
           T L   A+ AL+  F    S LK   D S     D CF L    + V VP +V HF GAD
Sbjct: 320 TYLEDSAFAALKKEF---ISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGAD 376

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + LPA NY+I     G  C    G+ SG+SI GN QQQ   V++DL    I FAP  C
Sbjct: 377 LKLPAENYIIADSGLGVICLTM-GSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  286 bits (731), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 162/341 (47%), Positives = 218/341 (63%), Gaps = 12/341 (3%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           VG P +  + VLDTGSDV W+QC PC     CY Q  P+FDP  S S+  V C S  C+ 
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAA 259
           LD +GCN  N+C+Y+V YGDGS T+G+ +TETLTF     +  +++GCGHDNEGLFV A 
Sbjct: 63  LDEAGCNV-NSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGAD 121

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANP 319
           GL+GLG G +S  +Q        FSYCLVD   S   S++ F     S  +  +PL+ N 
Sbjct: 122 GLIGLGGGAISISSQLKAS---SFSYCLVDID-SPSFSTLDFNTDPPSD-SLISPLVKND 176

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           +  +F YV+++G+SVGG  +  I++S F++D +G GG+I+DSGT++T+L    Y  LR+A
Sbjct: 177 RFPSFRYVKVIGMSVGGKPLP-ISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREA 235

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD-VSLPATNYLIPVDSSGT 438
           F    ++L  AP+ S FDTC+DLS ++ V+VPT+     G + + LPA N LI VDS+GT
Sbjct: 236 FLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGT 295

Query: 439 FCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FC AF      LSIIGN QQQG RV YDL  S +GF+   C
Sbjct: 296 FCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  285 bits (728), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 156/360 (43%), Positives = 216/360 (60%), Gaps = 16/360 (4%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKS 185
           V SG + GSG+Y   +G+GTP +   ++ DTGSD+ W QC PC K CY Q +P  DP KS
Sbjct: 122 VQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKS 181

Query: 186 RSFATVPCRSPLCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
            S+  + C S  C+ LD+ G    +  TCLYQV YGDGS ++G F+TETLT   + V + 
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKN 241

Query: 244 AL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
            L GCG  N GLF  AAGLLGLGR +LS P+QT +++ + FSYCL   ++S+    + FG
Sbjct: 242 FLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL--PASSSSKGYLSFG 299

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
              VS+T +FTPL  + K   FY +++  +SVGG  +  I AS+F        G +IDSG
Sbjct: 300 -GQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLS-IDASIFS-----TSGTVIDSG 352

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-D 421
           T +TRL   AY AL  AF+   +       +S+FDTC+D S    +K+P V + F+G  +
Sbjct: 353 TVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVE 412

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + +  +  L PV+     C AFAG    +  +I GN QQ+ ++VVYD A  R+GFAP GC
Sbjct: 413 MDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 166/424 (39%), Positives = 237/424 (55%), Gaps = 31/424 (7%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR---------GRANGGFSSSVI 128
           N+ P H F +R++  V  VK+LT F E   R   R ++R           AN      V 
Sbjct: 299 NKLPSHGFRVRLKH-VDHVKNLTRF-ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVK 356

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           + +  G+GE+  +L +G+PPR    ++DTGSD++W QC PC++C+ Q+ P+FDP +S SF
Sbjct: 357 APVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSF 416

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--- 245
             + C S LC  L +S C+  + C Y  +YGD S T G  + ET TF  +   ++++   
Sbjct: 417 YKISCSSELCGALPTSTCS-SDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGL 475

Query: 246 --GCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
             GCG+DN G  F   AGL+GLGRG LS  +Q      +KF+YCL     S KPSS++ G
Sbjct: 476 GFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDS-KPSSLLLG 531

Query: 303 DSA------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
             A           + TPL+ NP   +FYY+ L GISVGG  +  I  S F+L   G+GG
Sbjct: 532 SLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQL-SIPKSTFELHDDGSGG 590

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVL 415
           VIIDSGT++T +   A+ +L++ F A  +           D CF+L +G  +V+VP +  
Sbjct: 591 VIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 650

Query: 416 HFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           HF+GAD+ LP  NY+I    +G  C A  G+  G+SI GN+QQQ F VV+DL    + F 
Sbjct: 651 HFKGADLELPGENYMIGDSKAGLLCLAI-GSSRGMSIFGNLQQQNFMVVHDLQEETLSFL 709

Query: 476 PRGC 479
           P  C
Sbjct: 710 PTQC 713


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 166/424 (39%), Positives = 237/424 (55%), Gaps = 31/424 (7%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR---------GRANGGFSSSVI 128
           N+ P H F +R+ + V  VK+LT F E   R   R ++R           AN      V 
Sbjct: 44  NKLPSHGFRVRL-KHVDHVKNLTRF-ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVK 101

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           + +  G+GE+  +L +G+PPR    ++DTGSD++W QC PC++C+ Q+ P+FDP +S SF
Sbjct: 102 APVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSF 161

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--- 245
             + C S LC  L +S C+  + C Y  +YGD S T G  + ET TF  +   ++++   
Sbjct: 162 YKISCSSELCGALPTSTCS-SDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGL 220

Query: 246 --GCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
             GCG+DN G  F   AGL+GLGRG LS  +Q      +KF+YCL     S KPSS++ G
Sbjct: 221 GFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDS-KPSSLLLG 276

Query: 303 DSA------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
             A           + TPL+ NP   +FYY+ L GISVGG  +  I  S F+L   G+GG
Sbjct: 277 SLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLS-IPKSTFELHDDGSGG 335

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVL 415
           VIIDSGT++T +   A+ +L++ F A  +           D CF+L +G  +V+VP +  
Sbjct: 336 VIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 395

Query: 416 HFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           HF+GAD+ LP  NY+I    +G  C A  G+  G+SI GN+QQQ F VV+DL    + F 
Sbjct: 396 HFKGADLELPGENYMIGDSKAGLLCLAI-GSSRGMSIFGNLQQQNFMVVHDLQEETLSFL 454

Query: 476 PRGC 479
           P  C
Sbjct: 455 PTQC 458


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  281 bits (719), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 162/362 (44%), Positives = 207/362 (57%), Gaps = 19/362 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRS 187
           SG   GS  YF  +G+GTP R + +V DTGSD+ W QC PC   CY Q D +FDP+KS S
Sbjct: 127 SGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSS 186

Query: 188 FATVPCRSPLCRKLDSSGCNRR-----NTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           +  + C S LC +L S+G   R       C+Y + YGD S +VG  S E LT   T +  
Sbjct: 187 YINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATDIVD 246

Query: 243 VAL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
             L GCG DNEGLF  +AGL+GLGR  +SF  QT   +N+ FSYCL   STS+    + F
Sbjct: 247 DFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL--PSTSSSLGHLTF 304

Query: 302 GDSAVSR-TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           G SA +    ++TPL      +TFY +++VGISVGG  +  +++S F       GG IID
Sbjct: 305 GASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSIID 359

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           SGT +TRL   AY ALR AFR G      A +  LFDTC+D SG  E+ VP +   F G 
Sbjct: 360 SGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGG 419

Query: 421 -DVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             V LP    LI   S+   C AFA  G  + ++I GN+QQ+   VVYD+   RIGF   
Sbjct: 420 VTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAA 478

Query: 478 GC 479
           GC
Sbjct: 479 GC 480


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 151/397 (38%), Positives = 226/397 (56%), Gaps = 19/397 (4%)

Query: 93  VLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVY 152
           +++ +S   F  S +     +  R R +        SG   GSG Y   +G+GTP +Y+ 
Sbjct: 86  LVKDQSRVDFIHSKIAGELESVDRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKKYLS 145

Query: 153 MVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-----GC 206
           ++ DTGSD+ W QC PC + CY+Q DPVF P++S +++ + C SP C +L+S      GC
Sbjct: 146 LIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPGC 205

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCGHDNEGLFVAAAGLLGLG 265
           +    C+Y + YGD S +VG F+ ETLT   T V    L GCG +N GLF +AAGL+GLG
Sbjct: 206 SAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLG 265

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           + ++S   QT +++ + FSYCL    TS+    + FG        ++TP+     +  FY
Sbjct: 266 QDKISIVKQTAQKYGQVFSYCL--PKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFY 323

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
            V++VG+ VGG  +  I++S+F        G IIDSGT +TRL   AY AL+ AF  G +
Sbjct: 324 GVDIVGMKVGGTQIP-ISSSVFS-----TSGAIIDSGTVITRLPPDAYSALKSAFEKGMA 377

Query: 386 SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA 444
              +AP+ S+ DTC+DLS  + +++P V   F+G  ++ L     +    +S   C AFA
Sbjct: 378 KYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTS-QVCLAFA 436

Query: 445 GTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           G    S ++IIGN+QQ+  +VVYD+   +IGF   GC
Sbjct: 437 GNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 164/415 (39%), Positives = 222/415 (53%), Gaps = 27/415 (6%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T        ++R  LR++ L+A   S                 F  S
Sbjct: 44  VSLRHVDS-GGNYTKFERLQRAVKRGRLRLQRLSAKTAS-----------------FEPS 85

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V + +  G+GE+   L +GTP      ++DTGSD++W QC PCK C+ Q  P+FDP KS 
Sbjct: 86  VEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSS 145

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
           SF+ +PC S LC  L  S C+  + C Y+ SYGD S T G  +TET TF    V+++  G
Sbjct: 146 SFSKLPCSSDLCVALPISSCS--DGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFG 203

Query: 247 CGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG DN G  +   AGL+GLGRG LS  +Q G     KFSYCL     S   S+++ G  A
Sbjct: 204 CGEDNRGRAYSQGAGLVGLGRGPLSLISQLGV---PKFSYCLTSIDDSKGISTLLVGSEA 260

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             ++A  TPL+ NP   +FYY+ L GISVG   +  I  S F +   G+GG+IIDSGT++
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDT-LLPIEKSTFSIQDDGSGGLIIDSGTTI 319

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSL 424
           T L   A+ AL+  F +       A   +  + CF L    + V+VP +V HF G D+ L
Sbjct: 320 TYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLKL 379

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           P  NY+I   +    C    G+ SG+SI GN QQQ   V++DL    I FAP  C
Sbjct: 380 PKENYIIEDSALRVICLTM-GSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 172/426 (40%), Positives = 229/426 (53%), Gaps = 32/426 (7%)

Query: 84  LFNLRIQRDVLRVKSLTAFAESAVRV------PPRNRSRGRANGGFSSSVISGLAQGSGE 137
           +   + Q D+ R+K      E  ++        P +   G + G   +++ SG+  GSGE
Sbjct: 31  IIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGTGLS-GQLMATLESGVTLGSGE 89

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           YF  + +GTPP++  ++LDTGSD+ WIQC PC  C+ Q  P +DP +S SF  + C  P 
Sbjct: 90  YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149

Query: 198 CRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT---------RVARV 243
           C  + S      C   N TC Y   YGD S T GDF+TET T   T         RV  V
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG 302
             GCGH N GLF  A+GLLGLGRG LSF +Q    +   FSYCLVDR++    SS ++FG
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 269

Query: 303 ---DSAVSRTARFTPLLA---NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
              D        FT L+    NP +DTFYYV++  I VGG  V  I  S + +   G GG
Sbjct: 270 EDKDLLNHPELNFTTLVGGKENP-VDTFYYVQIKSIMVGG-EVLNIPESTWNMTSDGVGG 327

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV-VL 415
            I+DSGT+++  T PAY  ++DAF           DF + D C+++SG  ++ +P   +L
Sbjct: 328 TIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDFGIL 387

Query: 416 HFRGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGF 474
              GA  + P  NY I +D     C A  GT  S LSIIGN QQQ F V+YD   SR+G+
Sbjct: 388 FADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKSRLGY 447

Query: 475 APRGCA 480
           AP  CA
Sbjct: 448 APMNCA 453


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 170/454 (37%), Positives = 236/454 (51%), Gaps = 34/454 (7%)

Query: 45  VSVSESESSL---PLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTA 101
           ++VS S  SL   PLP     S   L L HVDS   N T        I R   R+  L A
Sbjct: 23  IAVSSSRRSLIDRPLPKNLPRSGFRLSLRHVDS-GKNLTKIQKIQRGINRGFHRLNRLGA 81

Query: 102 FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDV 161
            A  AV   P +          ++++ +    GSGE+   L +G P      ++DTGSD+
Sbjct: 82  VAVLAVASNPDD----------TNNIKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDL 131

Query: 162 VWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-RNTCLYQVSYGD 220
           +W QC PC +C+ Q  P+FDP KS S++ V C S LC  L  S CN  +++C Y  +YGD
Sbjct: 132 IWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGD 191

Query: 221 GSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRR 278
            S T G  +TET TF     ++ +  GCG +NEG  F   +GL+GLGRG LS  +Q    
Sbjct: 192 YSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKE- 250

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSA---VSRTA--------RFTPLLANPKLDTFYYV 327
              KFSYCL     S   SS+  G  A   V++T         +   LL NP   +FYY+
Sbjct: 251 --TKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYL 308

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
           EL GI+VG   +  +  S F+L   G GG+IIDSGT++T L   A+  L++ F +  S  
Sbjct: 309 ELQGITVGAKRLS-VEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLP 367

Query: 388 KRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGT 446
                 +  D CF L +    + VP ++ HF+GAD+ LP  NY++   S+G  C A  G+
Sbjct: 368 VDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAM-GS 426

Query: 447 MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            +G+SI GN+QQQ F V++DL    + F P  C 
Sbjct: 427 SNGMSIFGNVQQQNFNVLHDLEKETVTFVPTECG 460


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 164/415 (39%), Positives = 221/415 (53%), Gaps = 27/415 (6%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T        ++R  LR++ L+A   S                 F  S
Sbjct: 44  VSLRHVDS-GGNYTKFERLQRAVKRGRLRLQRLSAKTAS-----------------FEPS 85

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V + +  G+GE+   L +GTP      ++DTGSD++W QC PCK C+ Q  P+FDP KS 
Sbjct: 86  VEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSS 145

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
           SF+ +PC S LC  L  S C+  + C Y+ SYGD S T G  +TET TF    V+++  G
Sbjct: 146 SFSKLPCSSDLCVALPISSCS--DGCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFG 203

Query: 247 CGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG DN G  +   AGL+GLGRG LS  +Q G     KFSYCL     S   S+++ G  A
Sbjct: 204 CGEDNRGRAYSQGAGLVGLGRGPLSLISQLGV---PKFSYCLTSIDDSKGISTLLVGSEA 260

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             ++A  TPL+ NP   +FYY+ L GISVG   +  I  S F +   G+GG+IIDSGT++
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDT-LLPIEKSTFSIQDDGSGGLIIDSGTTI 319

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSL 424
           T L   A+ AL+  F +       A   +  + CF L    + V VP +V HF G D+ L
Sbjct: 320 TYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKL 379

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           P  NY+I   +    C    G+ SG+SI GN QQQ   V++DL    I FAP  C
Sbjct: 380 PKENYIIEDSALRVICLTM-GSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 161/391 (41%), Positives = 221/391 (56%), Gaps = 14/391 (3%)

Query: 97  KSLTAFA--ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           K+LT F   E AV    R   R  A     S V + +  G GEY   L +GTP +    +
Sbjct: 52  KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAI 111

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
           +DTGSD++W QC PC +C++Q+ P+F+P  S SF+T+PC S LC+ L S  C+  N+C Y
Sbjct: 112 MDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCS-NNSCQY 170

Query: 215 QVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPT 273
              YGDGS T G   TETLTF    +  +  GCG +N+G      AGL+G+GRG LS P+
Sbjct: 171 TYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPS 230

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR--FTPLLANPKLDTFYYVELVG 331
           Q       KFSYC+     S+  S+++ G  A S TA    T L+ + ++ TFYY+ L G
Sbjct: 231 QLDV---TKFSYCMTPIG-SSNSSTLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNG 286

Query: 332 ISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
           +SVG   +  I  S+FKL+   G GG+IIDSGT++T     AY A+R AF +  +     
Sbjct: 287 LSVGSTPLP-IDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVN 345

Query: 391 PDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
              S FD CF + S ++ +++PT V+HF G D+ LP+ NY I   S+G  C A   +  G
Sbjct: 346 GSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFIS-PSNGLICLAMGSSSQG 404

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +SI GNIQQQ   VVYD   S + F    C 
Sbjct: 405 MSIFGNIQQQNLLVVYDTGNSVVSFLSAQCG 435


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  271 bits (694), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 161/391 (41%), Positives = 221/391 (56%), Gaps = 14/391 (3%)

Query: 97  KSLTAFA--ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           K+LT F   E AV    R   R  A     S V + +  G GEY   L +GTP +    +
Sbjct: 52  KNLTKFELLERAVERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAI 111

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
           +DTGSD++W QC PC +C++Q+ P+F+P  S SF+T+PC S LC+ L S  C+  N+C Y
Sbjct: 112 MDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCS-NNSCQY 170

Query: 215 QVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPT 273
              YGDGS T G   TETLTF    +  +  GCG +N+G      AGL+G+GRG LS P+
Sbjct: 171 TYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPS 230

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR--FTPLLANPKLDTFYYVELVG 331
           Q       KFSYC+    +S   S+++ G  A S TA    T L+ + ++ TFYY+ L G
Sbjct: 231 QLDV---TKFSYCMTPIGSSTS-STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNG 286

Query: 332 ISVGGAHVRGITASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
           +SVG   +  I  S+FKL+   G GG+IIDSGT++T     AY A+R AF +  +     
Sbjct: 287 LSVGSTPLP-IDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVN 345

Query: 391 PDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
              S FD CF + S ++ +++PT V+HF G D+ LP+ NY I   S+G  C A   +  G
Sbjct: 346 GSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFIS-PSNGLICLAMGSSSQG 404

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +SI GNIQQQ   VVYD   S + F    C 
Sbjct: 405 MSIFGNIQQQNLLVVYDTGNSVVSFLFAQCG 435


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  271 bits (694), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 155/358 (43%), Positives = 207/358 (57%), Gaps = 19/358 (5%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+GE+   + +GTP      ++DTGSD+VW QC PC +C++Q+ PVFDP+ S ++A +PC
Sbjct: 98  GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPC 157

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
            S LC  L SS C     C Y  +YGD S T G  + ET T   T++  VA GCG  NEG
Sbjct: 158 SSTLCSDLPSSKCTSAK-CGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTNEG 216

Query: 254 L-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV------ 306
             F   AGL+GLGRG LS  +Q G     KFSYCL     ++K S ++ G  A       
Sbjct: 217 DGFTQGAGLVGLGRGPLSLVSQLGL---NKFSYCLTSLDDTSK-SPLLLGSLATISESAA 272

Query: 307 -SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            + + + TPL+ NP   +FYYV L G++VG  H+  + +S F +   G GGVI+DSGTS+
Sbjct: 273 AASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHIT-LPSSAFAVQDDGTGGVIVDSGTSI 331

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFD--LSGKTEVKVPTVVLHFRGADV 422
           T L    Y AL+ AF A    L  A    +  DTCF+   SG  +V+VP +V H  GAD+
Sbjct: 332 TYLELQGYRALKKAF-AAQMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLDGADL 390

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            LPA NY++    SG  C    G+  GLSIIGN QQQ  + VYD+  + + FAP  CA
Sbjct: 391 DLPAENYMVLDSGSGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVGENTLSFAPVQCA 447


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  271 bits (693), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 142/361 (39%), Positives = 201/361 (55%), Gaps = 11/361 (3%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
            ++S+  G+  G+  +  ++GVG PP+  YM+ D  +D  W+QC PC KCY Q D +FDP
Sbjct: 172 LNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDP 231

Query: 183 AKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVA 241
           ++S S+  + C +  C  L +S C+    C Y ++Y DG+ T G    ET++F  +  V 
Sbjct: 232 SQSSSYTLLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWVD 291

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
           RV+LGC + N+G FV + G  GLGRG LSFP++         SYCLV+       S++ F
Sbjct: 292 RVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINA---SSMSYCLVESKDGYSSSTLEF 348

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
                S + +   LL NPK +  YYV L GI VGG  +  +  S F +DP GNGG+I+ S
Sbjct: 349 NSPPCSGSVK-AKLLQNPKAENLYYVGLKGIKVGGEKID-VPNSTFTIDPYGNGGMIVSS 406

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR--- 418
            + +T L    Y  +RDAF A    L+R   F  FDTC++LS    V++P  +L F    
Sbjct: 407 SSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELP--ILEFEVND 464

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           G    LP  +YL  VD +GTFCFAFA +    SI+G +QQ G RV +DL  S +      
Sbjct: 465 GKSWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFVYLHTLC 524

Query: 479 C 479
           C
Sbjct: 525 C 525


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  271 bits (692), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 164/383 (42%), Positives = 215/383 (56%), Gaps = 25/383 (6%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG++ GSGEYF  + +GTPPR+  ++LDTGSD+ WIQC PC  C+ Q  P +
Sbjct: 175 GQLMATLESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYY 234

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTF 235
           DP +S SF  + C  P C  + S      C   N TC Y   YGD S T GDF+ ET T 
Sbjct: 235 DPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTV 294

Query: 236 RGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
             T         RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYC
Sbjct: 295 NLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYC 354

Query: 287 LVDRSTSAKPSS-MVFG---DSAVSRTARFTPLLA---NPKLDTFYYVELVGISVGGAHV 339
           LVDR++    SS ++FG   D        FT L+A   NP +DTFYYV++  I VGG  V
Sbjct: 355 LVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENP-VDTFYYVQIKSIMVGG-EV 412

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
             I    + L P G GG I+DSGT+++    P+Y  ++DAF           DF + D C
Sbjct: 413 LKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPC 472

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQ 457
           +++SG  ++++P   + F  GA  + P  NY I ++     C A  GT  S LSIIGN Q
Sbjct: 473 YNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQ 532

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ F ++YD   SR+G+AP  CA
Sbjct: 533 QQNFHILYDTKKSRLGYAPMKCA 555


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  270 bits (691), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 170/454 (37%), Positives = 233/454 (51%), Gaps = 34/454 (7%)

Query: 45  VSVSESESSL---PLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTA 101
           +SVS S  SL    LP     S   L L HVDS   N T        I R   R+  L A
Sbjct: 22  ISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDS-GKNLTKIQKIQRGINRGFHRLNRLGA 80

Query: 102 FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDV 161
            A  AV   P +          ++++ +    GSGE+   L +G P      ++DTGSD+
Sbjct: 81  VAVLAVASKPDD----------TNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDL 130

Query: 162 VWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-RNTCLYQVSYGD 220
           +W QC PC +C+ Q  P+FDP KS S++ V C S LC  L  S CN  ++ C Y  +YGD
Sbjct: 131 IWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGD 190

Query: 221 GSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRR 278
            S T G  +TET TF     ++ +  GCG +NEG  F   +GL+GLGRG LS  +Q    
Sbjct: 191 YSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKE- 249

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSA---VSRTA--------RFTPLLANPKLDTFYYV 327
              KFSYCL     S   SS+  G  A   V++T         +   LL NP   +FYY+
Sbjct: 250 --TKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYL 307

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
           EL GI+VG   +  +  S F+L   G GG+IIDSGT++T L   A+  L++ F +  S  
Sbjct: 308 ELQGITVGAKRLS-VEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLP 366

Query: 388 KRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGT 446
                 +  D CF L      + VP ++ HF+GAD+ LP  NY++   S+G  C A  G+
Sbjct: 367 VDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAM-GS 425

Query: 447 MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            +G+SI GN+QQQ F V++DL    + F P  C 
Sbjct: 426 SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECG 459


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 155/355 (43%), Positives = 206/355 (58%), Gaps = 14/355 (3%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+GEY   L +G+PP+   +++DTGSD+ W+QC PC+ CY Q  P FDP+KSRSF    C
Sbjct: 35  GNGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAAC 94

Query: 194 RSPLCR--KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR---GTR-VARVALGC 247
              LC    L    C   N C YQ +YGD S T GD + ET++     GT+ V   A GC
Sbjct: 95  TDNLCNVSALPLKAC-AANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGC 153

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFGDSAV 306
           G  N G F  AAGL+GLG+G LS  +Q    F  KFSYCLV   S SA P  + FG  A 
Sbjct: 154 GTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSASP--LTFGSIAA 211

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTSV 365
           +   ++T ++ N +  T+YYV+L  I VGG  +  +  S+F +D + G GG IIDSGT++
Sbjct: 212 AANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLN-LAPSVFAIDQSTGRGGTIIDSGTTI 270

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
           T LT PAY A+  A+ +  +  +        D CF+++G +   VP +V  F+GAD  + 
Sbjct: 271 TMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMR 330

Query: 426 ATNYLIPVDSSG-TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             N  + VD+S  T C A  G+  G SIIGNIQQQ   VVYDL A +IGFA   C
Sbjct: 331 GENLFVLVDTSATTLCLAMGGSQ-GFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 175/430 (40%), Positives = 232/430 (53%), Gaps = 30/430 (6%)

Query: 56  LPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRS 115
           L  P  +    +RL HVDS   N T        ++R   R++ L A A  A         
Sbjct: 31  LEHPKMQKGFRVRLKHVDS-GKNLTKLERIRHGVKRGRNRLQRLQAMALVASS------- 82

Query: 116 RGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ 175
                   SS + + +  G+GE+  +L +GTPP     +LDTGSD++W QC PC +C+ Q
Sbjct: 83  --------SSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQ 134

Query: 176 TDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF 235
           + P+FDP KS SF+ + C S LC  L  S CN  N C Y  SYGD S T G  ++ETLTF
Sbjct: 135 STPIFDPKKSSSFSKLSCSSQLCEALPQSSCN--NGCEYLYSYGDYSSTQGILASETLTF 192

Query: 236 RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
               V  VA GCG DNEG  F   AGL+GLGRG LS  +Q       KFSYCL     + 
Sbjct: 193 GKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKE---PKFSYCLTTVDDT- 248

Query: 295 KPSSMVFGD----SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
           K S+++ G     +A S   + TPL+ +P   +FYY+ L GISVG   +  I  S F L 
Sbjct: 249 KTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLP-IKKSTFSLQ 307

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVK 409
             G+GG+IIDSGT++T L   A+  +   F A  +    +   +  D CF L SG T ++
Sbjct: 308 DDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIE 367

Query: 410 VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           VP +V HF GAD+ LPA NY+I   S G  C A  G+ SG+SI GN+QQQ   V++DL  
Sbjct: 368 VPKLVFHFDGADLELPAENYMIGDSSMGVACLAM-GSSSGMSIFGNVQQQNMLVLHDLEK 426

Query: 470 SRIGFAPRGC 479
             + F P  C
Sbjct: 427 ETLSFLPTQC 436


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 172/422 (40%), Positives = 227/422 (53%), Gaps = 29/422 (6%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T        I+R   R++ L A   +A   P                
Sbjct: 49  VMLRHVDS-GKNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSE-----------DQ 96

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + + +  G+GEY   L +GTPP     VLDTGSD++W QC PC +CY Q  P+FDP KS 
Sbjct: 97  LEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSS 156

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR----VAR 242
           SF+ V C S LC  L SS C+  + C Y  SYGD S+T G  +TET TF  ++    V  
Sbjct: 157 SFSKVSCGSSLCSALPSSTCS--DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214

Query: 243 VALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
           +  GCG DNEG  F  A+GL+GLGRG LS  +Q      ++FSYCL     + K S ++ 
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---QRFSYCLTPIDDT-KESVLLL 270

Query: 302 GDSAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           G     + A+    TPLL NP   +FYY+ L  ISVG   +  I  S F++   GNGGVI
Sbjct: 271 GSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLS-IEKSTFEVGDDGNGGVI 329

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHF 417
           IDSGT++T + + AY AL+  F +           +  D CF L SG T+V++P +V HF
Sbjct: 330 IDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHF 389

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +G D+ LPA NY+I   + G  C A  G  SG+SI GN+QQQ   V +DL    I F P 
Sbjct: 390 KGGDLELPAENYMIGDSNLGVACLAM-GASSGMSIFGNVQQQNILVNHDLEKETISFVPT 448

Query: 478 GC 479
            C
Sbjct: 449 SC 450


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 164/419 (39%), Positives = 226/419 (53%), Gaps = 31/419 (7%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T   L    I+R   R++ L    E+ +  P              S 
Sbjct: 43  IMLEHVDS-GKNLTKFQLLERAIERGSRRLQRL----EAMLNGP--------------SG 83

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V + +  G GEY   L +GTP +    ++DTGSD++W QC PC +C++Q+ P+F+P  S 
Sbjct: 84  VETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
           SF+T+PC S LC+ L S  C+  N C Y   YGDGS T G   TETLTF    +  +  G
Sbjct: 144 SFSTLPCSSQLCQALSSPTCS-NNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFG 202

Query: 247 CGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG +N+G      AGL+G+GRG LS P+Q       KFSYC+     S+ PS+++ G  A
Sbjct: 203 CGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV---TKFSYCMTPIG-SSTPSNLLLGSLA 258

Query: 306 VSRTAR--FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP-AGNGGVIIDSG 362
            S TA    T L+ + ++ TFYY+ L G+SVG   +  I  S F L+   G GG+IIDSG
Sbjct: 259 NSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLP-IDPSAFALNSNNGTGGIIIDSG 317

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGAD 421
           T++T     AY ++R  F +  +        S FD CF   S  + +++PT V+HF G D
Sbjct: 318 TTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD 377

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           + LP+ NY I   S+G  C A   +  G+SI GNIQQQ   VVYD   S + FA   C 
Sbjct: 378 LELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 163/406 (40%), Positives = 216/406 (53%), Gaps = 28/406 (6%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS-------VISGLAQGSGEYFTR 141
           IQR V          + A  V P  +     + G S+S         SG A  +G Y   
Sbjct: 107 IQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVT 166

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           +G+GTP     +V DTGSD  W+QC PC  KCY Q +P+FDPAKS ++A V C    C  
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACAD 226

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAG 260
           LD++GC   + CLY V YGDGS TVG F+ +TLT     +     GCG  N GLF   AG
Sbjct: 227 LDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAG 285

Query: 261 LLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK 320
           L+GLGRG+ S   Q   ++   F+YCL   +T      + FG  +    AR TP+L + K
Sbjct: 286 LMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT--GYLDFGPGSAGNNARLTPMLTD-K 342

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
             TFYYV + GI VGG  V  +  S+F        G ++DSGT +TRL   AY AL  AF
Sbjct: 343 GQTFYYVGMTGIRVGGQQVP-VAESVFS-----TAGTLVDSGTVITRLPATAYTALSSAF 396

Query: 381 RAG--ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDS 435
                A   K+AP +S+ DTC+D +G ++V++PTV L F+G    DV +    Y I   S
Sbjct: 397 DKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI---S 453

Query: 436 SGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               C AFA  G    ++I+GN QQ+ + V+YDL    +GFAP  C
Sbjct: 454 EAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 178/484 (36%), Positives = 255/484 (52%), Gaps = 30/484 (6%)

Query: 10  LLLFSFFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLP-APDAESSLSLR 68
            LL+S   ++   L +Q     +L TPSTL      S+  S    P P   D  +SL + 
Sbjct: 13  FLLYSALLSSKRGLAFQG-RKTALSTPSTLHNVHITSLMPSSVCSPSPKGDDKRASLEVI 71

Query: 69  LHH--VDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG-FSS 125
             H     LS ++         + +D  RV S+ +      R+       G+  G   + 
Sbjct: 72  HKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRS------RLAKNPADGGKLKGSKVTL 125

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAK 184
              SG   G+G Y   +G+GTP R +  + DTGSD+ W QC PC + CY Q +P+F+P+K
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSK 185

Query: 185 SRSFATVPCRSPLCRKLDSSGCNR----RNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
           S S+  + C SP C +L S   N      +TC+Y + YGD S +VG F+ + L    T V
Sbjct: 186 STSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDV 245

Query: 241 -ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
                 GCG +N GLFV  AGL+GLGR  LS  +QT +++ + FSYCL   STS+    +
Sbjct: 246 FNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCL--PSTSSSTGYL 303

Query: 300 VFGD-SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
            FG     S+  +FTP L N +  +FY++ L+ ISVGG  +   +AS+F        G I
Sbjct: 304 TFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLS-TSASVFS-----TAGTI 357

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           IDSGT ++RL   AY  LR +F+   S   +A   S+ DTC+D S    V VP + L+F 
Sbjct: 358 IDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFS 417

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
            GA++ L  +     ++ S   C AFAG    + ++I+GN+QQ+ F VVYD+A  RIGFA
Sbjct: 418 DGAEMDLDPSGIFYILNIS-QVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFA 476

Query: 476 PRGC 479
           P GC
Sbjct: 477 PGGC 480


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 187/465 (40%), Positives = 253/465 (54%), Gaps = 41/465 (8%)

Query: 45  VSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLT--AF 102
           V+  E+      PA  +  SL LR+ H  S    RT +  F  + ++D +R++++   A 
Sbjct: 56  VAADEAGDEQKQPA-SSSPSLQLRMKH-RSAEGGRTRKESFLDKAEKDAVRIETMHRRAA 113

Query: 103 AESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVV 162
                R+P  +  R   +    ++V SG+A GSGEY   + VGTPPR   M++DTGSD+ 
Sbjct: 114 RSGVARMPASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLN 173

Query: 163 WIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS----GCNR--RNTCLYQV 216
           W+QCAPC  C+ Q  PVFDPA S S+  V C    C  +        C R   ++C Y  
Sbjct: 174 WLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYY 233

Query: 217 SYGDGSITVGDFSTETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
            YGD S T GD + E+ T   T      RV  V  GCGH N GLF  AAGLLGLGRG LS
Sbjct: 234 WYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLS 293

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL--------- 321
           F +Q    +   FSYCLV+  + A  S +VFG+  +        +LA+P+L         
Sbjct: 294 FASQLRAVYGHTFSYCLVEHGSDAG-SKVVFGEDYL--------VLAHPQLKYTAFAPTS 344

Query: 322 ---DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
              DTFYYV+L G+ VGG  +  I++  + +   G+GG IIDSGT+++    PAY  +R 
Sbjct: 345 SPADTFYYVKLKGVLVGG-DLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQ 403

Query: 379 AFRAGASSL-KRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSS 436
           AF    S L    PDF + + C+++SG    +VP + L F  GA    PA NY + +D  
Sbjct: 404 AFVDLMSRLYPLIPDFPVLNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPD 463

Query: 437 GTFCFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  C A  GT  +G+SIIGN QQQ F VVYDL  +R+GFAPR CA
Sbjct: 464 GIMCLAVRGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCA 508


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 164/432 (37%), Positives = 228/432 (52%), Gaps = 29/432 (6%)

Query: 62  ESSLSLRLHHVDSLSFNR----TPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           ESSL +   H      N     +P+H+  LR+  D  RV S+ +     +     + S+ 
Sbjct: 31  ESSLHVTHRHGTCSRLNNGKATSPDHVEILRL--DQARVNSIHSKLSKKLATDHVSESKS 88

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQT 176
                       G   GSG Y   +G+GTP   + ++ DTGSD+ W QC PC + CY Q 
Sbjct: 89  T-----DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQK 143

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTE 231
           +P+F+P+KS S+  V C S  C  L S+      C+  N C+Y + YGD S +VG  + E
Sbjct: 144 EPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKE 202

Query: 232 TLTFRGTRVAR-VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
             T   + V   V  GCG +N+GLF   AGLLGLGR +LSFP+QT   +N+ FSYCL   
Sbjct: 203 KFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL--P 260

Query: 291 STSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
           S+++    + FG + +SR+ +FTP+       +FY + +V I+VGG  +  I +++F   
Sbjct: 261 SSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLP-IPSTVFSTP 319

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
                G +IDSGT +TRL   AY ALR +F+A  S        S+ DTCFDLSG   V +
Sbjct: 320 -----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTI 374

Query: 411 PTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLA 468
           P V   F G  V    +  +  V      C AFAG    S  +I GN+QQQ   VVYD A
Sbjct: 375 PKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGA 434

Query: 469 ASRIGFAPRGCA 480
             R+GFAP GC+
Sbjct: 435 GGRVGFAPNGCS 446


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  268 bits (686), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 163/406 (40%), Positives = 215/406 (52%), Gaps = 28/406 (6%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS-------VISGLAQGSGEYFTR 141
           IQR V          + A  V P  +     + G S+S         SG A  +G Y   
Sbjct: 107 IQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVT 166

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           +G+GTP     +V DTGSD  W+QC PC  KCY Q  P+FDPAKS ++A V C    C  
Sbjct: 167 VGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACAD 226

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAG 260
           LD++GC   + CLY V YGDGS TVG F+ +TLT     +     GCG  N GLF   AG
Sbjct: 227 LDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAG 285

Query: 261 LLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK 320
           L+GLGRG+ S   Q   ++   F+YCL   +T      + FG  +    AR TP+L + K
Sbjct: 286 LMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT--GYLDFGPGSAGNNARLTPMLTD-K 342

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
             TFYYV + GI VGG  V  +  S+F        G ++DSGT +TRL   AY AL  AF
Sbjct: 343 GQTFYYVGMTGIRVGGQQVP-VAESVFS-----TAGTLVDSGTVITRLPATAYTALSSAF 396

Query: 381 RAG--ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDS 435
                A   K+AP +S+ DTC+D +G ++V++PTV L F+G    DV +    Y I   S
Sbjct: 397 DKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI---S 453

Query: 436 SGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               C AFA  G    ++I+GN QQ+ + V+YDL    +GFAP  C
Sbjct: 454 EAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  267 bits (683), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 164/436 (37%), Positives = 229/436 (52%), Gaps = 29/436 (6%)

Query: 58  APDAESSLSLRLHHVDSLSFNR----TPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRN 113
           A   +SSL +   H      N     +P+H+  LR+  D  RV S+ +     +     +
Sbjct: 55  ASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRL--DQARVNSIHSKLSKKLATDHVS 112

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKC 172
            S+             G   GSG Y   +G+GTP   + ++ DTGSD+ W QC PC + C
Sbjct: 113 ESKST-----DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTC 167

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGD 227
           Y Q +P+F+P+KS S+  V C S  C  L S+      C+  N C+Y + YGD S +VG 
Sbjct: 168 YDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGF 226

Query: 228 FSTETLTFRGTRV-ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
            + E  T   + V   V  GCG +N+GLF   AGLLGLGR +LSFP+QT   +N+ FSYC
Sbjct: 227 LAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYC 286

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           L   S+++    + FG + +SR+ +FTP+       +FY + +V I+VGG  +  I +++
Sbjct: 287 L--PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLP-IPSTV 343

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT 406
           F        G +IDSGT +TRL   AY ALR +F+A  S        S+ DTCFDLSG  
Sbjct: 344 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 398

Query: 407 EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVV 464
            V +P V   F G  V    +  +  V      C AFAG    S  +I GN+QQQ   VV
Sbjct: 399 TVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVV 458

Query: 465 YDLAASRIGFAPRGCA 480
           YD A  R+GFAP GC+
Sbjct: 459 YDGAGGRVGFAPNGCS 474


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 229/422 (54%), Gaps = 30/422 (7%)

Query: 67  LRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS 126
           + L HVDS   N T        I+R   R++ L A   +A  +   ++            
Sbjct: 50  VMLRHVDS-GKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQ------------ 96

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + + +  G+GEY   L +GTPP     VLDTGSD++W QC PC +CY Q  P+FDP KS 
Sbjct: 97  LEAPIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSS 156

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR----VAR 242
           SF+ V C S LC  + SS C+  + C Y  SYGD S+T G  +TET TF  ++    V  
Sbjct: 157 SFSKVSCGSSLCSAVPSSTCS--DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214

Query: 243 VALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
           +  GCG DNEG  F  A+GL+GLGRG LS  +Q       +FSYCL     + K S ++ 
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---PRFSYCLTPMDDT-KESILLL 270

Query: 302 GDSAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           G     + A+    TPLL NP   +FYY+ L GISVG   +  I  S F++   GNGGVI
Sbjct: 271 GSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLS-IEKSTFEVGDDGNGGVI 329

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHF 417
           IDSGT++T + + A+ AL+  F +           +  D CF L SG T+V++P +V HF
Sbjct: 330 IDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHF 389

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +G D+ LPA NY+I   + G  C A  G  SG+SI GN+QQQ   V +DL    I F P 
Sbjct: 390 KGGDLELPAENYMIGDSNLGVACLAM-GASSGMSIFGNVQQQNILVNHDLEKETISFVPT 448

Query: 478 GC 479
            C
Sbjct: 449 SC 450


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 153/359 (42%), Positives = 202/359 (56%), Gaps = 16/359 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRS 187
           SG   GS +Y+  +G+GTP R + ++ DTGS + W QC PC   CY Q DP+FDP+KS S
Sbjct: 131 SGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSS 190

Query: 188 FATVPCRSPLCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL 245
           +  + C S LC +  S+GC+     +C+Y V YGD SI+ G  S E LT   T +    L
Sbjct: 191 YTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATDIVHDFL 250

Query: 246 -GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCG DNEGLF   AGL+GL R  +SF  QT   +N+ FSYCL   ST +    + FG S
Sbjct: 251 FGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL--PSTPSSLGHLTFGAS 308

Query: 305 AVSR-TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           A +    ++TP       ++FY +++VGISVGG  +  +++S F       GG IIDSGT
Sbjct: 309 AATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSIIDSGT 363

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DV 422
            +TRL   AY ALR AFR        A    L DTC+D SG  E+ VP +   F G   V
Sbjct: 364 VITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVKV 423

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LP    L   +S+   C AFA   +G  ++I GN+QQ+   VVYD+   RIGF   GC
Sbjct: 424 ELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 159/404 (39%), Positives = 225/404 (55%), Gaps = 25/404 (6%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           + RD  RV S+   A  A R          A+ G S     G+  G+  Y   +G+GTP 
Sbjct: 91  LDRDQDRVDSIHRLA--AARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGTPK 148

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R + +V DTGSD+ W+QC PC  CY Q DP+FDP++S +++ VPC +  CR+LDS  C+ 
Sbjct: 149 RDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSGSCS- 207

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTF-------RGTRVARVALGCGHDNEGLFVAAAGL 261
              C Y+V YGD S T G+ + +TLT           ++     GCG D+ GLF  A GL
Sbjct: 208 SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADGL 267

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL 321
            GLGR R+S  +Q   ++   FSYCL   ST+    S+    SA    ARFT ++     
Sbjct: 268 FGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSL---GSAAPPNARFTAMVTRSDT 324

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
            +FYY+ LVGI V G  VR ++ ++F+       G +IDSGT +TRL   AY ALR +F 
Sbjct: 325 PSFYYLNLVGIKVAGRTVR-VSPAVFR-----TPGTVIDSGTVITRLPSRAYAALRSSF- 377

Query: 382 AGAS---SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT 438
           AG     S KRAP  S+ DTC+D +G+ +V++P+V L F G          ++ V +   
Sbjct: 378 AGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQ 437

Query: 439 FCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            C AFA  G  + ++I+GN+QQ+ F VVYD+A  +IGF  +GC+
Sbjct: 438 ACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 146/349 (41%), Positives = 195/349 (55%), Gaps = 5/349 (1%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +A G+GEY   +  G+PP+   +++DTGSD++W QC PC+ C +    +FDP KS ++ T
Sbjct: 73  VASGNGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDT 132

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           V C S  C  L    C    +C Y   YGDGS T G  STET+T     +  VA GCGH 
Sbjct: 133 VSCASNFCSSLPFQSCT--TSCKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHT 190

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           N G F  AAG++GLG+G LS  +Q     ++KFSYCLV    S K S M+ GDSA +   
Sbjct: 191 NLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLG-STKTSPMLIGDSAAAGGV 249

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
            +T LL N    TFYY +L GISV G  V       F +D +G GG I+DSGT++T L  
Sbjct: 250 AYTALLTNTANPTFYYADLTGISVSGKAVT-YPVGTFSIDASGQGGFILDSGTTLTYLET 308

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYL 430
            A+ AL  A +A     +        D CF  +G      PT+  HF+GAD  LP  N  
Sbjct: 309 GAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENVF 368

Query: 431 IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + +D+ G+ C A A + +G SI+GNIQQQ   +V+DL   R+GF    C
Sbjct: 369 VALDTGGSICLAMAAS-TGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/371 (39%), Positives = 219/371 (59%), Gaps = 15/371 (4%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           GF S V+SG   GSG+YF    +GTPP+   +++D+GSD++W+QC+PC++CY+Q  P++ 
Sbjct: 48  GFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYV 107

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG---CNRR--NTCLYQVSYGDGSITVGDFSTETLTFR 236
           P+ S +F+ VPC S  C  + ++    C+ R    C Y+  Y D S + G F+ E+ T  
Sbjct: 108 PSNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVD 167

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAK 295
           G R+ +VA GCG DN+G F AA G+LGLG+G LSF +Q G  +  KF+YCLV+    ++ 
Sbjct: 168 GVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSV 227

Query: 296 PSSMVFGDSAVS--RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
            SS++FGD  +S     ++TP+++NPK  T YYV++  ++VGG  +  I+ S +++D  G
Sbjct: 228 SSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLP-ISDSAWEIDLLG 286

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
           NGG I DSGT++T     AY  +  AF +G     RA      D C +L+G  +   P+ 
Sbjct: 287 NGGSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVELTGVDQPSFPSF 345

Query: 414 VLHFRGADVSLP-ATNYLIPVDSSGTFCFAFAGTMS---GLSIIGNIQQQGFRVVYDLAA 469
            + F    V  P A NY + V +    C A AG  S   G + IGN+ QQ F V YD   
Sbjct: 346 TIEFDDGAVFQPEAENYFVDV-APNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREE 404

Query: 470 SRIGFAPRGCA 480
           + IGFAP  C+
Sbjct: 405 NLIGFAPAKCS 415


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  264 bits (674), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/356 (40%), Positives = 206/356 (57%), Gaps = 15/356 (4%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G GE+   + +GTPP+   +++DTGSD+ WIQ  PC+ C+ Q DP+FDP+KS ++  + C
Sbjct: 21  GYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIAC 80

Query: 194 RSPLCRK-LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
            S  C   L +  C+    C+Y   YGDGS+T G FS ET+T   T    V  G    N 
Sbjct: 81  SSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNT 140

Query: 253 GLF--VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAV-SR 308
           G F      G+LGLG+G +S P+Q G     KFSYCLVD  S  ++ S+M FGD+AV S 
Sbjct: 141 GTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSG 200

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
             ++TP++ N    T+YY+ + GISVGG+ +  I  S++++D  G+GG IIDSGT++T L
Sbjct: 201 EVQYTPIVPNADHPTYYYIAVQGISVGGSLLD-IDQSVYEIDSGGSGGTIIDSGTTITYL 259

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSL---FDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
            +  + AL  A+    +S  R P  +     D CF+  G      P + +H  G  + LP
Sbjct: 260 QQEVFNALVAAY----TSQVRYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLDGVHLELP 315

Query: 426 ATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             N  I ++++   C AFA  +   ++I GNIQQQ F +VYDL   RIGFAP  CA
Sbjct: 316 TANTFISLETN-IICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCA 370


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  264 bits (674), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 162/436 (37%), Positives = 233/436 (53%), Gaps = 29/436 (6%)

Query: 58  APDAESSLSLRLHHVDSLSFNR----TPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRN 113
           A   +SSL +   H      N     +P+H+  LR+  D  RV S+   ++ + ++   +
Sbjct: 56  ASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRL--DQARVNSI--HSKLSKKLTTNH 111

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKC 172
            S+ ++          G   GSG Y   +G+GTP   + ++ DTGSD+ W QC PC + C
Sbjct: 112 VSQSQST---DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTC 168

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGD 227
           Y Q +P+F+P+KS S+  V C S  C  L S+      C+  N C+Y + YGD S +VG 
Sbjct: 169 YDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGF 227

Query: 228 FSTETLTFRGTRV-ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
            + +  T   + V   V  GCG +N+GLF   AGLLGLGR +LSFP+QT   +N+ FSYC
Sbjct: 228 LAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYC 287

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           L   S+++    + FG + +SR+ +FTP+       +FY + +V I+VGG  +  I +++
Sbjct: 288 L--PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLP-IPSTV 344

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT 406
           F        G +IDSGT +TRL   AY ALR +F+A  S        S+ DTCFDLSG  
Sbjct: 345 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 399

Query: 407 EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVV 464
            V +P V   F G  V    +  +         C AFAG    S  +I GN+QQQ   VV
Sbjct: 400 TVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVV 459

Query: 465 YDLAASRIGFAPRGCA 480
           YD A  R+GFAP GC+
Sbjct: 460 YDGAGGRVGFAPNGCS 475


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 172/431 (39%), Positives = 227/431 (52%), Gaps = 31/431 (7%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           +  L +RL HVD+   N +   L     +R   R+  L A A     V         A G
Sbjct: 37  KGGLRVRLTHVDAHG-NYSRLQLLQRAARRSHHRMSRLVARATGVKAV---------AGG 86

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           G    +   +  G+GE+   + +GTP      ++DTGSD+VW QC PC  C+ Q+ PVFD
Sbjct: 87  G---DLQVPVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 143

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTR 239
           P+ S ++ATVPC S LC  L +S C   + C Y  +YGD S T G  ++ET T      +
Sbjct: 144 PSSSSTYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKK 203

Query: 240 VARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
           +  VA GCG  NEG  F   AGL+GLGRG LS  +Q G     KFSYCL         S 
Sbjct: 204 LPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDGDGKSP 260

Query: 299 MVFGDSAVSRT-------ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           ++ G SA + +        + TPL+ NP   +FYYV L G++VG   +  + AS F +  
Sbjct: 261 LLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRIT-LPASAFAIQD 319

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD--LSGKTEVK 409
            G GGVI+DSGTS+T L    Y AL+ AF A  +           D CF     G  EV+
Sbjct: 320 DGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEVQ 379

Query: 410 VPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
           VP +VLHF  GAD+ LPA NY++   +SG  C   A +  GLSIIGN QQQ F+ VYD+A
Sbjct: 380 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPS-RGLSIIGNFQQQNFQFVYDVA 438

Query: 469 ASRIGFAPRGC 479
              + FAP  C
Sbjct: 439 GDTLSFAPVQC 449


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 173/443 (39%), Positives = 236/443 (53%), Gaps = 36/443 (8%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLT---AFAESAVRVPPRNRSR 116
            S L L LHH  S  S    P  L F+  +  D  RV  L    A ++   R P   R +
Sbjct: 41  SSGLHLTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQ 100

Query: 117 GRANGGFSSS------------VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
            +A GG S              +  G + G G Y T+LG+GTP     MV+DTGS + W+
Sbjct: 101 KKAAGGASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL 160

Query: 165 QCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSY 218
           QC+PC   C+ Q  P+FDP  S ++A+V C +  C +L +     S C+  N C+YQ SY
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY 220

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR 278
           GD S +VG  ST+T++F  TR      GCG DNEGLF  +AGL+GL R +LS   Q    
Sbjct: 221 GDSSFSVGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
               FSYCL    T+A    +  G         +TP+ ++    + Y++ L G+SVGG+ 
Sbjct: 281 LGYSFSYCL---PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           +  ++ S +   P      IIDSGT +TRL    + AL  A     +  +RAP FS+ DT
Sbjct: 338 L-AVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT 391

Query: 399 CFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQ 457
           CF+    ++++VPTV + F  GA + L   N LI VD S T C AFA T S  +IIGN Q
Sbjct: 392 CFE-GQASQLRVPTVAMAFAGGASMKLTTRNVLIDVDDS-TTCLAFAPTDS-TAIIGNTQ 448

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ F V+YD+A SRIGF+  GC+
Sbjct: 449 QQTFSVIYDVAQSRIGFSAGGCS 471


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 153/354 (43%), Positives = 204/354 (57%), Gaps = 16/354 (4%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           GL  G+  Y   +G GTP +   ++ DTGS+V WIQC PC   CY Q +P+FDP  S ++
Sbjct: 8   GLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTY 67

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGC 247
             + C S  C  L S GC+  +TC+Y V+YGDGS TVG  +TET T   G        GC
Sbjct: 68  RNISCTSAACTGLSSRGCSG-STCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFIFGC 126

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G +N+GLF  AAGL+GLGR   S  +Q        FSYCL   STS+    +  G+    
Sbjct: 127 GQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCL--PSTSSATGYLNIGNPL-- 182

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
           RT  +T +L N +  T Y+++L+GISVGG  +  +++++F+     + G IIDSGT +TR
Sbjct: 183 RTPGYTAMLTNSRAPTLYFIDLIGISVGGTRL-ALSSTVFQ-----SVGTIIDSGTVITR 236

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPAT 427
           L   AY ALR AFRA  +   RA   S+ DTC+D S  T V  PT+ LH+ G DV++P  
Sbjct: 237 LPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGA 296

Query: 428 NYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +  V SS   C AFAG    + + IIGN+QQ+   V YD A  RIGFA   C
Sbjct: 297 G-VFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 150/369 (40%), Positives = 209/369 (56%), Gaps = 28/369 (7%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRS 187
           SGL  G+G Y   +G+GTP + + ++ DTGSD+ W QC PC K CY+Q  P+FDP+ S++
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKT 204

Query: 188 FATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           ++ + C S  C  L S+     GC+  N C+Y + YGD S TVG F+ +TLT     V  
Sbjct: 205 YSNISCTSTACSGLKSATGNSPGCSSSN-CVYGIQYGDSSFTVGFFAKDTLTLTQNDVFD 263

Query: 243 -VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
               GCG +N GLF   AGL+GLGR  LS   QT ++F + FSYCL   ++      + F
Sbjct: 264 GFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL--PTSRGSNGHLTF 321

Query: 302 GD-------SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           G+        AV     FTP  A+ +  TFY+++++GISVGG  +  I+  LF+     N
Sbjct: 322 GNGNGVKTSKAVKNGITFTPF-ASSQGATFYFIDVLGISVGGKALS-ISPMLFQ-----N 374

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
            G IIDSGT +TRL    Y +L+  F+   S    AP  SL DTC+DLS  T + +P + 
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKIS 434

Query: 415 LHFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASR 471
            +F G A+V L     LI  + +   C AFAG      + I GNIQQQ   VVYD+A  +
Sbjct: 435 FNFNGNANVDLEPNGILI-TNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQ 493

Query: 472 IGFAPRGCA 480
           +GF  +GC+
Sbjct: 494 LGFGYKGCS 502


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 148/369 (40%), Positives = 210/369 (56%), Gaps = 28/369 (7%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRS 187
           SGL  G+G Y   +G+GTP + + ++ DTGSD+ W QC PC K CY+Q  P+FDP+ S++
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKT 204

Query: 188 FATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           ++ + C S  C  L S+     GC+  N C+Y + YGD S T+G F+ + LT     V  
Sbjct: 205 YSNISCTSAACSSLKSATGNSPGCSSSN-CVYGIQYGDSSFTIGFFAKDKLTLTQNDVFD 263

Query: 243 -VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
               GCG +N+GLF   AGL+GLGR  LS   QT ++F + FSYCL   ++      + F
Sbjct: 264 GFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL--PTSRGSNGHLTF 321

Query: 302 GD-------SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           G+        AV     FTP  A+ +   +Y+++++GISVGG  +  I+  LF+     N
Sbjct: 322 GNGNGVKASKAVKNGITFTPF-ASSQGTAYYFIDVLGISVGGKALS-ISPMLFQ-----N 374

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
            G IIDSGT +TRL   AY +L+ AF+   S    AP  SL DTC+DLS  T + +P + 
Sbjct: 375 AGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKIS 434

Query: 415 LHFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASR 471
            +F G A+V L     LI  + +   C AFAG      + I GNIQQQ   VVYD+A  +
Sbjct: 435 FNFNGNANVELDPNGILI-TNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQ 493

Query: 472 IGFAPRGCA 480
           +GF  +GC+
Sbjct: 494 LGFGYKGCS 502


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/362 (41%), Positives = 199/362 (54%), Gaps = 18/362 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRS 187
           SG   GS  Y   +G+GTP R + +V DTGSD+ W QC PC   CY Q D +FDP+KS S
Sbjct: 37  SGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSS 96

Query: 188 FATVPCRSPLCRKLDSSGCNRR------NTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           +  + C S LC +L S G           +C+Y   YGD S +VG  S E LT   T + 
Sbjct: 97  YTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIV 156

Query: 242 RVAL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
              L GCG DNEGLF  +AGL+GLGR  +S   QT   +N+ FSYCL   +TS+    + 
Sbjct: 157 DDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL--PATSSSLGHLT 214

Query: 301 FGDSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           FG SA +  +  +TPL      ++FY +++V ISVGG  +  +++S F       GG II
Sbjct: 215 FGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSA-----GGSII 269

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSGT +TRL    Y ALR AFR        A +  L DTC+DLSG  E+ VP +   F G
Sbjct: 270 DSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSG 329

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
                     ++ V+S    C AFA  G+ + +++ GN+QQ+   VVYD+   RIGF   
Sbjct: 330 GVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAA 389

Query: 478 GC 479
           GC
Sbjct: 390 GC 391


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 170/439 (38%), Positives = 234/439 (53%), Gaps = 31/439 (7%)

Query: 48  SESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAV 107
           S S  +L  PA   ++   + L HVDS   N T        I+R   R++ L A   +A 
Sbjct: 27  STSRRALSYPA-QLKNGFRITLKHVDS-DKNLTKFQRIQHGIKRANHRLERLNAMVLAA- 83

Query: 108 RVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA 167
                      +N   +S V+SG    +GE+   L +GTPP     ++DTGSD++W QC 
Sbjct: 84  ----------SSNAEINSPVLSG----NGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK 129

Query: 168 PCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGD 227
           PC +C+ Q  P+FDP KS SF+ + C S LC+ L  S C+  ++C Y  +YGD S T G 
Sbjct: 130 PCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS--DSCEYLYTYGDYSSTQGT 187

Query: 228 FSTETLTFRGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
            +TET TF    +  V  GCG DNEG  F   +GL+GLGRG LS  +Q       KFSYC
Sbjct: 188 MATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKE---AKFSYC 244

Query: 287 LVDRSTSAKPSSMVFGDSA----VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           L     + K S+++ G  A     S   R TPL+ NP   +FYY+ L GISVGG  +  I
Sbjct: 245 LTSIDDT-KTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLP-I 302

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL 402
             S F+L   G GG+IIDSGT++T L   A+  ++  F +           +  + C++L
Sbjct: 303 KESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNL 362

Query: 403 -SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGF 461
            S  +E++VP +VLHF GAD+ LP  NY+I   S G  C A  G+  G+SI GN+QQQ  
Sbjct: 363 PSDTSELEVPKLVLHFTGADLELPGENYMIADSSMGVICLAM-GSSGGMSIFGNVQQQNM 421

Query: 462 RVVYDLAASRIGFAPRGCA 480
            V +DL    + F P  C 
Sbjct: 422 FVSHDLEKETLSFLPTNCG 440


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 161/398 (40%), Positives = 220/398 (55%), Gaps = 19/398 (4%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +  D  RV S+     +A   P  +++RG+   G +     G++ G+G Y   +G+GTP 
Sbjct: 100 LNDDQARVDSIHRKIAAAAS-PVLDQARGKK--GVTLPAQRGISLGTGNYVVSMGLGTPA 156

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R + +V DTGSD+ W+QC PC  CY Q DP+FDPA+S +++ VPC SP C+ LDS  C+R
Sbjct: 157 RDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLDSRSCSR 216

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGLFVAAAGLLGLGRG 267
              C Y+V YGD S T G  + +TLT   + V      GCG  + GLF  A GL+GLGR 
Sbjct: 217 DKKCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVGLGRE 276

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
           ++S  +Q   ++   FSYCL    ++A    +  G  A +  ARFT +       +FYYV
Sbjct: 277 KVSLSSQAASKYGAGFSYCLPSSPSAA--GYLSLGGPAPA-NARFTAMETRHDSPSFYYV 333

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGAS 385
            LVG+ V G  VR ++  +F        G +IDSGT +TRL    Y ALR AF    G  
Sbjct: 334 RLVGVKVAGRTVR-VSPIVFSA-----AGTVIDSGTVITRLPPRVYAALRSAFARSMGRY 387

Query: 386 SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFA 444
             KRAP  S+ DTC+D +G T V++P+V L F  GA V L  +  L  V      C AFA
Sbjct: 388 GYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLY-VAKVSQACLAFA 446

Query: 445 --GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             G  +   IIGN QQ+   VVYD+A  +IGF   GC+
Sbjct: 447 PNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 155/396 (39%), Positives = 218/396 (55%), Gaps = 20/396 (5%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           + RD  RV S+           P    +  A+ G S     GL  G+  Y   +G+GTP 
Sbjct: 144 LDRDQDRVDSIHRMTAG-----PWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPR 198

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           R + +V DTGSD+ W+QC PC  CY Q DP+FDP++S +++ VPC +  C  LDS  C+ 
Sbjct: 199 RDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC--LDSGTCS- 255

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGHDNEGLFVAAAGLLGLGR 266
              C Y+V YGD S T G+ + +TLT      ++     GCG D+ GLF  A GL GLGR
Sbjct: 256 SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGR 315

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYY 326
            R+S  +Q   R+   FSYCL   S+      +  G +A    A+FT ++      +FYY
Sbjct: 316 DRVSLASQAAARYGAGFSYCL--PSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYY 373

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           ++LVGI V G  VR +  ++FK       G +IDSGT +TRL   AY ALR +F      
Sbjct: 374 LDLVGIKVAGRTVR-VAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRR 427

Query: 387 LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFA-- 444
            KRAP  S+ DTC+D +G+T+V++P+V L F G          ++ V +    C AFA  
Sbjct: 428 YKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANRSQACLAFASN 487

Query: 445 GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  + + I+GN+QQ+ F VVYDLA  +IGF  +GC+
Sbjct: 488 GDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  260 bits (665), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 224/421 (53%), Gaps = 31/421 (7%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           L + L  VDS   N T   L    I+R   R++S+ A  +S                  S
Sbjct: 42  LRVDLEQVDS-GKNLTKYELIKRAIKRGERRMRSINAMLQS------------------S 82

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S + + +  G GEY   + +GTP      ++DTGSD++W QC PC +C+SQ  P+F+P  
Sbjct: 83  SGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQD 142

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S SF+T+PC S  C+ L S  CN  N C Y   YGDGS T G  +TET TF  + V  +A
Sbjct: 143 SSSFSTLPCESQYCQDLPSETCN-NNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIA 201

Query: 245 LGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
            GCG DN+G      AGL+G+G G LS P+Q G     +FSYC+    +S+ PS++  G 
Sbjct: 202 FGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSYGSSS-PSTLALGS 257

Query: 304 SA--VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           +A  V   +  T L+ +    T+YY+ L GI+VGG ++ GI +S F+L   G GG+IIDS
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNL-GIPSSTFQLQDDGTGGMIIDS 316

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGA 420
           GT++T L + AY A+  AF    +        S   TCF   S  + V+VP + + F G 
Sbjct: 317 GTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGG 376

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++L   N LI   + G  C A   +   G+SI GNIQQQ  +V+YDL    + F P  C
Sbjct: 377 VLNLGEQNILIS-PAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

Query: 480 A 480
            
Sbjct: 436 G 436


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 172/443 (38%), Positives = 235/443 (53%), Gaps = 36/443 (8%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLT---AFAESAVRVPPRNRSR 116
            S L L LHH  S  S    P  L F+  +  D  RV  L    A ++   R P   R +
Sbjct: 41  SSGLHLTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQ 100

Query: 117 GRANGGFSSS------------VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
            +A GG S              +  G + G G Y T+LG+GTP     MV+DTGS + W+
Sbjct: 101 KKAAGGASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL 160

Query: 165 QCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSY 218
           QC+PC   C+ Q  P+FDP  S ++ +V C +  C +L +     S C+  N C+YQ SY
Sbjct: 161 QCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY 220

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR 278
           GD S +VG  ST+T++F  T       GCG DNEGLF  +AGL+GL R +LS   Q    
Sbjct: 221 GDSSFSVGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPS 280

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
               FSYCL    T+A    +  G         +TP+ ++    + Y++ L G+SVGG+ 
Sbjct: 281 LGYSFSYCL---PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSP 337

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           +  ++ S +   P      IIDSGT +TRL    + AL  A     +  +RAP FS+ DT
Sbjct: 338 L-AVSPSEYSSLP-----TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT 391

Query: 399 CFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQ 457
           CF+    ++++VPTVV+ F  GA + L   N LI VD S T C AFA T S  +IIGN Q
Sbjct: 392 CFE-GQASQLRVPTVVMAFAGGASMKLTTRNVLIDVDDS-TTCLAFAPTDS-TAIIGNTQ 448

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ F V+YD+A SRIGF+  GC+
Sbjct: 449 QQTFSVIYDVAQSRIGFSAGGCS 471


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 199/361 (55%), Gaps = 24/361 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   +G+GTPPRY   +LDTGSD++W QCAPC  C  Q  P FDPA+S S+A +PC S
Sbjct: 87  GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNS 146

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVA--RVALGCGHDN 251
           P+C  L    C  RN C+YQ  YGD + T G  S ET TF    TRV   R+A GCG+ N
Sbjct: 147 PMCNALYYPLC-YRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLN 205

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--------D 303
            G     +G++G GRG LS  +Q G   + +FSYCL     S  PS + FG         
Sbjct: 206 AGSLFNGSGMVGFGRGPLSLVSQLG---SPRFSYCLTSF-MSPVPSRLYFGAYATLNSTS 261

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSG 362
           ++     + TP + NP L T YY+ + GISVGG  +  I  S+F ++ A G GGVIIDSG
Sbjct: 262 ASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGG-ELLPIDPSVFAINDADGTGGVIIDSG 320

Query: 363 TSVTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDL--SGKTEVKVPTVVLHFR 418
           +++T L R AY  +  AF  + G           + DTCF      +  V +P +  HF 
Sbjct: 321 STITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFE 380

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GA++ LP  NY++    +G  C A A +  G SIIG+ Q Q F V+YD   S + F P  
Sbjct: 381 GANMELPLENYMLIDGDTGNLCLAIAASDDG-SIIGSFQHQNFHVLYDNENSLLSFTPAT 439

Query: 479 C 479
           C
Sbjct: 440 C 440


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 207/360 (57%), Gaps = 20/360 (5%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+GE+   + +GTP      ++DTGSD+VW QC PC +C++Q+ PVFDP+ S +++T+PC
Sbjct: 114 GNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPC 173

Query: 194 RSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
            S LC  L +S C +    C Y  +YGD S T G  + ET T   T++  VA GCG  NE
Sbjct: 174 SSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNE 233

Query: 253 GL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-------DS 304
           G  F   AGL+GLGRG LS  +Q G     KFSYCL     ++K S ++ G       D+
Sbjct: 234 GDGFTQGAGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDTSK-SPLLLGSLAAISTDT 289

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A +   + TPL+ NP   +FYYV L  ++VG   +  +  S F +   G GGVI+DSGTS
Sbjct: 290 ASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIP-LPGSAFAVQDDGTGGVIVDSGTS 348

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFD--LSGKTEVKVPTVVLHFR-GA 420
           +T L    Y  L+ AF A    L  A   ++  D CF    SG  +V+VP +VLHF  GA
Sbjct: 349 ITYLELQGYRPLKKAF-AAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGA 407

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           D+ LPA NY++   +SG  C    G+  GLSIIGN QQQ  + VYD+    + FAP  CA
Sbjct: 408 DLDLPAENYMVLDSASGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQCA 466


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 162/383 (42%), Positives = 216/383 (56%), Gaps = 25/383 (6%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG++ GSGEYF  + VGTPP++  ++LDTGSD+ WIQC PC +C+ Q  P +
Sbjct: 164 GQLIATLESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHY 223

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTF 235
           DP +S S+  + C    C  + S      C   N TC Y   YGD S T GDF+ ET T 
Sbjct: 224 DPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTV 283

Query: 236 RGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
             T         RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYC
Sbjct: 284 NLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYC 343

Query: 287 LVDRSTSAKPSS-MVFG---DSAVSRTARFTPLLA---NPKLDTFYYVELVGISVGGAHV 339
           LVDR++ A  SS ++FG   D        FT L+A   NP +DTFYYV++  I VGG  V
Sbjct: 344 LVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENP-VDTFYYVQIKSIVVGG-EV 401

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
             I    +++   G+GG IIDSGT+++    PAY  +++AF A         DF + + C
Sbjct: 402 VNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPC 461

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQ 457
           ++++G  +  +P   + F  GA  + P  NY I ++     C A  GT  S LSIIGN Q
Sbjct: 462 YNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQ 521

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ F ++YD   SR+GFAP  CA
Sbjct: 522 QQNFHILYDTKKSRLGFAPTKCA 544


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 159/384 (41%), Positives = 214/384 (55%), Gaps = 26/384 (6%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG++ GSGEYF  + +G+PP++  ++LDTGSD+ WIQC PC  C+ Q  P +
Sbjct: 179 GQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYY 238

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF 235
           DP  S SF  + C  P C+ + S    R       +C Y   YGD S T GDF+ ET T 
Sbjct: 239 DPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTV 298

Query: 236 RGT----------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
             T          RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSY
Sbjct: 299 NLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 358

Query: 286 CLVDR-STSAKPSSMVFGDSAVSRTA---RFTPLLA---NPKLDTFYYVELVGISVGGAH 338
           CLVDR S ++  S ++FG+     T     FT L+A   NP +DTFYY+++  I VGG  
Sbjct: 359 CLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENP-VDTFYYLQIKSIFVGGEK 417

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           ++ I    + L   G GG IIDSGT+++  + PAY  +++AF       K   DF +   
Sbjct: 418 LQ-IPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHP 476

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG  E+  P  ++ F  GA  + P  NY I +      C A  GT  S LSIIGN 
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNY 536

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F ++YD   SR+G+AP  CA
Sbjct: 537 QQQNFHILYDTKNSRLGYAPMRCA 560


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/370 (39%), Positives = 211/370 (57%), Gaps = 15/370 (4%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
           F S V+SG   GSG+YF    +GTPP+   +++D+GSD++W+QCAPC +CY+Q  P++ P
Sbjct: 50  FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAP 109

Query: 183 AKSRSFATVPCRSPLCRKLDSSG---CNRR--NTCLYQVSYGDGSITVGDFSTETLTFRG 237
           + S +F  VPC SP C  + ++    C+      C Y+  Y D S++ G F+ E+ T   
Sbjct: 110 SNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD 169

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
            R+ +VA GCG DN+G F AA G+LGLG+G LSF +Q G  +  KF+YCLV+       S
Sbjct: 170 VRIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVS 229

Query: 298 S-MVFGDSAVS--RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           S ++FGD  +S     +FTP+++N +  T YYV++  + VGG  +  I+ S + LD  GN
Sbjct: 230 SWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLP-ISHSAWSLDFLGN 288

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           GG I DSGT+VT    PAY  +  AF        RA      D C D++G  +   P+  
Sbjct: 289 GGSIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQGLDLCVDVTGVDQPSFPSFT 347

Query: 415 LHFRGADVSLPAT-NYLIPVDSSGTFCFAFAG---TMSGLSIIGNIQQQGFRVVYDLAAS 470
           +   G  V  P   NY + V +    C A AG   ++ G + IGN+ QQ F V YD   +
Sbjct: 348 IVLGGGAVFQPQQGNYFVDV-APNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREEN 406

Query: 471 RIGFAPRGCA 480
           RIGFAP  C+
Sbjct: 407 RIGFAPAKCS 416


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 202/358 (56%), Gaps = 20/358 (5%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   +G+GTP R+   +LDTGSD++W QCAPC  C  Q  P FDPA S ++ ++ C +
Sbjct: 90  GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSA 149

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVA--RVALGCGHDN 251
           P C  L    C ++ TC+YQ  YGD + T G  + ET TF    TRV   R++ GCG+ N
Sbjct: 150 PACNALYYPLCYQK-TCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLN 208

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-----DSAV 306
            G     +G++G GRG LS  +Q G   + +FSYCL    +  + S + FG     +S  
Sbjct: 209 AGSLANGSGMVGFGRGSLSLVSQLG---SPRFSYCLTSFLSPVR-SRLYFGAYATLNSTN 264

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           + T + TP + NP L T Y++ + GISVGG  +    A L   D  G GG IIDSGT++T
Sbjct: 265 ASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTIT 324

Query: 367 RLTRPAYIALRDAFRAGASS---LKRAPDFSLFDTCFDL--SGKTEVKVPTVVLHFRGAD 421
            L  PAY A+R+AF    +S   L    + S+ DTCF      +  V +P +VLHF GAD
Sbjct: 325 YLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGAD 384

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             LP  NY++   S+G  C A A +  G SIIG+ Q Q F V+YDL  S + F P  C
Sbjct: 385 WELPLQNYMLVDPSTGGLCLAMATSSDG-SIIGSYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 159/384 (41%), Positives = 214/384 (55%), Gaps = 26/384 (6%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG++ GSGEYF  + +G+PP++  ++LDTGSD+ WIQC PC  C+ Q  P +
Sbjct: 179 GQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYY 238

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF 235
           DP  S SF  + C  P C+ + S    R       +C Y   YGD S T GDF+ ET T 
Sbjct: 239 DPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTV 298

Query: 236 RGT----------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
             T          RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSY
Sbjct: 299 NLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 358

Query: 286 CLVDR-STSAKPSSMVFGDSAVSRTA---RFTPLLA---NPKLDTFYYVELVGISVGGAH 338
           CLVDR S ++  S ++FG+     T     FT L+A   NP +DTFYY+++  I VGG  
Sbjct: 359 CLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENP-VDTFYYLQIKSIFVGGEK 417

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           ++ I    + L   G GG IIDSGT+++  + PAY  +++AF       K   DF +   
Sbjct: 418 LQ-IPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHP 476

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG  E+  P  ++ F  GA  + P  NY I +      C A  GT  S LSIIGN 
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNY 536

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F ++YD   SR+G+AP  CA
Sbjct: 537 QQQNFHILYDTKNSRLGYAPMRCA 560


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  258 bits (660), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 163/429 (37%), Positives = 228/429 (53%), Gaps = 21/429 (4%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           + S++ L LHH        + +  F+  +  D  R+ S  A           + +   A 
Sbjct: 39  SSSAVHLPLHHPRGPCSPLSADIPFSAVLTHDAARIASFAARLAKKSSPSSASATTQAAG 98

Query: 121 GGFSSSVIS-GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDP 178
              +S  ++ G + G G Y TR+G+GTP +   MV+DTGS + W+QC+PC+  C+ Q+ P
Sbjct: 99  SSLASVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGP 158

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSG-----CNRRNTCLYQVSYGDGSITVGDFSTETL 233
           VFDP  S S+A V C SP C  L ++      C+  N C+YQ SYGD S +VG  S +T+
Sbjct: 159 VFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTV 218

Query: 234 TFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
           +F    V     GCG DNEGLF  +AGL+GL R +LS   Q        FSYCL   S+S
Sbjct: 219 SFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSS 278

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
              S   +     S    +TP+++N   D+ Y++ L G++V G  +  +++S +   P  
Sbjct: 279 GYLSIGSYNPGGYS----YTPMVSNTLDDSLYFISLSGMTVAGKPL-AVSSSEYTSLP-- 331

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEVKVPT 412
               IIDSGT +TRL    Y AL  A  A    S KRA  +S+ DTCF+        VP 
Sbjct: 332 ---TIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPA 388

Query: 413 VVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           V + F  GA + L A N L+ VD + T C AFA   S  +IIGN QQQ F VVYD+ ++R
Sbjct: 389 VSMAFSGGATLKLSAGNLLVDVDGA-TTCLAFAPARSA-AIIGNTQQQTFSVVYDVKSNR 446

Query: 472 IGFAPRGCA 480
           IGFA  GC+
Sbjct: 447 IGFAAAGCS 455


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 158/396 (39%), Positives = 218/396 (55%), Gaps = 14/396 (3%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           ++RD  RV S+      A   P        +  G S     G++ G+G Y   +G+GTP 
Sbjct: 100 LERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTPA 159

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           +   ++ DTGSD+ W+QC PC  CY Q DP+FDP+ S ++A V C +P C++LD+SGC+ 
Sbjct: 160 KQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDASGCSS 219

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRG 267
            + C Y+V YGD S T G+   +TLT   +  +     GCG  N GLF    GL GLGR 
Sbjct: 220 DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGRE 279

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
           ++S P+Q    +   F+YCL   S+S+    +  G  A    A+FT  LA+    +FYY+
Sbjct: 280 KVSLPSQGAPSYGPGFTYCL--PSSSSGRGYLSLG-GAPPANAQFT-ALADGATPSFYYI 335

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
           +LVGI VGG  +R     +     A  GG +IDSGT +TRL   AY  LR AF    +  
Sbjct: 336 DLVGIKVGGRAIR-----IPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQY 390

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGT 446
           K+AP  S+ DTC+D +G    ++PTV L F  GA VSL  T  L  V      C AFA  
Sbjct: 391 KKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPN 449

Query: 447 M--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              S ++I+GN QQ+ F V YD+A  RIGF  +GC+
Sbjct: 450 ADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 158/396 (39%), Positives = 218/396 (55%), Gaps = 14/396 (3%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           ++RD  RV S+      A   P        +  G S     G++ G+G Y   +G+GTP 
Sbjct: 100 LERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTPA 159

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           +   ++ DTGSD+ W+QC PC  CY Q DP+FDP+ S ++A V C +P C++LD+SGC+ 
Sbjct: 160 KQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDASGCSS 219

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRG 267
            + C Y+V YGD S T G+   +TLT   +  +     GCG  N GLF    GL GLGR 
Sbjct: 220 DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGRE 279

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
           ++S P+Q    +   F+YCL   S+S+    +  G  A    A+FT  LA+    +FYY+
Sbjct: 280 KVSLPSQGAPSYGPGFTYCL--PSSSSGRGYLSLG-GAPPANAQFT-ALADGATPSFYYI 335

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
           +LVGI VGG  +R     +     A  GG +IDSGT +TRL   AY  LR AF    +  
Sbjct: 336 DLVGIKVGGRAIR-----IPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQY 390

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGT 446
           K+AP  S+ DTC+D +G    ++PTV L F  GA VSL  T  L  V      C AFA  
Sbjct: 391 KKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPN 449

Query: 447 M--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              S ++I+GN QQ+ F V YD+A  RIGF  +GC+
Sbjct: 450 ADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 157/437 (35%), Positives = 242/437 (55%), Gaps = 28/437 (6%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQ--RDVLRVKSLTAFAE-----SAVRVPPRNR 114
           +SS+ L ++HV     + TP     L      D   VK+L+         S    PP++ 
Sbjct: 43  QSSIHLNIYHVHGHGSSLTPNSSSLLSDVLLHDEEHVKALSDRLANKGLGSGSAKPPKSG 102

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCY 173
                N   S  +  GL+ GSG Y+ +LG+GTPP+Y  M+LDTGS + W+QC PC   C+
Sbjct: 103 HLLEPNSA-SIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCH 161

Query: 174 SQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGD 227
           +Q DP++DP+ S+++  + C S  C +L ++  N        N CLY  SYGD S ++G 
Sbjct: 162 AQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGY 221

Query: 228 FSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
            S + LT   ++ + +   GCG DN+GLF  AAG++GL R +LS   Q   ++   FSYC
Sbjct: 222 LSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYC 281

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           L   ++ +     +   S    + +FTP+L + K  + Y++ L  I+V G  +  + A++
Sbjct: 282 LPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLD-LAAAM 340

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGK 405
           +++        +IDSGT +TRL    Y ALR AF +  ++   +AP +S+ DTCF  S K
Sbjct: 341 YRVP------TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLK 394

Query: 406 TEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFR 462
           +   VP + + F+ GAD++L A + LI  D  G  C AFAG+   + ++IIGN QQQ + 
Sbjct: 395 SISAVPEIKMIFQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYN 453

Query: 463 VVYDLAASRIGFAPRGC 479
           + YD++ SRIGFAP  C
Sbjct: 454 IAYDVSTSRIGFAPGSC 470


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 164/457 (35%), Positives = 253/457 (55%), Gaps = 40/457 (8%)

Query: 44  SVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEH----LFNLRIQRDVLRVKSL 99
           +++ S  +S L    PD    + L+L+H+ SL   ++P +    LF     +D  R++  
Sbjct: 14  AIASSLKDSGLKHKQPD----MQLKLYHMTSL---KSPPNSTSLLFAYMFAKDEERIRYF 66

Query: 100 TAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGS 159
            +           ++  G    G    + SGL+ GSG Y+ ++G+G+P +Y  M++DTGS
Sbjct: 67  HSRLAKNSDANASSKKVGPKLAGIP--LKSGLSMGSGNYYVKMGLGSPTKYYTMIVDTGS 124

Query: 160 DVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRK-----LDSSGCNRR-NTC 212
              W+QC PC   C+ Q DPVF+P+ S+++ TVPC S  C       L+   C+++ N C
Sbjct: 125 SFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNAC 184

Query: 213 LYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           +Y+ SYGD S ++G  S + LT   ++ ++    GCG DN+GLF    G++GL    LS 
Sbjct: 185 VYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSM 244

Query: 272 PTQTGRRFNRKFSYCL---VDRSTSAKPSSMVFGDSAV--SRTARFTPLLANPKLDTFYY 326
            +Q   ++   FSYCL        S K   +  G S++  S + +FTPLL NP   + Y+
Sbjct: 245 LSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYF 304

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS- 385
           ++L  I+V G  + G+ AS +K+        IIDSGT +TRL  P Y  L++A+    S 
Sbjct: 305 IDLESITVAGRPL-GVAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVTILSK 357

Query: 386 SLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFA 442
             ++AP  SL DTCF   L+G +EV  P + + F+ GAD+ L   N L+ ++ +G  C A
Sbjct: 358 KYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELE-TGITCLA 415

Query: 443 FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            AG+ S ++IIGN QQQ  +V YD+  SR+GFAP GC
Sbjct: 416 MAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 185/458 (40%), Positives = 244/458 (53%), Gaps = 51/458 (11%)

Query: 64  SLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSL--TAFAESAVRVPPRNRSRGRANG 121
           SL LRL+H  +       E L +L  ++D +R++++   A      R+P  +  R   + 
Sbjct: 76  SLKLRLNHRAAEGGRTREESLLDL-AEKDAVRIETMYRRAARSGGGRMPASSSPRRALSE 134

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              ++V SG+A GSGEY   + VGTPPR   M++DTGSD+ W+QCAPC  C+ Q  PVFD
Sbjct: 135 RMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFD 194

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG---------CNR--RNTCLYQVSYGDGSITVGDFST 230
           PA S S+  V C    C  +             C R   + C Y   YGD S T GD + 
Sbjct: 195 PAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLAL 254

Query: 231 ETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           E+ T   T      RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FS
Sbjct: 255 ESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFS 314

Query: 285 YCLVDRSTSAKPSSMVFG--DSAVSRTARFTPLLANPKL---------------DTFYYV 327
           YCLVD  +    S +VFG  D A++       L A+P+L               DTFYYV
Sbjct: 315 YCLVDHGSDVG-SKVVFGEDDDALA-------LAAHPQLKYTAFAPASSSSSPADTFYYV 366

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-S 386
           +L G+ VGG  +  I++  + +   G+GG IIDSGT+++    PAY  +R AF    S S
Sbjct: 367 KLKGVLVGG-ELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRS 425

Query: 387 LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSG--TFCFAF 443
               P+F +   C+++SG    +VP + L F  GA    PA NY I +D  G    C A 
Sbjct: 426 YPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAV 485

Query: 444 AGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            GT  +G+SIIGN QQQ F VVYDL  +R+GFAPR CA
Sbjct: 486 LGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCA 523


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 206/361 (57%), Gaps = 25/361 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   +G+G+PPRY   ++DTGSD++W QCAPC  C  Q  P F+PAKS S+A++PC S
Sbjct: 86  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVA--RVALGCGHDN 251
            +C  L S  C  +N C+YQ  YGD + + G  + ET TF    TRVA  RV+ GCG+ N
Sbjct: 146 AMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMN 204

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--------D 303
            G     +G++G GRG LS  +Q G   + +FSYCL    + A  S + FG        +
Sbjct: 205 AGTLFNGSGMVGFGRGALSLVSQLG---SPRFSYCLTSFMSPAT-SRLYFGAYATLNSTN 260

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSG 362
           ++ S   + TP + NP L T Y++ + GISV G  +  I  S+F ++   G GGVIIDSG
Sbjct: 261 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAG-DLLPIDPSVFAINETDGTGGVIIDSG 319

Query: 363 TSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDL--SGKTEVKVPTVVLHFR 418
           T+VT L +PAY  ++ AF A  G       P    FDTCF      +  V +P +VLHF 
Sbjct: 320 TTVTFLAQPAYAMVQGAFVAWVGLPRANATPS-DTFDTCFKWPPPPRRMVTLPEMVLHFD 378

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GAD+ LP  NY++    +G  C A   +  G SIIG+ Q Q F ++YDL  S + F P  
Sbjct: 379 GADMELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 437

Query: 479 C 479
           C
Sbjct: 438 C 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 206/361 (57%), Gaps = 25/361 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   +G+G+PPRY   ++DTGSD++W QCAPC  C  Q  P F+PAKS S+A++PC S
Sbjct: 83  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVA--RVALGCGHDN 251
            +C  L S  C  +N C+YQ  YGD + + G  + ET TF    TRVA  RV+ GCG+ N
Sbjct: 143 AMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMN 201

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--------D 303
            G     +G++G GRG LS  +Q G   + +FSYCL    + A  S + FG        +
Sbjct: 202 AGTLFNGSGMVGFGRGALSLVSQLG---SPRFSYCLTSFMSPAT-SRLYFGAYATLNSTN 257

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSG 362
           ++ S   + TP + NP L T Y++ + GISV G  +  I  S+F ++   G GGVIIDSG
Sbjct: 258 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAG-DLLPIDPSVFAINETDGTGGVIIDSG 316

Query: 363 TSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDL--SGKTEVKVPTVVLHFR 418
           T+VT L +PAY  ++ AF A  G       P    FDTCF      +  V +P +VLHF 
Sbjct: 317 TTVTFLAQPAYAMVQGAFVAWVGLPRANATPS-DTFDTCFKWPPPPRRMVTLPEMVLHFD 375

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GAD+ LP  NY++    +G  C A   +  G SIIG+ Q Q F ++YDL  S + F P  
Sbjct: 376 GADMELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 434

Query: 479 C 479
           C
Sbjct: 435 C 435


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 170/384 (44%), Positives = 212/384 (55%), Gaps = 38/384 (9%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           ++V SG+A GSGEY   L VGTPPR   M++DTGSD+ W+QCAPC  C+ Q  PVFDPA 
Sbjct: 139 ATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAA 198

Query: 185 SRSFATVPCRSPLCRKLDSS----GCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGT 238
           S S+  V C  P C  +        C R ++  C Y   YGD S T GD + E  T   T
Sbjct: 199 SLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLT 258

Query: 239 ------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
                 RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYCLVD  +
Sbjct: 259 APGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKL-------------DTFYYVELVGISVGGAHV 339
           S   S +VFGD           LL +P+L             DTFYYV+L G+ VGG  +
Sbjct: 319 SVG-SKIVFGDDDA--------LLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKL 369

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDT 398
             I+ S + +   G+GG IIDSGT+++    PAY  +R AF      +     DF +   
Sbjct: 370 N-ISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP 428

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG   V+VP   L F  GA    PA NY + +D  G  C A  GT  S +SIIGN 
Sbjct: 429 CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNF 488

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F V+YDL  +R+GFAPR CA
Sbjct: 489 QQQNFHVLYDLQNNRLGFAPRRCA 512


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 170/384 (44%), Positives = 212/384 (55%), Gaps = 38/384 (9%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           ++V SG+A GSGEY   L VGTPPR   M++DTGSD+ W+QCAPC  C+ Q  PVFDPA 
Sbjct: 139 ATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAT 198

Query: 185 SRSFATVPCRSPLCRKLDSS----GCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGT 238
           S S+  V C  P C  +        C R ++  C Y   YGD S T GD + E  T   T
Sbjct: 199 SLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLT 258

Query: 239 ------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
                 RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYCLVD  +
Sbjct: 259 APGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKL-------------DTFYYVELVGISVGGAHV 339
           S   S +VFGD           LL +P+L             DTFYYV+L G+ VGG  +
Sbjct: 319 SVG-SKIVFGDDDA--------LLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKL 369

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDT 398
             I+ S + +   G+GG IIDSGT+++    PAY  +R AF      +     DF +   
Sbjct: 370 N-ISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP 428

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG   V+VP   L F  GA    PA NY + +D  G  C A  GT  S +SIIGN 
Sbjct: 429 CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNF 488

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F V+YDL  +R+GFAPR CA
Sbjct: 489 QQQNFHVLYDLQNNRLGFAPRRCA 512


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 150/350 (42%), Positives = 204/350 (58%), Gaps = 20/350 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRS 187
           SG   GSG YF  +G+GTP R + ++ DTGSD+ W QC PC + CY Q D +FDP+KS S
Sbjct: 137 SGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTS 196

Query: 188 FATVPCRSPLCRKLDSS-----GCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           ++ + C S LC +L ++     GC+     C+Y + YGD S +VG FS E LT   T V 
Sbjct: 197 YSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATDVV 256

Query: 242 RVAL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
              L GCG +N+GLF  +AGL+GLGR  +SF  QT  ++ + FSYCL   STS+    + 
Sbjct: 257 DNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCL--PSTSSSTGHLS 314

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           FG +A  R  ++TP     +  +FY +++  I+VGG  +  +++S F       GG IID
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLP-VSSSTFS-----TGGAIID 368

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           SGT +TRL   AY ALR AFR G S    A + S+ DTC+DLSG     +PT+   F G 
Sbjct: 369 SGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEFSFAGG 428

Query: 421 -DVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDL 467
             V LP    L  V S+   C AFA  G  S ++I GN+QQ+   VVYD+
Sbjct: 429 VTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 199/337 (59%), Gaps = 21/337 (6%)

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-SGCNRR 209
           +++++DTGSD+ WIQC PC +CY Q D +F PA S ++  +PC S +C++L S S     
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60

Query: 210 NTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGL 264
           ++C Y VSYGD S T GDF+ ETLT R        V   A GCGH N+GLF  AAGL+GL
Sbjct: 61  SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGL 120

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV-SRTARFTPLLANPKLDT 323
           G+  + FP QT   F + FSYCL   S++     + FG++A+     RFTPL+ +    +
Sbjct: 121 GKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPS 180

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
            Y+V + GI+VG   +  I+A+           V++DSGT ++R  + AY  LRDAF   
Sbjct: 181 QYFVSMTGINVGD-ELLPISAT-----------VMVDSGTVISRFEQSAYERLRDAFTQI 228

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFA 442
              L+ A   + FDTCF +S   ++ +P + LHFR  A++ L   + L PVD  G  CFA
Sbjct: 229 LPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVD-DGVMCFA 287

Query: 443 FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FA + SG S++GN QQQ  R VYD+  SR+G +   C
Sbjct: 288 FAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 164/434 (37%), Positives = 227/434 (52%), Gaps = 38/434 (8%)

Query: 56  LPAPDAESSLSLRLHHVDS----LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPP 111
           L  P  ++    +L HVDS      F R    +   R +    +  +L A + S +  P 
Sbjct: 31  LEHPKVQNGFRAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAP- 89

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
                              +  G+GE+  +L +GTPP     ++DTGSD++W QC PC +
Sbjct: 90  -------------------VLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQ 130

Query: 172 CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTE 231
           C+ Q  P+FDP KS SF+ + C S LC  L  S C+  + C Y   YGD S T G  ++E
Sbjct: 131 CFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS--DGCEYLYGYGDYSSTQGMLASE 188

Query: 232 TLTFRGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
           TLTF    V  VA GCG DNEG  F   +GL+GLGRG LS  +Q       KFSYCL   
Sbjct: 189 TLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKE---PKFSYCLTSV 245

Query: 291 STSAKPSSMVFGDSAVSRTA----RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             + K S+++ G  A  + +    + TPL+ N    +FYY+ L GISVG   +  I  S 
Sbjct: 246 DDT-KASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLP-IKKST 303

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGK 405
           F L   G+GG+IIDSGT++T L + A+  +   F +  +        +  + CF L SG 
Sbjct: 304 FSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGS 363

Query: 406 TEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
           T+++VP +V HF GAD+ LPA NY+I   S G  C A  G+ SG+SI GNIQQQ   V++
Sbjct: 364 TDIEVPKLVFHFDGADLELPAENYMIADASMGVACLAM-GSSSGMSIFGNIQQQNMLVLH 422

Query: 466 DLAASRIGFAPRGC 479
           DL    + F P  C
Sbjct: 423 DLEKETLSFLPTQC 436


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/421 (38%), Positives = 226/421 (53%), Gaps = 32/421 (7%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           L + L  VDS   N T   L    I+R   R++S+ A  +S                  S
Sbjct: 42  LRVVLEQVDS-GMNLTKYELIKRAIKRGERRMRSINAMLQS------------------S 82

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S + + +  GSGEY   + +GTP   +  ++DTGSD++W QC PC +C+SQ  P+F+P  
Sbjct: 83  SGIETPVYAGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQD 142

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S SF+T+PC S  C+ L S  C   N C Y   YGDGS T G  +TET TF  + V  +A
Sbjct: 143 SSSFSTLPCESQYCQDLPSESC--YNDCQYTYGYGDGSSTQGYMATETFTFETSSVPNIA 200

Query: 245 LGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
            GCG DN+G      AGL+G+G G LS P+Q G     +FSYC+    +S+  S++  G 
Sbjct: 201 FGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGV---GQFSYCMTSSGSSSP-STLALGS 256

Query: 304 SA--VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           +A  V   +  T L+ +    T+YY+ L GI+VGG ++ GI +S F+L   G GG+IIDS
Sbjct: 257 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNL-GIPSSTFQLQDDGTGGMIIDS 315

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGA 420
           GT++T L + AY A+  AF    +        S   TCF L S  + V+VP + + F G 
Sbjct: 316 GTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGG 375

Query: 421 DVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++L   N LI   + G  C A  + +  G+SI GNIQQQ  +V+YDL    + F P  C
Sbjct: 376 VLNLGEENVLIS-PAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434

Query: 480 A 480
            
Sbjct: 435 G 435


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 149/355 (41%), Positives = 201/355 (56%), Gaps = 17/355 (4%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVP 192
           GSG Y   +G+G+P R +  + DTGSD+ W QC PC   CY Q + +FDP+ S S++ V 
Sbjct: 143 GSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVS 202

Query: 193 CRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALG 246
           C SP C KL+S+     GC+  +TCLY + YGDGS ++G F+ E L+   T V      G
Sbjct: 203 CDSPSCEKLESATGNSPGCSS-STCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFG 261

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG +N GLF   AGLLGL R  LS  +QT +++ + FSYCL   S+S    S   GD   
Sbjct: 262 CGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLSFGSGDGD- 320

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S+  +FTP   N    +FY++++VGISVG   +  I  S+F        G IIDSGT ++
Sbjct: 321 SKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLP-IPKSVFS-----TAGTIIDSGTVIS 374

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
           RL    Y +++  FR   S   R    S+ DTC+DLS    VKVP ++L+F G      A
Sbjct: 375 RLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDLA 434

Query: 427 TNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +I V      C AFAG      ++IIGN+QQ+   VVYD A  R+GFAP GC
Sbjct: 435 PEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/439 (36%), Positives = 244/439 (55%), Gaps = 29/439 (6%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNL--RIQRDVLRVKSLTA---FAESAVRVPPRNRSR 116
           +  + L L+HV  L  ++T    F+    I +D  RV+ L +     ES       ++ R
Sbjct: 32  QEGMQLNLYHVKGLDSSQTSTSPFSFSDMITKDEERVRFLHSRLTNKESVRNSATTDKLR 91

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQ 175
           G  +   ++ + SGL+ GSG Y+ ++G+GTP +Y  M++DTGS + W+QC PC   C+ Q
Sbjct: 92  GGPSLVSTTPLKSGLSIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQ 151

Query: 176 TDPVFDPAKSRSFATVPCRSPLCRK-----LDSSGC-NRRNTCLYQVSYGDGSITVGDFS 229
            DP+F P+ S+++  +PC S  C       L++ GC N    C+Y+ SYGD S ++G  S
Sbjct: 152 VDPIFTPSTSKTYKALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLS 211

Query: 230 TETLTFRGTRV--ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
            + LT   +    +    GCG DN+GLF  ++G++GL   ++S   Q  +++   FSYCL
Sbjct: 212 QDVLTLTPSEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCL 271

Query: 288 VDRSTSAKPSSM-----VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
               ++   SS+     +   S  S   +FTPL+ N K+ + Y+++L  I+V G  + G+
Sbjct: 272 PSSFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPL-GV 330

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFD 401
           +AS +      N   IIDSGT +TRL    Y AL+ +F    S    +AP FS+ DTCF 
Sbjct: 331 SASSY------NVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFK 384

Query: 402 LSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
            S K    VP + + FR GA + L A N L+ ++  GT C A A + + +SIIGN QQQ 
Sbjct: 385 GSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQT 443

Query: 461 FRVVYDLAASRIGFAPRGC 479
           F+V YD+A  +IGFAP GC
Sbjct: 444 FKVAYDVANFKIGFAPGGC 462


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 160/383 (41%), Positives = 206/383 (53%), Gaps = 25/383 (6%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG++ GSGEYF  + VGTPP++  ++LDTGSD+ WIQC PC  C+ Q  P +
Sbjct: 178 GQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYY 237

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSG----CN-RRNTCLYQVSYGDGSITVGDFSTETLTF 235
           DP  S SF  + C  P C+ + S      C     +C Y   YGD S T GDF+ ET T 
Sbjct: 238 DPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTV 297

Query: 236 RGTR---------VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
             T          V  V  GCGH N GLF  AAGLLGLGRG LSF TQ    +   FSYC
Sbjct: 298 NLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYC 357

Query: 287 LVDR-STSAKPSSMVFGDSAV---SRTARFTPLLA---NPKLDTFYYVELVGISVGGAHV 339
           LVDR S S+  S ++FG+           FT  +    NP +DTFYYV +  I VGG  V
Sbjct: 358 LVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENP-VDTFYYVLIKSIMVGGE-V 415

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
             I    + L   G GG IIDSGT++T    PAY  +++AF            F     C
Sbjct: 416 LKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPC 475

Query: 400 FDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQ 457
           +++SG  ++++P   + F  GA    P  NY I ++     C A  GT  S LSIIGN Q
Sbjct: 476 YNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQ 535

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ F ++YDL  SR+G+AP  CA
Sbjct: 536 QQNFHILYDLKKSRLGYAPMKCA 558


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 157/437 (35%), Positives = 230/437 (52%), Gaps = 39/437 (8%)

Query: 73  DSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLA 132
           D+ S +R  E   ++ IQ+      +  A  ES         S+G  +G   +++ SG +
Sbjct: 115 DTKSMSRKQEVKESITIQQQNNLANAFVASLES---------SKGEFSGNIMATLESGAS 165

Query: 133 QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP 192
            G+GEYF  + VGTPP++V+++LDTGSD+ WIQC PC  C+ Q    + P  S ++  + 
Sbjct: 166 LGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNIS 225

Query: 193 CRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT--------- 238
           C  P C+ + SS     C   N TC Y   Y DGS T GDF++ET T   T         
Sbjct: 226 CYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFK 285

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPS 297
           +V  V  GCGH N+G F  A+GLLGLGRG +SFP+Q    +   FSYCL D  S ++  S
Sbjct: 286 QVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSS 345

Query: 298 SMVFGDSAV---SRTARFTPLLANPKL--DTFYYVELVGISVGGAHVRGITASLFKLDPA 352
            ++FG+      +    FT LLA  +   +TFYY+++  I VGG  V  I+   +     
Sbjct: 346 KLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGG-EVLDISEQTWHWSSE 404

Query: 353 -----GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK-T 406
                  GG IIDSG+++T     AY  +++AF       + A D  +   C+++SG   
Sbjct: 405 GAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMM 464

Query: 407 EVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRV 463
           +V++P   +HF    V + PA NY    +     C A   T   S L+IIGN+ QQ F +
Sbjct: 465 QVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHI 524

Query: 464 VYDLAASRIGFAPRGCA 480
           +YD+  SR+G++PR CA
Sbjct: 525 LYDVKRSRLGYSPRRCA 541


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 173/442 (39%), Positives = 226/442 (51%), Gaps = 36/442 (8%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPR------- 112
            S L L LHH  S  S    P  L F+  +  D  R   L +   +    P R       
Sbjct: 42  SSGLHLTLHHPQSPCSPAPLPSDLPFSTVLTHDDARAAHLASRLATTSNAPSRRPTTSLR 101

Query: 113 --NRSRGRANGGFSSSVIS-----GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQ 165
               + G + G    S+ S     G + G G Y T LG+GTP     MV+DTGS + W+Q
Sbjct: 102 KPKAAAGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQ 161

Query: 166 CAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSYG 219
           C+PC   C+ Q  P++DP  S ++ATVPC +  C +L +     S C+ RN C+YQ SYG
Sbjct: 162 CSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYG 221

Query: 220 DGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           D S +VG  S +T++F          GCG DNEGLF  +AGL+GL R +LS   Q     
Sbjct: 222 DSSFSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSL 281

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
              FSYCL    T A    +  G    S    +TP+ ++    + Y+V L G+SVGG+ +
Sbjct: 282 GYSFSYCL---PTPASTGYLSIGP-YTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPL 337

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
               A    L        IIDSGT +TRL    Y AL  A  A    ++ AP FS+ DTC
Sbjct: 338 AVSPAEYSSLP------TIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTC 391

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           F     ++++VP V + F  GA + L   N LI VD S T C AFA T S  +IIGN QQ
Sbjct: 392 FQ-GQASQLRVPAVAMAFAGGATLKLATQNVLIDVDDS-TTCLAFAPTDS-TTIIGNTQQ 448

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           Q F VVYD+A SRIGFA  GC+
Sbjct: 449 QTFSVVYDVAQSRIGFAAGGCS 470


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 166/426 (38%), Positives = 221/426 (51%), Gaps = 28/426 (6%)

Query: 67  LRLH--HVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           LR+H  HVD+   N +   L     +R   R+  L A A       P   S+    G   
Sbjct: 41  LRVHLTHVDAHG-NYSRHQLLRRAARRSHHRMSRLVARATGV----PMTSSKAAGGGDLQ 95

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
             V +G    +GE+   + +GTP      ++DTGSD+VW QC PC  C+ Q+ PVFDP+ 
Sbjct: 96  VPVHAG----NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSS 151

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S ++ATVPC S  C  L +S C   + C Y  +YGD S T G  +TET T   +++  V 
Sbjct: 152 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVV 211

Query: 245 LGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
            GCG  NEG  F   AGL+GLGRG LS  +Q G     KFSYCL     +   S ++ G 
Sbjct: 212 FGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNN-SPLLLGS 267

Query: 304 SA-------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
            A        + + + TPL+ NP   +FYYV L  I+VG   +  + +S F +   G GG
Sbjct: 268 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQDDGTGG 326

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF--DLSGKTEVKVPTVV 414
           VI+DSGTS+T L    Y AL+ AF A  +           D CF     G  +V+VP +V
Sbjct: 327 VIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLV 386

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            HF  GAD+ LPA NY++    SG  C    G+  GLSIIGN QQQ F+ VYD+    + 
Sbjct: 387 FHFDGGADLDLPAENYMVLDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLS 445

Query: 474 FAPRGC 479
           FAP  C
Sbjct: 446 FAPVQC 451


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 170/384 (44%), Positives = 213/384 (55%), Gaps = 40/384 (10%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           ++V SG+A GSGEY   + VGTPPR   M++DTGSD+ W+QCAPC  C+ Q  PVFDP  
Sbjct: 137 ATVESGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMA 196

Query: 185 SRSFATVPCRSPLCRKLDSSGC------NRRNTCLYQVSYGDGSITVGDFSTETLTFRGT 238
           S S+  V C    C  +           +R + C Y   YGD S T GD + E  T   T
Sbjct: 197 STSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLT 256

Query: 239 -----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
                RV  V LGCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYCLVD   S
Sbjct: 257 ASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHG-S 315

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKL-----------DTFYYVELVGISVGGAHVRGI 342
           A  S +VFGD  V        LL++P+L           +TFYYV+L GI VGG  +   
Sbjct: 316 AVGSKIVFGDDNV--------LLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIP 367

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA----PDFSLFDT 398
           + +       G+GG IIDSGT+++    PAY A+R AF      + +A     DF +   
Sbjct: 368 SNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAF---VDRMDKAYPLIADFPVLSP 424

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG   V+VP   L F  GA    PA NY I +D+ G  C A  GT  S +SIIGN 
Sbjct: 425 CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNY 484

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F V+YDL  +R+GFAPR CA
Sbjct: 485 QQQNFHVLYDLHHNRLGFAPRRCA 508


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  254 bits (648), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 166/426 (38%), Positives = 221/426 (51%), Gaps = 28/426 (6%)

Query: 67  LRLH--HVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           LR+H  HVD+   N +   L     +R   R+  L A A       P   S+    G   
Sbjct: 31  LRVHLTHVDAHG-NYSRHQLLRRAARRSHHRMSRLVARATGV----PMTSSKAAGGGDLQ 85

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
             V +G    +GE+   + +GTP      ++DTGSD+VW QC PC  C+ Q+ PVFDP+ 
Sbjct: 86  VPVHAG----NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSS 141

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S ++ATVPC S  C  L +S C   + C Y  +YGD S T G  +TET T   +++  V 
Sbjct: 142 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVV 201

Query: 245 LGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
            GCG  NEG  F   AGL+GLGRG LS  +Q G     KFSYCL     +   S ++ G 
Sbjct: 202 FGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNN-SPLLLGS 257

Query: 304 SA-------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
            A        + + + TPL+ NP   +FYYV L  I+VG   +  + +S F +   G GG
Sbjct: 258 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQDDGTGG 316

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF--DLSGKTEVKVPTVV 414
           VI+DSGTS+T L    Y AL+ AF A  +           D CF     G  +V+VP +V
Sbjct: 317 VIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLV 376

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            HF  GAD+ LPA NY++    SG  C    G+  GLSIIGN QQQ F+ VYD+    + 
Sbjct: 377 FHFDGGADLDLPAENYMVLDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLS 435

Query: 474 FAPRGC 479
           FAP  C
Sbjct: 436 FAPVQC 441


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 148/357 (41%), Positives = 200/357 (56%), Gaps = 14/357 (3%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S +
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A V C +P C  LD+ GC+  + CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 230 YANVSCAAPACFDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 288

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RS+         G  A 
Sbjct: 289 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 348

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     TP+L +    TFYYV + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 349 AGARLTTPMLTD-NGPTFYYVGMTGIRVGG-QLLSIPQSVFA-----TAGTIVDSGTVIT 401

Query: 367 RLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
           RL  PAY +LR AF +   A   K+AP  SL DTC+D +G ++V +PTV L F+G  +  
Sbjct: 402 RLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILD 461

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              + ++   S    C  FA    G  + I+GN Q + F V YD+    +GF+P  C
Sbjct: 462 VDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 166/440 (37%), Positives = 246/440 (55%), Gaps = 33/440 (7%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNL--RIQRDVLRVKSLTA---FAESAVRVPPRNRSR 116
           +  + L L+HV  L  ++T    F+    I +D  RV+ L +     ESA      ++  
Sbjct: 28  QEGMQLNLYHVKGLDSSQTSTSPFSFSDMITKDEERVRFLHSRLTNKESASNSATTDKLG 87

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQ 175
           G +    S+ + SGL+ GSG Y+ ++GVGTP +Y  M++DTGS + W+QC PC   C+ Q
Sbjct: 88  GPSL--VSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQ 145

Query: 176 TDPVFDPAKSRSF-----ATVPCRSPLCRKLDSSGC-NRRNTCLYQVSYGDGSITVGDFS 229
            DP+F P+ S+++     ++  C S     L++ GC N    C+Y+ SYGD S ++G  S
Sbjct: 146 VDPIFTPSVSKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLS 205

Query: 230 TETLTFRGTRV--ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
            + LT   +    +    GCG DN+GLF  +AG++GL   +LS   Q   ++   FSYCL
Sbjct: 206 QDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCL 265

Query: 288 VDRSTSAKPSSMVFGDSAVSRTA------RFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
              S SA+P+S V G  ++  ++      +FTPL+ NPK+ + Y++ L  I+V G  + G
Sbjct: 266 -PSSFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPL-G 323

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCF 400
           ++AS +      N   IIDSGT +TRL    Y AL+ +F    S    +AP FS+ DTCF
Sbjct: 324 VSASSY------NVPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCF 377

Query: 401 DLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
             S K    VP + + FR GA + L   N L+ ++  GT C A A + + +SIIGN QQQ
Sbjct: 378 KGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQ 436

Query: 460 GFRVVYDLAASRIGFAPRGC 479
            F V YD+A S+IGFAP GC
Sbjct: 437 TFTVAYDVANSKIGFAPGGC 456


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 149/357 (41%), Positives = 197/357 (55%), Gaps = 17/357 (4%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+GE+   + +GTP      ++DTGSD+VW QC PC  C+ Q+ PVFDP+ S ++ATVPC
Sbjct: 70  GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPC 129

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
            S  C  L +S C   + C Y  +YGD S T G  +TET T   +++  V  GCG  NEG
Sbjct: 130 SSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEG 189

Query: 254 L-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------- 305
             F   AGL+GLGRG LS  +Q G     KFSYCL     +   S ++ G  A       
Sbjct: 190 DGFSQGAGLVGLGRGPLSLVSQLGL---DKFSYCLTSLDDTNN-SPLLLGSLAGISEASA 245

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            + + + TPL+ NP   +FYYV L  I+VG   +  + +S F +   G GGVI+DSGTS+
Sbjct: 246 AASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQDDGTGGVIVDSGTSI 304

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF--DLSGKTEVKVPTVVLHFR-GADV 422
           T L    Y AL+ AF A  +           D CF     G  +V+VP +V HF  GAD+
Sbjct: 305 TYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL 364

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LPA NY++    SG  C    G+  GLSIIGN QQQ F+ VYD+    + FAP  C
Sbjct: 365 DLPAENYMVLDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 420


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 152/389 (39%), Positives = 213/389 (54%), Gaps = 24/389 (6%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+   +G   +++ SG + G+GEYF  + VGTPP++V+++LDTGSD+ WIQC PC  C+ 
Sbjct: 147 SKDEFSGNIMATLESGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFE 206

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDS----SGCNRRN-TCLYQVSYGDGSITVGDFS 229
           Q  P ++P +S S+  + C  P C+ + S      C   N TC Y   Y DGS T GDF+
Sbjct: 207 QNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFA 266

Query: 230 TETLTFRGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
            ET T   T          V  V  GCGH N+G F  A GLLGLGRG LSFP+Q    + 
Sbjct: 267 LETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYG 326

Query: 281 RKFSYCLVDR-STSAKPSSMVFGDSAV---SRTARFTPLLANPKL--DTFYYVELVGISV 334
             FSYCL D  S ++  S ++FG+           FT LLA  +   DTFYY+++  I V
Sbjct: 327 HSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVV 386

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           GG  V  I    +     G GG IIDSG+++T     AY  +++AF       + A D  
Sbjct: 387 GGE-VLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDF 445

Query: 395 LFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT--MSGLS 451
           +   C+++SG  +V++P   +HF  GA  + PA NY    +     C A   T   S L+
Sbjct: 446 IMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLT 505

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           IIGN+ QQ F ++YD+  SR+G++PR CA
Sbjct: 506 IIGNLLQQNFHILYDVKRSRLGYSPRRCA 534


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 145/366 (39%), Positives = 208/366 (56%), Gaps = 22/366 (6%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + SG+   +  Y   +G+G+  + + +++DTGSD+ W+QC PC+ CY+Q  P+F P+ S 
Sbjct: 111 LTSGIKFQTLNYIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSP 168

Query: 187 SFATVPCRSPLCRKLDSSGC----NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           S+  + C S  C+ L+   C    +   TC Y V+YGDGS T G+   E L F G  V+ 
Sbjct: 169 SYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGISVSN 228

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
              GCG +N+GLF  A+GL+GLGR  LS  +QT   F   FSYCL     +    S+V G
Sbjct: 229 FVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMG 288

Query: 303 D-SAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           + S V +      +T +L N +L  FY + L GI VGG  +  + AS F     GNGGVI
Sbjct: 289 NQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLH-VQASSF-----GNGGVI 342

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +DSGT ++RL    Y AL+  F    S    AP FS+ DTCF+L+G  +V +PT+ ++F 
Sbjct: 343 LDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFE 402

Query: 419 G-ADVSLPATN--YLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIG 473
           G A++++ AT   YL+  D+S   C A A       + IIGN QQ+  RV+YD   S++G
Sbjct: 403 GNAELNVDATGIFYLVKEDAS-RVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVG 461

Query: 474 FAPRGC 479
           FA   C
Sbjct: 462 FAKEPC 467


>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
           thaliana]
          Length = 142

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 123/142 (86%), Positives = 130/142 (91%)

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           V G+TASLFKLD  GNGGVIIDSGTSVTRL RPAYIA+RDAFR GA +LKRAPDFSLFDT
Sbjct: 1   VPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDT 60

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           CFDLS   EVKVPTVVLHFRGADVSLPATNYLIPVD++G FCFAFAGTM GLSIIGNIQQ
Sbjct: 61  CFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQ 120

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           QGFRVVYDLA+SR+GFAP GCA
Sbjct: 121 QGFRVVYDLASSRVGFAPGGCA 142


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 172/429 (40%), Positives = 231/429 (53%), Gaps = 41/429 (9%)

Query: 69  LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI 128
           L HVD+     T E L +  ++R   RV +L + A  A                 +++ I
Sbjct: 35  LRHVDA-DAGYTEEQLLSRALRRSSARVATLQSLAALA------------PGDAITAARI 81

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
             LA   GEY   +G+GTP RY   +LDTGSD++W QCAPC  C  Q  P FDPA+S ++
Sbjct: 82  LVLAS-DGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATY 140

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--- 245
            ++ C SP C  L    C ++  C+YQ  YGD + T G  + ET TF GT   RV+L   
Sbjct: 141 RSLGCASPACNALYYPLCYQK-VCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGI 198

Query: 246 --GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG- 302
             GCG+ N GL    +G++G GRG LS  +Q G   + +FSYCL     S  PS + FG 
Sbjct: 199 SFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLG---SPRFSYCLTSF-LSPVPSRLYFGV 254

Query: 303 ------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL-DPAGNG 355
                  +A S   + TP + NP L T Y++ + GISVGG ++  I  ++F + D  G G
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-YLLPIDPAVFAINDTDGTG 313

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDL--SGKTEVKVPT 412
           G IIDSGT++T L  PAY A+R AF +  +  L    D S+ DTCF      +  V +P 
Sbjct: 314 GTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQ 373

Query: 413 VVLHFRGADVSLPATNYLIPVDSS--GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           +VLHF GAD  LP  NY++ VD S  G  C A A + S  SIIG+ Q Q F V+YDL  S
Sbjct: 374 LVLHFDGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENS 431

Query: 471 RIGFAPRGC 479
            + F P  C
Sbjct: 432 LMSFVPAPC 440


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 161/390 (41%), Positives = 218/390 (55%), Gaps = 35/390 (8%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           GF+S V++ L Q   EY+  L VGTP   V +++DTGSDV WIQC PCK C     P F+
Sbjct: 124 GFTSPVVT-LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFN 182

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG---CNRRN-TCLYQVSYGDGSITVGDFSTETLT--- 234
           P  S SF  +PC S  C  +       C+    TCL+ + YGDGS++ G  + ET+    
Sbjct: 183 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNT 242

Query: 235 -----FRGTRVARVALGCGH-DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
                    +++ + LGC   D EGL   A+GLLG+ R  +SFP+Q   R+ RKFS+C  
Sbjct: 243 PNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFP 302

Query: 289 DRSTSAKPSSMV-FGDS-AVSRTARFTPLLANPKLDT----FYYVELVGISVGGAHVRGI 342
           D+      S +V FG+S  +S   R+TPL+ NP + +    +YYV LVGISV  + +  +
Sbjct: 303 DKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP-L 361

Query: 343 TASLFKLDPA-GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           +   F +D   G+GG IIDSGT+ T L +PA+ A+R  F A  S L +  D S F  C++
Sbjct: 362 SHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYN 421

Query: 402 LSGKT----EVKVPTVVLHFRGA-DVSLPATNYLIPVDSSG---TFCFAFAGTMSG---L 450
           ++  T       +P++ LHFRG  DV LP  + LIPV SS    T C AF   MSG    
Sbjct: 422 ITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF--LMSGDIPF 479

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +IIGN QQQ   V YDL   R+G AP  CA
Sbjct: 480 NIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 509


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 148/357 (41%), Positives = 205/357 (57%), Gaps = 20/357 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           G + G G Y TR+G+GTP +   MV+DTGS + W+QC+PC+  C+ Q+ PVFDP  S S+
Sbjct: 129 GTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSY 188

Query: 189 ATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
           A V C +P C  L ++      C+  + C+YQ SYGD S +VG  S +T++F    V   
Sbjct: 189 AAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSNSVPNF 248

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
             GCG DNEGLF  +AGL+GL R +LS   Q        FSYCL   S+S   S   +  
Sbjct: 249 YYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSYNP 308

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
              S    +TP++++   D+ Y+++L G++V G  +  +++S +   P      IIDSGT
Sbjct: 309 GQYS----YTPMVSSTLDDSLYFIKLSGMTVAGKPL-AVSSSEYSSLP-----TIIDSGT 358

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
            +TRL    Y AL  A        KRA  +S+ DTCF +   + ++VP V + F  GA +
Sbjct: 359 VITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPAVSMAFSGGAAL 417

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            L A N L+ VDSS T C AFA   S  +IIGN QQQ F VVYD+ ++RIGFA  GC
Sbjct: 418 KLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 183/444 (41%), Positives = 238/444 (53%), Gaps = 31/444 (6%)

Query: 63  SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG 122
           SSL L + H       RT +  F    ++D +RV+++     S+   P R R+   +   
Sbjct: 72  SSLKLHMTHRRGAEGGRTRKGSFLDLAEKDAVRVEAMHRRVASSSSSPRRGRALSESER- 130

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
             ++V SG+A GS EY   + VGTPPR   M++DTGSD+ W+QCAPC  C+ Q  PVFDP
Sbjct: 131 VVATVESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDP 190

Query: 183 AKSRSFATVPCRSPLCRKL------DSSGCNR--RNTCLYQVSYGDGSITVGDFSTETLT 234
           A S S+  + C  P C  +          C R   + C Y   YGD S + GD + E+ T
Sbjct: 191 AASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFT 250

Query: 235 FRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCL 287
              T      RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +    FSYCL
Sbjct: 251 VNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCL 310

Query: 288 VDRSTSAKPSSMVFG-DSAVSRTAR-------FTPLLANPKLDTFYYVELVGISVGGAHV 339
           VD  +    S +VFG D A++  A        F P  A+   DTFYYV L G+ VGG  +
Sbjct: 311 VDHGSDVA-SKVVFGEDDALALAAHPRLKYTAFAP--ASSPADTFYYVRLTGVLVGG-EL 366

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDT 398
             I++  +     G+GG IIDSGT+++    PAY  +R AF    S S    PDF +   
Sbjct: 367 LNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSP 426

Query: 399 CFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNI 456
           C+++SG    +VP + L F  GA    PA NY I +D  G  C A  GT  +G+SIIGN 
Sbjct: 427 CYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNF 486

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
           QQQ F V YDL  +R+GFAPR CA
Sbjct: 487 QQQNFHVAYDLHNNRLGFAPRRCA 510


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 253/458 (55%), Gaps = 42/458 (9%)

Query: 44  SVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRTPEH----LFNLRIQRDVLRVKSL 99
           +++ S  +S L    PD    + L+L+ + SL   ++P +    LF     +D  R++  
Sbjct: 14  AIASSLKDSGLKHKQPD----MQLKLYPMTSL---KSPPNSTSLLFAYMFAKDEERIR-- 64

Query: 100 TAFAESAVRVPPRNRSRGRANGGFSS-SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTG 158
             F     +    N S  +     +   + SGL+ GSG Y+ ++G+G+P +Y  M++DTG
Sbjct: 65  -YFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMIVDTG 123

Query: 159 SDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRK-----LDSSGCNRR-NT 211
           S   W+QC PC   C+ Q DPVF+P+ S+++ TVPC S  C       L+   C+++ N 
Sbjct: 124 SSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNA 183

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
           C+Y+ SYGD S ++G  S + LT   ++ ++    GCG DN+GLF    G++GL    LS
Sbjct: 184 CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELS 243

Query: 271 FPTQTGRRFNRKFSYCL---VDRSTSAKPSSMVFGDSAV--SRTARFTPLLANPKLDTFY 325
             +Q   ++   FSYCL        S K   +  G S++  S + +FTPLL NP   + Y
Sbjct: 244 MLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLY 303

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           +++L  I+V G  + G+ AS +K+        IIDSGT +TRL  P Y  L++A+    S
Sbjct: 304 FIDLESITVAGRPL-GVAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVTILS 356

Query: 386 -SLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF 441
              ++AP  SL DTCF   L+G +EV  P + + F+ GAD+ L   N L+ ++ +G  C 
Sbjct: 357 KKYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELE-TGITCL 414

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A AG+ S ++IIGN QQQ  +V YD+  SR+GFAP GC
Sbjct: 415 AMAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 147/355 (41%), Positives = 199/355 (56%), Gaps = 17/355 (4%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           GL  GSG Y   +G GTP R   +V DTGSDV W+QC PC  +CY+Q +P+FDP+ S ++
Sbjct: 8   GLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTY 67

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGC 247
             V C  P C  L + GC+  +TCLY V YGDGS T+G  + +T       +      GC
Sbjct: 68  RNVSCTEPACVGLSTRGCSS-STCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKNFIFGC 126

Query: 248 GHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           G +N GLF   AGL+GLGR    S  +Q        FSYCL   STS+    +  G+   
Sbjct: 127 GQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL--PSTSSATGYLNIGNP-- 182

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
             T  +T +L + ++ T Y+++L+GISVGG  +  +++++F+     + G IIDSGT +T
Sbjct: 183 QNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLS-LSSTVFQ-----SVGTIIDSGTVIT 236

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
           RL   AY AL+ A RA  +    AP  ++ DTC+D S  T V  P +VLHF G DV +PA
Sbjct: 237 RLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPA 296

Query: 427 TNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           T      +SS   C AFAG      + IIGN+QQ    V YD    RIGF+   C
Sbjct: 297 TGVFFVFNSS-QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 198/360 (55%), Gaps = 20/360 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S +
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A + C +P C  LD+ GC+  N CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 231 YANISCAAPACSDLDTRGCSGGN-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 289

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RS+         G  A 
Sbjct: 290 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 349

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     TP+L +    TFYYV + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 350 AGARLTTPMLTD-NGPTFYYVGMTGIRVGG-QLLSIPQSVFT-----TAGTIVDSGTVIT 402

Query: 367 RLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---D 421
           RL   AY +LR AF +   A   K+AP  SL DTC+D +G ++V +PTV L F+G    D
Sbjct: 403 RLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLD 462

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V      Y   V      C  FA    G  + I+GN Q + F V YD+    +GF+P  C
Sbjct: 463 VDASGIMYAASVSQ---VCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 160/390 (41%), Positives = 218/390 (55%), Gaps = 35/390 (8%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           GF+S V++ L Q   EY+  L +GTP   V +++DTGSDV WIQC PCK C     P F+
Sbjct: 123 GFTSPVVT-LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFN 181

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG---CNRRN-TCLYQVSYGDGSITVGDFSTETLT--- 234
           P  S SF  +PC S  C  +       C+    TCL+ + YGDGS++ G  + ET+    
Sbjct: 182 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNT 241

Query: 235 -----FRGTRVARVALGCGH-DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
                    +++ + LGC   D EGL   A+GLLG+ R  +SFP+Q   R+ RKFS+C  
Sbjct: 242 PNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFP 301

Query: 289 DRSTSAKPSSMV-FGDS-AVSRTARFTPLLANPKLDT----FYYVELVGISVGGAHVRGI 342
           D+      S +V FG+S  +S   R+TPL+ NP + +    +YYV LVGISV  + +  +
Sbjct: 302 DKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP-L 360

Query: 343 TASLFKLDPA-GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           +   F +D   G+GG IIDSGT+ T L +PA+ A+R  F A  S L +  D S F  C++
Sbjct: 361 SHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYN 420

Query: 402 LSGKT----EVKVPTVVLHFRGA-DVSLPATNYLIPVDSSG---TFCFAFAGTMSG---L 450
           ++  T       +P++ LHFRG  DV LP  + LIPV SS    T C AF   MSG    
Sbjct: 421 ITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF--QMSGDIPF 478

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +IIGN QQQ   V YDL   R+G AP  CA
Sbjct: 479 NIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 508


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 140/355 (39%), Positives = 194/355 (54%), Gaps = 20/355 (5%)

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
            L +G P      ++DTGSD++W QC PC +C+ Q  P+FDP KS S++ V C S LC  
Sbjct: 2   ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNA 61

Query: 201 LDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL-FVA 257
           L  S CN  ++ C Y  +YGD S T G  +TET TF     ++ +  GCG +NEG  F  
Sbjct: 62  LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQ 121

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA---VSRTA---- 310
            +GL+GLGRG LS  +Q       KFSYCL     S   SS+  G  A   V++T     
Sbjct: 122 GSGLVGLGRGPLSLISQLKE---TKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLD 178

Query: 311 ----RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
               +   LL NP   +FYY+EL GI+VG   +  +  S F+L   G GG+IIDSGT++T
Sbjct: 179 GEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLS-VEKSTFELAEDGTGGMIIDSGTTIT 237

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLP 425
            L   A+  L++ F +  S        +  D CF L      + VP ++ HF+GAD+ LP
Sbjct: 238 YLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELP 297

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             NY++   S+G  C A  G+ +G+SI GN+QQQ F V++DL    + F P  C 
Sbjct: 298 GENYMVADSSTGVLCLAM-GSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECG 351


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  251 bits (641), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 204/357 (57%), Gaps = 18/357 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRSF 188
           G A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S ++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGC 247
           A V C +P C  LD+ GC+  + CLY V YGDGS ++G F+ +TLT       +    GC
Sbjct: 231 ANVSCAAPACSDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST      + FG  + +
Sbjct: 290 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT--GYLDFGAGSPA 347

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
                TP+L +    TFYYV L GI VGG  +  I  S+F        G I+DSGT +TR
Sbjct: 348 ARLTTTPMLVD-NGPTFYYVGLTGIRVGG-RLLYIPQSVFA-----TAGTIVDSGTVITR 400

Query: 368 LTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 424
           L   AY +LR AF A  S+   K+AP  SL DTC+D +G ++V +PTV L F+ GA + +
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDV 460

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            A+  +    +S   C AFA    G  + I+GN Q + F V YD+    + F+P  C
Sbjct: 461 DASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  251 bits (641), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 147/359 (40%), Positives = 215/359 (59%), Gaps = 20/359 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSF 188
           GL+ GSG Y+ +LG+G+PP+Y  M+LDTGS + W+QC PC   C+SQ DP+F+P+ S ++
Sbjct: 112 GLSIGSGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTY 171

Query: 189 ATVPCRSPLCRKLDSSG-----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VAR 242
             + C S  C  L ++      C     C+Y  SYGD S ++G  S + LT   ++ +  
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPS 231

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
              GCG DNEGLF  AAG++GL R +LS   Q   ++   FSYCL   STS+    +  G
Sbjct: 232 FTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCL-PTSTSSGGGFLSIG 290

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
             + S + +FTP++ N +  + Y++ L  I+V G  V G+ A+ +++        IIDSG
Sbjct: 291 KISPS-SYKFTPMIRNSQNPSLYFLRLAAITVAGRPV-GVAAAGYQVP------TIIDSG 342

Query: 363 TSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GA 420
           T VTRL    Y ALR+AF +  +   ++AP +S+ DTCF  S K+    P + + F+ GA
Sbjct: 343 TVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGA 402

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D+SL A N LI  D  G  C AFA + + ++IIGN QQQ + + YD++AS+IGFAP GC
Sbjct: 403 DLSLRAPNILIEAD-KGIACLAFASS-NQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  251 bits (640), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 157/366 (42%), Positives = 204/366 (55%), Gaps = 25/366 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+GE+   L VGTP      ++DTGSD+VW QC PC +C++QT PVFDPA S ++A +PC
Sbjct: 112 GNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPC 171

Query: 194 RSPLCRKLDSSGCNRRNTCL-------YQVSYGDGSITVGDFSTETLTFRGTRVARVALG 246
            S LC  L +S C   ++         Y  +YGD S T G  +TET T    +V  VA G
Sbjct: 172 SSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAFG 231

Query: 247 CGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG  NEG  F   AGL+GLGRG LS  +Q G     +FSYCL     +A  S ++ G +A
Sbjct: 232 CGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGI---DRFSYCLTSLDDAAGRSPLLLGSAA 288

Query: 306 VSRT------ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
                     A+ TPL+ NP   +FYYV L G++VG   +  + +S F +   G GGVI+
Sbjct: 289 GISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRL-ALPSSAFAIQDDGTGGVIV 347

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD-----LSGKTEVKVPTVV 414
           DSGTS+T L   AY ALR AF A  S           D CF      +    +V+VP +V
Sbjct: 348 DSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKLV 407

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
           LHF  GAD+ LPA NY++   +SG  C     +  GLSIIGN QQQ F+ VYD+A   + 
Sbjct: 408 LHFDGGADLDLPAENYMVLDSASGALCLTVMAS-RGLSIIGNFQQQNFQFVYDVAGDTLS 466

Query: 474 FAPRGC 479
           FAP  C
Sbjct: 467 FAPAEC 472


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 148/350 (42%), Positives = 203/350 (58%), Gaps = 21/350 (6%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRS 187
           SG   GSG YF  +G+GTP R + ++ DTGSD+ W QC PC + CY Q D +FDP+KS S
Sbjct: 136 SGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTS 195

Query: 188 FATVPCRSPLCRKLDSS-----GCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           ++ + C S LC +L ++     GC+     C+Y + YGD S +VG FS E L+   T + 
Sbjct: 196 YSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATDIV 255

Query: 242 RVAL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
              L GCG +N+GLF  +AGL+GLGR  +SF  QT   + + FSYCL   +TS+    + 
Sbjct: 256 DNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL--PATSSSTGRLS 313

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           FG +  S   ++TP     +  +FY +++ GISVGGA +  +++S F       GG IID
Sbjct: 314 FGTTTTSY-VKYTPFSTISRGSSFYGLDITGISVGGAKLP-VSSSTFS-----TGGAIID 366

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           SGT +TRL   AY ALR AFR G S    A + S+ DTC+DLSG     +P +   F G 
Sbjct: 367 SGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAGG 426

Query: 421 -DVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDL 467
             V LP    L  V S+   C AFA  G  S ++I GN+QQ+   VVYD+
Sbjct: 427 VTVQLPPQGILY-VASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 171/429 (39%), Positives = 230/429 (53%), Gaps = 41/429 (9%)

Query: 69  LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI 128
           L HVD+     T E L +  ++R   RV +L + A  A                 +++ I
Sbjct: 35  LRHVDA-DAGYTEEQLLSRALRRSSARVATLQSLAALA------------PGDAITAARI 81

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
             LA   GEY   +G+GTP RY   +LDTGSD++W QCAPC  C  Q  P FDPA+S ++
Sbjct: 82  LVLAS-DGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATY 140

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--- 245
            ++ C SP C  L    C ++  C+YQ  YGD + T G  + ET TF GT   RV+L   
Sbjct: 141 RSLGCASPACNALYYPLCYQK-VCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGI 198

Query: 246 --GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG- 302
             GCG+ N G     +G++G GRG LS  +Q G   + +FSYCL     S  PS + FG 
Sbjct: 199 SFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLG---SPRFSYCLTSF-LSPVPSRLYFGV 254

Query: 303 ------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL-DPAGNG 355
                  +A S   + TP + NP L T Y++ + GISVGG ++  I  ++F + D  G G
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-YLLPIDPAVFAINDTDGTG 313

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDL--SGKTEVKVPT 412
           G IIDSGT++T L  PAY A+R AF +  +  L    D S+ DTCF      +  V +P 
Sbjct: 314 GTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQ 373

Query: 413 VVLHFRGADVSLPATNYLIPVDSS--GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           +VLHF GAD  LP  NY++ VD S  G  C A A + S  SIIG+ Q Q F V+YDL  S
Sbjct: 374 LVLHFDGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENS 431

Query: 471 RIGFAPRGC 479
            + F P  C
Sbjct: 432 LMSFVPAPC 440


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 148/372 (39%), Positives = 206/372 (55%), Gaps = 17/372 (4%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
           F + ++SG   GSG+YF    +GTP +  ++++DTGSD+ ++QCAPC  CY Q  P++ P
Sbjct: 19  FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78

Query: 183 AKSRSFATVPCRSPLCRKLDS---SGCNR-------RNTCLYQVSYGDGSITVGDFSTET 232
           + S +F  VPC S  C  + +   + C+        +  C Y+  YGD S TVG F+ ET
Sbjct: 79  SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-S 291
            T  G RV  VA GCG+ N+G FV+A G+LGLG+G LSF +Q G  F  KF+YCL    S
Sbjct: 139 ATVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLS 198

Query: 292 TSAKPSSMVFGDSAVS--RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
            ++  SS++FGD  +S     +FTPL++NP   + YYV++V I  GG  +  I  S +K+
Sbjct: 199 PTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLL-IPDSAWKI 257

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
           D  GNGG I DSGT+VT  +  AY  +  AF       +  P       C ++SG     
Sbjct: 258 DSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPI 317

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDL 467
            P+  + F +GA       NY I V S    C A   + S G ++IGNI QQ + V YD 
Sbjct: 318 YPSFTIEFDQGATYRPNQGNYFIEV-SPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDR 376

Query: 468 AASRIGFAPRGC 479
              RIGFA   C
Sbjct: 377 EEHRIGFAHANC 388


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 176/455 (38%), Positives = 246/455 (54%), Gaps = 39/455 (8%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKS--------LTAFAESAVRVPPRN 113
           ++SL + L H D     R    L    ++RD+ R++S        LTA A     +   N
Sbjct: 80  KTSLKMELKHRDHGQPTRNRRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAYLEMTN 139

Query: 114 RSRGRANGG-------FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
            S  ++            S+V SG   G+GEYF  + VG PPR+  +++DTGSD+ W+QC
Sbjct: 140 SSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC 199

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL------DSSGCNRRNTCLYQVSYGD 220
            PCK C+ Q+ PVFDP++S SF  +PC +  C  +      D+S      TC Y   YGD
Sbjct: 200 KPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGD 259

Query: 221 GSITVGDFSTETLTF------RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ 274
            S T GD + E+L+           +  + +GCGH N+GLF  A GLLGLG+G LSFP+Q
Sbjct: 260 SSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQ 319

Query: 275 T-GRRFNRKFSYCLVDRSTSAKPSSMV-FGDS-AVSR---TARFTPLL-ANPKLDTFYYV 327
                  + FSYCLVDR+ +   SS + FG   A+SR     RFTP +  N  ++TFYY+
Sbjct: 320 LRSSPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYL 379

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            + GI +    +  I A  F + P G+GG IIDSGT++T L R AY A+  AF A   S 
Sbjct: 380 GIQGIKI-DQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SY 437

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD-SSGTFCFAFAG 445
            RA  F +   C++ +G+T V  PT+ + F+ GA++ LP  NY I  D      C A   
Sbjct: 438 PRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILP 497

Query: 446 TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           T  G+SIIGN QQQ    +YD+  +R+GFA   C+
Sbjct: 498 T-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDCS 531


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 155/381 (40%), Positives = 204/381 (53%), Gaps = 40/381 (10%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG++ GSGEYF  + +GTPP++  ++LDTGSD+ WIQC PC  C+ Q+ P +DP +S SF
Sbjct: 183 SGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSF 242

Query: 189 ATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT----- 238
             + C  P C+ + S      C   N TC Y   YGD S T GDF+ ET T   T     
Sbjct: 243 ENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGK 302

Query: 239 ----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
                V  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYCLVDR++  
Sbjct: 303 SEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDT 362

Query: 295 KPSS-MVFGDSAVSRTARFTPLLANPKL-------------DTFYYVELVGISVGGAHVR 340
             SS ++FG+           LL++P L             DTFYYV +  I V G  V 
Sbjct: 363 SVSSKLIFGEDK--------ELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDG-EVL 413

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
            I    + L   G GG IIDSGT++T    PAY  +++AF       +    F     C+
Sbjct: 414 KIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCY 473

Query: 401 DLSGKTEVKVPTV-VLHFRGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQ 458
           ++SG  ++++P   +L   GA    P  NY I ++     C A  GT  S LSIIGN QQ
Sbjct: 474 NVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPD-LVCLAILGTPKSALSIIGNYQQ 532

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           Q F ++YD+  SR+G+AP  C
Sbjct: 533 QNFHILYDMKKSRLGYAPMKC 553


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 161/423 (38%), Positives = 224/423 (52%), Gaps = 47/423 (11%)

Query: 88  RIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           R+Q++  +      FA +A    P        +G   +++ SG++ GSGEYF  + VGTP
Sbjct: 152 RLQKEQPKQSFKPVFAPAASSTSP-------VSGQLVATLESGVSLGSGEYFMDVFVGTP 204

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS---- 203
           P++  ++LDTGSD+ WIQC PC  C+ Q+ P +DP  S SF  + C  P C+ + S    
Sbjct: 205 PKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSSPDPP 264

Query: 204 SGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT---------RVARVALGCGHDNEG 253
           + C   N +C Y   YGDGS T GDF+ ET T   T          V  V  GCGH N G
Sbjct: 265 NPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRG 324

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRTARF 312
           LF  AAGLLGLG+G LSF +Q    + + FSYCLVDR+++A  SS ++FG+         
Sbjct: 325 LFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDK------- 377

Query: 313 TPLLANPKL-------------DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             LL++P L             DTFYYV++  + V    V  I    + L   G GG II
Sbjct: 378 -ELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDD-EVLKIPEETWHLSSEGAGGTII 435

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV-VLHFR 418
           DSGT++T    PAY  +++AF       +          C+++SG  ++++P   +L   
Sbjct: 436 DSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFAD 495

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           GA  + P  NY I +D     C A  G   S LSIIGN QQQ F ++YD+  SR+G+AP 
Sbjct: 496 GAVWNFPVENYFIQIDPD-VVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAPM 554

Query: 478 GCA 480
            CA
Sbjct: 555 KCA 557


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 19/358 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSF 188
           G+A G+G Y   + +GTP     +V DTGSD  W+QC PC   CY Q +P+FDP KS ++
Sbjct: 153 GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 212

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           A + C S  C  L  SGC+  + CLY + YGDGS T+G ++ +TLT     +     GCG
Sbjct: 213 ANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCG 271

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
             N GLF  AAGLLGLGRG+ S P Q   ++   F+YCL   +TSA    +  G  A + 
Sbjct: 272 EKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL--PATSAGTGFLDLGPGAPAA 329

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
            AR TP+L + +  TFYYV + GI VGG HV  I  S+F        G ++DSGT +TRL
Sbjct: 330 NARLTPMLVD-RGPTFYYVGMTGIKVGG-HVLPIPGSVFS-----TAGTLVDSGTVITRL 382

Query: 369 TRPAYIALRDAFRAGASSL--KRAPDFSLFDTCFDLSGKT--EVKVPTVVLHFR-GADVS 423
              AY  LR AF      L    AP FS+ DTC+DL+G     + +P V L F+ GA + 
Sbjct: 383 PPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLD 442

Query: 424 LPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + A+  L   D S   C AFA     + ++I+GN QQ+   V+YD+    +GFAP  C
Sbjct: 443 VDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 157/421 (37%), Positives = 227/421 (53%), Gaps = 47/421 (11%)

Query: 85  FNLRIQR----DVLRVKSLTAFAESAVRVPPRNRSRGRA---NGGFSSSVI---SGLAQG 134
           +N R+Q+    D LRV+S+            +NR R  A   N   S + I   SG+   
Sbjct: 14  WNRRLQKQLILDDLRVRSM------------QNRIRRVASTHNVEASQTQIPLSSGINLQ 61

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +  Y   +G+G+  + + +++DTGSD+ W+QC PC  CY+Q  P+F P+ S S+ +V C 
Sbjct: 62  TLNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCN 119

Query: 195 SPLCRKL-----DSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           S  C+ L     ++  C   N  TC Y V+YGDGS T G+   E L+F G  V+    GC
Sbjct: 120 SSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGC 179

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G +N+GLF   +GL+GLGR  LS  +QT   F   FSYCL      +  S ++  +S+V 
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVF 239

Query: 308 RTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           + A    +T +L+NP+L  FY + L GI VGG  ++   +        GNGG++IDSGT 
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS-------FGNGGILIDSGTV 292

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---D 421
           +TRL    Y AL+  F    +    AP FS+ DTCF+L+G  EV +PT+ L F G    +
Sbjct: 293 ITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLN 352

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V    T Y++  D+S   C A A        +IIGN QQ+  RV+YD   S++GFA   C
Sbjct: 353 VDATGTFYVVKEDAS-QVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPC 411

Query: 480 A 480
           +
Sbjct: 412 S 412


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 145/361 (40%), Positives = 211/361 (58%), Gaps = 23/361 (6%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSF 188
           G + GSG Y+ ++G+G+P RY  M++DTGS + W+QC PC   C+ Q DP+FDP+ S+++
Sbjct: 5   GASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 64

Query: 189 ATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VA 241
            ++ C S  C  L  +  N        N C+Y  SYGD S ++G  S + LT   ++ + 
Sbjct: 65  KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLP 124

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
               GCG D+EGLF  AAG+LGLGR +LS   Q   +F   FSYCL  R        +  
Sbjct: 125 GFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGG---FLSI 181

Query: 302 GDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
           G ++++ +A +FTP+  +P   + Y++ L  I+VGG    G+ A+ +++        IID
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGG-RALGVAAAQYRVP------TIID 234

Query: 361 SGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR- 418
           SGT +TRL    Y   + AF +  +S   RAP FS+ DTCF  + K    VP V L F+ 
Sbjct: 235 SGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIFQG 294

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GAD++L   N L+ VD  G  C AFAG  +G++IIGN QQQ F+V +D++ +RIGFA  G
Sbjct: 295 GADLNLRPVNVLLQVD-EGLTCLAFAGN-NGVAIIGNHQQQTFKVAHDISTARIGFATGG 352

Query: 479 C 479
           C
Sbjct: 353 C 353


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 19/358 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSF 188
           G+A G+G Y   + +GTP     +V DTGSD  W+QC PC   CY Q +P+FDP KS ++
Sbjct: 88  GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 147

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           A + C S  C  L  SGC+  + CLY + YGDGS T+G ++ +TLT     +     GCG
Sbjct: 148 ANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCG 206

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
             N GLF  AAGLLGLGRG+ S P Q   ++   F+YCL   +TSA    +  G  A + 
Sbjct: 207 EKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL--PATSAGTGFLDLGPGAPAA 264

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
            AR TP+L + +  TFYYV + GI VGG HV  I  S+F        G ++DSGT +TRL
Sbjct: 265 NARLTPMLVD-RGPTFYYVGMTGIKVGG-HVLPIPGSVFS-----TAGTLVDSGTVITRL 317

Query: 369 TRPAYIALRDAFRAGASSL--KRAPDFSLFDTCFDLSGKT--EVKVPTVVLHFR-GADVS 423
              AY  LR AF      L    AP FS+ DTC+DL+G     + +P V L F+ GA + 
Sbjct: 318 PPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLD 377

Query: 424 LPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + A+  L   D S   C AFA     + ++I+GN QQ+   V+YD+    +GFAP  C
Sbjct: 378 VDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 144/346 (41%), Positives = 197/346 (56%), Gaps = 25/346 (7%)

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN- 207
           R + +++DTGSD+ W+QC PCK+CY+Q DPVF+P+ S S+ TV C SP C+ L S+  N 
Sbjct: 144 RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNL 203

Query: 208 -----RRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVAAAGL 261
                   +C Y V+YGDGS T G+  TE L     T V     GCG +N+GLF  A+GL
Sbjct: 204 GVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGASGL 263

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA---RFTPLLAN 318
           +GLGR  LS  +QT   F   FSYCL    T A  S ++ G+S+V +      +T ++ N
Sbjct: 264 VGLGRSSLSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPN 323

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
           P+L  FY++ L GI+VG   V+   A  F     G  G++IDSGT +TRL    Y AL+D
Sbjct: 324 PQL-PFYFLNLTGITVGSVAVQ---APSF-----GKDGMMIDSGTVITRLPPSIYQALKD 374

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDS 435
            F    S    AP F + DTCF+LSG  EV++P + +HF G    +V +    Y +  D+
Sbjct: 375 EFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDA 434

Query: 436 SGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           S   C A A     + + IIGN QQ+  RV+YD   S +GFA   C
Sbjct: 435 S-QVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 228/443 (51%), Gaps = 36/443 (8%)

Query: 53  SLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPR 112
           SLP+   +      L+L HVD+ +    P+ L +  I R   RV +L + A S   V   
Sbjct: 16  SLPVARCNDNVGFQLKLTHVDAGTSYTKPQ-LLSRAIARSKARVAALQSAAVSPAPV--- 71

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
                 A+   ++ V+  +   SGEY   L +GTPP Y   ++DTGSD++W QCAPC  C
Sbjct: 72  ------ADPITAARVL--VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC 123

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
            +Q  P FD  +S ++  +PCRS  C  L S  C ++  C+YQ  YGD + T G  + ET
Sbjct: 124 AAQPTPYFDVKRSATYRALPCRSSRCAALSSPSCFKK-MCVYQYYYGDTASTAGVLANET 182

Query: 233 LTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
            TF        R A ++ GCG  N G    ++G++G GRG LS  +Q G     +FSYCL
Sbjct: 183 FTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGP---SRFSYCL 239

Query: 288 VDRSTSAKPSSMVFG--------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
                S  PS + FG        +++     + TP + NP L   Y++ + GIS+G   +
Sbjct: 240 TSY-LSPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRL 298

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDT 398
             I   +F ++  G GGVIIDSGTS+T L + AY A+R    A    L    D  +  DT
Sbjct: 299 P-IDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGL-ASTIPLPAMNDTDIGLDT 356

Query: 399 CFDL--SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNI 456
           CF         V VP  V HF GA+++LP  NY++   ++G  C A A T  G +IIGN 
Sbjct: 357 CFQWPPPPNVTVTVPDFVFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIGNY 415

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           QQQ   ++YD+A S + F P  C
Sbjct: 416 QQQNLHLLYDIANSFLSFVPAPC 438


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 154/422 (36%), Positives = 228/422 (54%), Gaps = 49/422 (11%)

Query: 85  FNLRIQR----DVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSS--------VISGLA 132
           +N ++Q+    D LRV+S+            +NR R + +G  SS         + SG+ 
Sbjct: 80  WNRKLQKQLIFDDLRVRSM------------QNRIRAKVSGHNSSEQSSEIQIPLASGIN 127

Query: 133 QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP 192
             +  Y   +G+G     V  ++DTGSD+ W+QC PC  CYSQ  PVF+P+ S S+ ++ 
Sbjct: 128 LETLNYIVTIGLGNQNMTV--IIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLL 185

Query: 193 CRSPLCRKL-----DSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL 245
           C S  C+ L     ++  C   N  +C + VSYGDGS T G+   E L+F G  V+    
Sbjct: 186 CNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFVF 245

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GCG +N+GLF   +G++GLGR  LS  +QT   F   FSYCL    + A  S ++  +S+
Sbjct: 246 GCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNESS 305

Query: 306 VSRT---ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
           + +      +T +++NP+L  FY + L GI VGG  ++  +         GNGG++IDSG
Sbjct: 306 LFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTS--------FGNGGILIDSG 357

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GAD 421
           T +TRL    Y AL+  F    S    AP  S+ DTCF+L+G  EV +PT+ +HF    D
Sbjct: 358 TVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNVD 417

Query: 422 VSLPATNYL-IPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           +++ A   L +P D S   C A A     + ++IIGN QQ+  RV+YD   S+IGFA   
Sbjct: 418 LNVDAVGILYMPKDGS-QVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFARED 476

Query: 479 CA 480
           C+
Sbjct: 477 CS 478


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 156/405 (38%), Positives = 216/405 (53%), Gaps = 40/405 (9%)

Query: 106 AVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQ 165
            V  P  +R+    +G   +++ SG++ GSGEYF  + VGTPP++  ++LDTGSD+ WIQ
Sbjct: 165 VVAAPAASRTTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQ 224

Query: 166 CAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGD 220
           C PC  C+ Q+ P +DP  S SF  + C  P C+ + +      C   N +C Y   YGD
Sbjct: 225 CVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGD 284

Query: 221 GSITVGDFSTETLTFRGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           GS T GDF+ ET T   T          V  V  GCGH N GLF  AAGLLGLG+G LSF
Sbjct: 285 GSNTTGDFALETFTVNLTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSF 344

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRTARFTPLLANPKL--------- 321
            +Q    + + FSYCLVDR+++A  SS ++FG+           LL++P L         
Sbjct: 345 ASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDK--------ELLSHPNLNFTSFGGGK 396

Query: 322 ----DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
               DTFYYV++  + V    V  I    + L   G GG IIDSGT++T    PAY  ++
Sbjct: 397 DGSVDTFYYVQIKSVMVDD-EVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIK 455

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSS 436
           +AF       +          C+++SG  ++++P   + F    V + P  NY I +D  
Sbjct: 456 EAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPE 515

Query: 437 GTFCFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              C A  G   S LSIIGN QQQ F ++YD+  SR+G+AP  CA
Sbjct: 516 -VVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCA 559


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  247 bits (631), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 141/366 (38%), Positives = 208/366 (56%), Gaps = 25/366 (6%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC+ CY+Q DP+F+P+ S S+
Sbjct: 58  SGVRLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSY 115

Query: 189 ATVPCRSPLCRKLDSSGCN------RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
            T+ C S  C+ L  +  N         TC Y V+YGDGS T GD   E L    T V+ 
Sbjct: 116 QTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSN 175

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
              GCG +N+GLF  A+GL+GLG+  LS  +QT   F   FSYCL   +  A  S ++ G
Sbjct: 176 FIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGG 235

Query: 303 DSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           +S+V +      +T ++ANP+L TFY++ L GIS+GG  ++   A  ++       G++I
Sbjct: 236 NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQ---APNYR-----QSGILI 287

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSGT +TRL  P Y  L+  F    S    AP FS+ DTCF+L+G  EV +PT+ + F G
Sbjct: 288 DSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEG 347

Query: 420 -ADVSLPATN--YLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGF 474
            A++++  T   Y +  D+S   C A A       + IIGN QQ+  RV+Y+   S++GF
Sbjct: 348 NAELTVDVTGIFYFVKTDAS-QVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGF 406

Query: 475 APRGCA 480
           A   C+
Sbjct: 407 AAEACS 412


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 229/434 (52%), Gaps = 36/434 (8%)

Query: 65  LSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSR---GR 118
           L L LHH  S  S    P  L F   +  D  R+ SL A  A++     P  R+      
Sbjct: 43  LHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKT-----PSARATSLDAD 97

Query: 119 ANGGFSSSVIS-----GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKC 172
           A+ G + S+ S     G + G G Y TR+G+GTP     MV+DTGS + W+QC+PC   C
Sbjct: 98  ADAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSC 157

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSYGDGSITVGD 227
           + Q+ PVF+P  S ++A+V C +  C  L S     S C+  N C+YQ SYGD S +VG 
Sbjct: 158 HRQSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGY 217

Query: 228 FSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
            S +T++F  T +     GCG DNEGLF  +AGL+GL R +LS   Q        F+YCL
Sbjct: 218 LSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL 277

Query: 288 VDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
              S+S   S   +     S    +TP++++   D+ Y+++L G++V G  +   +++  
Sbjct: 278 PSSSSSGYLSLGSYNPGQYS----YTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 333

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTE 407
            L        IIDSGT +TRL    Y AL  A  A      RA  +S+ DTCF     + 
Sbjct: 334 SLP------TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFK-GQASR 386

Query: 408 VKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
           V  P V + F  GA + L A N L+ VD S T C AFA   S  +IIGN QQQ F VVYD
Sbjct: 387 VSAPAVTMSFAGGAALKLSAQNLLVDVDDS-TTCLAFAPARSA-AIIGNTQQQTFSVVYD 444

Query: 467 LAASRIGFAPRGCA 480
           + +SRIGFA  GC+
Sbjct: 445 VKSSRIGFAAGGCS 458


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 163/406 (40%), Positives = 231/406 (56%), Gaps = 30/406 (7%)

Query: 86  NLRI-QRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGV 144
           N+ I  RD  RV S+ A   S    P +  +        +  V SG + G+G+Y   +G+
Sbjct: 74  NMEIFLRDQNRVDSIHARLSSRGMFPEKQAT--------TLPVQSGASIGAGDYVVTVGL 125

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCR---- 199
           GTP +   ++ DTGSD+ W QC PC K CY Q +P  +P+ S S+  + C S LC+    
Sbjct: 126 GTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVAS 185

Query: 200 -KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCGHDNEGLFVA 257
            K  S  C+  +TCLYQV YGDGS ++G F+TETLT   + V +  L GCG  N GLF  
Sbjct: 186 GKKFSQSCS-SSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGG 244

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLA 317
           AAGLLGLGR +L+ P+QT + + + FSYCL   S+S    S+      VS++ +FTPL A
Sbjct: 245 AAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSL---GGQVSKSVKFTPLSA 301

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           +     FY +++ G+SVGG  +  I  S F      + G +IDSGT +TRL+  AY  L 
Sbjct: 302 DFDSTPFYGLDITGLSVGGRKLS-IDESAF------SAGTVIDSGTVITRLSPTAYSELS 354

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSS 436
            AF+   +       +S+FDTC+D S    V++P V + F+G  ++ +  +  L PV+  
Sbjct: 355 SAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGL 414

Query: 437 GTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              C AFAG    S  SI GN+QQ+ ++VVYD A  R+GFAP GC+
Sbjct: 415 KKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 148/360 (41%), Positives = 197/360 (54%), Gaps = 20/360 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S +
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A V C +P C  L + GC+  + CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 233 YANVSCAAPACSDLYTRGCSGGH-CLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 291

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RS+         G  A 
Sbjct: 292 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 351

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
               + TP+L +    TFYYV + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 352 VGARQTTPMLTD-NGPTFYYVGMTGIRVGG-QLLSIPQSVFS-----TAGTIVDSGTVIT 404

Query: 367 RLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---D 421
           RL   AY +LR AF +   A   K+AP  SL DTC+D +G +EV +P V L F+G    D
Sbjct: 405 RLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLD 464

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V+     Y   +      C  FA       + I+GN Q + F VVYD+    +GF+P  C
Sbjct: 465 VNASGIMYAASLSQ---VCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 243/434 (55%), Gaps = 35/434 (8%)

Query: 63  SSLSLRLHH-----VDSLSFNRTPEHLFNLRI-QRDVLRVKSLTAFAESAVRVPPRNRSR 116
           +SLSL + H     +  ++  +  +   N+ I  RD  RV S+ A   S    P +  + 
Sbjct: 58  NSLSLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPEKQAT- 116

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQ 175
                  +  V SG + G+G+Y   +G+GTP +   ++ DTGSD+ W QC PC K CY Q
Sbjct: 117 -------TLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQ 169

Query: 176 TDPVFDPAKSRSFATVPCRSPLCR-----KLDSSGCNRRNTCLYQVSYGDGSITVGDFST 230
            +P  +P+ S S+  + C S LC+     K  S  C+  +TCLYQV YGDGS ++G F+T
Sbjct: 170 KEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCS-SSTCLYQVQYGDGSYSIGFFAT 228

Query: 231 ETLTFRGTRVARVAL-GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD 289
           ETLT   + V +  L GCG  N GLF  AAGLLGLGR +L+ P+QT + + + FSYCL  
Sbjct: 229 ETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPA 288

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
            S+S    S+      VS++ +FTPL A+     FY +++ G+SVGG  +  I  S F  
Sbjct: 289 SSSSKGYLSL---GGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLS-IDESAF-- 342

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
               + G +IDSGT +TRL+  AY  L  AF+   +       +S+FDTC+D S    V+
Sbjct: 343 ----SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVR 398

Query: 410 VPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYD 466
           +P V + F+G  ++ +  +  L PV+     C AFAG    S  SI GN+QQ+ ++VVYD
Sbjct: 399 IPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 458

Query: 467 LAASRIGFAPRGCA 480
            A  R+GFAP GC+
Sbjct: 459 GAKGRVGFAPGGCS 472


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 155/416 (37%), Positives = 226/416 (54%), Gaps = 39/416 (9%)

Query: 85  FNLRIQR----DVLRVKSLTAFAESAVR--VPPRNRSRGRANGGFSSSVISGLAQGSGEY 138
           +N R+Q+    D LRV+S+    ++ +R  V   N    +     SS    G+   +  Y
Sbjct: 14  WNRRLQKQLISDDLRVRSM----QNRIRRVVSSHNVEASQTQIPLSS----GINLQTLNY 65

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
              +G+G+    V  ++DTGSD+ W+QC PC  CY+Q  P+F P+ S S+ +V C S  C
Sbjct: 66  IVTMGLGSTNMTV--IIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTC 123

Query: 199 RKL-----DSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
           + L     ++  C    +TC Y V+YGDGS T G+   E L+F G  V+    GCG +N+
Sbjct: 124 QSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFGCGRNNK 183

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
           GLF   +GL+GLGR  LS  +QT   F   FSYCL    + A  S ++  +S+V +    
Sbjct: 184 GLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTP 243

Query: 311 -RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
             +T +L NP+L  FY + L GI V G  +        ++   GNGGV+IDSGT +TRL 
Sbjct: 244 ITYTRMLPNPQLSNFYILNLTGIDVDGVAL--------QVPSFGNGGVLIDSGTVITRLP 295

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN 428
              Y AL+  F    +    AP FS+ DTCF+L+G  EV +PT+ +HF G A++ + AT 
Sbjct: 296 SSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATG 355

Query: 429 --YLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             Y++  D+S   C A A        +IIGN QQ+  RV+YD   S++GFA   C+
Sbjct: 356 TFYVVKEDAS-QVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 150/359 (41%), Positives = 207/359 (57%), Gaps = 19/359 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRS 187
           SGL+  +G Y   + +GTP     +V DTGSD  W+QC PC   CY Q +P+F P KS +
Sbjct: 156 SGLSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSAT 215

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           +A + C S  C  LD+ GC+  + CLY V YGDGS TVG ++ +TLT     V     GC
Sbjct: 216 YANISCTSSYCSDLDTRGCSGGH-CLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGC 274

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF-GDSAV 306
           G  N GLF  AAGL+GLGRG+ S P Q   +++  F+YC+   +TS+    + F   +  
Sbjct: 275 GEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCI--PATSSGTGFLDFGPGAPA 332

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +  AR TP+L +    TFYYV + GI VGG H+  I A++F      + G ++DSGT +T
Sbjct: 333 AANARLTPMLVD-NGPTFYYVGMTGIKVGG-HLLSIPATVFS-----DAGALVDSGTVIT 385

Query: 367 RLTRPAYIALRDAFRAGASSL--KRAPDFSLFDTCFDLSG-KTEVKVPTVVLHFR-GADV 422
           RL   AY  LR AF  G   L  K AP FS+ DTC+DL+G +  + +P V L F+ GA +
Sbjct: 386 RLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACL 445

Query: 423 SLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            + A+  L   D S   C AFA     + ++I+GN QQ+ + V+YDL    +GFAP  C
Sbjct: 446 DVDASGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 163/406 (40%), Positives = 231/406 (56%), Gaps = 30/406 (7%)

Query: 86  NLRI-QRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGV 144
           N+ I  RD  RV S+ A   S    P +  +        +  V SG + G+G+Y   +G+
Sbjct: 26  NMEIFLRDQNRVDSIHARLSSRGMFPEKQAT--------TLPVQSGASIGAGDYVVTVGL 77

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCR---- 199
           GTP +   ++ DTGSD+ W QC PC K CY Q +P  +P+ S S+  + C S LC+    
Sbjct: 78  GTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVAS 137

Query: 200 -KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCGHDNEGLFVA 257
            K  S  C+  +TCLYQV YGDGS ++G F+TETLT   + V +  L GCG  N GLF  
Sbjct: 138 GKKFSQSCS-SSTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGG 196

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLA 317
           AAGLLGLGR +L+ P+QT + + + FSYCL   S+S    S+      VS++ +FTPL A
Sbjct: 197 AAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSL---GGQVSKSVKFTPLSA 253

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           +     FY +++ G+SVGG  +  I  S F      + G +IDSGT +TRL+  AY  L 
Sbjct: 254 DFDSTPFYGLDITGLSVGGRQLS-IDESAF------SAGTVIDSGTVITRLSPTAYSELS 306

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSS 436
            AF+   +       +S+FDTC+D S    V++P V + F+G  ++ +  +  L PV+  
Sbjct: 307 SAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGL 366

Query: 437 GTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              C AFAG    S  SI GN+QQ+ ++VVYD A  R+GFAP GC+
Sbjct: 367 KKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 199/358 (55%), Gaps = 16/358 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S +
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A V C +P C  L+  GC+  + CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 289

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G  A 
Sbjct: 290 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAA 349

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +R    TP+L      TFYYV + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 350 ARARLTTPMLTE-NGPTFYYVGMTGIRVGG-QLLSIPQSVFA-----TAGTIVDSGTVIT 402

Query: 367 RLTRPAYIALR--DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           RL   AY +LR   A    A   K+AP  SL DTC+D +G ++V +PTV L F+ GA + 
Sbjct: 403 RLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLD 462

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + A+  +    +S   C AFA    G  + I+GN Q + F V YD+    +GF P  C
Sbjct: 463 VDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 151/359 (42%), Positives = 203/359 (56%), Gaps = 19/359 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           GL+ G+  Y   +G+GTPP    +V DTGSD  W+QC PC   CY Q D +FDPAKS ++
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           A V C  P C  LD+SGCN  + CLY + YGDGS TVG F+ +TL      +     GCG
Sbjct: 215 ANVSCADPACADLDASGCNAGH-CLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFKFGCG 273

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF---GDSA 305
             N GLF   AGLLGLGRG  S   Q   ++   FSYCL   ++SA    + F     S+
Sbjct: 274 EKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCL--PASSAATGYLEFGPLSPSS 331

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
               A+ TP+L + K  TFYYV L GI VGG  +  I  S+F      N G ++DSGT +
Sbjct: 332 SGSNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVI 385

Query: 366 TRL--TRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
           TRL  T  A ++   A    AS  K+A  +S+ DTC+D +G ++V +PTV L F+ GA +
Sbjct: 386 TRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACL 445

Query: 423 SLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            L A+  +  +  S   C  FA  G    + I+GN QQ+ + V+YD++   +GFAP  C
Sbjct: 446 DLDASGIVYAISQS-QVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 158/397 (39%), Positives = 210/397 (52%), Gaps = 15/397 (3%)

Query: 90  QRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISG-------LAQGSGEYFTRL 142
           Q   LR ++L   +E  +    R   R RA    +  V++G       +A G+GEY   +
Sbjct: 38  QSSPLRSETLKTPSEIFIAAVKRGHER-RAR--LAKHVLAGDQLFETPVASGNGEYLIDI 94

Query: 143 GVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD 202
             G PP+    ++DTGSD+ W+QC PCK CY      FDP+KS S+ T+ C S  C+ L 
Sbjct: 95  SYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNFCQDLP 154

Query: 203 SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLL 262
              C    +C Y   YGDGS T G  ST+ +T    ++  VA GCG+ N G F  A GL+
Sbjct: 155 FQSC--AASCQYDYMYGDGSSTSGALSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLV 212

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLD 322
           GLG+G LS  +Q G    +KFSYCLV    S K S +  GDS ++    +TP+L N    
Sbjct: 213 GLGKGPLSLVSQLGGTATKKFSYCLVPLG-STKTSPLYIGDSTLAGGVAYTPMLTNNNYP 271

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
           TFYY EL GISV G  V    A+ F +   G GG+I+DSGT++T L   A+  +  A +A
Sbjct: 272 TFYYAELQGISVEGKAVN-YPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMVAALKA 330

Query: 383 GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFA 442
                +    F   + CF  +G      PTVV HF GADV+L   N  I +D  GT C A
Sbjct: 331 ALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGADVALAPDNTFIALDFEGTTCLA 390

Query: 443 FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            A + +G SI GNIQQ    +V+DL   RIGF    C
Sbjct: 391 MASS-TGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  245 bits (625), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 187/451 (41%), Positives = 243/451 (53%), Gaps = 35/451 (7%)

Query: 61  AESSLSLRLH-HVDSLSFNRT-PEHLFNLRIQRDVLRVKSLT--AFAESAVRVP--PRNR 114
           A  S SL+LH +  +    RT  E + +L   +D +R++++   A      R P  P + 
Sbjct: 69  ASLSPSLKLHMNRRAAEGGRTRKESVLDL-ADKDAVRIETMHRRAARSGGDRTPASPSSS 127

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
            R   +    ++V SG+A GSGEY   + VGTPPR   M++DTGSD+ W+QCAPC  C+ 
Sbjct: 128 PRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFD 187

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKL----DSSGCNR--RNTCLYQVSYGDGSITVGDF 228
           Q  PVFDPA S S+  V C    C  +        C R   ++C Y   YGD S T GD 
Sbjct: 188 QVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDL 247

Query: 229 STETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRK 282
           + E+ T   T      RV  V  GCGH N GLF  AAGLLGLGRG LSF +Q    +   
Sbjct: 248 ALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHT 307

Query: 283 FSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL-------ANPKLDTFYYVELVGISVG 335
           FSYCLVD  +    S +VFG+      A   P L       A+   DTFYYV+L G+ VG
Sbjct: 308 FSYCLVDHGSDVA-SKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVG 366

Query: 336 GAHVRGITASLF--KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAP 391
           G  +  I++  +       G+GG IIDSGT+++    PAY  +R AF  R G  S    P
Sbjct: 367 G-ELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMG-RSYPLIP 424

Query: 392 DFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSG 449
           DF +   C+++SG    +VP + L F  GA    PA NY I +D  G  C A  GT  +G
Sbjct: 425 DFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +SIIGN QQQ F VVYDL  +R+GFAPR CA
Sbjct: 485 MSIIGNFQQQNFHVVYDLKNNRLGFAPRRCA 515


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  244 bits (624), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 146/367 (39%), Positives = 205/367 (55%), Gaps = 43/367 (11%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG++ G+G Y   +G+G+P + + ++ DTGSD+ W +C+  +         FDP KS S+
Sbjct: 125 SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAE--------TFDPTKSTSY 176

Query: 189 ATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARV 243
           A V C +PLC  + S+  N      +TC+Y + YGDGS ++G    E LT   T +    
Sbjct: 177 ANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIFNNF 236

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
             GCG D +GLF  AAGLLGLGR +LS  +QT  ++N+ FSYCL   S++     + FG 
Sbjct: 237 YFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTG---FLSFG- 292

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           S+ S++A+FTPL + P   +FY ++L GI+VGG  +  I  S+F        G IIDSGT
Sbjct: 293 SSQSKSAKFTPLSSGPS--SFYNLDLTGITVGGQKL-AIPLSVFS-----TAGTIIDSGT 344

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
            VTRL   AY ALR AFR   +S       S+ DTC+D S    +KVP +V+ F G    
Sbjct: 345 VVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGG--- 401

Query: 424 LPATNYLIPVDSSGTF--------CFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIG 473
                  + VD +G F        C AFAG       +I GN QQ+ F VVYD++  ++G
Sbjct: 402 -----VDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVG 456

Query: 474 FAPRGCA 480
           FAP  C+
Sbjct: 457 FAPASCS 463


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 146/347 (42%), Positives = 191/347 (55%), Gaps = 17/347 (4%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +GTP      ++DTGSD+VW QC PC  C+ Q+ PVFDP+ S ++ATVPC S  C  L +
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL-FVAAAGLL 262
           S C   + C Y  +YGD S T G  +TET T   +++  V  GCG  NEG  F   AGL+
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLV 292

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-------VSRTARFTPL 315
           GLGRG LS  +Q G     KFSYCL     +   S ++ G  A        + + + TPL
Sbjct: 293 GLGRGPLSLVSQLGL---DKFSYCLTSLDDTNN-SPLLLGSLAGISEASAAASSVQTTPL 348

Query: 316 LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
           + NP   +FYYV L  I+VG   +  + +S F +   G GGVI+DSGTS+T L    Y A
Sbjct: 349 IKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRA 407

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGK--TEVKVPTVVLHFR-GADVSLPATNYLIP 432
           L+ AF A  +           D CF    K   +V+VP +V HF  GAD+ LPA NY++ 
Sbjct: 408 LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVL 467

Query: 433 VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              SG  C    G+  GLSIIGN QQQ F+ VYD+    + FAP  C
Sbjct: 468 DGGSGALCLTVMGSR-GLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 157/389 (40%), Positives = 208/389 (53%), Gaps = 28/389 (7%)

Query: 114 RSRGRANGGFSSS-------------VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSD 160
           R R    GG+S+              + SG A  S  Y  +LG GTPP+  Y VLDTGS+
Sbjct: 87  RYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSN 146

Query: 161 VVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNRRNTCLYQVSY 218
           + WI C PC  C S+  P F+P+KS ++  + C S  C+ L   +   N  N  L Q  Y
Sbjct: 147 IAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQ-RY 204

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR 278
           GD S      S+ETL+    +V     GC +   GL      L+G GR  LSF +QT   
Sbjct: 205 GDQSEVDEILSSETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATL 264

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSAVS-RTARFTPLLANPKLDTFYYVELVGISVGGA 337
           ++  FSYCL    +SA   S++ G  A+S +  +FTPLL+N +  +FYYV L GISVG  
Sbjct: 265 YDSTFSYCLPSLFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEE 324

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD 397
            V  I A    LD +   G IIDSGT +TRL  PAY A+RD+FR+  S+L  A    LFD
Sbjct: 325 LV-SIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFD 383

Query: 398 TCFDL-SGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGT-FCFAF----AGTMSGL 450
           TC++  SG  +V+ P + LHF    D++LP  N L P +  G+  C AF     G    L
Sbjct: 384 TCYNRPSG--DVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVL 441

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           S  GN QQQ  R+V+D+A SR+G A   C
Sbjct: 442 STFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/418 (35%), Positives = 219/418 (52%), Gaps = 35/418 (8%)

Query: 82  EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTR 141
           E +F  RI  D + V SL +  +SA+  P +      +       + SG    +  Y   
Sbjct: 94  EKIFQNRIILDAINVNSLFSHFKSAI-FPGQTHQLSDS----QIPISSGARLQTLNYIVT 148

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           +G+G   +   +++DTGSD+ W+QC PC+ CY+Q +P+F+P+ S SF ++PC SP C  L
Sbjct: 149 VGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVAL 206

Query: 202 D----SSG-CNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
                SSG C+ +N+  C YQ+ YGDGS + G+   E LT   T +     GCG +N+GL
Sbjct: 207 QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGL 266

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           F  A+GL+GL R  LS  +QT   F   FSYCL      +   S+  G +  S     +P
Sbjct: 267 FGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS-GSLTLGGADFSNFKNISP 325

Query: 315 -----LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV--IIDSGTSVTR 367
                ++ NP++  FY++ L GIS+GG ++     S        N GV  ++DSGT +TR
Sbjct: 326 ISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-------SNEGVLSLLDSGTVITR 378

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD---VSL 424
           L+   Y A +  F    S  +  P FS+ +TCF+L+G  EV +PTV   F G     V +
Sbjct: 379 LSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDV 438

Query: 425 PATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
               Y +  D+S   C AFA  G      IIGN QQ+  RV+Y+   S++GFA   C+
Sbjct: 439 EGVFYFVKSDAS-QICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 178/447 (39%), Positives = 232/447 (51%), Gaps = 45/447 (10%)

Query: 64  SLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR--ANG 121
           SL L + H  + +        F    ++D +R+ ++   A  +     R  S  R   + 
Sbjct: 73  SLKLHMTHRSAAAGETGKGSFFLDSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSE 132

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
              ++V SG+  GSGEY   + +GTPPR   M++DTGSD+ W+QCAPC  C+ Q+ P+FD
Sbjct: 133 RVVATVESGVPVGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFD 192

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG------CN--RRNTCLYQVSYGDGSITVGDFSTETL 233
           PA S S+  V C    CR +          C   R + C Y   YGD S T GD + E  
Sbjct: 193 PAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAF 252

Query: 234 TFRGT-----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT-GRRFNRKFSYCL 287
           T   T     RV  VA GCGH N GLF  AAGLLGLGRG LSF +Q  G      FSYCL
Sbjct: 253 TVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCL 312

Query: 288 VDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL-----------DTFYYVELVGISVGG 336
           V+  ++A  S ++FG            LLA+P+L           DTFYY++L  I VGG
Sbjct: 313 VEHGSAAG-SKIIFGHDDA--------LLAHPQLNYTAFAPTTDADTFYYLQLKSILVGG 363

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSL 395
             V          D    GG IIDSGT+++    PAY A+R AF    S S      F +
Sbjct: 364 EAVN------ISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPV 417

Query: 396 FDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSII 453
              C+++SG  +V+VP + L F  GA    PA NY I ++  G  C A  GT  SG+SII
Sbjct: 418 LSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSII 477

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           GN QQQ F V+YDL  +R+GFAPR CA
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCA 504


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 149/418 (35%), Positives = 219/418 (52%), Gaps = 35/418 (8%)

Query: 82  EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTR 141
           E +F  RI  D + V SL +  +SA+  P +      +       + SG    +  Y   
Sbjct: 15  EKIFQNRIILDAINVNSLFSHFKSAI-FPGQTHQLSDS----QIPISSGARLQTLNYIVT 69

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           +G+G   +   +++DTGSD+ W+QC PC+ CY+Q +P+F+P+ S SF ++PC SP C  L
Sbjct: 70  VGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVAL 127

Query: 202 D----SSG-CNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
                SSG C+ +N+  C YQ+ YGDGS + G+   E LT   T +     GCG +N+GL
Sbjct: 128 QPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGL 187

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           F  A+GL+GL R  LS  +QT   F   FSYCL      +   S+  G +  S     +P
Sbjct: 188 FGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS-GSLTLGGADFSNFKNISP 246

Query: 315 -----LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV--IIDSGTSVTR 367
                ++ NP++  FY++ L GIS+GG ++     S        N GV  ++DSGT +TR
Sbjct: 247 ISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-------SNEGVLSLLDSGTVITR 299

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD---VSL 424
           L+   Y A +  F    S  +  P FS+ +TCF+L+G  EV +PTV   F G     V +
Sbjct: 300 LSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDV 359

Query: 425 PATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
               Y +  D+S   C AFA  G      IIGN QQ+  RV+Y+   S++GFA   C+
Sbjct: 360 EGVFYFVKSDAS-QICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 165/423 (39%), Positives = 223/423 (52%), Gaps = 31/423 (7%)

Query: 66  SLRLHHV----DSLSFNRTPEHLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRA 119
           SLR+ H+      LS +   +H  +  I+RD  RV+S+ +     SA  V     +   A
Sbjct: 64  SLRVVHMHGACSHLSSDARVDH--DEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPA 121

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDP 178
                    SG+  GSG Y   +G+GTP   + +V DTGSD+ W QC PC   CYSQ +P
Sbjct: 122 K--------SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEP 173

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT 238
            F+P+ S ++  V C SP+C   D+  C+  N C+Y + YGD S T G  + E  T   +
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCE--DAESCSASN-CVYSIGYGDKSFTQGFLAKEKFTLTNS 230

Query: 239 RVAR-VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
            V   V  GCG +N+GLF   AGLLGLG G+LS P QT   +N  FSYCL    TS    
Sbjct: 231 DVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCL-PSFTSNSTG 289

Query: 298 SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            + FG + +S + +FTP+ + P     Y ++++GISVG   +  IT + F  +     G 
Sbjct: 290 HLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKEL-AITPNSFSTE-----GA 342

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           IIDSGT  TRL    Y  LR  F+   SS K    + LFDTC+D +G   V  PT+   F
Sbjct: 343 IIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSF 402

Query: 418 RGAD-VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
            G   V L  +   +P+  S   C AFAG     +I GN+QQ    VVYD+A  R+GFAP
Sbjct: 403 AGGTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAP 461

Query: 477 RGC 479
            GC
Sbjct: 462 NGC 464


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 165/423 (39%), Positives = 224/423 (52%), Gaps = 31/423 (7%)

Query: 66  SLRLHHV----DSLSFNRTPEHLFNLRIQRDVLRVKSLTA--FAESAVRVPPRNRSRGRA 119
           SLR+ H+      LS +   +H  +  I+RD  RV+S+ +     SA  V     +   A
Sbjct: 64  SLRVVHMHGACSHLSSDARVDH--DEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPA 121

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDP 178
                    SG+  GSG Y   +G+GTP   + +V DTGSD+ W QC PC   CYSQ +P
Sbjct: 122 K--------SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEP 173

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT 238
            F+P+ S ++  V C SP+C   D+  C+  N C+Y + YGD S T G  + E  T   +
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCE--DAESCSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNS 230

Query: 239 RVAR-VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
            V   V  GCG +N+GLF   AGLLGLG G+LS P QT   +N  FSYCL    TS    
Sbjct: 231 DVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCL-PSFTSNSTG 289

Query: 298 SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            + FG + +S + +FTP+ + P     Y ++++GISVG   +  IT + F  +     G 
Sbjct: 290 HLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKEL-AITPNSFSTE-----GA 342

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           IIDSGT  TRL    Y  LR  F+   SS K    + LFDTC+D +G   V  PT+   F
Sbjct: 343 IIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSF 402

Query: 418 RGAD-VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
            G+  V L  +   +P+  S   C AFAG     +I GN+QQ    VVYD+A  R+GFAP
Sbjct: 403 AGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAP 461

Query: 477 RGC 479
            GC
Sbjct: 462 NGC 464


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 158/363 (43%), Positives = 209/363 (57%), Gaps = 33/363 (9%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVP 192
           GSG YF  +G+GTP +   ++ DTGSD+ W QC PC K CY+Q + +F+P++S S+A + 
Sbjct: 149 GSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANIS 208

Query: 193 CRSPLCRKLDSSGCNRRN----TCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGC 247
           C S LC  L S+  N  N    TC+Y + YGD S ++G F  E L+   T V      GC
Sbjct: 209 CGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATDVFNDFYFGC 268

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G +N+GLF  AAGLLGLGR +LS  +QT +R+N+ FSYCL   S+S+    + FG S  S
Sbjct: 269 GQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCL--PSSSSSTGFLTFGGS-TS 325

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
           ++A FTPL       +FY ++L GISVGG  +  I+ S+F        G IIDSGT +TR
Sbjct: 326 KSASFTPLATISGGSSFYGLDLTGISVGGRKL-AISPSVFS-----TAGTIIDSGTVITR 379

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPAT 427
           L   AY AL   FR   S    AP  S+ DTCFD S    + VP + L F G  V     
Sbjct: 380 LPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVV----- 434

Query: 428 NYLIPVDSSGTF--------CFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
              + +D +G F        C AFAG    S ++I GN+QQ+   VVYD AA R+GFAP 
Sbjct: 435 ---VDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPA 491

Query: 478 GCA 480
           GC+
Sbjct: 492 GCS 494


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 150/385 (38%), Positives = 209/385 (54%), Gaps = 29/385 (7%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VF 180
           F S VISG + GSG+YF  L +GTPP+ + +V DTGSD++W++C+PC+ C S   P   F
Sbjct: 71  FRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNC-SHRSPGSAF 129

Query: 181 DPAKSRSFATVPCRSPLCRKL---DSSGCNR---RNTCLYQVSYGDGSITVGDFSTETLT 234
               S +++ + C SP C+ +     + CNR    + C YQ +Y D S T G FS E LT
Sbjct: 130 FARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALT 189

Query: 235 FRGT-----RVARVALGCGHDNEGL------FVAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
              +     ++  ++ GCG    G       F  A G++GLGR  +SF +Q GRRF  KF
Sbjct: 190 LNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKF 249

Query: 284 SYCLVDRSTSAKPSS-MVFG---DSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGA 337
           SYCL+D + S  P+S +  G   + AVS+     FTPLL NP   TFYY+ + G+ V G 
Sbjct: 250 SYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGV 309

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD 397
            +  I  S++ +D  GNGG IIDSGT++T +T PAY  +  AF+        A     FD
Sbjct: 310 KLP-INPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFD 368

Query: 398 TCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPV-DSSGTFCFAFAGTMSGLSIIGN 455
            C ++SG T   +P +  +  G  V S P  NY I   D              G S++GN
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGN 428

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQGF + +D   SR+GF  RGCA
Sbjct: 429 LMQQGFLLEFDRDKSRLGFTRRGCA 453


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 170/433 (39%), Positives = 232/433 (53%), Gaps = 26/433 (6%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA 119
            S L L LHH  S  S    P  + F+  +  D  R+ SL A         P    RG +
Sbjct: 38  SSGLHLTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSS 97

Query: 120 NGGFSSSVIS-----GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCY 173
           +   + S+ S     G + G G Y TR+G+GTP +   MV+DTGS + W+QC+PC   C+
Sbjct: 98  SSPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCH 157

Query: 174 SQTDPVFDPAKSRSFATVPCRSPLCRKL-----DSSGCNRRNTCLYQVSYGDGSITVGDF 228
            Q+ PVF+P  S S+A+V C +P C  L     + S C+  N C+YQ SYGD S +VG  
Sbjct: 158 RQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYL 217

Query: 229 STETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           S +T++F  T V     GCG DNEGLF  +AGL+GL R +LS   Q        FSYCL 
Sbjct: 218 SKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLP 277

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
             S+S+   S+   +        +TP+  +   D+ Y++++ GI+V G  +  ++AS + 
Sbjct: 278 TSSSSSGYLSIGSYNPG---QYSYTPMAKSSLDDSLYFIKMTGITVAGKPLS-VSASAYS 333

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV 408
             P      IIDSGT +TRL    Y AL  A         RA  FS+ DTCF     + +
Sbjct: 334 SLP-----TIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQASRL 387

Query: 409 KVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           +VP V + F  GA + L ATN L+ VDS+ T C AFA   S  +IIGN QQQ F VVYD+
Sbjct: 388 RVPQVSMAFAGGAALKLKATNLLVDVDSA-TTCLAFAPARSA-AIIGNTQQQTFSVVYDV 445

Query: 468 AASRIGFAPRGCA 480
             S+IGFA  GC+
Sbjct: 446 KNSKIGFAAGGCS 458


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 149/469 (31%), Positives = 233/469 (49%), Gaps = 58/469 (12%)

Query: 41  WPESVSVSESESSLPLPAPDAESSLSLRLHH--------VDSLSFNRTPEHLFNLRIQRD 92
           W    S   S S           S +L + H        +D     R    L N+R+Q  
Sbjct: 45  WSPKKSYEASSSCFSRSLGKGRESTTLEMKHRELCSGKTIDWGKKMRRALLLDNIRVQSL 104

Query: 93  VLRVKSLTAFAE----SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
            LR+K++T+       S  ++P                + SG+   +  Y   + +G   
Sbjct: 105 QLRIKAMTSSTTEQSVSETQIP----------------LTSGIKLETLNYIVTVELG--G 146

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           + + +++DTGSD+ W+QC PC+ CY+Q  P++DP+ S S+ TV C S  C+ L ++  N 
Sbjct: 147 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNS 206

Query: 209 ----------RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAA 258
                     + TC Y VSYGDGS T GD ++E++    T++  +  GCG +N+GLF  A
Sbjct: 207 GPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKGLFGGA 266

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR---FTPL 315
           +GL+GLGR  +S  +QT + FN  FSYCL      A  +     D +V + +    +TPL
Sbjct: 267 SGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPL 326

Query: 316 LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
           + NP+L +FY + L G S+GG  ++ ++            G++IDSGT +TRL    Y A
Sbjct: 327 VQNPQLRSFYILNLTGASIGGVELKTLSFGR---------GILIDSGTVITRLPPSIYKA 377

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIP 432
           ++  F    S    AP +S+ DTCF+L+   ++ +PT+ + F G    +V +    Y + 
Sbjct: 378 VKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVK 437

Query: 433 VDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            D+S   C A A     + + IIGN QQ+  RV+YD    R+G A   C
Sbjct: 438 PDAS-LVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 140/350 (40%), Positives = 194/350 (55%), Gaps = 14/350 (4%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRS 195
           E+   +G GTP +   ++ DTGSDV WIQC PC   CY Q DP+FDP KS +++ VPC  
Sbjct: 134 EFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL 254
           P C   D S C+   TCLY+V YGDGS + G  S ETL+   TR +   A GCG  N G 
Sbjct: 194 PQCAAADGSKCSN-GTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGFAFGCGQTNLGD 252

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
           F    GL+GLGRG+LS  +Q    F   FSYCL   +T+    ++     A +   ++T 
Sbjct: 253 FGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPASNDDVQYTA 312

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           ++      +FY+VELV I +GG ++  +  +LF  D     G  +DSGT +T L   AY 
Sbjct: 313 MVQKQDYPSFYFVELVSIDIGG-YILPVPPTLFTDD-----GTFLDSGTILTYLPPEAYT 366

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLI-P 432
           ALRD F+   +  K AP +  FDTC+D +G++ + +P V   F    V  L     LI P
Sbjct: 367 ALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFGILIFP 426

Query: 433 VDSSGTF-CFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            D++    C  F    S +  +I+GN+QQ+   V+YD+AA +IGFA   C
Sbjct: 427 DDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 147/357 (41%), Positives = 194/357 (54%), Gaps = 20/357 (5%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRSFAT 190
           A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S + A 
Sbjct: 180 ALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDAN 239

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGH 249
           + C +P C  L + GC+  + CLY V YGDGS ++G F+ +TLT       +    GCG 
Sbjct: 240 ISCAAPACSDLYTKGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGE 298

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            NEGLF  AAGLLGLGRG+ S P Q   ++   F++C   RS+         G S    T
Sbjct: 299 RNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSPAVST 358

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              TP+L +  L TFYYV L GI VGG  +  I  S+F        G I+DSGT +TRL 
Sbjct: 359 KLTTPMLVDNGL-TFYYVGLTGIRVGG-KLLSIPPSVFT-----TAGTIVDSGTVITRLP 411

Query: 370 RPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSL 424
             AY +LR AF +   A   K+AP  SL DTC+D +G ++V +PTV L F+G    DV  
Sbjct: 412 PAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDA 471

Query: 425 PATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               Y   V  +   C  FA       + I+GN Q + F VVYD+    +GF+P  C
Sbjct: 472 SGIIYAASVSQA---CLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 155/385 (40%), Positives = 218/385 (56%), Gaps = 34/385 (8%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VFDP 182
           S V+SG + GSG+YF  L +G PP+ + ++ DTGSD+VW++C+ C+ C S   P  VF P
Sbjct: 70  SPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNC-SHHSPATVFFP 128

Query: 183 AKSRSFATVPCRSPLCRKLDSSG----CNR---RNTCLYQVSYGDGSITVGDFSTETLTF 235
             S +F+   C  P+CR +   G    CN     +TC Y+  Y DGS+T G F+ ET + 
Sbjct: 129 RHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSL 188

Query: 236 RGT-----RVARVALGCGHDNEGL------FVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           + +     ++  VA GCG    G       F  A G++GLGRG +SF +Q GRRF  KFS
Sbjct: 189 KTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFS 248

Query: 285 YCLVDRSTSAKPSS-MVFGD--SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           YCL+D + S  P+S ++ GD   AVS+   FTPLL NP   TFYYV+L  + V GA +R 
Sbjct: 249 YCLMDYTLSPPPTSYLIIGDGGDAVSKLF-FTPLLTNPLSPTFYYVKLKSVFVNGAKLR- 306

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCF 400
           I  S++++D +GNGG ++DSGT++  L  PAY  +  A +     L  A + +  FD C 
Sbjct: 307 IDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRI-KLPNADELTPGFDLCV 365

Query: 401 DLSG--KTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGT--MSGLSIIGN 455
           ++SG  K E  +P +   F G  V +P   NY I  +     C A        G S+IGN
Sbjct: 366 NVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFSVIGN 424

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQGF   +D   SR+GF+ RGCA
Sbjct: 425 LMQQGFLFEFDRDRSRLGFSRRGCA 449


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 149/367 (40%), Positives = 203/367 (55%), Gaps = 22/367 (5%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SG Y   + +G+PP+    ++DTGSD+VWIQC PC +CYSQ+DP++DP+ S +FA   C 
Sbjct: 1   SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60

Query: 195 SPLCRKLDSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA-----LGCG 248
           +  C+ L +SGC+    TC+Y   YGD S T GDF+ ETLT R +  +  A      GCG
Sbjct: 61  TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFGDSAVS 307
             N G F  AAG++GLG+G++S  TQ G   N KFSYCLVD    S+K S ++FG SA +
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180

Query: 308 RTARF-TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD------------PAGN 354
            +    TP++ N    T+Y+V L GISVGG  +   T ++  L                +
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           GG I DSGT++T L    Y  ++ AF +  S        S FD C+D+S     K P + 
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPALT 300

Query: 415 LHFRGADVSLPATNYLIPVDSSGTF-CFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRI 472
           L F+G   S P  NY + VD++ T  C A       GL IIGN+ QQ + VVYD   S I
Sbjct: 301 LAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTSTI 360

Query: 473 GFAPRGC 479
             +P  C
Sbjct: 361 SMSPAQC 367


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  241 bits (614), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 161/445 (36%), Positives = 223/445 (50%), Gaps = 39/445 (8%)

Query: 53  SLPLPAPDAESSL--SLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVP 110
           +L LPA     ++   L+L HVD+   + T   L +  I R   RV +L    +SA  +P
Sbjct: 15  TLSLPAAHCNDNVGFQLKLTHVDA-GTSYTKLQLLSRAIARSKARVAAL----QSAAVLP 69

Query: 111 PRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
           P       A    ++S        SGEY   L +GTPP Y   ++DTGSD++W QCAPC 
Sbjct: 70  PVVDPITAARVLVTAS--------SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCL 121

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFST 230
            C  Q  P FD  KS ++  +PCRS  C  L S  C ++  C+YQ  YGD + T G  + 
Sbjct: 122 LCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKK-MCVYQYYYGDTASTAGVLAN 180

Query: 231 ETLTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
           ET TF        R   +A GCG  N G    ++G++G GRG LS  +Q G     +FSY
Sbjct: 181 ETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSY 237

Query: 286 CLVDRSTSAKPSSMVFG--------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGA 337
           CL     SA PS + FG        +++     + TP + NP L   Y++ L  IS+ G 
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISL-GT 295

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-F 396
            +  I   +F ++  G GGVIIDSGTS+T L + AY A+R      A  L    D  +  
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGL-VSAIPLPAMNDTDIGL 354

Query: 397 DTCFDL--SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIG 454
           DTCF         V VP +V HF  A+++L   NY++   ++G  C   A T  G +IIG
Sbjct: 355 DTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVG-TIIG 413

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           N QQQ   ++YD+  S + F P  C
Sbjct: 414 NYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  240 bits (613), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 157/401 (39%), Positives = 212/401 (52%), Gaps = 36/401 (8%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           IQR   RV + T  +      P RNR    A+        SG A G+G Y   +G+GTP 
Sbjct: 126 IQR---RVSTTTTVSRGK---PKRNRPSLPAS--------SGSALGTGNYVVTIGLGTPA 171

Query: 149 RYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN 207
               +V DTGSD  W+QC PC   CY Q + +FDPA+S ++A + C +P C  L   GC+
Sbjct: 172 GRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAPACSDLYIKGCS 231

Query: 208 RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEGLFVAAAGLLGLGR 266
             + CLY V YGDGS ++G F+ +TLT       +    GCG  NEGL+  AAGLLGLGR
Sbjct: 232 GGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGR 290

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV-SRTARFTPLLANPKLDTFY 325
           G+ S P Q   ++   F++C   RS+      + FG  ++ + +A+ T  +      TFY
Sbjct: 291 GKTSLPVQAYDKYGGVFAHCFPARSSGT--GYLDFGPGSLPAVSAKLTTPMLVDNGPTFY 348

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           YV L GI VGG  +  I  S+F        G I+DSGT +TRL   AY +LR AF +  +
Sbjct: 349 YVGLTGIRVGG-KLLSIPQSVFT-----TSGTIVDSGTVITRLPPAAYSSLRSAFASAMA 402

Query: 386 S--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDSSGTFC 440
               K+AP  SL DTC+D +G +EV +PTV L F+G    DV      Y   V  +   C
Sbjct: 403 ERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHASGIIYAASVSQA---C 459

Query: 441 FAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             FAG      + I+GN Q + F VVYD+    +GF P  C
Sbjct: 460 LGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 137/351 (39%), Positives = 195/351 (55%), Gaps = 13/351 (3%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           GSGEY  ++ +GTP   +  ++DTGSD+VW +C PC  C   T  ++DP+ S +++ V C
Sbjct: 38  GSGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLC 95

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           +S LC+      CN    C Y   YGD S T G  S ET +     +  +  GCGHDN+G
Sbjct: 96  QSSLCQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQG 155

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA--VSRTAR 311
            F    GL+G GRG LS  +Q G     KFSYCLV R+ S+K S +  G++A   + T  
Sbjct: 156 -FDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVG 214

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
            TPL+ +   +  YY+ L GISVGG  +  I    F +   G+GG+IIDSGT++T L + 
Sbjct: 215 STPLVQSSSTN-HYYLSLEGISVGGQSL-AIPTGTFDIQSDGSGGLIIDSGTTLTFLQQT 272

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
           AY A+++A     SS+         D CF+  G +    P++  HF+GAD  +P  NYL 
Sbjct: 273 AYDAVKEAM---VSSINLPQADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLF 329

Query: 432 PVDSSGTFCFAFAGTMSGL---SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           P  +S   C A   T S L   +I GN+QQQ ++++YD   + + FAP  C
Sbjct: 330 PDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 152/384 (39%), Positives = 211/384 (54%), Gaps = 32/384 (8%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VFDP 182
           S V+SG A GSG+YF  L +G PP+ + ++ DTGSD+VW++C+ C+ C S   P  VF P
Sbjct: 71  SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNC-SHHSPATVFFP 129

Query: 183 AKSRSFATVPCRSPLCRKLDSSG----CNR---RNTCLYQVSYGDGSITVGDFSTETLTF 235
             S +F+   C  P+CR +        CN     +TC Y+  Y DGS+T G F+ ET + 
Sbjct: 130 RHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSL 189

Query: 236 RGT-----RVARVALGCGHDNEGL------FVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           + +     R+  VA GCG    G       F  A G++GLGRG +SF +Q GRRF  KFS
Sbjct: 190 KTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFS 249

Query: 285 YCLVDRSTSAKPSSMVF---GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           YCL+D + S  P+S +    G   +S+   FTPLL NP   TFYYV+L  + V GA +R 
Sbjct: 250 YCLMDYTLSPPPTSYLIIGNGGDGISKLF-FTPLLTNPLSPTFYYVKLKSVFVNGAKLR- 307

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           I  S++++D +GNGG ++DSGT++  L  PAY ++  A R              FD C +
Sbjct: 308 IDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVN 367

Query: 402 LSG--KTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGT--MSGLSIIGNI 456
           +SG  K E  +P +   F G  V +P   NY I  +     C A        G S+IGN+
Sbjct: 368 VSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFSVIGNL 426

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
            QQGF   +D   SR+GF+ RGCA
Sbjct: 427 MQQGFLFEFDRDRSRLGFSRRGCA 450


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 141/283 (49%), Positives = 179/283 (63%), Gaps = 14/283 (4%)

Query: 63  SSLSLRLHHVDSLSFNRTP------EHLFNLRIQRDVLRVKSLTAFAESAVRV--PPRNR 114
           S  S+ + H D+L            E     +++R+ +RV+ L    E  + +   P NR
Sbjct: 72  SPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNR 131

Query: 115 SRGRA--NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
               A  +  F   V+SG+ QGSGEYFTR+GVGTP R  YMVLDTGSDV WIQC PC++C
Sbjct: 132 YENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCREC 191

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           YSQ DP+F+P+ S SF+TV C S +C +LD+  C+    CLY+ SYGDGS + G F+TET
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCH-SGGCLYEASYGDGSYSTGSFATET 250

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-S 291
           LTF  T VA VA+GCGH N GLF+ AAGLLGLG G LSFP Q G +    FSYCLVDR S
Sbjct: 251 LTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRES 310

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISV 334
            S+ P  + FG  +V   + FTPL  NP L TFYY+ +  IS+
Sbjct: 311 DSSGP--LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISI 351


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 201/357 (56%), Gaps = 19/357 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           G A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA S ++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGC 247
           A V C +P C  LD SGC+  + CLY V YGDGS ++G F+ +TLT       +    GC
Sbjct: 231 ANVSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G  N+GLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G    +
Sbjct: 290 GERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPAT 349

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
            T   TP+L      TFYYV + GI VGG  +  I  S+F        G I+DSGT +TR
Sbjct: 350 TT---TPMLTG-NGPTFYYVGMTGIRVGG-RLLPIAPSVFAA-----AGTIVDSGTVITR 399

Query: 368 LTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 424
           L   AY +LR AF A  +    ++A   SL DTC+D +G ++V +PTV L F+ GA + +
Sbjct: 400 LPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 459

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            A+  +  V +S   C AFAG   G  + I+GN Q + F V YD+    +GF+P  C
Sbjct: 460 DASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 159/434 (36%), Positives = 221/434 (50%), Gaps = 39/434 (8%)

Query: 69  LHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR--GRANGGFSSS 126
           L HVD+      PE +      R  +R     A A SAVR    NR+R  G+      + 
Sbjct: 35  LKHVDAGKQLSRPELI------RRAMRRSKARAAALSAVR----NRARFSGKNEQQTPAG 84

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+     G  EY   L +GTPP+ V  +LDTGSD++W QCAPC  C SQ DP+F P +S 
Sbjct: 85  VLPVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSA 144

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR---- 242
           S+  + C   LC  +    C R +TC Y+ +YGDG++TVG ++TE  TF  +        
Sbjct: 145 SYEPMRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTT 204

Query: 243 ---VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
              +  GCG  N G     +G++G GR  LS  +Q      R+FSYCL   + S + S++
Sbjct: 205 TVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLS---IRRFSYCLTSYA-SRRQSTL 260

Query: 300 VFGD-------SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           +FG         A  R  + TPLL +P+  TFYYV   G++VG   +R I  S F L P 
Sbjct: 261 LFGSLSDGVYGDATGRV-QTTPLLQSPQNPTFYYVHFTGLTVGARRLR-IPESAFALRPD 318

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFR-------AGASSLKRAPDFSLFDTCFDLSGK 405
           G+GGVI+DSGT++T L       +  AFR       A   + +    F +       S  
Sbjct: 319 GSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSST 378

Query: 406 TEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
           +++ VP +VLHF+GAD+ LP  NY++     G  C   A +    S IGN+ QQ  RV+Y
Sbjct: 379 SQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLY 438

Query: 466 DLAASRIGFAPRGC 479
           DL A  +  AP  C
Sbjct: 439 DLEAETLSIAPARC 452


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 201/357 (56%), Gaps = 19/357 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           G A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA S ++
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGC 247
           A V C +P C  LD SGC+  + CLY V YGDGS ++G F+ +TLT       +    GC
Sbjct: 232 ANVSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G  N+GLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G    +
Sbjct: 291 GERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPAT 350

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
            T   TP+L      TFYYV + GI VGG  +  I  S+F        G I+DSGT +TR
Sbjct: 351 TT---TPMLTG-NGPTFYYVGMTGIRVGG-RLLPIAPSVFAA-----AGTIVDSGTVITR 400

Query: 368 LTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 424
           L   AY +LR AF A  +    ++A   SL DTC+D +G ++V +PTV L F+ GA + +
Sbjct: 401 LPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 460

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            A+  +  V +S   C AFAG   G  + I+GN Q + F V YD+    +GF+P  C
Sbjct: 461 DASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 201/357 (56%), Gaps = 19/357 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSF 188
           G A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA S ++
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGC 247
           A V C +P C  LD SGC+  + CLY V YGDGS ++G F+ +TLT       +    GC
Sbjct: 235 ANVSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 293

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G  N+GLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G    +
Sbjct: 294 GERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPAT 353

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
            T   TP+L      TFYYV + GI VGG  +  I  S+F        G I+DSGT +TR
Sbjct: 354 TT---TPMLTG-NGPTFYYVGMTGIRVGG-RLLPIAPSVFAA-----AGTIVDSGTVITR 403

Query: 368 LTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 424
           L   AY +LR AF A  +    ++A   SL DTC+D +G ++V +PTV L F+ GA + +
Sbjct: 404 LPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDV 463

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            A+  +  V +S   C AFAG   G  + I+GN Q + F V YD+    +GF+P  C
Sbjct: 464 DASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 165/442 (37%), Positives = 235/442 (53%), Gaps = 46/442 (10%)

Query: 79  RTPEHLFNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSRGRAN-----------GGFSSS 126
           RT   + +L+IQ D+ R+++L A F +S  +   + + +  ++           G   ++
Sbjct: 92  RTTHSVVDLQIQ-DLTRIQTLHARFKKSKKQRNEKVKKKITSDISLVGAPEVSPGKLIAT 150

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + SG+  GSGEYF  + VGTPP++  ++LDTGSD+ W+QC PC  C+ Q +  +DP  S 
Sbjct: 151 LESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSA 210

Query: 187 SFATVPCRSPLCRKLDSS----GCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT--- 238
           SF  + C  P C  + S      C   N +C Y   YGD S T GDF+ ET T   T   
Sbjct: 211 SFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTE 270

Query: 239 ------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
                 +V  +  GCGH N GLF  A+GLLGLGRG LSF +Q    +   FSYCLVDR++
Sbjct: 271 GRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS 330

Query: 293 SAKPSS-MVFG---DSAVSRTARFTPLLANPK--LDTFYYVELVGISVGGAHVRGITASL 346
               SS ++FG   D        FT  +   +  ++TFYY+++  I VGG  +  I    
Sbjct: 331 DTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEAL-DIPEET 389

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA----PDFSLFDTCFDL 402
           + + P G GG IIDSGT+++    PAY  +++ F   A  +K       DF + D CF++
Sbjct: 390 WNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKF---AEKMKENYLVFRDFPVLDPCFNV 446

Query: 403 SGKTE--VKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQ 458
           SG  E  + +P + + F  GA  + PA N  I + S    C A  GT  S  SIIGN QQ
Sbjct: 447 SGIEENNIHLPELGIAFADGAVWNFPAENSFIWL-SEDLVCLAILGTPKSTFSIIGNYQQ 505

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           Q F ++YD   SR+GF P  CA
Sbjct: 506 QNFHILYDTKMSRLGFTPTKCA 527


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 148/358 (41%), Positives = 199/358 (55%), Gaps = 16/358 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDP +S +
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A V C +P C  L+  GC+  + CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 229 YANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 287

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G  A 
Sbjct: 288 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAA 347

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     TP+L +    TFYY+ + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 348 ASARLTTPMLTD-NGPTFYYIGMTGIRVGG-QLLSIPQSVFA-----TAGTIVDSGTVIT 400

Query: 367 RLTRPAYIALR--DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           RL  PAY +LR   A    A   K+AP  SL DTC+D +G ++V +PTV L F+ GA + 
Sbjct: 401 RLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLD 460

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + A+  +    +S   C AFA    G  + I+GN Q + F V YD+    +GF P  C
Sbjct: 461 VDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 199/358 (55%), Gaps = 16/358 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRS 187
           SG A G+G Y   +G+GTP     +V DTGSD  W+QC PC   CY Q + +FDPA+S +
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALG 246
           +A V C +P C  L+  GC+  + CLY V YGDGS ++G F+ +TLT       +    G
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFG 289

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  NEGLF  AAGLLGLGRG+ S P QT  ++   F++CL  RST         G  A 
Sbjct: 290 CGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAA 349

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     TP+L +    TFYYV + GI VGG  +  I  S+F        G I+DSGT +T
Sbjct: 350 ASARLTTPMLTD-NGPTFYYVGMTGIRVGG-QLLSIPQSVFA-----TAGTIVDSGTVIT 402

Query: 367 RLTRPAYIALR--DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           RL   AY +LR   A    A   K+AP  SL DTC+D +G ++V +PTV L F+ GA + 
Sbjct: 403 RLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLD 462

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + A+  +    +S   C AFA    G  + I+GN Q + F V YD+    +GF P  C
Sbjct: 463 VDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 163/408 (39%), Positives = 215/408 (52%), Gaps = 45/408 (11%)

Query: 91  RDVLRVKSLTA----------FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
           +D LRV S+ A          F E   ++P +                SG+A G+G Y  
Sbjct: 94  QDQLRVDSIQARLSKISGHGIFEEMVTKLPAQ----------------SGIAIGTGNYVV 137

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
            +G+GTP     +V DTGS + W QC PC   CY Q +  FDP KS S+  V C S  C 
Sbjct: 138 TVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSASCN 197

Query: 200 KLDSS--GCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGLF 255
            L +S  GC+  N TCLYQ+ YGD S + G F+TETLT   + V      GCG  N GLF
Sbjct: 198 LLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFLFGCGQSNNGLF 257

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPL 315
             AAGLLGL    +S P+QT  ++ ++FSYCL   ST +    + FG   VS+TA FTP+
Sbjct: 258 GQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCL--PSTPSSTGYLNFG-GKVSQTAGFTPI 314

Query: 316 LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
             +P   +FY +++VGISV G+ +  I  S+F        G IIDSGT +TRL   AY A
Sbjct: 315 --SPAFSSFYGIDIVGISVAGSQLP-IDPSIFT-----TSGAIIDSGTVITRLPPTAYKA 366

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVD 434
           L++AF    S+  +     L DTC+D S  T V  P V + F+G  +V + A+  L  V+
Sbjct: 367 LKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVN 426

Query: 435 SSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                C AFA     S   I GN QQ+ + VVYD A   IGFA   C+
Sbjct: 427 GVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 138/351 (39%), Positives = 196/351 (55%), Gaps = 13/351 (3%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
            YFT L +GTP   + + LDTGSD  WIQC PC  CY Q + +FDP+KS +++ + C S 
Sbjct: 133 NYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSR 192

Query: 197 LCRKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNE 252
            C++L SS    C+    C Y+++Y D S TVG+ + +TLT   T  V     GCGH+N 
Sbjct: 193 ECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFVFGCGHNNA 252

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G F    GLLGLGRG+ S  +Q   R+   FSYCL    ++    S     +A    A+F
Sbjct: 253 GSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFSGAAAAAPTNAQF 312

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           T ++A  +  +FYY+ L GI+V G  ++ +  S+F    A   G IIDSGT+ + L   A
Sbjct: 313 TEMVAG-QHPSFYYLNLTGITVAGRAIK-VPPSVF----ATAAGTIIDSGTAFSCLPPSA 366

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLI 431
           Y ALR + R+     KRAP  ++FDTC+DL+G   V++P+V L F  GA V L  +  L 
Sbjct: 367 YAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLY 426

Query: 432 PVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              +    C AF      + L ++GN QQ+   V+YD+   ++GF   GCA
Sbjct: 427 TWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 477


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  238 bits (607), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 158/427 (37%), Positives = 223/427 (52%), Gaps = 24/427 (5%)

Query: 69  LHHVDSLSFNRTPEHLFNLRIQRDVLRVKS-LTAFAESAVRVPPR-----NRSRGRANGG 122
           +H V  +     P  L  LRI  D++R  S L+ F+   +    R      RS+ R    
Sbjct: 39  VHEVVGVRLQEEP--LIGLRI--DLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKL 94

Query: 123 FSS-----SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
             S     +V + +  G+GE+  ++ +GTP      +LDTGSD+ W QC PC  CY Q  
Sbjct: 95  QMSVDEVKAVEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPT 154

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           P++DP++S +++ VPC S +C+ L    C+  N C Y  SYGD S T G  S E+ T   
Sbjct: 155 PIYDPSQSSTYSKVPCSSSMCQALPMYSCSGAN-CEYLYSYGDQSSTQGILSYESFTLTS 213

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTS-AK 295
             +  +A GCG +NEG   +  G L       LS  +Q G+    KFSYCLV  + S +K
Sbjct: 214 QSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSK 273

Query: 296 PSSMVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
            S +  G +A   ++T   TPL+ +    TFYY+ L GISVGG  +  I    F L   G
Sbjct: 274 TSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGG-QLLDIADGTFDLQLDG 332

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD-LSGKTEVKVPT 412
            GGVIIDSGT+VT L +  Y  ++ A  +  +  +        D CF+  SG +    PT
Sbjct: 333 TGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPT 392

Query: 413 VVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRI 472
           +  HF GAD +LP  NY I  DSSG  C A   + +G+SI GNIQQQ ++++YD   + +
Sbjct: 393 ITFHFEGADFNLPKENY-IYTDSSGIACLAMLPS-NGMSIFGNIQQQNYQILYDNERNVL 450

Query: 473 GFAPRGC 479
            FAP  C
Sbjct: 451 SFAPTVC 457


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  237 bits (605), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 163/398 (40%), Positives = 209/398 (52%), Gaps = 56/398 (14%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
           A G   S V SG+   SGEYF  +GVGTP     +V+DTGSD+VW+QC+PC++CY+Q   
Sbjct: 67  ATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQ 126

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT----CLYQVSYGDGSITVGDFSTETLT 234
           VFDP +S ++  VPC SP CR L   GC+        C Y V+YGDGS + GD +T+ L 
Sbjct: 127 VFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLA 186

Query: 235 F-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
           F   T V  V LGCG DNEGLF +AAGLLG  R    +P++  RR+ R+ +         
Sbjct: 187 FANDTYVNNVTLGCGRDNEGLFDSAAGLLGR-RAAARYPSR--RRWPRRTA--------- 234

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA- 352
             PSS     S  S T R     A               S      RG  A      P  
Sbjct: 235 --PSS-----STASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGS 287

Query: 353 -----GNGG----------------VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP 391
                G+ G                V++DSGT+++R  R AY ALRDAF A A +     
Sbjct: 288 ASAARGSPGSRTPASRWTRRRGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRR 347

Query: 392 ---DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD------SSGTFCF 441
              + S+FD C+DL G+     P +VLHF  GAD++LP  NY +PVD      +S   C 
Sbjct: 348 LAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCL 407

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            F     GLS+IGN+QQQGFRVV+D+   RIGFAP+GC
Sbjct: 408 GFEAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 445


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  237 bits (605), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 146/381 (38%), Positives = 201/381 (52%), Gaps = 23/381 (6%)

Query: 114 RSRGRANGGF----SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           RS  RAN  F    +S+  S +    G Y     VGTPP  +Y + DTGSD+VW+QC PC
Sbjct: 59  RSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC 118

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           ++CY+QT P+F+P+KS S+  +PC S LC  +  + C+ +N+C Y++SYGD S + GD S
Sbjct: 119 EQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLS 178

Query: 230 TETLTFRGT-----RVARVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
            +TL+   T        ++ +GCG DN G F  A++G++GLG G +S  TQ G     KF
Sbjct: 179 VDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKF 238

Query: 284 SYCLVD--RSTSAKPSSMVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           SYCLV      S   S + FGD+AV        TPL+   K   FY++ L   SVG   V
Sbjct: 239 SYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRV 296

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDT 398
               +S    D    G +IIDSGT++T +    Y  L  A       L R  D    F  
Sbjct: 297 EFGGSSEGGDD---EGNIIIDSGTTLTLIPSDVYTNLESAV-VDLVKLDRVDDPNQQFSL 352

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           C+ L    E   P + +HF+GADV L + +  +P+ + G  CFAF  +    SI GN+ Q
Sbjct: 353 CYSLK-SNEYDFPIITVHFKGADVELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGNLAQ 410

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           Q   V YDL    + F P  C
Sbjct: 411 QNLLVGYDLQQKTVSFKPTDC 431


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 150/374 (40%), Positives = 204/374 (54%), Gaps = 30/374 (8%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRS 187
           G++ G+G Y   +G+GTP R + +V DTGSD+ W+QC PC    CY Q DP+F P+ S +
Sbjct: 77  GISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSST 136

Query: 188 FATVPCRSPLCRKLDSSGCNRR---NTCLYQVSYGDGSITVGDFSTETLTFRGT------ 238
           F+ V C  P C +   S C+     + C Y+V YGD S TVG    +TLT   T      
Sbjct: 137 FSAVRCGEPECPRARQS-CSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNAS 195

Query: 239 -----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
                ++     GCG +N GLF  A GL GLGRG++S  +Q   ++   FSYCL   S++
Sbjct: 196 ENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSN 255

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
           A     +   +     ARFTP+L      +FYYV+LVGI V G  ++   +S   L PA 
Sbjct: 256 AHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIK--VSSRPALWPA- 312

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTE--VK 409
             G+I+DSGT +TRL   AY ALR AF +  G    KRAP  S+ DTC+D +      V 
Sbjct: 313 --GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVS 370

Query: 410 VPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS--IIGNIQQQGFRVVYD 466
           +P V L F  GA +S+  +  L  V      C AFA   +G S  I+GN QQ+   VVYD
Sbjct: 371 IPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYD 429

Query: 467 LAASRIGFAPRGCA 480
           +   +IGFA +GC+
Sbjct: 430 VGRQKIGFAAKGCS 443


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 157/471 (33%), Positives = 233/471 (49%), Gaps = 56/471 (11%)

Query: 25  YQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHH----VDSLSFNRT 80
           Y+   + SL T S  S  ++V  S   +++PL             HH       L   + 
Sbjct: 30  YKVLSIGSLRTKSVCSESKAVRSSSGATTVPL-------------HHRHGPCSPLPTKKM 76

Query: 81  PEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVIS-----GLAQGS 135
           P      R+ RD LR   +       V+        G+  GG   S ++     G +  +
Sbjct: 77  PS--LEDRLHRDQLRAAYIKRKFSGDVK------KDGQGAGGVEQSHVTVPTTLGTSLNT 128

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
            EY   + +G+P +   +++D+GSDV W+QC PC +C+SQ DP+FDP+ S +++   C S
Sbjct: 129 LEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSS 188

Query: 196 PLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
             C +L  D +GC+  + C Y V Y DGS T G +S++TL      ++    GC H   G
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESG 248

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
                 GL+GLG G  S  +QT   F   FSYCL    +S+   ++  G S   +    T
Sbjct: 249 FNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSSGFLTLGAGTSGFVK----T 304

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           P+L +  + TFY V L  I VGG  +  I  S+F      + G+++DSGT +TRL R AY
Sbjct: 305 PMLRSSPVPTFYGVRLEAIRVGGTQLS-IPTSVF------SAGMVMDSGTIITRLPRTAY 357

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPV 433
            AL  AF+AG    + AP  S+ DTCFD SG++ V++P+V L F G  V        + +
Sbjct: 358 SALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAV--------VNL 409

Query: 434 DSSGTF---CFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D++G     C AFA     S   I+GN+QQ+ F V+YD+    +GF    C
Sbjct: 410 DANGIILGNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 140/361 (38%), Positives = 199/361 (55%), Gaps = 19/361 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           S +    GEY   L +GTPP  +  + DTGSD++W QC PC++CY Q DP+FDP  S+++
Sbjct: 86  SDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTY 145

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARV 243
               C +  C  LD S C+  N C YQ SYGD S T+G+ +++T+T   T        + 
Sbjct: 146 RDFSCDARQCSLLDQSTCS-GNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKT 204

Query: 244 ALGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV-F 301
            +GCGH+N+G F    +G++GLG G LS  +Q G     KFSYCLV  S+ A  SS + F
Sbjct: 205 VIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNF 264

Query: 302 GDSAVSR--TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           G +AV      + TPLL++  + +FY++ L  +SVG   ++   +SL      G G +II
Sbjct: 265 GSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSL----GTGEGNIII 320

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLHFR 418
           DSGT++T +    +  L  A        +RA D S F   C+  S  +++KVP +  HF 
Sbjct: 321 DSGTTLTIVPDDFFSNLSTAVGNQVEG-RRAEDPSGFLSVCY--SATSDLKVPAITAHFT 377

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GADV L   N  + V S    C AFA T SG+SI GN+ Q  F V Y++    + F P  
Sbjct: 378 GADVKLKPINTFVQV-SDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTD 436

Query: 479 C 479
           C
Sbjct: 437 C 437


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 149/389 (38%), Positives = 213/389 (54%), Gaps = 35/389 (8%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VFDP 182
           S ++SG + GSG+YF  + +G+PP+ + +V DTGSD+ W++C+ CK   S   P   F  
Sbjct: 70  SPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLA 129

Query: 183 AKSRSFATVPCRSPLCR---KLDSSGCNR---RNTCLYQVSYGDGSITVGDFSTETLTF- 235
             S +F+   C S LC+   + + + CN     +TC Y+  Y DGS T G FS ET T  
Sbjct: 130 RHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLN 189

Query: 236 ----RGTRVARVALGCGHDNEG------LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
               R  ++  +A GCG    G       F  A+G++GLGRG +SF +Q GRRF R FSY
Sbjct: 190 TSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSY 249

Query: 286 CLVDRSTSAKPSS-MVFGDSAVSR-----TARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           CL+D + S  P+S ++ GD   ++        FTPLL NP+  TFYY+ + G+ V G  +
Sbjct: 250 CLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKL 309

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP----DFSL 395
             I  S++ LD  GNGG +IDSGT++T LT PAY  +  AF+         P      S 
Sbjct: 310 H-IDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSG 368

Query: 396 FDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSG---LS 451
           FD C +++G +  + P + L   G  + S P  NY I + S G  C A     +     S
Sbjct: 369 FDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-SEGIKCLAIQPVEAESGRFS 427

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +IGN+ QQGF + +D   SR+GF+ RGCA
Sbjct: 428 VIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 144/367 (39%), Positives = 200/367 (54%), Gaps = 28/367 (7%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG+   S  Y   + +G   R + +++DTGSD+ W+QC PC +CY+Q DPVF+P+KS S+
Sbjct: 57  SGIRLQSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSY 114

Query: 189 ATVPCRSPLCRKLD----SSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
            TV C S  CR L     +SG    N  TC Y V+YGDGS T G+   E L    T V  
Sbjct: 115 RTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNN 174

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
              GCG  N+GLF  A+GL+GLGR  LS  +Q    F   FSYCL      A  S ++ G
Sbjct: 175 FIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSLVMGG 234

Query: 303 DSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVI 358
           +S+V +      +T ++ NP L  FY++ L GI+VGG  V+          P+ G   +I
Sbjct: 235 NSSVYKNTTPISYTRMIHNPLL-PFYFLNLTGITVGGVEVQA---------PSFGKDRMI 284

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           IDSGT ++RL    Y AL+  F    S    AP F + D+CF+LSG  EVK+P + ++F 
Sbjct: 285 IDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFE 344

Query: 419 GA---DVSLPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIG 473
           G+   +V +    Y +  D+S   C A A       + IIGN QQ+  R++YD   S +G
Sbjct: 345 GSAELNVDVTGVFYSVKTDAS-QVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLG 403

Query: 474 FAPRGCA 480
           FA   C+
Sbjct: 404 FAEEACS 410


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 165/442 (37%), Positives = 234/442 (52%), Gaps = 46/442 (10%)

Query: 79  RTPEHLFNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSRGRAN-----------GGFSSS 126
           RT   + +L+IQ D+ R+K+L A F +S  +   + R +  ++           G   ++
Sbjct: 90  RTTHSVVDLQIQ-DLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIAT 148

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           + SG+  GSGEYF  + VGTPP++  ++LDTGSD+ W+QC PC  C+ Q    +DP  S 
Sbjct: 149 LESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSA 208

Query: 187 SFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT--- 238
           SF  + C  P C  + S      C   N +C Y   YGD S T GDF+ ET T   T   
Sbjct: 209 SFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTE 268

Query: 239 ------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
                 +V  +  GCGH N GLF  A+GLLGLGRG LSF +Q    +   FSYCLVDR++
Sbjct: 269 GGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS 328

Query: 293 SAKPSS-MVFG---DSAVSRTARFTPLLANPK--LDTFYYVELVGISVGGAHVRGITASL 346
           +   SS ++FG   D        FT  +   +  ++TFYY+++  I VGG  +  I    
Sbjct: 329 NTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKAL-DIPEET 387

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA----PDFSLFDTCFDL 402
           + +   G+GG IIDSGT+++    PAY  +++ F   A  +K       DF + D CF++
Sbjct: 388 WNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKF---AEKMKENYPIFRDFPVLDPCFNV 444

Query: 403 SGKTE--VKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQ 458
           SG  E  + +P + + F  G   + PA N  I + S    C A  GT  S  SIIGN QQ
Sbjct: 445 SGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL-SEDLVCLAILGTPKSTFSIIGNYQQ 503

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           Q F ++YD   SR+GF P  CA
Sbjct: 504 QNFHILYDTKRSRLGFTPTKCA 525


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 155/377 (41%), Positives = 215/377 (57%), Gaps = 24/377 (6%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           S+V SG   G+GEYF  + VG PPR+  +++DTGSD+ W+QC PCK C+ Q+ PVFDP++
Sbjct: 74  STVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQ 133

Query: 185 SRSFATVPCRSPLCRKL------DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--- 235
           S SF  +PC +  C  +      D+S      TC Y   YGD S T GD + E+L+    
Sbjct: 134 STSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLS 193

Query: 236 ---RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT-GRRFNRKFSYCLVDRS 291
                  +  + +GCGH N+GLF  A GLLGLG+G LSFP+Q       + FSYCLVDR+
Sbjct: 194 DHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRT 253

Query: 292 TSAKPSSMV-FGDS-AVSR---TARFTPLL-ANPKLDTFYYVELVGISVGGAHVRGITAS 345
            +   SS + FG   A+SR     +FTP +  N  ++TFYY+ + GI +    +  I A 
Sbjct: 254 NNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKI-DQELLPIPAE 312

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
            F +   G+GG IIDSGT++T L R AY A+  AF A   S  RA  F +   C++ +G+
Sbjct: 313 RFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGR 371

Query: 406 TEVKVPTVVLHFR-GADVSLPATNYLIPVD-SSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
             V  P + + F+ GA++ LP  NY I  D      C A   T  G+SIIGN QQQ    
Sbjct: 372 AAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT-DGMSIIGNFQQQNIHF 430

Query: 464 VYDLAASRIGFAPRGCA 480
           +YD+  +R+GFA   C+
Sbjct: 431 LYDVQHARLGFANTDCS 447


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 145/368 (39%), Positives = 200/368 (54%), Gaps = 26/368 (7%)

Query: 131 LAQGSGEYFTRLGVGTPP-RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           +A   GEY   L +GTPP RY  MV DTGSD++W QCAPC  C  Q  P F PA+S ++ 
Sbjct: 85  VAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCADQPTPYFRPARSATYR 143

Query: 190 TVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVA 244
            VPCRSPLC  L    C +R+ C+YQ  YGD + T G  ++ET TF         V+ VA
Sbjct: 144 LVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-- 302
            GCG+ N G    ++G++GLGRG LS  +Q G     +FSYCL     S +PS + FG  
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSF-LSPEPSRLNFGVF 259

Query: 303 -------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
                   S+     + TPL+ N  L + Y++ L GIS+G   +  I   +F ++  G G
Sbjct: 260 ATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLP-IDPLVFAINDDGTG 318

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDL--SGKTEVKVPT 412
           GV IDSGTS+T L + AY A+R    +    L    D  +  +TCF         V VP 
Sbjct: 319 GVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPD 378

Query: 413 VVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           + LHF  GA++++P  NY++   ++G  C A   +    +IIGN QQQ   ++YD+A S 
Sbjct: 379 MELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSL 437

Query: 472 IGFAPRGC 479
           + F P  C
Sbjct: 438 LSFVPAPC 445


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 157/451 (34%), Positives = 228/451 (50%), Gaps = 41/451 (9%)

Query: 58  APDAESSLS--LRLH--HVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRN 113
           +PD   + +  +RLH  HVD+     +   L    +QR   R  +L+     + RVP ++
Sbjct: 23  SPDTADAFAGDVRLHLTHVDA-GKQMSRRELIRRAMQRSKARAAALSVARSGSGRVPGKS 81

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCY 173
             +G  +       +     G  EY   L +GTPP+ V  +LDTGSD++W QCAPC  C 
Sbjct: 82  AQQGEQH---QQPGVPVRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCL 138

Query: 174 SQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETL 233
           +Q DP+F PA S S+  + C   LC  +    C R +TC Y+ +YGDG+ T+G ++TE  
Sbjct: 139 AQPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERF 198

Query: 234 TFRGTRVARVAL----GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD 289
           TF  +   ++++    GCG  N G     +G++G GR  LS  +Q      R+FSYCL  
Sbjct: 199 TFASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLS---IRRFSYCLTP 255

Query: 290 RSTSAKPSSMVF---------GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             TS + S+++F         GD A +   + T LL + +  TFYYV   G++VG   +R
Sbjct: 256 Y-TSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLR 314

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS---SLKRAPDFSLFD 397
            I  S F L P G+GGVI+DSGT++T         +  AFRA      +   +PD  +  
Sbjct: 315 -IPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGV-- 371

Query: 398 TCF---------DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
            CF           S  T V VP +  HF+GAD+ LP  NY++     G+ C   A +  
Sbjct: 372 -CFATPMAAGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGD 430

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             + IGN  QQ  RV+YDL A  + FAP  C
Sbjct: 431 SGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 145/368 (39%), Positives = 200/368 (54%), Gaps = 26/368 (7%)

Query: 131 LAQGSGEYFTRLGVGTPP-RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           +A   GEY   L +GTPP RY  MV DTGSD++W QCAPC  C  Q  P F PA+S ++ 
Sbjct: 85  VAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCADQPTPYFRPARSATYR 143

Query: 190 TVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVA 244
            VPCRSPLC  L    C +R+ C+YQ  YGD + T G  ++ET TF         V+ VA
Sbjct: 144 LVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-- 302
            GCG+ N G    ++G++GLGRG LS  +Q G     +FSYCL     S +PS + FG  
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSF-LSPEPSRLNFGVF 259

Query: 303 -------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
                   S+     + TPL+ N  L + Y++ L GIS+G   +  I   +F ++  G G
Sbjct: 260 ATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLP-IDPLVFAINDDGTG 318

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDL--SGKTEVKVPT 412
           GV IDSGTS+T L + AY A+R    +    L    D  +  +TCF         V VP 
Sbjct: 319 GVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPD 378

Query: 413 VVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           + LHF  GA++++P  NY++   ++G  C A   +    +IIGN QQQ   ++YD+A S 
Sbjct: 379 MELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSL 437

Query: 472 IGFAPRGC 479
           + F P  C
Sbjct: 438 LSFVPAPC 445


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 145/381 (38%), Positives = 199/381 (52%), Gaps = 23/381 (6%)

Query: 114 RSRGRANGGF----SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           RS  RAN  F    +S+  S +    G Y     VGTPP  +Y + DTGSD+VW+QC PC
Sbjct: 59  RSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC 118

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           ++CY+QT P+F+P+KS S+  +PC S LC  +  + C+ +N+C Y++SYGD S + GD S
Sbjct: 119 EQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLS 178

Query: 230 TETLTFRGT-----RVARVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
            +TL+   T        +  +GCG DN G F  A++G++GLG G +S  TQ G     KF
Sbjct: 179 VDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKF 238

Query: 284 SYCLVD--RSTSAKPSSMVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           SYCLV      S   S + FGD+AV        TPL+   K   FY++ L   SVG   V
Sbjct: 239 SYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRV 296

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDT 398
               +S    D    G +IIDSGT++T +    Y  L  A       L R  D    F  
Sbjct: 297 EFGGSSEGGDD---EGNIIIDSGTTLTLIPSDVYTNLESAV-VDLVKLDRVDDPNQQFSL 352

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           C+ L    E   P +  HF+GAD+ L + +  +P+ + G  CFAF  +    SI GN+ Q
Sbjct: 353 CYSLK-SNEYDFPIITAHFKGADIELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGNLAQ 410

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           Q   V YDL    + F P  C
Sbjct: 411 QNLLVGYDLQQKTVSFKPTDC 431


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 158/463 (34%), Positives = 227/463 (49%), Gaps = 35/463 (7%)

Query: 25  YQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHH----VDSLSFNRT 80
           Y+   L SL T S  S  ++V  S   +++PL             HH       L   + 
Sbjct: 31  YKVLSLGSLRTKSVCSESKAVKSSTGAATVPL-------------HHRHGPCSPLPTKKM 77

Query: 81  PEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
           P      R+ RD LR   +            R  +        +     G +  + EY  
Sbjct: 78  PT--LEERLHRDQLRAAYIQRKFSGGGVNGSRGGAGDVQQSHATVPTTLGTSLDTLEYLI 135

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
            + +G+P +   M++DTGSDV W+QC PC +C+SQ DP+FDP+ S +++   C S  C +
Sbjct: 136 TVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQ 195

Query: 201 L--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAA 258
           L  + +GC+    C Y V+YGDGS T G +S++TL      V +   GC +   G     
Sbjct: 196 LGQEGNGCSSSQ-CQYTVTYGDGSSTTGTYSSDTLALGSNAVRKFQFGCSNVESGFNDQT 254

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN 318
            GL+GLG G  S  +QT   F   FSYCL   S+S+   ++  G S   +T    P+L +
Sbjct: 255 DGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSSGFLTLGAGTSGFVKT----PMLRS 310

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
            ++ TFY V +  I VGG  +  I  S+F      + G I+DSGT +TRL   AY AL  
Sbjct: 311 SQVPTFYGVRIQAIRVGGRQLS-IPTSVF------SAGTIMDSGTVLTRLPPTAYSALSS 363

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT 438
           AF+AG      AP   + DTCFD SG++ V +PTV L F G  V   A++ ++   S+  
Sbjct: 364 AFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNSI 423

Query: 439 FCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C AFA     S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 424 LCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 139/379 (36%), Positives = 196/379 (51%), Gaps = 22/379 (5%)

Query: 114 RSRGRANGGFSSSVI----SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           RS  RAN  F  S+     S +    GEY     VGTPP  VY V+DTGSD+VW+QC PC
Sbjct: 59  RSINRANRLFKDSLSNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPC 118

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           ++CY QT P+F+P+KS S+  +PC S LC+ +  + CN++N+C Y +++ D S + G+ S
Sbjct: 119 EQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELS 178

Query: 230 TETLTFRGT-----RVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKF 283
            ETLT   T        +  +GCGH+N G+F    +G++GLG G +S  TQ       KF
Sbjct: 179 VETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKF 238

Query: 284 SYCLVDRST-SAKPSSMVFGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           SYCL+     S K S + FGD+A VS     +          FYY+ L   SVG   +  
Sbjct: 239 SYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIE- 297

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCF 400
                  LD +  G +I+DSGT++T L    Y  L  A  A    L R  D   L + C+
Sbjct: 298 ----FEVLDDSEEGNIILDSGTTLTLLPSHVYTNLESAV-AQLVKLDRVDDPNQLLNLCY 352

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
            ++   +   P +  HF+GAD+ L   +    V + G  C AF  + +G  I GN+ Q  
Sbjct: 353 SITSD-QYDFPIITAHFKGADIKLNPISTFAHV-ADGVVCLAFTSSQTG-PIFGNLAQLN 409

Query: 461 FRVVYDLAASRIGFAPRGC 479
             V YDL  + + F P  C
Sbjct: 410 LLVGYDLQQNIVSFKPSDC 428


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 223/408 (54%), Gaps = 33/408 (8%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           NRT E L + +I+ D  R++ L   + S         S+  AN          +  GSGE
Sbjct: 70  NRTWESLMSEKIRGDANRLRFLKRTSRS---------SKQDANANVP------VRSGSGE 114

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  ++  GTP + +Y ++DTGSDV WI C  C+ C+S T P+FDPAKS S+    C S  
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYKPFACDSQP 173

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
           C+++ S  C   + C ++VSYGDG+   G  +++ +T     +   + GC          
Sbjct: 174 CQEI-SGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSP 232

Query: 258 AAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV--SRTARFT 313
           + GL+GLG G LS  TQ  T   F   FSYCL   S+S    S+V G  A   S + +FT
Sbjct: 233 SPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL--PSSSTSSGSLVLGKEAAVSSSSLKFT 290

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
            L+ +P + TFY+V L  ISVG   +     S+   + A  GG IIDSGT++T L   AY
Sbjct: 291 TLIKDPSIPTFYFVTLKAISVGNTRI-----SVPGTNIASGGGTIIDSGTTITHLVPSAY 345

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIP 432
            ALRDAFR   SSL+  P     DTC+DLS  + V VPT+ LH  R  D+ LP  N LI 
Sbjct: 346 TALRDAFRQQLSSLQPTP-VEDMDTCYDLS-SSSVDVPTITLHLDRNVDLVLPKENILI- 402

Query: 433 VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              SG  C AF+ T S  SIIGN+QQQ +R+V+D+  S++GFA   CA
Sbjct: 403 TQESGLACLAFSSTDS-RSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 151/385 (39%), Positives = 205/385 (53%), Gaps = 29/385 (7%)

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   +++ SG+  GSGEYF  + VG+PP++  ++LDTGSD+ WIQC PC  C+ Q    +
Sbjct: 138 GQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFY 197

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSYGDGSITVGDFSTETLTF 235
           DP  S S+  + C  P C  +        C   N +C Y   YGD S T GDF+ ET T 
Sbjct: 198 DPKASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTV 257

Query: 236 RGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
             T          V  +  GCGH N GLF  AAGLLGLGRG LSF +Q    +   FSYC
Sbjct: 258 NLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYC 317

Query: 287 LVDRSTSAKPSS-MVFG---DSAVSRTARFTPLLANPK--LDTFYYVELVGISVGGAHVR 340
           LVDR++    SS ++FG   D        FT  +A  +  +DTFYYV++  I V G  V 
Sbjct: 318 LVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAG-EVL 376

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP---DFSLFD 397
            I    + +   G GG IIDSGT+++    PAY  +++     A    + P   DF + D
Sbjct: 377 NIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILD 434

Query: 398 TCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGN 455
            CF++SG   +++P + + F  GA  + P  N  I ++     C A  GT  S  SIIGN
Sbjct: 435 PCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAILGTPKSAFSIIGN 493

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
            QQQ F ++YD   SR+G+AP  CA
Sbjct: 494 YQQQNFHILYDTKRSRLGYAPTKCA 518


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 155/402 (38%), Positives = 209/402 (51%), Gaps = 29/402 (7%)

Query: 104 ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           +  V   P   S     G   +++ SG+  GSGEYF  + VG+PP++  ++LDTGSD+ W
Sbjct: 136 KEVVTTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNW 195

Query: 164 IQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG----CNRRN-TCLYQVSY 218
           IQC PC  C+ Q    +DP  S S+  + C    C  + S      C   N +C Y   Y
Sbjct: 196 IQCLPCYDCFQQNGAFYDPKASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWY 255

Query: 219 GDGSITVGDFSTETLTFRGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRL 269
           GD S T GDF+ ET T   T          V  +  GCGH N GLF  AAGLLGLGRG L
Sbjct: 256 GDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPL 315

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG---DSAVSRTARFTPLLANPK--LDT 323
           SF +Q    +   FSYCLVDR++    SS ++FG   D        FT  +A  +  +DT
Sbjct: 316 SFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDT 375

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           FYYV++  I V G  V  I    + +   G GG IIDSGT+++    PAY  +++     
Sbjct: 376 FYYVQIKSILVAG-EVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEK 434

Query: 384 ASSLKRAP---DFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTF 439
           A    + P   DF + D CF++SG   V++P + + F  GA  + P  N  I ++     
Sbjct: 435 AKG--KYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LV 491

Query: 440 CFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C A  GT  S  SIIGN QQQ F ++YD   SR+G+AP  CA
Sbjct: 492 CLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 154/439 (35%), Positives = 224/439 (51%), Gaps = 32/439 (7%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           +  + + L HVD+     +   L    +QR   R  +L+A    A       R  G+ + 
Sbjct: 29  DDDVRVALKHVDA-GKQLSRSELIRRAMQRSKARAAALSAVRNRAASA----RFSGKNDD 83

Query: 122 GFSS--SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV 179
             ++  + +S    G  EY   L +GTPP+ V  +LDTGSD++W QCAPC  C +Q DP+
Sbjct: 84  QRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPL 143

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--- 236
           F P +S S+  + C   LC  +   GC   +TC Y+ +YGDG++T+G ++TE  TF    
Sbjct: 144 FAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSG 203

Query: 237 GTRVARVAL--GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
           G R+  V L  GCG  N G     +G++G GR  LS  +Q      R+FSYCL    +  
Sbjct: 204 GDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLS---IRRFSYCLTSYGSGR 260

Query: 295 KPSSM-------VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
           K + +       V+GD+  +   + TPLL + +  TFYYV L G++VG   +R I  S F
Sbjct: 261 KSTLLFGSLSGGVYGDA--TGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLR-IPESAF 317

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR-------AGASSLKRAPDFSLFDTCF 400
            L P G+GGVI+DSGT++T L       +  AFR       A   + +    F +     
Sbjct: 318 ALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWR 377

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
             S  ++V VP +V HF+ AD+ LP  NY++     G  C   A +    S IGN+ QQ 
Sbjct: 378 RSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQD 437

Query: 461 FRVVYDLAASRIGFAPRGC 479
            RV+YDL A  + FAP  C
Sbjct: 438 MRVLYDLEAETLSFAPAQC 456


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 192/346 (55%), Gaps = 20/346 (5%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           +G+GTP     MV+DTGS + W+QC+PC   C+ Q+ PVF+P  S ++A+V C +  C  
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 201 LDS-----SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLF 255
           L S     S C+  N C+YQ SYGD S +VG  S +T++F  T +     GCG DNEGLF
Sbjct: 61  LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLF 120

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPL 315
             +AGL+GL R +LS   Q        F+YCL   S+S   S   +     S    +TP+
Sbjct: 121 GRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQYS----YTPM 176

Query: 316 LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
           +++   D+ Y+++L G++V G  +   +++   L        IIDSGT +TRL    Y A
Sbjct: 177 VSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP------TIIDSGTVITRLPTSVYSA 230

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD 434
           L  A  A      RA  +S+ DTCF     + V  P V + F  GA + L A N L+ VD
Sbjct: 231 LSKAVAAAMKGTSRASAYSILDTCFK-GQASRVSAPAVTMSFAGGAALKLSAQNLLVDVD 289

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            S T C AFA   S  +IIGN QQQ F VVYD+ +SRIGFA  GC+
Sbjct: 290 DSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 154/433 (35%), Positives = 222/433 (51%), Gaps = 45/433 (10%)

Query: 75  LSFNRTPEHLFNLR-IQRDVLRVKSLTAFAESAVR--VPPRNRSRGRANGGFSSSVISGL 131
           +SF+   ++ F++  I RD L+   L    ++  +  V    RS  RAN  +  S ++ +
Sbjct: 18  VSFSHAQKNGFSVELIHRDSLK-SPLYKPTQNKYQYFVDAARRSINRANHFYKYS-LANI 75

Query: 132 AQGS-----GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
            Q +     GEY     VGTPP  +Y ++DTGSD+VW+QC PC++CY+QT P+F+P+KS 
Sbjct: 76  PQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSS 135

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VA 241
           S+  +PC S LC+ ++ + CN +N C Y   YGD S + GD S +TLT   T        
Sbjct: 136 SYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFP 195

Query: 242 RVALGCGHDN----EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-----VDRST 292
            + +GCG +N    EG   A++G++G G G  SF TQ G     KFSYCL     V    
Sbjct: 196 NIVIGCGTNNILSYEG---ASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQ 252

Query: 293 SAKPSSMVFGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR--GITASLFKL 349
           S   S + FGD+A VS     T  +     +TFYY+ L   SVG   V   G+       
Sbjct: 253 SNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGV------- 305

Query: 350 DPAGN--GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKT 406
            P G+  G +IIDSGT++T LT+  Y  L  A       L+R  D     + C+ +  + 
Sbjct: 306 -PNGDNEGNIIIDSGTTLTSLTKDDYSFLESAV-VDLVKLERVDDPTQTLNLCYSVKAEG 363

Query: 407 EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
               P + +HF+GADV L   +  + V + G FC AF  +    +I GN+ QQ   V YD
Sbjct: 364 -YDFPIITMHFKGADVDLHPISTFVSV-ADGVFCLAFESSQDH-AIFGNLAQQNLMVGYD 420

Query: 467 LAASRIGFAPRGC 479
           L    + F P  C
Sbjct: 421 LQQKIVSFKPSDC 433


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 154/375 (41%), Positives = 202/375 (53%), Gaps = 24/375 (6%)

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTD 177
            GG S     G +  S EY   LG+GTP     +++DTGSD+ W+QC PC   +CY+Q D
Sbjct: 100 GGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKD 159

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSS----GCNRRNT--CLYQVSYGDGSITVGDFSTE 231
           P+FDP+ S S+A+VPC S  CRKL +     GC       C Y + YG+ + T G +STE
Sbjct: 160 PLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTE 219

Query: 232 TLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
           TLT + G  VA    GCG    G +    GLLGLG    S  +QT  +F   FSYCL   
Sbjct: 220 TLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPT 279

Query: 291 STSAKPSSMVFGDSAVSRTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           S  A   ++   +S+ S TA     FTP+   P + TFY V L GISVGGA +  +  S 
Sbjct: 280 SGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLA-VPPSA 338

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCFDLSG 404
           F      + G++IDSGT +T L   AY ALR AFR+  S  +  P  + ++ DTC+D +G
Sbjct: 339 F------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTG 392

Query: 405 KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVV 464
            T V VPT+ L F G      AT   + VD  G   FA AGT   + IIGN+ Q+ F V+
Sbjct: 393 HTNVTVPTIALTFSGGATIDLATPAGVLVD--GCLAFAGAGTDDTIGIIGNVNQRTFEVL 450

Query: 465 YDLAASRIGFAPRGC 479
           YD     +GF    C
Sbjct: 451 YDSGKGTVGFRAGAC 465


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 178/442 (40%), Positives = 232/442 (52%), Gaps = 35/442 (7%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSL---------TAFAESAVRVP 110
            S L L LHH  S  S    P  L F+  +  D  R+  L         T+ + S++   
Sbjct: 40  SSGLHLTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHG 99

Query: 111 PRNRSRGRANGGFSSSVISGLAQGS----GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
            R +  G   G  +SS    L  G+    G Y TRLG+GTP     MV+DTGS + W+QC
Sbjct: 100 HRKKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQC 159

Query: 167 APCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSYGD 220
           +PC   C+ Q  PVFDP  S ++A V C S  C +L +     S C+  N C+YQ SYGD
Sbjct: 160 SPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGD 219

Query: 221 GSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
            S +VG  S +T++F          GCG DNEGLF  +AGL+GL + +LS   Q      
Sbjct: 220 SSYSVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLG 279

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             FSYCL   S +A   S+    S       +TP+ ++    + Y+V L GISV GA + 
Sbjct: 280 YAFSYCLPTSSAAAGYLSI---GSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPL- 335

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL-RDAFRAGASSLKRAPDFSLFDTC 399
            +  S ++  P      IIDSGT +TRL    Y AL R    A AS+  RAP +S+ DTC
Sbjct: 336 AVPPSEYRSLP-----TIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTC 390

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           F  S    ++VP V + F  GA ++L   N LI VD S T C AFA T  G +IIGN QQ
Sbjct: 391 FRGSAA-GLRVPRVDMAFAGGATLALSPGNVLIDVDDS-TTCLAFAPT-GGTAIIGNTQQ 447

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           Q F VVYD+A SRIGFA  GC+
Sbjct: 448 QTFSVVYDVAQSRIGFAAGGCS 469


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 155/382 (40%), Positives = 205/382 (53%), Gaps = 33/382 (8%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQT 176
           A GG S     G +  S EY   LG+GTP     +++DTGSD+ W+QC PC   +CY+Q 
Sbjct: 152 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 211

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSS----GCNRRNT-----CLYQVSYGDGSITVGD 227
           DP+FDP+ S S+A+VPC S  CRKL +     GC   +      C Y + YG+ + T G 
Sbjct: 212 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 271

Query: 228 FSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
           +STETLT + G  VA    GCG    G +    GLLGLG    S  +QT  +F   FSYC
Sbjct: 272 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 331

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTA----RFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           L   S  A   ++    ++ S TA     FTP+   P + TFY V L GISVGGA +  I
Sbjct: 332 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLA-I 390

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCF 400
             S F      + G++IDSGT +T L   AY ALR AFR+  S  +  P  +  + DTC+
Sbjct: 391 PPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 444

Query: 401 DLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQ 457
           D +G   V VPT+ L F G    D++ PA    + VD  G   FA AGT + + IIGN+ 
Sbjct: 445 DFTGHANVTVPTISLTFSGGATIDLAAPAG---VLVD--GCLAFAGAGTDNAIGIIGNVN 499

Query: 458 QQGFRVVYDLAASRIGFAPRGC 479
           Q+ F V+YD     +GF    C
Sbjct: 500 QRTFEVLYDSGKGTVGFRAGAC 521


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 155/382 (40%), Positives = 205/382 (53%), Gaps = 33/382 (8%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQT 176
           A GG S     G +  S EY   LG+GTP     +++DTGSD+ W+QC PC   +CY+Q 
Sbjct: 72  AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 131

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSS----GCNRRN-----TCLYQVSYGDGSITVGD 227
           DP+FDP+ S S+A+VPC S  CRKL +     GC   +      C Y + YG+ + T G 
Sbjct: 132 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 191

Query: 228 FSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
           +STETLT + G  VA    GCG    G +    GLLGLG    S  +QT  +F   FSYC
Sbjct: 192 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 251

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTA----RFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           L   S  A   ++    ++ S TA     FTP+   P + TFY V L GISVGGA +  I
Sbjct: 252 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLA-I 310

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCF 400
             S F      + G++IDSGT +T L   AY ALR AFR+  S  +  P  +  + DTC+
Sbjct: 311 PPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 364

Query: 401 DLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQ 457
           D +G   V VPT+ L F G    D++ PA    + VD  G   FA AGT + + IIGN+ 
Sbjct: 365 DFTGHANVTVPTISLTFSGGATIDLAAPAG---VLVD--GCLAFAGAGTDNAIGIIGNVN 419

Query: 458 QQGFRVVYDLAASRIGFAPRGC 479
           Q+ F V+YD     +GF    C
Sbjct: 420 QRTFEVLYDSGKGTVGFRAGAC 441


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 155/382 (40%), Positives = 200/382 (52%), Gaps = 39/382 (10%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRS 187
           GLA  S EY   +G+GTPPR   ++ DTGSD+ W+QC PC    CY Q +P+FDP+KS +
Sbjct: 114 GLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSST 173

Query: 188 FATVPCRSPLCR--KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-----GTRV 240
           +  VPC +P C    +  + C    +C Y V YGD S T G  + ET T           
Sbjct: 174 YVDVPCSAPECHIGGVQQTRCG-ATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232

Query: 241 ARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGRRFNRK---FSYCLVDRSTS 293
             V  GC H+   +F    +  AGLLGLGRG  S  +QT R  N     FSYCL  R +S
Sbjct: 233 TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSS 292

Query: 294 AKPSSMVFGDSAVSR---TARFTPLLAN-PKLDTFYYVELVGISVGGAHVRGITASLFKL 349
               ++  G +A  +      FTPL+    +L + Y V L G+SV GA V  I AS F L
Sbjct: 293 TGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVD-IPASAFSL 351

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS--LFDTCFDLSGKTE 407
                 G +IDSGT VT +   AY  LRD FR    S K  P+ S  L DTC+D++G+  
Sbjct: 352 ------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDV 405

Query: 408 VKVPTVVLHFRGA---DVSLPATNYLIPV-DSSGT----FCFAFAGTMS-GLSIIGNIQQ 458
           V  P V L F G    DV       ++P  D SG      C AF  T S GL I+GN+QQ
Sbjct: 406 VTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQ 465

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           + + VV+D+   RIGF P GC+
Sbjct: 466 RAYNVVFDVDGGRIGFGPNGCS 487


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 148/385 (38%), Positives = 203/385 (52%), Gaps = 28/385 (7%)

Query: 113 NRSRGRANGGFSS--SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
           +RS  RAN    +  +  + + Q  GEY     VG PP  +Y ++DTGSD++W+QC PC+
Sbjct: 59  HRSVNRANHFHKAHKAAKATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCE 118

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDF 228
           KCY+QT  +FDP+KS ++  +P  S  C+ ++ + C  + R  C Y + YGDGS + GD 
Sbjct: 119 KCYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDL 178

Query: 229 STETLTFRGT-----RVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRR---F 279
           S ETLT   T     +  R  +GCG +N   F   ++G++GLG G +S   Q  RR    
Sbjct: 179 SVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSI 238

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAV--SRTARFTPLLA-NPKLDTFYYVELVGISVGG 336
            RKFSYCL   S S   S + FGD+AV        TP++  +PK+  FYY+ L   SVG 
Sbjct: 239 GRKFSYCLA--SMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKV--FYYLTLEAFSVGN 294

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSL 395
             +   T+S F+    GN  +IIDSGT++T L    Y  L  A  A    L R  D    
Sbjct: 295 NRIE-FTSSSFRFGEKGN--IIIDSGTTLTLLPNDIYSKLESAV-ADLVELDRVKDPLKQ 350

Query: 396 FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGN 455
              C+  S   E+  P ++ HF GADV L A N  I V+  G  C AF  +  G  I GN
Sbjct: 351 LSLCYR-STFDELNAPVIMAHFSGADVKLNAVNTFIEVE-QGVTCLAFISSKIG-PIFGN 407

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQ F V YDL    + F P  C+
Sbjct: 408 MAQQNFLVGYDLQKKIVSFKPTDCS 432


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 136/348 (39%), Positives = 193/348 (55%), Gaps = 18/348 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M++DTGSDV W+QC PC +C+SQ DP+FDP+ S +++   C S 
Sbjct: 127 EYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSA 186

Query: 197 LCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +L  + +GC+  + C Y V+YGDGS T G +S++TL    + V     GC +   G 
Sbjct: 187 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGF 246

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GL+GLG G  S  +QT     R FSYCL    +S+   ++     + +     TP
Sbjct: 247 NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTP 306

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +L + ++ TFY V L  I VGG  +  I AS+F      + G ++DSGT +TRL   AY 
Sbjct: 307 MLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------SAGTVMDSGTVITRLPPTAYS 359

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV 433
           AL  AF+AG      A    + DTCFD SG++ V +P+V L F  GA VSL A+  ++  
Sbjct: 360 ALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-- 417

Query: 434 DSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + C AFAG    S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 418 ----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 143/355 (40%), Positives = 206/355 (58%), Gaps = 21/355 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   + +GTPP  +  VLDTGSD++W QC APC++C+ Q  P++ PA+S ++A V CRSP
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 197 LCRKLDS--SGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
           +C+ L S  S C+  +T C Y  SYGDG+ T G  +TET T    T V  VA GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFGDSA-VSRTA 310
           G    ++GL+G+GRG LS  +Q G     +FSYC     +T+A P  +  G SA +S  A
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLG---VTRFSYCFTPFNATAASP--LFLGSSARLSSAA 266

Query: 311 RFTPLLANP-----KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           + TP + +P     +  ++YY+ L GI+VG   +  I  ++F+L P G+GGVIIDSGT+ 
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP-IDPAVFRLTPMGDGGVIIDSGTTF 325

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
           T L   A++AL  A  A    L  A    L    CF  +    V+VP +VLHF GAD+ L
Sbjct: 326 TALEESAFVALARAL-ASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMEL 384

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +Y++   S+G  C     +  G+S++G++QQQ   ++YDL    + F P  C
Sbjct: 385 RRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 147/364 (40%), Positives = 188/364 (51%), Gaps = 27/364 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V ++LDTGSD+VW QC PC  C+S+     DP+ S +F  +PC SP
Sbjct: 414 EYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSP 473

Query: 197 LCRKLDSSGCNRRN----TCLYQVSYGDGSITVGDFSTETLTF---RGTRVARV---ALG 246
           +C  L  S C + N    TC+Y  +Y DGSIT G    ET TF    GT  A V   A G
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533

Query: 247 CGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           CG  N G+F +   G+ G GRG LS P+Q        FS+C     T ++PSS++ G  A
Sbjct: 534 CGLFNNGIFTSNETGIAGFGRGALSLPSQLKV---DNFSHCFT-AITGSEPSSVLLGLPA 589

Query: 306 -----VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
                     + TPL+ N      YY+ L GI+VG   +  I  S F L   G GG IID
Sbjct: 590 NLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRL-PIPESTFALKQDGTGGTIID 648

Query: 361 SGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEVK--VPTVVLHF 417
           SGT +T L + AY  + DAF A     +  A   SL   CF  S     K  VP +VLHF
Sbjct: 649 SGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHF 708

Query: 418 RGADVSLPATNYLIPVDSSG--TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
            GA + LP  NY+   + +G    C A       L+IIGN QQQ   V+YDL  + + F 
Sbjct: 709 EGATLDLPRENYMFEFEDAGGSVTCLAI-NAGDDLTIIGNYQQQNLHVLYDLVRNMLSFV 767

Query: 476 PRGC 479
           P  C
Sbjct: 768 PAQC 771


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 136/348 (39%), Positives = 193/348 (55%), Gaps = 18/348 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M++DTGSDV W+QC PC +C+SQ DP+FDP+ S +++   C S 
Sbjct: 197 EYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSA 256

Query: 197 LCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +L  + +GC+  + C Y V+YGDGS T G +S++TL    + V     GC +   G 
Sbjct: 257 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGF 316

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GL+GLG G  S  +QT     R FSYCL    +S+   ++     + +     TP
Sbjct: 317 NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTP 376

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +L + ++ TFY V L  I VGG  +  I AS+F      + G ++DSGT +TRL   AY 
Sbjct: 377 MLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------SAGTVMDSGTVITRLPPTAYS 429

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV 433
           AL  AF+AG      A    + DTCFD SG++ V +P+V L F  GA VSL A+  ++  
Sbjct: 430 ALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-- 487

Query: 434 DSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + C AFAG    S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 488 ----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 151/381 (39%), Positives = 204/381 (53%), Gaps = 31/381 (8%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPV 179
           G S     G++ G+G Y   +G+GTP R + +V DTGSD+ W+QC PC    CY Q DP+
Sbjct: 138 GVSLPAERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPL 197

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF--- 235
           F P+ S +F+ V C +  CR   S G +   + C Y+V YGD S T G    +TLT    
Sbjct: 198 FAPSDSSTFSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTM 257

Query: 236 --------RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
                      ++     GCG +N GLF  A GL GLGRG++S  +Q   +F   FSYCL
Sbjct: 258 APANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCL 317

Query: 288 VDRSTSAKPSSMVFGDSAVSRT-ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
              S+SA P  +  G    +   A+FTP+L      +FYYV+LVGI V G  +R +++  
Sbjct: 318 PSSSSSA-PGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIR-VSSPR 375

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSG 404
             L       +I+DSGT +TRL   AY ALR AF +  G    KRAP  S+ DTC+D + 
Sbjct: 376 VALP------LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTA 429

Query: 405 KTE--VKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS--IIGNIQQQ 459
                V +P V L F  GA +S+  +  L  V      C AFA    G S  I+GN QQ+
Sbjct: 430 HANATVSIPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGDGRSAGILGNTQQR 488

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
              VVYD+A  +IGFA +GC+
Sbjct: 489 TLAVVYDVARQKIGFAAKGCS 509


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 192/348 (55%), Gaps = 18/348 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M++DTGSDV W+QC PC +C+SQ DP+FDP+ S +++   C S 
Sbjct: 127 EYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSA 186

Query: 197 LCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +L  + +GC+  + C Y V+YGDGS T G +S++TL    + V     GC +   G 
Sbjct: 187 ACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVKSFQFGCSNVESGF 246

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GL+GLG G  S  +QT     R FSYCL    +S+   ++     + +     TP
Sbjct: 247 NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTP 306

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +L + ++ TFY V L  I VGG  +  I AS+F      + G ++DSGT +TRL   AY 
Sbjct: 307 MLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------SAGTVMDSGTVITRLPPTAYS 359

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV 433
           AL  AF+AG      A    + DTCFD SG++ V +P+V L F  GA VSL A+  ++  
Sbjct: 360 ALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-- 417

Query: 434 DSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + C AFA     S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 418 ----SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 143/355 (40%), Positives = 206/355 (58%), Gaps = 21/355 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   + +GTPP  +  VLDTGSD++W QC APC++C+ Q  P++ PA+S ++A V CRSP
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 197 LCRKLDS--SGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
           +C+ L S  S C+  +T C Y  SYGDG+ T G  +TET T    T V  VA GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFGDSA-VSRTA 310
           G    ++GL+G+GRG LS  +Q G     +FSYC     +T+A P  +  G SA +S  A
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLG---VTRFSYCFTPFNATAASP--LFLGSSARLSSAA 266

Query: 311 RFTPLLANP-----KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           + TP + +P     +  ++YY+ L GI+VG   +  I  ++F+L P G+GGVIIDSGT+ 
Sbjct: 267 KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP-IDPAVFRLTPMGDGGVIIDSGTTF 325

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
           T L   A++AL  A  A    L  A    L    CF  +    V+VP +VLHF GAD+ L
Sbjct: 326 TALEERAFVALARAL-ASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMEL 384

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +Y++   S+G  C     +  G+S++G++QQQ   ++YDL    + F P  C
Sbjct: 385 RRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 142/365 (38%), Positives = 193/365 (52%), Gaps = 45/365 (12%)

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK---- 200
           G+P   + +++DTGSD+ W+QC PC  CY+Q DP+FDPA S ++A V C +  C      
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214

Query: 201 -------LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
                    S+G      C Y ++YGDGS + G  +T+T+   G  +     GCG  N G
Sbjct: 215 ATGTPGSCGSTGAGSEK-CYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRG 273

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF--GDSAVSRTAR 311
           LF   AGL+GLGR  LS  +QT  R+   FSYCL   ++     S+    GD A S    
Sbjct: 274 LFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRN 333

Query: 312 FTP-----LLANPKLDTFYYVELVGISVGGAHV--RGITASLFKLDPAGNGGVIIDSGTS 364
            TP     ++A+P    FY++ + G +VGG  +  +G+ AS           V+IDSGT 
Sbjct: 334 TTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGAS----------NVLIDSGTV 383

Query: 365 VTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GAD 421
           +TRL    Y A+R  F  + GA+    AP FS+ DTC+DL+G  EVKVP + L    GAD
Sbjct: 384 ITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAD 443

Query: 422 VSLPATNYLIPVDSSGT-FCFAFAGTMSGLS------IIGNIQQQGFRVVYDLAASRIGF 474
           V++ A   L  V   G+  C A    M+ LS      IIGN QQ+  RVVYD   SR+GF
Sbjct: 444 VTVDAAGMLFVVRKDGSQVCLA----MASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGF 499

Query: 475 APRGC 479
           A   C
Sbjct: 500 ADEDC 504


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 136/348 (39%), Positives = 193/348 (55%), Gaps = 18/348 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M++DTGSDV W+QC PC +C+SQ DP+FDP+ S +++   C S 
Sbjct: 51  EYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSA 110

Query: 197 LCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +L  + +GC+  + C Y V+YGDGS T G +S++TL    + V     GC +   G 
Sbjct: 111 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGF 170

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GL+GLG G  S  +QT     R FSYCL    +S+   ++     + +     TP
Sbjct: 171 NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTP 230

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +L + ++ TFY V L  I VGG  +  I AS+F      + G ++DSGT +TRL   AY 
Sbjct: 231 MLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------SAGTVMDSGTVITRLPPTAYS 283

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV 433
           AL  AF+AG      A    + DTCFD SG++ V +P+V L F  GA VSL A+  ++  
Sbjct: 284 ALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-- 341

Query: 434 DSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + C AFAG    S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 342 ----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 157/456 (34%), Positives = 219/456 (48%), Gaps = 50/456 (10%)

Query: 52  SSLPLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPP 111
           ++LP+        L LRL H+   +   TP    +    R         +F  SA+  P 
Sbjct: 24  NALPIAQNGTVEYLKLRLLHIKPFT---TPSQALSFDSHR--------LSFFFSALHTP- 71

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
                        S V+SG + GSG+YF  L +GTPP+ + +V DTGSD+VW++C+ C+ 
Sbjct: 72  ---------QSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRN 122

Query: 172 CYSQTD-PVFDPAKSRSFATVPCRSPLCRKL---DSSGCNR---RNTCLYQVSYGDGSIT 224
           C   T    F    S +F+   C    C+ +       CN     + C Y+ SYGDGS T
Sbjct: 123 CTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKT 182

Query: 225 VGDFSTETLTF-----RGTRVARVALGCGHDNEGL------FVAAAGLLGLGRGRLSFPT 273
            G FS ET T      R  ++  +A GC     G       F  A G++GLGRG +S  +
Sbjct: 183 SGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSS 242

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS------RTARFTPLLANPKLDTFYYV 327
           Q G RF  KFSYCL+D   S  P+S +   S  +      R  RFTPL  NP   TFYY+
Sbjct: 243 QLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYI 302

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            +  +SV G  +  I  S++ LD  GNGG I+DSGT++T L  PAY+ +    +      
Sbjct: 303 GIESVSVDGIKLP-INPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLP 361

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGT 446
             A     FD C ++S     ++P +     G  V S P  NY +  D     C A    
Sbjct: 362 SPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDED-VKCLALQAV 420

Query: 447 M--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           M  SG S+IGN+ QQGF + +D   +R+GF+  GCA
Sbjct: 421 MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCA 456


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 162/408 (39%), Positives = 221/408 (54%), Gaps = 33/408 (8%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           NRT E L + +I+ D  R++ L   + S         S+  AN          +  GSGE
Sbjct: 70  NRTWESLMSEKIRGDANRLRFLKRTSRS---------SKEDANANVP------VRSGSGE 114

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  ++  GTP + +Y ++DTGSDV WI C  C+ C+S T P+FDPAKS S+    C S  
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYKPFACDSQP 173

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
           C+++ S  C   + C ++V YGDG+   G  +++ +T     +   + GC         +
Sbjct: 174 CQEI-SGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTYS 232

Query: 258 AAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV--SRTARFT 313
           + GL+GLG G LS  TQ  T   F   FSYCL   S+S    S+V G  A   S + +FT
Sbjct: 233 SPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL--PSSSTSSGSLVLGKEAAVSSSSLKFT 290

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
            L+ +P   TFY+V L  ISVG   +     S+   + A  GG IIDSGT++T L   AY
Sbjct: 291 TLIKDPSFPTFYFVTLKAISVGNTRI-----SVPATNIASGGGTIIDSGTTITYLVPSAY 345

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIP 432
             LRDAFR   SSL+  P     DTC+DLS  + V VPT+ LH  R  D+ LP  N LI 
Sbjct: 346 KDLRDAFRQQLSSLQPTP-VEDMDTCYDLSSSS-VDVPTITLHLDRNVDLVLPKENILI- 402

Query: 433 VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              SG  C AF+ T S  SIIGN+QQQ +R+V+D+  S++GFA   CA
Sbjct: 403 TQESGLSCLAFSSTDS-RSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 139/368 (37%), Positives = 193/368 (52%), Gaps = 30/368 (8%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G  EY   L VGTPP+ V  +LDTGSD++W QCAPC  C  Q DP+F P  S S+  + C
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRC 159

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-------RGTRV-ARVAL 245
              LC  +    C R +TC Y+ SYGDG+ T G ++TE  TF         T++ A +  
Sbjct: 160 AGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGF 219

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD-- 303
           GCG  N+G     +G++G GR  LS  +Q      R+FSYCL   + S + S+++FG   
Sbjct: 220 GCGTMNKGSLNNGSGIVGFGRAPLSLVSQLA---IRRFSYCLTPYA-SGRKSTLLFGSLR 275

Query: 304 ----SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
                A + T + T LL + +  TFYYV   G++VG   +R I  S F L P G+GG I+
Sbjct: 276 GGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLR-IPISAFALRPDGSGGAIV 334

Query: 360 DSGTSVTRLTRPAYIALRDAFRAG-----ASSLKRAPDFSLFDTCFDLSGKTEVK---VP 411
           DSGT++T    P    +  AFR+      A++    PD  +   CF  +     +   VP
Sbjct: 335 DSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGV---CFAAAASRVPRPAVVP 391

Query: 412 TVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
            +V H +GAD+ LP  NY++     G  C   A +    + IGN  QQ  RV+YDL A  
Sbjct: 392 RMVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADT 451

Query: 472 IGFAPRGC 479
           + FAP  C
Sbjct: 452 LSFAPAQC 459


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 144/394 (36%), Positives = 207/394 (52%), Gaps = 35/394 (8%)

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC-YSQT 176
           R N    S +ISG + GSG+YF  + +GTPP+ + +V DTGSD+VW++C+ C+ C +   
Sbjct: 68  RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 127

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSSG---CNR---RNTCLYQVSYGDGSITVGDFST 230
              F P  S SF+   C  P CR L  +    CN     + C +  SY DGS++ G FS 
Sbjct: 128 SSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSK 187

Query: 231 ETLTFRGTRVARVAL-----GCGHDNEG------LFVAAAGLLGLGRGRLSFPTQTGRRF 279
           ET T +    + + L     GCG    G       F  A G++GLGRG +SF +Q GRRF
Sbjct: 188 ETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRF 247

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAV-------SRTARFTPLLANPKLDTFYYVELVGI 332
             KFSYCL+D + S  P+S +     +       +    +TPL  NP   TFYY+ +  I
Sbjct: 248 GNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSI 307

Query: 333 SVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD 392
           ++ G  +  I  +++++D  GNGG ++DSGT++T LT+ AY  +  + R        A  
Sbjct: 308 TIDGVKLP-INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAEL 366

Query: 393 FSLFDTCFDLSGKTEVKVPTVV-LHFR---GADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
              FD C + SG  E + P++  L FR   GA  + P  NY +  +  G  C A     S
Sbjct: 367 TPGFDLCVNASG--ESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCLAIRAVES 423

Query: 449 --GLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             G S+IGN+ QQGF + +D   SR+GF  RGC 
Sbjct: 424 GNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCG 457


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 134/361 (37%), Positives = 191/361 (52%), Gaps = 19/361 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L   SGEY   + +GTPP  +  + DTGSD++W QCAPC  CY+Q DP+FDP  S ++  
Sbjct: 83  LTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKD 142

Query: 191 VPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARV 243
           V C S  C  L++  S     NTC Y +SYGD S T G+ + +TLT      R  ++  +
Sbjct: 143 VSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNI 202

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV-F 301
            +GCGH+N G F      +    G  +S   Q G   + KFSYCLV  ++    +S + F
Sbjct: 203 IIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINF 262

Query: 302 GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           G +A+   +    TPL+A    +TFYY+ L  ISVG   ++   +       +  G +II
Sbjct: 263 GTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE----SSEGNIII 318

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSGT++T L    Y  L DA  +   + K+    S    C+  +G  ++KVP + +HF G
Sbjct: 319 DSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKVPVITMHFDG 376

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ADV L ++N  + V S    CFAF G+ S  SI GN+ Q  F V YD  +  + F P  C
Sbjct: 377 ADVKLDSSNAFVQV-SEDLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434

Query: 480 A 480
           A
Sbjct: 435 A 435


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 146/358 (40%), Positives = 194/358 (54%), Gaps = 23/358 (6%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRS 187
           G   G+  Y   + +GTP     + +DTGSD+ W+QC PC    CYSQ DP+FDPA+S S
Sbjct: 132 GFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSS 191

Query: 188 FATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VA 244
           +A VPC  P+C  L   +S C+    C Y VSYGDGS T G +S++TLT       R   
Sbjct: 192 YAAVPCGGPVCGGLGIYASSCSAAQ-CGYVVSYGDGSKTTGVYSSDTLTLSPNDAVRGFF 250

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCGH   G F    GLLGLGR   S   QT   +   FSYCL  R ++    ++     
Sbjct: 251 FGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGYLTLGGPSG 309

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A       T LL++P   T+Y V L GISVGG  +  + +S+F       GG ++D+GT 
Sbjct: 310 AAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLS-VPSSVFA------GGTVVDTGTV 362

Query: 365 VTRLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GAD 421
           +TRL   AY ALR AFR+G +S     AP   + DTC++ SG   V +P V L F  GA 
Sbjct: 363 ITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTFSGGAT 422

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V+L A   L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 423 VTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 474


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 134/361 (37%), Positives = 191/361 (52%), Gaps = 19/361 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L   SGEY   + +GTPP  +  + DTGSD++W QCAPC  CY+Q DP+FDP  S ++  
Sbjct: 83  LTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKD 142

Query: 191 VPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARV 243
           V C S  C  L++  S     NTC Y +SYGD S T G+ + +TLT      R  ++  +
Sbjct: 143 VSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNI 202

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV-F 301
            +GCGH+N G F      +    G  +S   Q G   + KFSYCLV  ++    +S + F
Sbjct: 203 IIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINF 262

Query: 302 GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           G +A+   +    TPL+A    +TFYY+ L  ISVG   ++   +       +  G +II
Sbjct: 263 GTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSE----SSEGNIII 318

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSGT++T L    Y  L DA  +   + K+    S    C+  +G  ++KVP + +HF G
Sbjct: 319 DSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKVPVITMHFDG 376

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ADV L ++N  + V S    CFAF G+ S  SI GN+ Q  F V YD  +  + F P  C
Sbjct: 377 ADVKLDSSNAFVQV-SEDLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434

Query: 480 A 480
           A
Sbjct: 435 A 435


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  225 bits (573), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 160/397 (40%), Positives = 211/397 (53%), Gaps = 22/397 (5%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRY 150
           +D LRVKS+ A      R   +N             V SG+  G+G Y  ++ +GTP   
Sbjct: 4   QDQLRVKSMHA------RFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLS 57

Query: 151 VYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR- 208
           + + LDTGSD+ W QC PC   CY Q    FDP KS S+  V C S  CR +  SG  R 
Sbjct: 58  LSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARG 117

Query: 209 --RNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGLFVAAAGLLGLG 265
              +TC+Y+V YGDGS +VG F+TE LT   + V +    GCG  N G F   AGLLGLG
Sbjct: 118 CVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLG 177

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           RG+LS   QT  ++N  F+YCL   S+S+     + G   V ++ +FTPL    K   FY
Sbjct: 178 RGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQ--VPKSVKFTPLSPAFKNTPFY 235

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
            +++ G+SVGG HV  I AS+F      N G IIDSGT +TRL    Y AL   F+    
Sbjct: 236 GIDIKGLSVGG-HVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKFQQLMK 289

Query: 386 SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA 444
              +   FS+ DTC+D SG   + VP +   F+G  +V +     L  +++    C AFA
Sbjct: 290 DYPKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFA 349

Query: 445 --GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                    + GN QQQ + VV+DLA  RIGFAP GC
Sbjct: 350 PNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  224 bits (572), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 144/366 (39%), Positives = 199/366 (54%), Gaps = 26/366 (7%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           ++VIS L    GEY     VGTP   V+ +LDTGSD++W+QC PCKKCY QT P+FD +K
Sbjct: 80  TTVISAL----GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSK 135

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV- 243
           S+++ T+PC S  C+ +  + C+ R  CLY + Y DGS ++GD S ETLT   T  + V 
Sbjct: 136 SQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQ 195

Query: 244 ----ALGCGHDNE-GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
                +GCG  N  G+    +G++GLGRG +S  TQ       KFSYCLV   ++A  S 
Sbjct: 196 FPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTAS-SK 254

Query: 299 MVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNG 355
           + FG++AV   R    TPL +   L  FY++ L   SVG   +  G   S       G G
Sbjct: 255 LNFGNAAVVSGRGTVSTPLFSKNGL-VFYFLTLEAFSVGRNRIEFGSPGS------GGKG 307

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLS-GKTEVKVPTV 413
            +IIDSGT++T L    Y  L  A  A    L+R  D   +   C+ ++  K +  VP +
Sbjct: 308 NIIIDSGTTLTALPNGVYSKLEAAV-AKTVILQRVRDPNQVLGLCYKVTPDKLDASVPVI 366

Query: 414 VLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
             HF GADV+L A N  + V +    CFAF  T +G ++ GN+ QQ   V YDL  + + 
Sbjct: 367 TAHFSGADVTLNAINTFVQV-ADDVVCFAFQPTETG-AVFGNLAQQNLLVGYDLQMNTVS 424

Query: 474 FAPRGC 479
           F    C
Sbjct: 425 FKHTDC 430


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  224 bits (571), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 127/339 (37%), Positives = 197/339 (58%), Gaps = 20/339 (5%)

Query: 153 MVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR--- 208
           M+LDTGS + W+QC PC   C++Q DP++DP+ S+++  + C S  C +L ++  N    
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 209 ---RNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGL 264
               N CLY  SYGD S ++G  S + LT   ++ + +   GCG DN+GLF  AAG++GL
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTF 324
            R +LS   Q   ++   FSYCL   ++ +     +   S    + +FTP+L + K  + 
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180

Query: 325 YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAG 383
           Y++ L  I+V G  +  + A+++++        +IDSGT +TRL    Y ALR AF +  
Sbjct: 181 YFLRLTAITVSGRPL-DLAAAMYRVP------TLIDSGTVITRLPMSMYAALRQAFVKIM 233

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFA 442
           ++   +AP +S+ DTCF  S K+   VP + + F+ GAD++L A + LI  D  G  C A
Sbjct: 234 STKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEAD-KGITCLA 292

Query: 443 FAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           FAG+   + ++IIGN QQQ + + YD++ SRIGFAP  C
Sbjct: 293 FAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 146/356 (41%), Positives = 194/356 (54%), Gaps = 23/356 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 191 VPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGC 247
           VPC  P+C  L   ++       C Y VSYGDGS T G +S++TLT   +  V     GC
Sbjct: 196 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 255

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAV 306
           GH   GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ + G S  
Sbjct: 256 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGA 315

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     T LL +P   T+Y V L GISVGG  +  + AS F       GG ++D+GT +T
Sbjct: 316 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFA------GGTVVDTGTVIT 368

Query: 367 RLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
           RL   AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V 
Sbjct: 369 RLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVM 428

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L A   L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 429 LGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 155/437 (35%), Positives = 225/437 (51%), Gaps = 45/437 (10%)

Query: 69  LHHVDSLSFNRTPEHL--------------FNLRIQRDVLRVKSLTAFAESAVRVPPRNR 114
             H++S   ++T  H               F+  I  D  R+  L      A R+  +++
Sbjct: 34  FQHLNSTGLHQTLHHPQSPCSPAPLSSDLPFSAFITHDAARIAGL------ASRLATKDK 87

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCY 173
               A+   S  + SG + G G Y TRLG+GTP     MV+D+GS + W+QCAPC   C+
Sbjct: 88  DWVAAS---SVPLASGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCH 144

Query: 174 SQTDPVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRNTCLYQVSYGDGSITVGDF 228
            Q  P++DP  S ++A VPC +P C +L +     S C+    C YQ SYGDGS + G  
Sbjct: 145 PQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYL 204

Query: 229 STETLTFRGT-RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
           S +T++   +        GCG DN GLF  AAGL+GL R +LS  +Q        F+YCL
Sbjct: 205 SKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCL 264

Query: 288 VDRSTSAKPSSMVFGDSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
              S +A    + FG ++ ++      +T ++++    + Y+V L G+SV G+ +  + +
Sbjct: 265 -PTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPL-AVPS 322

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG 404
           S +   P      IIDSGT +TRL  P Y AL  A  A  ++      +S+  TCF    
Sbjct: 323 SEYGSLP-----TIIDSGTVITRLPTPVYTALSKAVGAALAAPSAP-AYSILQTCFK-GQ 375

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
             ++ VP V + F  GA + L   N L+ V+ + T C AFA T S  +IIGN QQQ F V
Sbjct: 376 VAKLPVPAVNMAFAGGATLRLTPGNVLVDVNET-TTCLAFAPTDS-TAIIGNTQQQTFSV 433

Query: 464 VYDLAASRIGFAPRGCA 480
           VYD+  SRIGFA  GC+
Sbjct: 434 VYDVKGSRIGFAAGGCS 450


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 140/361 (38%), Positives = 197/361 (54%), Gaps = 20/361 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L   SGEY   + +GTPP  +  + DTGSD++W QC PC  CY+Q DP+FDP  S ++  
Sbjct: 87  LTSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKD 146

Query: 191 VPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARV 243
           V C S  C  L++  S     NTC Y  SYGD S T G+ + +TLT      R  ++  +
Sbjct: 147 VSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNI 206

Query: 244 ALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVF 301
            +GCGH+N G F    +G++GLG G +S  TQ G   + KFSYCLV   S + + S + F
Sbjct: 207 IIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINF 266

Query: 302 GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           G +AV        TPL+A  + +TFYY+ L  ISVG   V+   +       +G G +II
Sbjct: 267 GTNAVVSGTGVVSTPLIAKSQ-ETFYYLTLKSISVGSKEVQYPGSD----SGSGEGNIII 321

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSGT++T L    Y  L DA  +   + K+    +    C+  +G  ++KVP + +HF G
Sbjct: 322 DSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATG--DLKVPAITMHFDG 379

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ADV+L  +N  + + S    CFAF G+ S  SI GN+ Q  F V YD  +  + F P  C
Sbjct: 380 ADVNLKPSNCFVQI-SEDLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437

Query: 480 A 480
           A
Sbjct: 438 A 438


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/356 (38%), Positives = 187/356 (52%), Gaps = 33/356 (9%)

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL--- 201
           G+P   + +++DTGSD+ W+QC PC  CY+Q DP+FDPA S ++A V C +  C      
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKA 256

Query: 202 ------DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLF 255
                    G N R  C Y ++YGDGS + G  +T+T+   G  +     GCG  N GLF
Sbjct: 257 ATGTPGSCGGGNER--CYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRGLF 314

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS----RTAR 311
              AGL+GLGR  LS  +QT  R+   FSYCL   ++     S+  G  A S        
Sbjct: 315 GGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVA 374

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHV--RGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
           +T ++A+P    FY++ + G +VGG  +  +G+ AS           V+IDSGT +TRL 
Sbjct: 375 YTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGAS----------NVLIDSGTVITRLA 424

Query: 370 RPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPA 426
              Y  +R  F  +  A+    AP FS+ DTC+DL+G  EVKVP + L    GA+V++ A
Sbjct: 425 PSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDA 484

Query: 427 TNYLIPVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              L  V   G+  C A A         IIGN QQ+  RVVYD   SR+GFA   C
Sbjct: 485 AGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 147/359 (40%), Positives = 194/359 (54%), Gaps = 26/359 (7%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRS 187
           G + G+ +Y   + +GTP     + +DTGSDV W+QC PC    CYSQ DP+FDP +S S
Sbjct: 134 GFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSS 193

Query: 188 FATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL 245
           ++ VPC +  C +L   S+GC+    C Y VSYGDGS T G +S++TLT  G+   +  L
Sbjct: 194 YSAVPCAAASCSQLALYSNGCSG-GQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFL 252

Query: 246 -GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCGH  +GLF    GLLGLGR   S  +Q    +   FSYCL     S    S+     
Sbjct: 253 FGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISL----G 308

Query: 305 AVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
             S TA F  TPLL      T+Y V L GISVGG  +  I AS+F        G ++D+G
Sbjct: 309 GPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLS-IDASVFA------SGAVVDTG 361

Query: 363 TSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           T VTRL   AY ALR AFRA  +      AP   + DTC+D +    V +PT+ + F G 
Sbjct: 362 TVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGG 421

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                 T+ ++   +SG   FA  G  S  SI+GN+QQ+ F V +D   S +GF P  C
Sbjct: 422 AAMDLGTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 138/368 (37%), Positives = 182/368 (49%), Gaps = 35/368 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           + EY  RL VGTP R V + LDTGSD+VW QCAPC+ C+ Q  PV DPA S ++A +PC 
Sbjct: 81  TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140

Query: 195 SPLCRKLDSSGCNRRN-----TCLYQVSYGDGSITVGDFSTETLTF-------RGTRVAR 242
           +  CR L  + C  R      +C+Y   YGD S+TVG+ +T+  TF             R
Sbjct: 141 AARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR 200

Query: 243 VALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
           +  GCGH N+G+F +   G+ G GRGR S P+Q        FSYC      S K S +  
Sbjct: 201 LTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLN---VTSFSYCFTSMFES-KSSLVTL 256

Query: 302 GDS-------AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           G S       A S   R TP+L NP   + Y++ L GISVG   +  +  + F+      
Sbjct: 257 GGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLP-VPETKFR------ 309

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK---VP 411
              IIDSG S+T L    Y A++  F A         + S  D CF L      +   VP
Sbjct: 310 -STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVP 368

Query: 412 TVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           ++ LH  GAD  LP +NY+     +   C          ++IGN QQQ   VVYDL   R
Sbjct: 369 SLTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDR 428

Query: 472 IGFAPRGC 479
           + FAP  C
Sbjct: 429 LSFAPARC 436


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 152/424 (35%), Positives = 209/424 (49%), Gaps = 36/424 (8%)

Query: 87  LRIQRDVLRVKSLTAFA--ESAVRVPPRNRSRG----RANGGFSSSVISGLAQGSGEYFT 140
           L ++ D+  V     F   E   R+  R+R+R     +  G +   V +     SGEY  
Sbjct: 30  LTMRADLTHVDKGRGFTRWERLSRMAVRSRARAASLYQRGGHYGQPVTATAVPSSGEYLI 89

Query: 141 RLGVGTP-PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
              +GTP P+ V + +DTGSD+VW QC PC  C+ Q  P+FDP+ S +F  V C  P+CR
Sbjct: 90  HFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPICR 149

Query: 200 K---LDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTR--------VARVALGC 247
               L  S C  +   C Y  SYGD SIT G    +T TF            V+ +A GC
Sbjct: 150 PSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGC 209

Query: 248 GHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV--DRSTSAKPSSMVFGDS 304
           G  N G+F +  +G+ G GRG LS P+Q   R  R FSYCL   D + S K S++  G  
Sbjct: 210 GDYNTGVFASNESGIAGFGRGPLSLPSQL--RVGR-FSYCLTSHDETESNKTSAVFLGTP 266

Query: 305 AVSRTA------RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                A      R TP++ +P   TFYY+ L GI+VG   +  + +S+F L   G+GG +
Sbjct: 267 PNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP-VDSSVFALKKDGSGGTV 325

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT--CFDL-SGKTEVKVPTVVL 415
           IDSGT VT      +  L++ F A    L R  + S      CF    G  +V VP ++ 
Sbjct: 326 IDSGTGVTTFPAAVFEQLKNEFVAQL-PLPRYDNTSEVGNLLCFQRPKGGKQVPVPKLIF 384

Query: 416 HFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           H   AD+ LP  NY+     SG  C    G    + +IGN QQQ   +VYD+  S++ FA
Sbjct: 385 HLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFA 444

Query: 476 PRGC 479
              C
Sbjct: 445 SAQC 448


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 141/353 (39%), Positives = 191/353 (54%), Gaps = 17/353 (4%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRS 195
           E+   +G G+P +   + +DTGSDV WIQC PC   CY Q DPVFDP KS +++ VPC  
Sbjct: 160 EFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL 254
           P C       C+   TCLY+V+YGDGS T G  S ETL+   TR +   A GCG  N G 
Sbjct: 220 PQCAAAGGK-CSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPGFAFGCGQTNLGE 278

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT---AR 311
           F    GL+GLGRG LS P+Q    F   FSYCL    T+    +M     A S      +
Sbjct: 279 FGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPAASNDDDDVQ 338

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
           +T ++      + Y+VE+V I +GG ++  +  ++F  D     G + DSGT +T L   
Sbjct: 339 YTAMIQKEDYPSLYFVEVVSIDIGG-YILPVPPTVFTRD-----GTLFDSGTILTYLPPE 392

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL-PATNY 429
           AY +LRD F+   +  K AP +  FDTC+D +G   + +P V   F  GA   L P    
Sbjct: 393 AYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAVFDLSPVAIL 452

Query: 430 LIPVDSS-GTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + P D++  T C AF    S +  +IIGN QQ+G  V+YD+AA +IGF    C
Sbjct: 453 IYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 149/382 (39%), Positives = 200/382 (52%), Gaps = 25/382 (6%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S  +  GG S     G +  +  Y   L +GTP   + + LDTGSD  W+QC PC  CY 
Sbjct: 116 SSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYE 175

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKL------DSSGCNRRNTCLYQVSYGDGSITVGDF 228
           Q DPVFDP  S +++ VPC +  C++L       +   +    C Y+VSY D S TVGD 
Sbjct: 176 QRDPVFDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDL 235

Query: 229 STETLTFRGTRVARVA-------LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNR 281
           + +TLT   +     A        GCGH N G F    GLLGLG G+ S P+Q   R+  
Sbjct: 236 ARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGA 295

Query: 282 KFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
            FSYCL    ++A    + FG +A    A+FT ++   +  T YY+ L GI V G  ++ 
Sbjct: 296 AFSYCLPSSPSAA--GYLSFGGAAARANAQFTEMVTG-QDPTSYYLNLTGIVVAGRAIK- 351

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTC 399
           + AS F    A   G IIDSGT+ +RL   AY ALR +FR+  G    KRAP   +FDTC
Sbjct: 352 VPASAF----ATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTC 407

Query: 400 FDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           +D +G   V++P V L F  GA V L  +  L   +     C AF      L I+GN QQ
Sbjct: 408 YDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHD-LGILGNTQQ 466

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
           +   V+YD+ + RIGF  +GCA
Sbjct: 467 RTLAVIYDVGSQRIGFGRKGCA 488


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 142/418 (33%), Positives = 221/418 (52%), Gaps = 50/418 (11%)

Query: 84  LFNLRIQRDVLRVKSLTAFAE----SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYF 139
           L N+R+Q   L++K++T+       S  ++P                + SG+   S  Y 
Sbjct: 45  LDNIRVQSLQLKIKAMTSSTTEQSVSETQIP----------------LTSGIKLESLNYI 88

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
             + +G   + + +++DTGSD+ W+QC PC+ CY+Q  P++DP+ S S+ TV C S  C+
Sbjct: 89  VTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQ 146

Query: 200 KL-----DSSGCNRRNT-----CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
            L     +S  C   N      C Y VSYGDGS T GD ++E++    T++     GCG 
Sbjct: 147 DLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGR 206

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV--- 306
           +N+GLF  ++GL+GLGR  +S  +QT + FN  FSYCL      A  S     DS+V   
Sbjct: 207 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTN 266

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S +  +TPL+ NP+L +FY + L G S+GG  ++  ++S  +       G++IDSGT +T
Sbjct: 267 STSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--SSSFGR-------GILIDSGTVIT 317

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVS 423
           RL    Y A++  F    S    AP +S+ DTCF+L+   ++ +P + + F+G    +V 
Sbjct: 318 RLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVD 377

Query: 424 LPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +    Y +  D+S   C A A     + + IIGN QQ+  RV+YD    R+G     C
Sbjct: 378 VTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 147/359 (40%), Positives = 194/359 (54%), Gaps = 26/359 (7%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRS 187
           G + G+ +Y   + +GTP     + +DTGSDV W+QC PC    CYSQ DP+FDP +S S
Sbjct: 123 GFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSS 182

Query: 188 FATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL 245
           ++ VPC +  C +L   S+GC+    C Y VSYGDGS T G +S++TLT  G+   +  L
Sbjct: 183 YSAVPCAAASCSQLALYSNGCSG-GQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFL 241

Query: 246 -GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCGH  +GLF    GLLGLGR   S  +Q    +   FSYCL     S    S+     
Sbjct: 242 FGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISL----G 297

Query: 305 AVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
             S TA F  TPLL      T+Y V L GISVGG  +  I AS+F        G ++D+G
Sbjct: 298 GPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLS-IDASVFA------SGAVVDTG 350

Query: 363 TSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           T VTRL   AY ALR AFRA  +      AP   + DTC+D +    V +PT+ + F G 
Sbjct: 351 TVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGG 410

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                 T+ ++   +SG   FA  G  S  SI+GN+QQ+ F V +D   S +GF P  C
Sbjct: 411 AAMDLGTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 220/418 (52%), Gaps = 50/418 (11%)

Query: 84  LFNLRIQRDVLRVKSLTAFAE----SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYF 139
           L N+R+Q   L++K++T+       S  ++P                + SG+   S  Y 
Sbjct: 93  LDNIRVQSLQLKIKAMTSSTTEQSVSETQIP----------------LTSGIKLESLNYI 136

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
             + +G   + + +++DTGSD+ W+QC PC+ CY+Q  P++DP+ S S+ TV C S  C+
Sbjct: 137 VTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQ 194

Query: 200 KL-----DSSGCNRRNT-----CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
            L     +S  C   N      C Y VSYGDGS T GD ++E++    T++     GCG 
Sbjct: 195 DLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGR 254

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV--- 306
           +N+GLF  ++GL+GLGR  +S  +QT + FN  FSYCL      A  S     DS+V   
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTN 314

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S +  +TPL+ NP+L +FY + L G S+GG  ++   +S F        G++IDSGT +T
Sbjct: 315 STSVSYTPLVQNPQLRSFYILNLTGASIGGVELK---SSSF------GRGILIDSGTVIT 365

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVS 423
           RL    Y A++  F    S    AP +S+ DTCF+L+   ++ +P + + F+G    +V 
Sbjct: 366 RLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVD 425

Query: 424 LPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +    Y +  D+S   C A A     + + IIGN QQ+  RV+YD    R+G     C
Sbjct: 426 VTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 152/379 (40%), Positives = 202/379 (53%), Gaps = 42/379 (11%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSF 188
           GLA  S EY   +G+GTP R   ++ DTGSD+ W+QC PC   CY Q +P+FDP+KS ++
Sbjct: 118 GLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTY 177

Query: 189 ATVPCRSPLCR-----KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR--VA 241
             VPC +P C+      L   G     TC Y V YGD S+T G+ + E  T   +    A
Sbjct: 178 VDVPCGTPQCKIGGGQDLTCGG----TTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA 233

Query: 242 RVALGCGHD-NEGL-----FVAAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTS 293
            V  GC H+ + G+      ++ AGLLGLGRG  S  +QT RR N    FSYCL  R +S
Sbjct: 234 GVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQT-RRGNSGDVFSYCLPPRGSS 292

Query: 294 AKPSSMVFGDSAVSRTA-RFTPLLA-NPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           A    +  G +A  ++   FTPL+  N +L + Y V LVGISV GA +  I AS F +  
Sbjct: 293 A--GYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALP-IDASAFYI-- 347

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVK 409
               G +IDSGT +T +   AY  LRD FR         P+  +   DTC+D++G   V 
Sbjct: 348 ----GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVT 403

Query: 410 VPTVVLHFRGA---DVSLPATNYLIPVDSSGT----FCFAFAGT-MSGLSIIGNIQQQGF 461
            P V L F G    DV       +  VD+SG      C AF  T + G  IIGN+QQ+ +
Sbjct: 404 APPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAY 463

Query: 462 RVVYDLAASRIGFAPRGCA 480
            VV+D+   RIGF   GC+
Sbjct: 464 NVVFDVEGRRIGFGANGCS 482


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  221 bits (563), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 144/361 (39%), Positives = 195/361 (54%), Gaps = 19/361 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRS 187
           +G + G+ E+   +G GTP +   ++ DTGSDV WIQC PC   CY Q DP+FDP KS +
Sbjct: 111 TGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSAT 170

Query: 188 FATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALG 246
           ++ VPC  P C       C+   TCLY+V YGDGS T G  S ETL+    R +   A G
Sbjct: 171 YSAVPCGHPQCAAAGGK-CSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFAFG 229

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG  N G F    GL+GLGRG+LS  +Q    F   FSYCL   +TS     +  G +  
Sbjct: 230 CGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSH--GYLTIGTTTP 287

Query: 307 ---SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
              S   R+T ++      +FY+V+LV I VGG  V  +   LF  D     G ++DSGT
Sbjct: 288 ASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGG-FVLPVPPILFTRD-----GTLLDSGT 341

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
            +T L   AY ALRD F+   +  K AP +  FDTC+D +G+  + +P V   F  G+  
Sbjct: 342 VLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSF 401

Query: 423 SLPATNYLI-PVDSS-GTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            L     LI P D++  T C AF    S +  +I+GN QQ+   ++YD+AA +IGF    
Sbjct: 402 DLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGS 461

Query: 479 C 479
           C
Sbjct: 462 C 462


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  221 bits (563), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 161/403 (39%), Positives = 224/403 (55%), Gaps = 30/403 (7%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI---SGLAQGSGEYFTRLGVGTP 147
           +D  RVKS+ +      R+     S G+      S+ I    G   GSG Y   +G+GTP
Sbjct: 105 QDQSRVKSIHS------RLSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTP 158

Query: 148 PRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-- 204
            + + ++ DTGSD+ W QC PC + CY Q + +FDP++S S+  + C S +C  L S+  
Sbjct: 159 KKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATG 218

Query: 205 ---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGLFVAAAG 260
              GC   + C+Y + YGD S +VG F TE LT   T     +  GCG +N+GLF  +AG
Sbjct: 219 NTPGC-ASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAG 277

Query: 261 LLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK 320
           LLGLGR +LS  +QT +++N+ FSYCL   S+S+    + FG SA S+ A+FTPL     
Sbjct: 278 LLGLGRDKLSVVSQTAQKYNKIFSYCL--PSSSSSTGFLTFGGSA-SKNAKFTPLSTISA 334

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
             +FY ++  GISVGG  +  I+AS+F        G IIDSGT +TRL   AY ALR +F
Sbjct: 335 GPSFYGLDFTGISVGGKKL-AISASVFS-----TAGAIIDSGTVITRLPPAAYSALRASF 388

Query: 381 RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTF 439
           R   S        S+ DTC+D S  T + VP +   F  G +V + AT  L    S    
Sbjct: 389 RNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILY-ASSLSQV 447

Query: 440 CFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C AFAG    + + I GN+QQ+   V YD +A ++GFAP GC+
Sbjct: 448 CLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 152/418 (36%), Positives = 211/418 (50%), Gaps = 30/418 (7%)

Query: 81  PEHLFNLR-IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISG-LAQGSGEY 138
           P++ F +  I RD  +            RV    R     N G  ++ +   +    GEY
Sbjct: 26  PDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNTVEAPIYNNRGEY 85

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
             +L VGTPP  +  V DTGSD++W QC PC  CY Q  P+F+P+KS ++  V C SP+C
Sbjct: 86  LMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145

Query: 199 R-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT--RVA---RVALGCGHDNE 252
               + + C+ +  C Y +SYGD S + GDF+ +TLT   T  RV    R A+GCGHDN 
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNA 205

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV----DRSTSAK----PSSMVFGD 303
           G F A  +G++GLG G  S   Q G     KFSYCL     D   S K     ++ V G 
Sbjct: 206 GSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGS 265

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
            AVS     TP+  + K  +FY ++L  +SVG  +    TA+       G   +IIDSGT
Sbjct: 266 GAVS-----TPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL---GGKANIIIDSGT 317

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLHFRGADV 422
           ++T L    Y     A  + + +L+R  D + F + CF+ +   + KVP + +HF GA++
Sbjct: 318 TLTLLPVDLYHNFAKAI-SNSINLQRTDDPNQFLEYCFETT-TDDYKVPFIAMHFEGANL 375

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            L   N LI V S    C AFAG     +SI GNI Q  F V YD+    + F P  C
Sbjct: 376 RLQRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 138/355 (38%), Positives = 185/355 (52%), Gaps = 29/355 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
           EY   +G+GTP     + +DTGSDV W+QC PC    CY+QT  +FDPAKS ++  V C 
Sbjct: 126 EYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCA 185

Query: 195 SPLCRKLDS--SGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTR--VARVALGCGH 249
           +  C +L+   +GC   N  C Y V YGDGS T G +S +TLT  G    V     GC H
Sbjct: 186 AAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSH 245

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
              G      GL+GLG G  S  +QT   +   FSYCL    TS     +  G       
Sbjct: 246 VESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL--PPTSGSSGFLTLGGGGGVSG 303

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T +L + ++ TFY   L  I+VGG  + G++ S+F        G ++DSGT +TRL 
Sbjct: 304 FVTTRMLRSRQIPTFYGARLQDIAVGGKQL-GLSPSVFA------AGSVVDSGTIITRLP 356

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
             AY AL  AF+AG    + AP  S+ DTCFD +G+T++ +PTV L F G          
Sbjct: 357 PTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAA------- 409

Query: 430 LIPVDSSGTF---CFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            I +D +G     C AFA T       IIGN+QQ+ F V+YD+ +S +GF    C
Sbjct: 410 -IDLDPNGIMYGNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  221 bits (562), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 152/418 (36%), Positives = 211/418 (50%), Gaps = 30/418 (7%)

Query: 81  PEHLFNLR-IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISG-LAQGSGEY 138
           P++ F +  I RD  +            RV    R     N G  ++ +   +    GEY
Sbjct: 26  PDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNTVEAPIYNNRGEY 85

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
             +L VGTPP  +  V DTGSD++W QC PC  CY Q  P+F+P+KS ++  V C SP+C
Sbjct: 86  LMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145

Query: 199 R-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT--RVA---RVALGCGHDNE 252
               + + C+ +  C Y +SYGD S + GDF+ +TLT   T  RV    R A+GCGHDN 
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNA 205

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV----DRSTSAK----PSSMVFGD 303
           G F A  +G++GLG G  S   Q G     KFSYCL     D   S K     ++ V G 
Sbjct: 206 GSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGS 265

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
            AVS     TP+  + K  +FY ++L  +SVG  +    TA+       G   +IIDSGT
Sbjct: 266 GAVS-----TPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL---GGKANIIIDSGT 317

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLHFRGADV 422
           ++T L    Y     A  + + +L+R  D + F + CF+ +   + KVP + +HF GA++
Sbjct: 318 TLTLLPVDLYHNFAKAI-SNSINLQRTDDPNQFLEYCFETT-TDDYKVPFIAMHFEGANL 375

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            L   N LI V S    C AFAG     +SI GNI Q  F V YD+    + F P  C
Sbjct: 376 RLQRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  221 bits (562), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 220/418 (52%), Gaps = 50/418 (11%)

Query: 84  LFNLRIQRDVLRVKSLTAFAE----SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYF 139
           L N+R+Q   L++K++T+       S  ++P                + SG+   S  Y 
Sbjct: 93  LDNIRVQSLQLKIKAMTSSTTEQSVSETQIP----------------LTSGIKLESLNYI 136

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR 199
             + +G   + + +++DTGSD+ W+QC PC+ CY+Q  P++DP+ S S+ TV C S  C+
Sbjct: 137 VTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQ 194

Query: 200 KL-----DSSGCNRRNT-----CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
            L     +S  C   N      C Y VSYGDGS T GD ++E++    T++     GCG 
Sbjct: 195 DLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGR 254

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV--- 306
           +N+GLF  ++GL+GLGR  +S  +QT + FN  FSYCL      A  S     DS+V   
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTN 314

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S +  +TPL+ NP+L +FY + L G S+GG  ++   +S F        G++IDSGT +T
Sbjct: 315 STSVSYTPLVQNPQLRSFYILNLTGASIGGVELK---SSSF------GRGILIDSGTVIT 365

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVS 423
           RL    Y A++  F    S    AP +S+ DTCF+L+   ++ +P + + F+G    +V 
Sbjct: 366 RLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVD 425

Query: 424 LPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +    Y +  D+S   C A A     + + IIGN QQ+  RV+YD    R+G     C
Sbjct: 426 VTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 143/364 (39%), Positives = 196/364 (53%), Gaps = 20/364 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G  EY   L +GTPP     + DTGSD+ W QC PCK C+ Q  P++D A S SF+ 
Sbjct: 86  LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSP 145

Query: 191 VPCRSPLCRKLDSS-GCNRRNT-CLYQVSYGDGSITVGDFSTETLTF---RGTRVARVAL 245
           VPC S  C  + SS  C   ++ C Y+ +YGDG+ + G   TETLTF    G  V  +A 
Sbjct: 146 VPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAF 205

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GCG DN GL   + G +GLGRG LS   Q G     KFSYCL D   ++  S ++FG  A
Sbjct: 206 GCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGALA 262

Query: 306 ------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
                      + TPL+ +P + T+YYV L GIS+G A +  I    F L   G+GG+I+
Sbjct: 263 ELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLP-IPNGTFDLRDDGSGGMIV 321

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD-LSGKTEV-KVPTVVLHF 417
           DSGT+ T L   A+  + D   AG          SL   CF   +G+ ++  +P +VLHF
Sbjct: 322 DSGTTFTFLVESAFRVVVDHV-AGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHF 380

Query: 418 R-GADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFA 475
             GAD+ L   NY+       +FC   AG+ S  +SI+GN QQQ  ++++D+   ++ F 
Sbjct: 381 AGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFM 440

Query: 476 PRGC 479
           P  C
Sbjct: 441 PTDC 444


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 212/402 (52%), Gaps = 47/402 (11%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGS---GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP 168
           R+ +R  A    S + +S   Q S   GEY   L +GTPP     + DTGSD++W QCAP
Sbjct: 61  RHNARKLALAASSGATVSAPTQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAP 120

Query: 169 C-KKCYSQTDPVFDPAKSRSFATVPCRSPL--CRKLDSS-------GCNRRNTCLYQVSY 218
           C  +C+ Q  P+++P+ S +FA +PC S L  C    +        GC     C Y V+Y
Sbjct: 121 CTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC----ACTYNVTY 176

Query: 219 GDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFP 272
           G G  +V   S ET TF  T     RV  +A GC   + G    +A+GL+GLGRGRLS  
Sbjct: 177 GSGWTSVFQGS-ETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLV 235

Query: 273 TQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VSRTARF--TPLLANPK---LDTFYY 326
           +Q G     KFSYCL     +   S+++ G SA ++ TA    TP +A+P    ++TFYY
Sbjct: 236 SQLGV---PKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYY 292

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           + L GIS+G   +  I    F L+  G GG+IIDSGT++T L   AY       RA   S
Sbjct: 293 LNLTGISLGTTALS-IPPDAFLLNADGTGGLIIDSGTTITLLGNTAY----QQVRAAVVS 347

Query: 387 LKRAP--DFSL---FDTCFDLSGKTEV--KVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           L   P  D S     D CF L   T     +P++ LHF GAD+ LPA +Y++  D SG +
Sbjct: 348 LVTLPTTDGSAATGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMM-SDDSGLW 406

Query: 440 CFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C A      G ++I+GN QQQ   ++YD+    + FAP  C+
Sbjct: 407 CLAMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 212/402 (52%), Gaps = 47/402 (11%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGS---GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP 168
           R+ +R  A    S + +S   Q S   GEY   L +GTPP     + DTGSD++W QCAP
Sbjct: 63  RHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAP 122

Query: 169 C-KKCYSQTDPVFDPAKSRSFATVPCRSPL--CRKLDSS-------GCNRRNTCLYQVSY 218
           C  +C+ Q  P+++P+ S +FA +PC S L  C    +        GC     C Y V+Y
Sbjct: 123 CTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC----ACTYNVTY 178

Query: 219 GDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFP 272
           G G  +V   S ET TF  T     RV  +A GC   + G    +A+GL+GLGRGRLS  
Sbjct: 179 GSGWTSVFQGS-ETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLV 237

Query: 273 TQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VSRTARF--TPLLANPK---LDTFYY 326
           +Q G     KFSYCL     +   S+++ G SA ++ TA    TP +A+P    ++TFYY
Sbjct: 238 SQLGV---PKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYY 294

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           + L GIS+G   +  I    F L+  G GG+IIDSGT++T L   AY       RA   S
Sbjct: 295 LNLTGISLGTTALS-IPPDAFSLNADGTGGLIIDSGTTITLLGNTAY----QQVRAAVVS 349

Query: 387 LKRAP--DFSL---FDTCFDLSGKTEV--KVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           L   P  D S     D CF L   T     +P++ LHF GAD+ LPA +Y++  D SG +
Sbjct: 350 LVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMM-SDDSGLW 408

Query: 440 CFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C A      G ++I+GN QQQ   ++YD+    + FAP  C+
Sbjct: 409 CLAMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 212/402 (52%), Gaps = 47/402 (11%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGS---GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP 168
           R+ +R  A    S + +S   Q S   GEY   L +GTPP     + DTGSD++W QCAP
Sbjct: 3   RHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAP 62

Query: 169 C-KKCYSQTDPVFDPAKSRSFATVPCRSPL--CRKLDSS-------GCNRRNTCLYQVSY 218
           C  +C+ Q  P+++P+ S +FA +PC S L  C    +        GC     C Y V+Y
Sbjct: 63  CTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGC----ACTYNVTY 118

Query: 219 GDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFP 272
           G G  +V   S ET TF  T     RV  +A GC   + G    +A+GL+GLGRGRLS  
Sbjct: 119 GSGWTSVFQGS-ETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLV 177

Query: 273 TQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VSRTARF--TPLLANPK---LDTFYY 326
           +Q G     KFSYCL     +   S+++ G SA ++ TA    TP +A+P    ++TFYY
Sbjct: 178 SQLGV---PKFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYY 234

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           + L GIS+G   +  I    F L+  G GG+IIDSGT++T L   AY       RA   S
Sbjct: 235 LNLTGISLGTTALS-IPPDAFSLNADGTGGLIIDSGTTITLLGNTAY----QQVRAAVVS 289

Query: 387 LKRAP--DFSL---FDTCFDLSGKTEV--KVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           L   P  D S     D CF L   T     +P++ LHF GAD+ LPA +Y++  D SG +
Sbjct: 290 LVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMM-SDDSGLW 348

Query: 440 CFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C A      G ++I+GN QQQ   ++YD+    + FAP  C+
Sbjct: 349 CLAMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 169/465 (36%), Positives = 230/465 (49%), Gaps = 57/465 (12%)

Query: 36  PSTLSWPESVSVSESESSLPLP-----APDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQ 90
           P  +    SV++  S ++L +P      P A S  S     + + SF+ T  H    R +
Sbjct: 37  PKAVCSASSVNLEPSSATLSVPLVHRYGPCAASQYS----DMPTPSFSETLRHS---RAR 89

Query: 91  RDVLRVKSLTAFA----ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGT 146
            + ++ ++ T  A    ++AV VP R        GGF  S+         EY   LG GT
Sbjct: 90  TNYIKSRASTGMASTPDDAAVTVPTRL-------GGFVDSL---------EYMVTLGFGT 133

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS- 203
           P     +++DTGSDV W+QCAPC   +CY Q DP+FDP+KS ++A + C +  C KL   
Sbjct: 134 PSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDH 193

Query: 204 --SGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAA 259
             +GC    T C Y+V YGDGS T G +S ET+TF  G  V     GCGHD  G      
Sbjct: 194 YRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFD 253

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR--FTPLLA 317
           GLLGLG    S   QT   +   FSYCL   ++ A   ++    SA + T+   FTP+  
Sbjct: 254 GLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVFTPMWH 313

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
            P   T Y V + GISVGG  +  I  S F+      GG++IDSGT VT L   AY AL 
Sbjct: 314 LPMDATSYMVNMTGISVGGKPLD-IPRSAFR------GGMLIDSGTIVTELPETAYNALN 366

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSS 436
            A R   ++         FDTC++ +G + V VP V L F  GA + L   N ++  D  
Sbjct: 367 AALRKAFAAYPMVASED-FDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD-- 423

Query: 437 GTFCFAF--AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C AF  +G   GL IIGN+ Q+   V+YD    ++GF    C
Sbjct: 424 ---CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 147/412 (35%), Positives = 207/412 (50%), Gaps = 53/412 (12%)

Query: 89  IQRDVLRVKSLTA-----------FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           ++RD LRVKS+ A           F E   RVP  +   G                    
Sbjct: 92  LRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGG-------------------- 131

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   +G+GTP +   ++ DTGSD+ W QC PC   C+ Q D  FDP KS S+  + C S 
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191

Query: 197 LCR---KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNE 252
            C+   K  + GC+  N+CLY V YG G  TVG  +TETLT   + V     +GCG  N 
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNG 250

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G F   AGLLGLGR  ++ P+QT   +   FSYCL   S+S     + FG   VS+ A+F
Sbjct: 251 GRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSST--GHLSFG-GGVSQAAKF 307

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TP+ +  K+   Y +++ GISVGG  +  I  S+F+       G IIDSGT++T L   A
Sbjct: 308 TPITS--KIPELYGLDVSGISVGGRKLP-IDPSVFR-----TAGTIIDSGTTLTYLPSTA 359

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLS--GKTEVKVPTVVLHFRGA-DVSLPATNY 429
           + AL  AF+   ++       S    C+D S      + +P + + F G  +V +  +  
Sbjct: 360 HSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGI 419

Query: 430 LIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            I  +     C AF   G  + ++I GN+QQ+ + VVYD+A   +GFAP GC
Sbjct: 420 FIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 153/463 (33%), Positives = 229/463 (49%), Gaps = 43/463 (9%)

Query: 25  YQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRT---P 81
           + T  ++SLP+ + +    S +++E  SSL L            +H     + +RT   P
Sbjct: 35  FHTLKISSLPS-TEVCKESSKALNEGSSSLKL------------VHRFGPCNPHRTSTAP 81

Query: 82  EHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQ-GSGEYFT 140
              FN  ++RD LRV S+     S          +       SS    GL++  + +Y  
Sbjct: 82  ASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMK-------SSVPFYGLSKITASDYIV 134

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
            +G+GTP + + ++ DTGS ++W QC PCK CY +  PVFDP KS SF  +PC S LC+ 
Sbjct: 135 NVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKLCQS 193

Query: 201 LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV--ARVALGCGHDNEGLFVAA 258
           +   GC+    C Y  +Y D S + G  +TET++F   +     + +GC     G  +  
Sbjct: 194 I-RQGCSSPK-CTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSGESLGE 251

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN 318
           +G++GL R  +S  +QT   +++ FSYC+   ST      + FG   V    RF+P+   
Sbjct: 252 SGIMGLNRSPISLASQTANIYDKLFSYCI--PSTPGSTGHLTFG-GKVPNDVRFSPVSKT 308

Query: 319 -PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
            P  D  Y +++ GISVGG  +  I AS FK+         IDSG  +TRL   AY ALR
Sbjct: 309 APSSD--YDIKMTGISVGGRKLL-IDASAFKI------ASTIDSGAVLTRLPPKAYSALR 359

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSS 436
             FR               DTC+D S  + V +P++ + F G  ++ +  +  +  V  S
Sbjct: 360 SVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGS 419

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +C AFA     +SI GN QQ+ + VV+D A  RIGFAP GC
Sbjct: 420 KVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 186/355 (52%), Gaps = 29/355 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
           EY   +G+GTP     + +DTGSDV W+QC PC    C++QT  +FDPAKS ++  V C 
Sbjct: 126 EYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCA 185

Query: 195 SPLCRKLDS--SGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTR--VARVALGCGH 249
           +  C +L+   +GC   N  C Y V YGDGS T G +S +TLT  G    V     GC H
Sbjct: 186 AAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSH 245

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
              G      GL+GLG G  S  +QT   +   FSYCL    TS     +  G    +  
Sbjct: 246 LESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL--PPTSGSSGFLTLGGGGGASG 303

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T +L + ++ TFY   L  I+VGG  + G++ S+F        G ++DSGT +TRL 
Sbjct: 304 FVTTRMLRSKQIPTFYGARLQDIAVGGKQL-GLSPSVFA------AGSVVDSGTIITRLP 356

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
             AY AL  AF+AG    + AP  S+ DTCFD +G+T++ +PTV L F G          
Sbjct: 357 PTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAA------- 409

Query: 430 LIPVDSSGTF---CFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            I +D +G     C AFA T       IIGN+QQ+ F V+YD+ +S +GF    C
Sbjct: 410 -IDLDPNGIMYGNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 139/365 (38%), Positives = 195/365 (53%), Gaps = 27/365 (7%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GEY   L +GTPP     V DTGSD++W QCAPC  +C+ Q  P+++PA S +F+ +PC 
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 169

Query: 195 SPL--CRKLDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALG 246
           S L  C    +         C+Y  +YG G  T G   +ET TF  +     RV  VA G
Sbjct: 170 SSLSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFG 228

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           C + +   +  +AGL+GLGRG LS  +Q G     +FSYCL     +   S+++ G SA 
Sbjct: 229 CSNASSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGPSAA 285

Query: 307 --SRTARFTPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
                 R TP +A+P    + T+YY+ L GIS+ GA    I+   F L P G GG+IIDS
Sbjct: 286 LNGTGVRSTPFVASPARAPMSTYYYLNLTGISL-GAKALPISPGAFSLKPDGTGGLIIDS 344

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVK---VPTVVLH 416
           GT++T L   AY  +R A ++  ++L      D +  D CF L   T      +P++ LH
Sbjct: 345 GTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLH 404

Query: 417 FRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFA 475
           F GAD+ LPA +Y+I    SG +C A      G +S  GN QQQ   ++YD+    + FA
Sbjct: 405 FDGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSFA 462

Query: 476 PRGCA 480
           P  C+
Sbjct: 463 PAKCS 467


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 158/446 (35%), Positives = 216/446 (48%), Gaps = 53/446 (11%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLT------AFAESAVRVPPRNRSRGR 118
           + + L HVD+         L    +QR   R  +L+       F  S  +   R R  G 
Sbjct: 30  IRVDLTHVDA-GKELPKRELIRRAMQRSKARAAALSVVRNGGGFYGSIAQAREREREPGM 88

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
           A            A G  EY   L VGTPP+ +  +LDTGSD++W QC  C  C  Q DP
Sbjct: 89  AV----------RASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP 138

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG- 237
           +F P  S S+  + C   LC  +    C R +TC Y+ SYGDG+ T+G ++TE  TF   
Sbjct: 139 LFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASS 198

Query: 238 ---TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
              T+   +  GCG  N G    A+G++G GR  LS  +Q      R+FSYCL   ++S 
Sbjct: 199 SGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLS---IRRFSYCLTPYASSR 255

Query: 295 KPSSMVFGDSA-------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
           K S++ FG  A        +   + TP+L + +  TFYYV   G++VG   +R I AS F
Sbjct: 256 K-STLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLR-IPASAF 313

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK------RAPDFSLFDTCFD 401
            L P G+GGVIIDSGT++T    PA + L +  RA  S L+       +PD  +   CF 
Sbjct: 314 ALRPDGSGGVIIDSGTALTLF--PAAV-LAEVVRAFRSQLRLPFANGSSPDDGV---CFA 367

Query: 402 LSGKT--------EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSII 453
                        +V VP +V HF+GAD+ LP  NY++     G  C     +    + I
Sbjct: 368 APAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATI 427

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
           GN  QQ  RVVYDL    + FAP  C
Sbjct: 428 GNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 164/437 (37%), Positives = 220/437 (50%), Gaps = 29/437 (6%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA 119
            S L L LHH  S  S    P  L F+  +  D  R+ SL A         P      RA
Sbjct: 40  SSGLHLTLHHPQSPCSPAPLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRA 99

Query: 120 NGGFSSSVISGLAQ---------GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
               SS     LA          G G Y TR+G+GTP +   MV+DTGS + W+QC+PC 
Sbjct: 100 GSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159

Query: 171 -KCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-----DSSGCNRRNTCLYQVSYGDGSIT 224
             C+ Q+ PVF+P  S S+A+V C +  C  L     + + C+  N C+YQ SYGD S +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219

Query: 225 VGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           VG  S +T++F  T V     GCG DNEGLF  +AGL+GL R +LS   Q        FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279

Query: 285 YCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
           YCL   S+S+     +   +    +  +TP+ ++   D+ Y++++ GI V G  +   ++
Sbjct: 280 YCLPTSSSSSSGYLSIGSYNPGQYS--YTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG 404
           +   L        IIDSGT +TRL    Y AL  A         RA  FS+ DTCF    
Sbjct: 338 AYSSLP------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQ 390

Query: 405 KTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
              ++VP V + F G      A  N L+ VDS+ T C AFA   S  +IIGN QQQ F V
Sbjct: 391 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSV 448

Query: 464 VYDLAASRIGFAPRGCA 480
           VYD+  S+IGFA  GC+
Sbjct: 449 VYDVKNSKIGFAAAGCS 465


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 164/441 (37%), Positives = 221/441 (50%), Gaps = 31/441 (7%)

Query: 60  DAESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           +  S L L LHH  S  S    P  L F+  +  D  RV SL A         P      
Sbjct: 38  NNSSGLHLTLHHPQSPCSPAPLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDES 97

Query: 118 RANGGFSSSVIS-----------GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RA    SSS              G + G G Y TR+G+GTP +   MV+DTGS + W+QC
Sbjct: 98  RAGSSSSSSPDDESSLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQC 157

Query: 167 APCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-----DSSGCNRRNTCLYQVSYGD 220
           +PC   C+ Q+ PVF+P  S S+ +V C +  C  L     + + C+  N C+YQ SYGD
Sbjct: 158 SPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGD 217

Query: 221 GSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
            S +VG  S +T++F  T V     GCG DNEGLF  +AGL+GL R +LS   Q      
Sbjct: 218 SSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMG 277

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             FSYCL   S+S+     +   +    +  +TP+ ++   D+ Y++++ GI V G  + 
Sbjct: 278 YSFSYCLPTSSSSSSGYLSIGSYNPGQYS--YTPMASSSLDDSLYFIKMTGIKVAGKPLS 335

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
             +++   L        IIDSGT +TRL    Y AL  A         RA  FS+ DTCF
Sbjct: 336 VSSSAYSSLP------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCF 389

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
                  ++VP V + F G      A  N L+ VDS+ T C AFA   S  +IIGN QQQ
Sbjct: 390 Q-GQAARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQ 446

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
            F VVYD+  S+IGFA  GC+
Sbjct: 447 TFSVVYDVKNSKIGFAAGGCS 467


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 158/480 (32%), Positives = 229/480 (47%), Gaps = 59/480 (12%)

Query: 19  AAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHH----VDS 74
           AA    ++   + SL + +T S P++              P     +++ LHH       
Sbjct: 27  AADHRTHKVLSVGSLKSAATCSEPKAT------------PPSTSGGITVPLHHRHGPCSP 74

Query: 75  LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVIS----- 129
           +  N+ P  L   R+QRD LR   +            + +  G   G    S  +     
Sbjct: 75  VPSNKMPASL-EERLQRDQLRAAYI------------KRKFSGAKGGDVEQSDAATVPTT 121

Query: 130 -GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
            G +  + EY   +G+G+P     M +DTGSDV W+QC PC +C+S+ D +FDP+ S ++
Sbjct: 122 LGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTY 181

Query: 189 ATVPCRSPLCRKLDSS----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           +   C S  C +L  S    GC+    C Y VSY DGS T G +S++TLT     +    
Sbjct: 182 SPFSCSSAACVQLSQSQQGNGCSSSQ-CQYIVSYVDGSSTTGTYSSDTLTLGSNAIKGFQ 240

Query: 245 LGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
            GC     G F     GL+GLG    S  +QT   F + FSYCL    T      +  G 
Sbjct: 241 FGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCL--PPTPGSSGFLTLG- 297

Query: 304 SAVSRTARF-TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            A SR+    TP+L + ++ T+Y V L  I VGG  +  I  S+F      + G ++DSG
Sbjct: 298 -AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLN-IPTSVF------SAGSVMDSG 349

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GAD 421
           T +TRL   AY AL  AF+AG      A    + DTCFD SG++ V +P+V L F  GA 
Sbjct: 350 TVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAV 409

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V+L     ++ +D+   +C AFA     S L  IGN+QQ+ F V+YD+    +GF    C
Sbjct: 410 VNLDFNGIMLELDN---WCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 164/441 (37%), Positives = 221/441 (50%), Gaps = 31/441 (7%)

Query: 60  DAESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           +  S L L LHH  S  S    P  L F+  +  D  RV SL A         P      
Sbjct: 38  NNSSGLHLTLHHPQSPCSPAPLPADLPFSAVLAHDGARVASLAARLAKTPSSRPTLLDES 97

Query: 118 RANGGFSSSVIS-----------GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RA    SSS              G + G G Y TR+G+GTP +   MV+DTGS + W+QC
Sbjct: 98  RAGSSSSSSPDDESSLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQC 157

Query: 167 APCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-----GCNRRNTCLYQVSYGD 220
           +PC   C+ Q+ PVF+P  S S+ +V C +  C  L ++      C+  N C+YQ SYGD
Sbjct: 158 SPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGD 217

Query: 221 GSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
            S +VG  S +T++F  T V     GCG DNEGLF  +AGL+GL R +LS   Q      
Sbjct: 218 SSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMG 277

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             FSYCL   S+S+     +   +    +  +TP+ ++   D+ Y++++ GI V G  + 
Sbjct: 278 YSFSYCLPTSSSSSSGYLSIGSYNPGQYS--YTPMASSSLDDSLYFIKMTGIKVAGKPLS 335

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
             +++   L        IIDSGT +TRL    Y AL  A         RA  FS+ DTCF
Sbjct: 336 VSSSAYSSLP------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCF 389

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
                  ++VP V + F G      A  N L+ VDS+ T C AFA   S  +IIGN QQQ
Sbjct: 390 Q-GQAARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQ 446

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
            F VVYD+  S+IGFA  GC+
Sbjct: 447 TFSVVYDVKNSKIGFAAGGCS 467


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 117/207 (56%), Positives = 154/207 (74%), Gaps = 7/207 (3%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           ++SG +QGSGEYF+R+G+G+PP++VYMV+DTGSDV W+QCAPC  CY Q DP+F+P+ S 
Sbjct: 42  LVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSS 101

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVAL 245
           S+A + C +  C+ LD S C R ++CLY+VSYGDGS TVGDF+TET+T  G+  +  VA+
Sbjct: 102 SYAPLTCETHQCKSLDVSEC-RNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAI 160

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GCGHDNEGLFV AAGLLGLG G LSFP+Q        FSYCLV+R T +  S++ F +S 
Sbjct: 161 GCGHDNEGLFVGAAGLLGLGGGSLSFPSQINA---SSFSYCLVNRDTDSA-STLEF-NSP 215

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGI 332
           +   +   PLL N +LDTFYY+ + GI
Sbjct: 216 IPSHSVTAPLLRNNQLDTFYYLGMTGI 242


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  218 bits (555), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 164/437 (37%), Positives = 220/437 (50%), Gaps = 29/437 (6%)

Query: 62  ESSLSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA 119
            S L L LHH  S  S    P  L F+  +  D  R+ SL A         P      RA
Sbjct: 40  SSGLHLTLHHPQSPCSPAPLPADLPFSAVLAHDGARIASLAARLAKTPSSRPTLLDESRA 99

Query: 120 NGGFSSSVISGLAQ---------GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
               SS     LA          G G Y TR+G+GTP +   MV+DTGS + W+QC+PC 
Sbjct: 100 GSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCV 159

Query: 171 -KCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-----DSSGCNRRNTCLYQVSYGDGSIT 224
             C+ Q+ PVF+P  S S+A+V C +  C  L     + + C+  N C+YQ SYGD S +
Sbjct: 160 VSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFS 219

Query: 225 VGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           VG  S +T++F  T V     GCG DNEGLF  +AGL+GL R +LS   Q        FS
Sbjct: 220 VGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFS 279

Query: 285 YCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
           YCL   S+S+     +   +    +  +TP+ ++   D+ Y++++ GI V G  +   ++
Sbjct: 280 YCLPTSSSSSSGYLSIGSYNPGQYS--YTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSS 337

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG 404
           +   L        IIDSGT +TRL    Y AL  A         RA  FS+ DTCF    
Sbjct: 338 AYSSLP------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQ 390

Query: 405 KTEVKVPTVVLHFRGADVSLPAT-NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
              ++VP V + F G      A  N L+ VDS+ T C AFA   S  +IIGN QQQ F V
Sbjct: 391 AARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSV 448

Query: 464 VYDLAASRIGFAPRGCA 480
           VYD+  S+IGFA  GC+
Sbjct: 449 VYDVKNSKIGFAAGGCS 465


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 138/380 (36%), Positives = 186/380 (48%), Gaps = 49/380 (12%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           + EY   L VGTPPR V + LDTGSD+VW QCAPC+ C+ Q  P+ DPA S ++A +PC 
Sbjct: 89  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148

Query: 195 SPLCRKLDSSGC---------NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR------ 239
           +P CR L  + C         N   +C Y   YGD S+TVG+ +T+  TF G        
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 240 --VARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS--- 293
               R+  GCGH N+G+F +   G+ G GRGR S P+Q        FSYC      S   
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNV---TTFSYCFTSMFESKSS 265

Query: 294 ------AKPSSMVFGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
                 A  +++++  +A +S   R TPLL NP   + Y++ L GISVG   +    A L
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKL 325

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP----DFSLFDTCFDL 402
                      IIDSG S+T L    Y A++  F   A+ +   P    + S  D CF L
Sbjct: 326 RS--------TIIDSGASITTLPEAVYEAVKAEF---AAQVGLPPTGVVEGSALDLCFAL 374

Query: 403 SGKTEVK---VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
                 +   VP++ LH  GAD  LP  NY+    ++   C          ++IGN QQQ
Sbjct: 375 PVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQ 434

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              VVYDL    + FAP  C
Sbjct: 435 NTHVVYDLENDWLSFAPARC 454


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  217 bits (553), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 141/360 (39%), Positives = 191/360 (53%), Gaps = 25/360 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   + VGTPP  +  V DTGSDV+W QC PC  CY Q  P+FDP+KS ++  V C S
Sbjct: 81  GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSS 140

Query: 196 PLCR-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGH 249
           P+C    D S C+  + CLY ++YGD S + G+ + +T+T + T        R  +GCGH
Sbjct: 141 PVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGH 200

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS--MVFGDSA- 305
           DN G F A  +G++GLGRG  S  TQ G     KFSYCL+   T +   S  + FG +A 
Sbjct: 201 DNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNAN 260

Query: 306 VSRTARF-TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           VS +    TP+ ++ +  TFY ++L  +SVG            KL   G   +IIDSGT+
Sbjct: 261 VSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFN-FPEGASKL--GGESNIIIDSGTT 317

Query: 365 VTRLTRPAYIALRDAFRAGAS---SLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLHFRGA 420
           +T L      AL ++F +  S   SL  A D S F D CF  +   + ++P V +HF GA
Sbjct: 318 LTYLPS----ALLNSFGSAISQSMSLPHAQDPSEFLDYCF-ATTTDDYEMPPVTMHFEGA 372

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAG-TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           DV L   N  + + S  T C AF       + I GNI Q  F V YD+    + F P  C
Sbjct: 373 DVPLQRENLFVRL-SDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 147/398 (36%), Positives = 201/398 (50%), Gaps = 47/398 (11%)

Query: 118 RANGGFSSSV-----------ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           R  GGF  S+           ++  A G  EY   L VGTPP+ +  +LDTGSD++W QC
Sbjct: 67  RNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQC 126

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVG 226
             C  C  Q DP+F P  S S+  + C   LC  +    C R +TC Y+ SYGDG+ T+G
Sbjct: 127 DTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLG 186

Query: 227 DFSTETLTFRG----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRK 282
            ++TE  TF      T+   +  GCG  N G    A+G++G GR  LS  +Q      R+
Sbjct: 187 YYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLS---IRR 243

Query: 283 FSYCLVDRSTSAKPSSMVFGDSA-------VSRTARFTPLLANPKLDTFYYVELVGISVG 335
           FSYCL   ++S K S++ FG  A        +   + TP+L + +  TFYYV   G++VG
Sbjct: 244 FSYCLTPYASSRK-STLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVG 302

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK------R 389
              +R I AS F L P G+GGVIIDSGT++T    P  + L +  RA  S L+       
Sbjct: 303 ARRLR-IPASAFALRPDGSGGVIIDSGTALTLF--PVAV-LAEVVRAFRSQLRLPFANGS 358

Query: 390 APDFSLFDTCFDLSGKT--------EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCF 441
           +PD  +   CF              +V VP +V HF+GAD+ LP  NY++     G  C 
Sbjct: 359 SPDDGV---CFAAPAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCV 415

Query: 442 AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               +    + IGN  QQ  RVVYDL    + FAP  C
Sbjct: 416 LLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/353 (39%), Positives = 191/353 (54%), Gaps = 22/353 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATV 191
           G+  Y     +GTP     + +DTGSD+ W+QC PC    CY Q DP+FDPA+S S+A V
Sbjct: 133 GTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAV 192

Query: 192 PCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCG 248
           PC    C  L   +S C+    C Y VSYGDGS T G +S++TLT       +  L GCG
Sbjct: 193 PCGRSACAGLGIYASACSAAQ-CGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCG 251

Query: 249 H-DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           H  + GLF    GLLG GR + S   QT   +   FSYCL  +S++    ++  G S V+
Sbjct: 252 HAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTL-GGPSGVA 310

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
                T LL +P   T+Y V L GISVGG  +  + AS F        G ++D+GT +TR
Sbjct: 311 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLS-VPASAFA------AGTVVDTGTVITR 363

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPA 426
           L   AY ALR AFR+G +S   AP   + DTC+  +G   V + +V L F  GA ++L A
Sbjct: 364 LPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGA 423

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +    S G   FA +G+   ++I+GN+QQ+ F V  D   S +GF P  C
Sbjct: 424 DGIM----SFGCLAFASSGSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 139/367 (37%), Positives = 194/367 (52%), Gaps = 18/367 (4%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           + S  S +    GEY  R  VG+PP  V  ++DTGSD++W+QC PC+ CY QT P+FDP+
Sbjct: 77  TDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPS 136

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT----- 238
           KS+++ T+PC S  C  L ++ C+  N C Y + YGDGS + GD S ETLT   T     
Sbjct: 137 KSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSV 196

Query: 239 RVARVALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDR-STSAKP 296
              +  +GCGH+N G F      +    G  +S  +Q       KFSYCL    S S   
Sbjct: 197 HFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSS 256

Query: 297 SSMVFGDSAV--SRTARFTPLLANP-KLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
           S + FGD+AV   R    TPL  +P     FY++ L   SVG   +   + S      +G
Sbjct: 257 SKLNFGDAAVVSGRGTVSTPL--DPLNGQVFYFLTLEAFSVGDNRIE-FSGSSSSGSGSG 313

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTCFDLSGKTEVKVPT 412
           +G +IIDSGT++T L +  Y+ L  A  +    L+RA D S L   C+  +   E+ +P 
Sbjct: 314 DGNIIIDSGTTLTLLPQEDYLNLESAV-SDVIKLERARDPSKLLSLCYKTTSD-ELDLPV 371

Query: 413 VVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRI 472
           +  HF+GADV L   +  +PV+  G  CFAF  +  G +I GN+ QQ   V YDL    +
Sbjct: 372 ITAHFKGADVELNPISTFVPVE-KGVVCFAFISSKIG-AIFGNLAQQNLLVGYDLVKKTV 429

Query: 473 GFAPRGC 479
            F P  C
Sbjct: 430 SFKPTDC 436


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 148/432 (34%), Positives = 219/432 (50%), Gaps = 30/432 (6%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           L LR HH    S  ++     +  +  D  RV SL     S   +  R+     A+    
Sbjct: 43  LELR-HHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLI--RSSDAASASKLAQ 99

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
             V SG    +  Y   +G+G     V  ++DT S++ W+QC PC  C+ Q +P+FDP+ 
Sbjct: 100 VPVTSGARLRTLNYVATVGIGGGEATV--IVDTASELTWVQCEPCDACHDQQEPLFDPSS 157

Query: 185 SRSFATVPCRSPLCRKL------DSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFRG 237
           S S+A VPC S  C  L          C+ +   C Y +SY DGS + G  + + L+  G
Sbjct: 158 SPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAG 217

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
             +     GCG  N+G F   +GL+GLGR +LS  +QT  +F   FSYCL  + + +  S
Sbjct: 218 EDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGS 277

Query: 298 SMVFGDSAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVR--GITASLFKLDPA 352
            ++  D++V R +    +T ++++P    FY   L GI+VGG  V+  G +A        
Sbjct: 278 LVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSA-------G 330

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
           G G  I+DSGT +T L    Y A+R  F +  +   +A  FS+ DTCFDL+G  EV+VP+
Sbjct: 331 GGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPS 390

Query: 413 VVLHFR-GADVSLPATN--YLIPVDSSGTFCFAFAGTMSGLS--IIGNIQQQGFRVVYDL 467
           + L F  GA+V + +    Y++  D+S   C A A   S     IIGN QQ+  RV++D 
Sbjct: 391 LKLVFDGGAEVEVDSKGVLYVVTGDAS-QVCLALASLKSEYDTPIIGNYQQKNLRVIFDT 449

Query: 468 AASRIGFAPRGC 479
             S+IGFA   C
Sbjct: 450 VGSQIGFAQETC 461


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 149/378 (39%), Positives = 208/378 (55%), Gaps = 25/378 (6%)

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
           RANG  ++S+ S +   +GEY   + +GTPP  ++ + DTGSD++W QC PC  CY Q +
Sbjct: 75  RANGVSTNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIE 134

Query: 178 PVFDPAKSRSFATVPCRSPLCRKL-DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF- 235
           P+FDPAKS+++  + C    C  L    GC+  NTC+Y  SYGDGS T GD + +TLT  
Sbjct: 135 PIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIG 194

Query: 236 ----RGTRVARVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV-- 288
               R   V +V  GCGH+N G F +  +GL+GLG G LS  +Q       +FSYCLV  
Sbjct: 195 STTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPL 254

Query: 289 --DRSTSAKPSSMVFGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHV--RGIT 343
             D S S+K   M FG    VS     +  LA+ + DTFYY+ L  +SVG   +  +G +
Sbjct: 255 GNDPSVSSK---MHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFS 311

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR-DAFRAGASSLKRAPDFSLFDTCF-D 401
                L  A  G +IIDSGT++T L +  Y  L  +   A      R P+ ++F  C+ +
Sbjct: 312 KVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPN-NVFSLCYSN 370

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGF 461
           LSG   +++PT+  HF GAD+ L   N  + V     FCFA    +S L+I GN+ Q  F
Sbjct: 371 LSG---LRIPTITAHFVGADLELKPLNTFVQVQED-LFCFAMI-PVSDLAIFGNLAQMNF 425

Query: 462 RVVYDLAASRIGFAPRGC 479
            V YDL +  + F P  C
Sbjct: 426 LVGYDLKSRTVSFKPTDC 443


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 128/358 (35%), Positives = 191/358 (53%), Gaps = 30/358 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   +G+G     V  ++DT S++ W+QCAPC  C+ Q  P+FDPA S S+A +PC S  
Sbjct: 127 YVATVGLGGGEATV--IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSS 184

Query: 198 CRKLD--------SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           C  L         + G   + +C Y +SY DGS + G  + + L+  G  +     GCG 
Sbjct: 185 CDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT 244

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            N+G F   +GL+GLGR +LS  +QT  +F   FSYCL  + + +  S ++  D++V R 
Sbjct: 245 SNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRN 304

Query: 310 AR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +    +T ++++P    FY+V L GI++GG  V            +  G VI+DSGT +T
Sbjct: 305 STPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE-----------SSAGKVIVDSGTIIT 353

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG---ADVS 423
            L    Y A++  F +  +   +AP FS+ DTCF+L+G  EV++P++   F G    +V 
Sbjct: 354 SLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 413

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                Y +  DSS   C A A   S    SIIGN QQ+  RV++D   S+IGFA   C
Sbjct: 414 SSGVLYFVSSDSS-QVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/360 (36%), Positives = 191/360 (53%), Gaps = 29/360 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y     +GTPP  +Y V+DTGSD +W QC PCK C +QT P+F+P+KS ++  + C SP+
Sbjct: 90  YVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSPI 149

Query: 198 CRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFR---GTRVA--RVALGCGHD 250
           C++ + + C  NR+  C Y+++Y D S + GD S +TLT     G+ ++  ++ +GCGH 
Sbjct: 150 CKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGHK 209

Query: 251 N----EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSA 305
           N    EGL   A+G++G GRG  S  +Q G     KFSYCL    + A  SS + FGD A
Sbjct: 210 NSLTTEGL---ASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMA 266

Query: 306 VSRTARFTPLLANPKLDTF----YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           V        +++ P + +F    Y+  L   SVG   ++   +SL    P   G  +IDS
Sbjct: 267 VVSGHG---VVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLI---PDNEGNAVIDS 320

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKTEVKVPTVVLHFRGA 420
           G+++T+L    Y  L  A  +    LKR  D       C+  + K + +VP +  HFRGA
Sbjct: 321 GSTITQLPNDVYSQLETAVISMV-KLKRVKDPTQQLSLCYKTTLK-KYEVPIITAHFRGA 378

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           DV L A N  I ++     CFAF  +     + GNI QQ F V YD   + I F P  C 
Sbjct: 379 DVKLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 128/358 (35%), Positives = 191/358 (53%), Gaps = 30/358 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   +G+G     V  ++DT S++ W+QCAPC  C+ Q  P+FDPA S S+A +PC S  
Sbjct: 126 YVATVGLGGGEATV--IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSS 183

Query: 198 CRKLD--------SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           C  L         + G   + +C Y +SY DGS + G  + + L+  G  +     GCG 
Sbjct: 184 CDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT 243

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            N+G F   +GL+GLGR +LS  +QT  +F   FSYCL  + + +  S ++  D++V R 
Sbjct: 244 SNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRN 303

Query: 310 AR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +    +T ++++P    FY+V L GI++GG  V            +  G VI+DSGT +T
Sbjct: 304 STPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE-----------SSAGKVIVDSGTIIT 352

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG---ADVS 423
            L    Y A++  F +  +   +AP FS+ DTCF+L+G  EV++P++   F G    +V 
Sbjct: 353 SLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 412

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                Y +  DSS   C A A   S    SIIGN QQ+  RV++D   S+IGFA   C
Sbjct: 413 SSGVLYFVSSDSS-QVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 145/383 (37%), Positives = 185/383 (48%), Gaps = 40/383 (10%)

Query: 127 VISGLAQGSG----EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ-TDPVFD 181
           V +GL  G G    EY   + VGTPPR V + LDTGSD+VW QCAPC  C+ Q   PV D
Sbjct: 75  VRAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLD 134

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRN----TCLYQVSYGDGSITVGDFSTETLTFRG 237
           PA S + A +PC +PLCR L  + C  R+    +C+Y   YGD S+TVG  +T++ TF G
Sbjct: 135 PAASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGG 194

Query: 238 TRVA------RVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
              A      RV  GCGH N+G+F A   G+ G GRGR S P+Q        FSYC    
Sbjct: 195 DDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNV---TSFSYCFTSM 251

Query: 291 STSAKPSSMVFGDSAV----------SRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             +   S +  G +A           +   R T L+ NP   + Y+V L GISVGGA V 
Sbjct: 252 FDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVA 311

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
              + L           IIDSG S+T L    Y A++  F +       A   +  D CF
Sbjct: 312 VPESRL-------RSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCF 364

Query: 401 DLSGKTEVK---VPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNI 456
            L      +   VP + LH   GAD  LP  NY+    ++   C           +IGN 
Sbjct: 365 ALPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNY 424

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           QQQ   VVYDL    + FAP  C
Sbjct: 425 QQQNTHVVYDLENDVLSFAPARC 447


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 145/388 (37%), Positives = 203/388 (52%), Gaps = 29/388 (7%)

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKC 172
            S GR +   S+     L  G GEY   L +GTPP     V DTGSD++W QCAPC  +C
Sbjct: 91  ESDGRTSTTVSARTRKDLPNG-GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQC 149

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPL--CRKLDSSGCNRRN-TCLYQVSYGDGSITVGDFS 229
           + Q  P+++PA S +F+ +PC S L  C    +         C+Y  +YG G  T G   
Sbjct: 150 FEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQG 208

Query: 230 TETLTFRGT-----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +ET TF  +     RV  VA GC + +   +  +AGL+GLGRG LS  +Q G     +FS
Sbjct: 209 SETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFS 265

Query: 285 YCLVDRSTSAKPSSMVFGDSAV--SRTARFTPLLANPK---LDTFYYVELVGISVGGAHV 339
           YCL     +   S+++ G SA       R TP +A+P    + T+YY+ L GIS+ GA  
Sbjct: 266 YCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISL-GAKA 324

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG-ASSLKR--APDFSLF 396
             I+   F L P G GG+IIDSGT++T L   AY  +R A ++   ++L      D +  
Sbjct: 325 LPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGL 384

Query: 397 DTCFDLSGKTEVK---VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSI 452
           D CF L   T      +P++ LHF GAD+ LPA +Y+I    SG +C A      G +S 
Sbjct: 385 DLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMST 442

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            GN QQQ   ++YD+    + FAP  C+
Sbjct: 443 FGNYQQQNMHILYDVREETLSFAPAKCS 470


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 140/352 (39%), Positives = 196/352 (55%), Gaps = 19/352 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRS 195
           E+   +G GTP +   ++LDTGSD+ WIQC PC   CY Q DP FDPAKS S+A VPC +
Sbjct: 136 EFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGT 195

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNEGL 254
           P+C    + G     TCLY V YGDGS T G  S +TLTF  + +      GCG  N G 
Sbjct: 196 PVCAA--AGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFTFGCGEKNIGD 253

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--RF 312
           F    GLLGLGRG+LS P+Q    F   FSYCL   +T+  P  +  G +  + T   ++
Sbjct: 254 FGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTT--PGYLNIGATKPTSTVPVQY 311

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           T ++  P+  +FY++ELV I++GG ++  +  S+F        G ++DSGT +T L  PA
Sbjct: 312 TAMIKKPQYPSFYFIELVSINIGG-YILPVPPSVFT-----KTGTLLDSGTILTYLPPPA 365

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
           Y +LRD F+      K AP +   DTC+D +G+  + +P V  +F  GA   L     +I
Sbjct: 366 YTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIMI 425

Query: 432 PVDSSGTF--CFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             D +     C AF    + +  SI+GN QQ+   V+YD+ + +IGF P  C
Sbjct: 426 FPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 122/234 (52%), Positives = 148/234 (63%), Gaps = 15/234 (6%)

Query: 57  PAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSR 116
           P   + S+LSL+LH   SLS +   + L   R+ RD  RVK +T              ++
Sbjct: 62  PFTSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITT-----------KLNQ 110

Query: 117 GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT 176
                  S  +ISG +QGSGEYF+R+G+G PP   YMVLDTGSD+ W+QCAPC  CY Q 
Sbjct: 111 NFNTDKLSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQA 170

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR 236
           DP+F+P  S S+A + C +  CR LD S C R   CLYQVSYGDGS TVGDF TET+T  
Sbjct: 171 DPIFEPTASASYAPLSCEAAQCRYLDQSQC-RNGNCLYQVSYGDGSYTVGDFVTETVTIG 229

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
             +V  VALGCGH+NEGLFV AAGL+GLG G LSFP Q     +  FSYCLVDR
Sbjct: 230 VNKVKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLN---STSFSYCLVDR 280


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 189/354 (53%), Gaps = 31/354 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCR 194
           EY  R+  GTP     +V+DTGSDV W+QC PC   +C+ Q DP++DP+ S +++ VPC 
Sbjct: 78  EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137

Query: 195 SPLCRKLDS----SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGH 249
           S +C+KL +    SGC     C + +SY DG+ TVG +S + LT   G  V     GCGH
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGH 197

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
               +     G+LGLGR R S     G R+   FSYCL   S S+KP  +  G       
Sbjct: 198 GKHAVRGLFDGVLGLGRLRESL----GARYGGVFSYCL--PSVSSKPGFLALGAGKNPSG 251

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD---PAGNGGVIIDSGTSVT 366
             FTP+   P   TF  V L GI+VGG           KLD    A +GG+I+DSGT +T
Sbjct: 252 FVFTPMGTVPGQPTFSTVTLAGINVGGK----------KLDLRPSAFSGGMIVDSGTVIT 301

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
            L   AY ALR AFR    + +  P+  L DTC++L+G   V VP + L F  GA ++L 
Sbjct: 302 GLQSTAYRALRSAFRKAMEAYRLLPNGDL-DTCYNLTGYKNVVVPKIALTFTGGATINLD 360

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             N ++    +G   FA +G      ++GN+ Q+ F V++D + S+ GF  + C
Sbjct: 361 VPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 139/372 (37%), Positives = 193/372 (51%), Gaps = 42/372 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPC 193
           GEY   L +GTPP     + DTGSD++W QCAPC   +C++Q  P+++PA S +F  +PC
Sbjct: 90  GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPC 149

Query: 194 RSPLCR-------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVA 241
            S L         K    GC     C+Y  +YG G  T G   +ET TF        RV 
Sbjct: 150 NSSLSMCAGVLAGKAPPPGC----ACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVP 204

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
            +A GC + +   +  +AGL+GLGRG LS  +Q G     +FSYCL     +   S+++ 
Sbjct: 205 GIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLL 261

Query: 302 GDSAV--SRTARFTPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           G SA       R TP +A+P    + T+YY+ L GIS+ GA    I+   F L   G GG
Sbjct: 262 GPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISL-GAKALSISPDAFSLKADGTGG 320

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-----DFSLFDTCFDLSGKTEV--K 409
           +IIDSGT++T L   AY  +R A +    SL   P     D +  D C+ L   T     
Sbjct: 321 LIIDSGTTITSLVNAAYQQVRAAVQ----SLVTLPAIDGSDSTGLDLCYALPTPTSAPPA 376

Query: 410 VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLA 468
           +P++ LHF GAD+ LPA +Y+I    SG +C A      G +S  GN QQQ   ++YD+ 
Sbjct: 377 MPSMTLHFDGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVR 434

Query: 469 ASRIGFAPRGCA 480
              + FAP  C+
Sbjct: 435 NEMLSFAPAKCS 446


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 134/352 (38%), Positives = 189/352 (53%), Gaps = 27/352 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCR 194
           EY  R+  GTP     +V+DTGSDV W+QC PC   +C+ Q DP++DP+ S +++ VPC 
Sbjct: 112 EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171

Query: 195 SPLCRKLDS----SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGH 249
           S +C+KL +    SGC     C + +SY DG+ TVG +S + LT   G  V     GCGH
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGH 231

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
               +     G+LGLGR R S     G R+   FSYCL   S S+KP  +  G       
Sbjct: 232 GKHAVRGLFDGVLGLGRLRESL----GARYGGVFSYCL--PSVSSKPGFLALGAGKNPSG 285

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG-NGGVIIDSGTSVTRL 368
             FTP+   P   TF  V L GI+VGG  +         L P+  +GG+I+DSGT +T L
Sbjct: 286 FVFTPMGTVPGQPTFSTVTLAGINVGGKKL--------DLRPSAFSGGMIVDSGTVITGL 337

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPAT 427
              AY ALR AFR    + +  P+  L DTC++L+G   V VP + L F  GA ++L   
Sbjct: 338 QSTAYRALRSAFRKAMEAYRLLPNGDL-DTCYNLTGYKNVVVPKIALTFTGGATINLDVP 396

Query: 428 NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           N ++    +G   FA +G      ++GN+ Q+ F V++D + S+ GF  + C
Sbjct: 397 NGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  214 bits (545), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 144/370 (38%), Positives = 194/370 (52%), Gaps = 27/370 (7%)

Query: 129 SGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKS 185
           SG+   +  Y T + +G    + + +++DTGSD+ W+QC PC    CY+Q DP+FDPA S
Sbjct: 171 SGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAAS 230

Query: 186 RSFATVPCRSPLCRK--LDSSGC---------NRRNTCLYQVSYGDGSITVGDFSTETLT 234
            +FA VPC SP C     D++G          N    C Y +SYGDGS + G  + +TL 
Sbjct: 231 PTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLG 290

Query: 235 F-RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
               T++     GCG  N GLF   AGL+GLGR  LS  +QT  RF   FSYCL   +TS
Sbjct: 291 LGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTS 350

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
               S+  G S+      +T ++A+P    FY++ + G +V       +TA  F     G
Sbjct: 351 TGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAV--GGGAALTAPGF-----G 403

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
            G V++DSGT +TRL    Y A+R  F A       AP FS+ D C+DL+G+ EV VP +
Sbjct: 404 AGNVLVDSGTVITRLAPSVYKAVRAEF-ARRFEYPAAPGFSILDACYDLTGRDEVNVPLL 462

Query: 414 VLHFR-GADVSLPATNYLIPVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAA 469
            L    GA V++ A   L  V   G+  C A A         IIGN QQ+  RVVYD   
Sbjct: 463 TLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVG 522

Query: 470 SRIGFAPRGC 479
           SR+GFA   C
Sbjct: 523 SRLGFADEDC 532


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 144/369 (39%), Positives = 193/369 (52%), Gaps = 24/369 (6%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQT 176
           A G  S++V + +  G+ +Y   + +GTP     + +DTGSDV W+QC PC    C SQ 
Sbjct: 124 ATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQR 183

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           D +FDPAKS +++ VPC +  C +L    +GC+    C Y VSYGDGS T G + ++TL 
Sbjct: 184 DQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLA 242

Query: 235 FR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
              G  V     GCGH   G+F    GLL LGR  +S  +Q    +   FSYCL  + ++
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSA 302

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
           A    +  G  + +     T LL      TFY V L GISVGG  V  + AS F      
Sbjct: 303 A--GYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQV-AVPASAFA----- 354

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVP 411
            GG ++D+GT +TRL   AY ALR AFR   +      AP   + DTC+D S    V +P
Sbjct: 355 -GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLP 413

Query: 412 TVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           TV L F  GA ++L A   L    SSG   FA  G     +I+GN+QQ+ F V +D   S
Sbjct: 414 TVALTFSGGATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GS 467

Query: 471 RIGFAPRGC 479
            +GF P  C
Sbjct: 468 TVGFMPGAC 476


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 142/361 (39%), Positives = 187/361 (51%), Gaps = 28/361 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FDP+ S + +   C S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 197 LCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGH 249
           LC+ L  + C         TC+Y  SYGD S+T G    +  TF   G  V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VS 307
            N G+F +   G+ G GRG LS P+Q        FS+C        KPS+++    A + 
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFT-AVNGLKPSTVLLDLPADLY 256

Query: 308 RTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           ++ R     TPL+ NP   TFYY+ L GI+VG   +  +  S F L   G GG IIDSGT
Sbjct: 257 KSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLP-VPESEFTLK-NGTGGTIIDSGT 314

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFS--LFDTCFDLSGKTEVK--VPTVVLHFRG 419
           ++T L    Y  +RDAF A      + P  S    D  F LS     K  VP +VLHF G
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQV----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           A + LP  NY+  V+ +G+     A    G ++ IGN QQQ   V+YDL  S++ F P  
Sbjct: 371 ATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 479 C 479
           C
Sbjct: 431 C 431


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 142/361 (39%), Positives = 187/361 (51%), Gaps = 28/361 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FDP+ S + +   C S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 197 LCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGH 249
           LC+ L  + C         TC+Y  SYGD S+T G    +  TF   G  V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VS 307
            N G+F +   G+ G GRG LS P+Q        FS+C        KPS+++    A + 
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFT-AVNGLKPSTVLLDLPADLY 256

Query: 308 RTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           ++ R     TPL+ NP   TFYY+ L GI+VG   +  +  S F L   G GG IIDSGT
Sbjct: 257 KSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLP-VPESEFALK-NGTGGTIIDSGT 314

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFS--LFDTCFDLSGKTEVK--VPTVVLHFRG 419
           ++T L    Y  +RDAF A      + P  S    D  F LS     K  VP +VLHF G
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQV----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           A + LP  NY+  V+ +G+     A    G ++ IGN QQQ   V+YDL  S++ F P  
Sbjct: 371 ATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 479 C 479
           C
Sbjct: 431 C 431


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 153/445 (34%), Positives = 212/445 (47%), Gaps = 57/445 (12%)

Query: 66  SLRLH--HVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGF 123
           +LRLH  H D+     T E L  +  +      + L+  A SA RV P + + G  +   
Sbjct: 53  ALRLHATHADAGRGLSTRELLHRMAARSKARSARLLSGRAASA-RVDPGSYTDGVPDT-- 109

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
                        EY   + +GTPP+ V ++LDTGSD+ W QCAPC  C+ Q+ P F+P+
Sbjct: 110 -------------EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPS 156

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRN----TCLYQVSYGDGSITVGDFSTETLTFR--- 236
           +S +F+ +PC   +CR L  S C  ++     C+Y  +Y D SIT G   ++T +F    
Sbjct: 157 RSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASAD 216

Query: 237 ----GTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
               G  V  +  GCG  N G+FV+   G+ G  RG LS P Q        FSYC     
Sbjct: 217 HAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSYCFT-AI 272

Query: 292 TSAKPSSMVFG-------DSA------VSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
           T ++PS +  G       D+A      V  TA      +  K    YY+ L G++VG   
Sbjct: 273 TGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA---YYISLKGVTVGTTR 329

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           +  I  S+F L   G GG I+DSGT +T L    Y  + DAF A           SL   
Sbjct: 330 LP-IPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQL 388

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFAF-AGTMSGLSIIG 454
           CF +    +  VP +VLHF GA + LP  NY+  ++ +G     C A  AG    LS+IG
Sbjct: 389 CFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAG--EDLSVIG 446

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           N QQQ   V+YDLA   + F P  C
Sbjct: 447 NFQQQNMHVLYDLANDMLSFVPARC 471


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 144/369 (39%), Positives = 192/369 (52%), Gaps = 24/369 (6%)

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQT 176
           A G  S++V + +  G+ +Y   + +GTP     + +DTGSDV W+QC PC    C SQ 
Sbjct: 124 ATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQR 183

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           D +FDPAKS +++ VPC +  C +L    +GC+    C Y VSYGDGS T G + ++TL 
Sbjct: 184 DQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLA 242

Query: 235 FR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
              G  V     GCGH   G+F    GLL LGR  +S  +Q    +   FSYCL  + ++
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSA 302

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
           A    +  G    +     T LL      TFY V L GISVGG  V  + AS F      
Sbjct: 303 A--GYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQV-AVPASAFA----- 354

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLSGKTEVKVP 411
            GG ++D+GT +TRL   AY ALR AFR   +      AP   + DTC+D S    V +P
Sbjct: 355 -GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLP 413

Query: 412 TVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           TV L F  GA ++L A   L    SSG   FA  G     +I+GN+QQ+ F V +D   S
Sbjct: 414 TVALTFSGGATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GS 467

Query: 471 RIGFAPRGC 479
            +GF P  C
Sbjct: 468 TVGFMPGAC 476


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 144/397 (36%), Positives = 202/397 (50%), Gaps = 47/397 (11%)

Query: 124 SSSVISGLAQGS---------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC----- 169
           SS+  +GL  G+         GEY   L +GTPP     + DTGSD++W QCAPC     
Sbjct: 64  SSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVT 123

Query: 170 ---KKCYSQTDPVFDPAKSRSFATVPCRSPL--CRKLDSSGCNRRNTCLYQVSYGDGSIT 224
               +C+ Q+  +++P+ S +F  +PC SPL  C  +          C+Y  +YG G  T
Sbjct: 124 DTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTG-WT 182

Query: 225 VGDFSTETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR 278
            G  S ET TF  +      RV  +A GC + +   +  +AGL+GLGRG +S  +Q G  
Sbjct: 183 AGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLGA- 241

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSAVSR-----TARFTPLLANPK---LDTFYYVELV 330
               FSYCL     +   S+++ G SA +        R TP +A P    + T+YY+ L 
Sbjct: 242 --GAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLT 299

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS---L 387
           GISVG   +  I    F L   G GG+IIDSGT++T L   AY  +R A R+   +   L
Sbjct: 300 GISVGETAL-AIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPL 358

Query: 388 KRAPDFSL-FDTCFDLSGKT-EVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFA 444
              PD S   D CF L   T    +P++ LHF  GAD+ LP  NY+I    SG +C A  
Sbjct: 359 AHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI--LGSGVWCLAMR 416

Query: 445 G-TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             T+  +S++GN QQQ   V+YD+    + FAP  C+
Sbjct: 417 NQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCS 453


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 129/381 (33%), Positives = 187/381 (49%), Gaps = 21/381 (5%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+  + G  S+ V SG  Q    Y  R G+G+P + + + LDT +D  W  C+PC  C S
Sbjct: 56  SKAASTGVSSAPVASG--QSPPSYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPS 113

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN---------TCLYQVSYGDGSITV 225
               +F PA S S+A +PC S +C  L    C  ++          C +   + D S   
Sbjct: 114 SGS-LFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQ- 171

Query: 226 GDFSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
              +++ L      +   A GC     G    +   GLLGLGRG ++  +Q G  +N  F
Sbjct: 172 ASLASDWLHLGKDAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVF 231

Query: 284 SYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
           SYCL    +     S+  G +   R  R+TP+L NP   + YYV + G+SVG A V+ + 
Sbjct: 232 SYCLPSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVK-VP 290

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS 403
           A  F  DPA   G ++DSGT +TR T P Y ALR+ FR   ++         FDTCF+  
Sbjct: 291 AGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTD 350

Query: 404 GKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQ 458
                  P V +H  G  D++LP  N LI   ++   C A A       + ++++ N+QQ
Sbjct: 351 EVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQ 410

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           Q  RVV+D+A SR+GFA   C
Sbjct: 411 QNLRVVFDVANSRVGFARESC 431


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 142/353 (40%), Positives = 193/353 (54%), Gaps = 19/353 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFATVPC 193
           E+   +G+GTP +   ++ DTGSD+ W+QC PC     C+ Q DP+FDP+KS ++A V C
Sbjct: 148 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNE 252
             P C            TCLY V YGDGS T G  S +TL    +R +A    GCG  N 
Sbjct: 208 GEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFPFGCGTRNL 267

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT--A 310
           G F    GLLGLGRG LS P+Q    F   FSYCL   S+++    +  G +  + T  A
Sbjct: 268 GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL--PSSNSTTGYLTIGATPATDTGAA 325

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
           ++T +L  P+  +FY+VELV I +GG ++  +  ++F       GG ++DSGT +T L  
Sbjct: 326 QYTAMLRKPQFPSFYFVELVSIDIGG-YILPVPPAVFT-----RGGTLLDSGTVLTYLPA 379

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNY 429
            AY  LRD FR        AP   + D C+D +G++EV VP V   F  GA   L     
Sbjct: 380 QAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFGV 439

Query: 430 LIPVDSSGTFCFAFAGTMSG---LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +I +D +   C AFA   +G   LSIIGN QQ+   V+YD+AA +IGF P  C
Sbjct: 440 MIFLDEN-VGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 146/403 (36%), Positives = 205/403 (50%), Gaps = 40/403 (9%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRY 150
           +D LRVKS        VR+   N S G       +++ + +    G Y   +G+GTP + 
Sbjct: 101 QDQLRVKSF------QVRLS-MNPSSGVFKE-MQTTIPASIVPTGGAYVVTVGLGTPKKD 152

Query: 151 VYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRR 209
             +  DTGSD+ W QC PC   C+ Q  P FDP  S S+  V C S  C+ +       +
Sbjct: 153 FTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQ 212

Query: 210 ----NTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCGHDNEGLFVAAAGLLGL 264
               NTCLY + YG G  T+G  +TETL    + V +  L GC  ++ G F    GLLGL
Sbjct: 213 DCISNTCLYGIQYGSG-YTIGFLATETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGL 271

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS---MVFGDSAVSRTARFTPLLANPKL 321
           GR  ++ P+QT  ++   FSYCL      A PSS   + FG   VS+ A+ TP+  +PKL
Sbjct: 272 GRSPIALPSQTTNKYKNLFSYCL-----PASPSSTGHLSFG-VEVSQAAKSTPI--SPKL 323

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
              Y +  VGISV G  +  I  S+ +         IIDSGT+ T L  P Y AL  AFR
Sbjct: 324 KQLYGLNTVGISVRGRELP-INGSISR--------TIIDSGTTFTFLPSPTYSALGSAFR 374

Query: 382 AGASSLKRAPDFSLFDTCFDLS--GKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGT 438
              ++       S F  C+D S  G   + +P + + F G  +V +  +  +IPV+    
Sbjct: 375 EMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKE 434

Query: 439 FCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C AFA  G+ S  +I GN QQ+ + V+YD+A   +GFAP+GC
Sbjct: 435 VCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  212 bits (539), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 144/417 (34%), Positives = 201/417 (48%), Gaps = 43/417 (10%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRY 150
           R++LR  +  + A SA  +  R  S     G ++  V         EY   + +GTPP+ 
Sbjct: 44  RELLRRMAARSKARSARLLSGRAASARMDPGSYTDGV------PDTEYLVHMAIGTPPQP 97

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           V ++LDTGSD+ W QCAPC  C+ Q+ P F+P++S +F+ +PC   +CR L  S C  ++
Sbjct: 98  VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 157

Query: 211 ----TCLYQVSYGDGSITVGDFSTETLTFR-------GTRVARVALGCGHDNEGLFVA-A 258
                C+Y  +Y D SIT G   ++T +F        G  V  +  GCG  N G+FV+  
Sbjct: 158 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 217

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-------DSA------ 305
            G+ G  RG LS P Q        FSYC     T ++PS +  G       D+A      
Sbjct: 218 TGIAGFSRGALSMPAQLKV---DNFSYCFT-AITGSEPSPVFLGVPPNLYSDAAGGGHGV 273

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           V  TA      +  K    YY+ L G++VG   +  I  S+F L   G GG I+DSGT +
Sbjct: 274 VQSTALIRYHSSQLKA---YYISLKGVTVGTTRLP-IPESVFALKEDGTGGTIVDSGTGM 329

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
           T L    Y  + DAF A           SL   CF +    +  VP +VLHF GA + LP
Sbjct: 330 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLP 389

Query: 426 ATNYLIPVDSSGTF---CFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             NY+  ++ +G     C A       LS+IGN QQQ   V+YDLA   + F P  C
Sbjct: 390 RENYMFEIEEAGGIRLTCLAI-NAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARC 445


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  212 bits (539), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 132/356 (37%), Positives = 182/356 (51%), Gaps = 18/356 (5%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           G+ Q S  Y  R  +GTP + + + LDT +D  WI C+ C  C S    +FDP+KS S  
Sbjct: 81  GIVQ-SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSR 137

Query: 190 TVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           T+ C +P C++  +  C    +C + ++YG GS      + +TLT     +     GC +
Sbjct: 138 TLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDVIPNYTFGCIN 196

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
              G  + A GL+GLGRG LS  +Q+   +   FSYCL +  +S    S+  G       
Sbjct: 197 KASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR 256

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
            + TPLL NP+  + YYV LVGI VG   V  I  S    DPA   G I DSGT  TRL 
Sbjct: 257 IKTTPLLKNPRRSSLYYVNLVGIRVGNKIV-DIPTSALAFDPATGAGTIFDSGTVYTRLV 315

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPAT 427
            PAY+A+R+ FR     +K A   SL  FDTC+  S    V  P+V   F G +V+LP  
Sbjct: 316 EPAYVAMRNEFR---RRVKNANATSLGGFDTCYSGS----VVFPSVTFMFAGMNVTLPPD 368

Query: 428 NYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           N LI   +    C A A       S L++I ++QQQ  RV+ D+  SR+G +   C
Sbjct: 369 NLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  212 bits (539), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 146/418 (34%), Positives = 203/418 (48%), Gaps = 45/418 (10%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRY 150
           R++LR  +  + A SA  +  R  S     G ++  V         EY   + +GTPP+ 
Sbjct: 70  RELLRRMAARSKARSARLLSGRAASARMDPGSYTDGV------PDTEYLVHMAIGTPPQP 123

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           V ++LDTGSD+ W QCAPC  C+ Q+ P F+P++S +F+ +PC   +CR L  S C  ++
Sbjct: 124 VQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQS 183

Query: 211 ----TCLYQVSYGDGSITVGDFSTETLTFR-------GTRVARVALGCGHDNEGLFVA-A 258
                C+Y  +Y D SIT G   ++T +F        G  V  +  GCG  N G+FV+  
Sbjct: 184 WGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNE 243

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-------DSA------ 305
            G+ G  RG LS P Q        FSYC     T ++PS +  G       D+A      
Sbjct: 244 TGIAGFSRGALSMPAQLKV---DNFSYCFT-AITGSEPSPVFLGVPPNLYSDAAGGGHGV 299

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           V  TA      +  K    YY+ L G++VG   +  I  S+F L   G GG I+DSGT +
Sbjct: 300 VQSTALIRYHSSQLKA---YYISLKGVTVGTTRLP-IPESVFALKEDGTGGTIVDSGTGM 355

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
           T L    Y  + DAF A           SL   CF +    +  VP +VLHF GA + LP
Sbjct: 356 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLP 415

Query: 426 ATNYLIPVDSSGTF---CFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             NY+  ++ +G     C A  AG    LS+IGN QQQ   V+YDLA   + F P  C
Sbjct: 416 RENYMFEIEEAGGIRLTCLAINAG--EDLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  212 bits (539), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 141/363 (38%), Positives = 197/363 (54%), Gaps = 18/363 (4%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           S +    GEY     VGTPP  +  ++DTGSD++W+QC PC+ CY+QT P+FDP++S+++
Sbjct: 85  STVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTY 144

Query: 189 ATVPCRSPLCRKLDSSG-CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT-----RVA 241
            T+PC S +C+ + S+  C+  N  C Y ++YGD S + GD S ETLT   T     +  
Sbjct: 145 KTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFP 204

Query: 242 RVALGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSM 299
           +  +GCGH+N+G F    +G++GLG G +S  +Q       KFSYCL    S S   S +
Sbjct: 205 KTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKL 264

Query: 300 VFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            FGD AV   R    TP++    L  FY++ L   SVG   +   ++S       GN  +
Sbjct: 265 NFGDEAVVSGRGTVSTPIVPKNGLG-FYFLTLEAFSVGDNRIEFGSSSFESSGGEGN--I 321

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLH 416
           IIDSGT++T L    Y+ L  A  A A  L+R  D S F   C+  +   E+ VP +  H
Sbjct: 322 IIDSGTTLTILPEDDYLNLESAV-ADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAH 380

Query: 417 FRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           F+GADV L   +  I VD  G  CFAF  +  G  I GN+ QQ   V YDL    + F P
Sbjct: 381 FKGADVELNPISTFIEVD-EGVVCFAFRSSKIG-PIFGNLAQQNLLVGYDLVKQTVSFKP 438

Query: 477 RGC 479
             C
Sbjct: 439 TDC 441


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 142/353 (40%), Positives = 192/353 (54%), Gaps = 19/353 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFATVPC 193
           E+   +G+GTP +   ++ DTGSD+ W+QC PC     C+ Q DP+FDP+KS ++A V C
Sbjct: 143 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNE 252
             P C            TCLY V YGDGS T G  S +TL    +R +     GCG  N 
Sbjct: 203 GEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFPFGCGTRNL 262

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT--A 310
           G F    GLLGLGRG LS P+Q    F   FSYCL   S+++    +  G +  + T  A
Sbjct: 263 GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL--PSSNSTTGYLTIGATPATDTGAA 320

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
           ++T +L  P+  +FY+VELV I +GG +V  +  ++F       GG ++DSGT +T L  
Sbjct: 321 QYTAMLRKPQFPSFYFVELVSIDIGG-YVLPVPPAVFT-----RGGTLLDSGTVLTYLPA 374

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNY 429
            AY  LRD FR        AP   + D C+D +G++EV VP V   F  GA   L     
Sbjct: 375 QAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGV 434

Query: 430 LIPVDSSGTFCFAFAGTMSG---LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +I +D +   C AFA   +G   LSIIGN QQ+   V+YD+AA +IGF P  C
Sbjct: 435 MIFLDEN-VGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 190/358 (53%), Gaps = 32/358 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
           EY   +G+GTP     +++DTGSD+ W+QCAPC    CY Q DP+FDP++S ++A +PC 
Sbjct: 119 EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCN 178

Query: 195 SPLCRKLD--------SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVAL 245
           +  CR L         +SG      C Y ++YGDGS T G +S ETLT   G  V     
Sbjct: 179 TDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHF 238

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GCGHD +G      GLLGLG    S   QT   +   FSYCL   + + +   +  G   
Sbjct: 239 GCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCL--PAANDQAGFLALGAPV 296

Query: 306 VSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
              +   FTP++   +  TFY V + GI+VGG  +  +  S F      +GG+IIDSGT 
Sbjct: 297 NDASGFVFTPMVREQQ--TFYVVNMTGITVGGEPID-VPPSAF------SGGMIIDSGTV 347

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VT L   AY AL+ AFR   ++    P+  L DTC++ +G + V VP V L F  GA V 
Sbjct: 348 VTELQHTAYAALQAAFRKAMAAYPLLPNGEL-DTCYNFTGHSNVTVPRVALTFSGGATVD 406

Query: 424 LPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L   + ++ +D+    C AF  AG  +   I+GN+ Q+   V+YD+   R+GF    C
Sbjct: 407 LDVPDGIL-LDN----CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 141/380 (37%), Positives = 193/380 (50%), Gaps = 20/380 (5%)

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
           +RSR RA  G+ ++    L     EY   L +G PP     + DTGSD+ W QC PCK C
Sbjct: 47  HRSRLRALSGYDATSPR-LHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLC 105

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           + Q  PV+DP+ S +F+ +PC S  C  + S  C   + C Y+ +YGDG+ + G   TET
Sbjct: 106 FPQDTPVYDPSASSTFSPLPCSSATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTET 165

Query: 233 LTFRGT----RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           LT   +     V  VA GCG DN G  + + G +GLGRG LS   Q G     KFSYCL 
Sbjct: 166 LTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGV---GKFSYCLT 222

Query: 289 DRSTSAKPSSMVFGDSAV----SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
           D   SA  S  + G  A       T + TPLL +P+  + Y+V L GIS+G   +  I  
Sbjct: 223 DFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLP-IPN 281

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCFDL 402
             F L   G GG+I+DSGT+ T L    +   R+     A  L + P    SL   CF  
Sbjct: 282 GTFDLRGDGTGGMIVDSGTTFTILAESGF---REVVGRVARVLGQPPVNASSLDAPCFPA 338

Query: 403 SGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTM-SGLSIIGNIQQQG 460
                  +P +VLHF  GAD+ L   NY+   +   +FC   AGT     S++GN QQQ 
Sbjct: 339 PAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQN 398

Query: 461 FRVVYDLAASRIGFAPRGCA 480
            ++++D    ++ F P  C+
Sbjct: 399 IQMLFDTTVGQLSFLPTDCS 418


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 179/351 (50%), Gaps = 17/351 (4%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R  +GTP + + + LDT +D  WI C+ C  C S    +FDP+KS S  T+ C 
Sbjct: 85  SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCE 142

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C++  +  C    +C + ++YG GS      + +TLT     +     GC +   G 
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            + A GL+GLGRG LS  +Q+   +   FSYCL +  +S    S+  G        + TP
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV LVGI VG   V  I  S    DPA   G I DSGT  TRL  PAY+
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIV-DIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYV 320

Query: 375 ALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           A+R+ FR     +K A   SL  FDTC+  S    V  P+V   F G +V+LP  N LI 
Sbjct: 321 AVRNEFR---RRVKNANATSLGGFDTCYSGS----VVFPSVTFMFAGMNVTLPPDNLLIH 373

Query: 433 VDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +    C A A       S L++I ++QQQ  RV+ D+  SR+G +   C
Sbjct: 374 SSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 134/363 (36%), Positives = 190/363 (52%), Gaps = 32/363 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   +G+G     V  V+DT S++ W+QC PC+ C+ Q DP+FDP+ S S+A VPC S  
Sbjct: 120 YVATVGLGAAEATV--VVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSS 177

Query: 198 CRKL------DSSGCNRRN----TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
           C  L       +S C   N     C Y +SY DGS + G  + + L   G  +     GC
Sbjct: 178 CDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVFGC 237

Query: 248 GHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           G  N+G  F   +GL+GLGR  +S  +QT  +F   FSYCL  R + +  S ++  DS+ 
Sbjct: 238 GTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLGDDSSA 297

Query: 307 SRTAR---FTPLLAN--PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            R +    +T ++++  P    FY++ L GI+VGG  V     S         G VIIDS
Sbjct: 298 YRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSA--------GRVIIDS 349

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA- 420
           GT +T L    Y A+R  F +  +   +AP FS+ DTCF+L+G  EV+VP++   F G+ 
Sbjct: 350 GTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNLTGLKEVQVPSLKFVFEGSV 409

Query: 421 --DVSLPATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAP 476
             +V      Y +  D+S   C A A   S    SIIGN QQ+  RV++D   S+IGFA 
Sbjct: 410 EVEVDSKGVLYFVSSDAS-QVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLGSQIGFAQ 468

Query: 477 RGC 479
             C
Sbjct: 469 ETC 471


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 179/351 (50%), Gaps = 17/351 (4%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R  +GTP + + + LDT +D  WI C+ C  C S    +FDP+KS S  T+ C 
Sbjct: 85  SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCE 142

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C++  +  C    +C + ++YG GS      + +TLT     +     GC +   G 
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            + A GL+GLGRG LS  +Q+   +   FSYCL +  +S    S+  G        + TP
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV LVGI VG   V  I  S    DPA   G I DSGT  TRL  PAY+
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIV-DIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYV 320

Query: 375 ALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           A+R+ FR     +K A   SL  FDTC+  S    V  P+V   F G +V+LP  N LI 
Sbjct: 321 AVRNEFR---RRVKNANATSLGGFDTCYSGS----VVFPSVTFMFAGMNVTLPPDNLLIH 373

Query: 433 VDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +    C A A       S L++I ++QQQ  RV+ D+  SR+G +   C
Sbjct: 374 SSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 147/397 (37%), Positives = 197/397 (49%), Gaps = 55/397 (13%)

Query: 104 ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           +  V   P   S     G   +++ SG+  GSGEYF  + VG+PP++  ++LDTGSD+ W
Sbjct: 136 KEVVTTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNW 195

Query: 164 IQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSI 223
           IQC PC  C+ Q D                                 +C Y   YGD S 
Sbjct: 196 IQCLPCYDCFQQND-------------------------------NQSCPYYYWYGDSSN 224

Query: 224 TVGDFSTETLTFRGT---------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ 274
           T GDF+ ET T   T          V  +  GCGH N GLF  AAGLLGLGRG LSF +Q
Sbjct: 225 TTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQ 284

Query: 275 TGRRFNRKFSYCLVDRSTSAKPSS-MVFG---DSAVSRTARFTPLLANPK--LDTFYYVE 328
               +   FSYCLVDR++    SS ++FG   D        FT  +A  +  +DTFYYV+
Sbjct: 285 LQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQ 344

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           +  I V G  V  I    + +   G GG IIDSGT+++    PAY  +++     A    
Sbjct: 345 IKSILVAG-EVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG-- 401

Query: 389 RAP---DFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFA 444
           + P   DF + D CF++SG   V++P + + F  GA  + P  N  I ++     C A  
Sbjct: 402 KYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAML 460

Query: 445 GT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           GT  S  SIIGN QQQ F ++YD   SR+G+AP  CA
Sbjct: 461 GTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 497


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 149/388 (38%), Positives = 190/388 (48%), Gaps = 43/388 (11%)

Query: 104 ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           ++AV +P R        GGF  S+         EY   LG GTP     +++DTGSDV W
Sbjct: 113 DAAVTIPTRL-------GGFVDSL---------EYVVTLGFGTPSVPQVLLMDTGSDVSW 156

Query: 164 IQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS---SGCNRRNT-CLYQVS 217
           +QC PC   KCY Q DP+FDP+KS ++A + C +  CRKL     +GC    T C Y V 
Sbjct: 157 VQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVE 216

Query: 218 YGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG 276
           Y DGS + G +S ETLT   G  V     GCG D  G      GLLGLG   +S   QT 
Sbjct: 217 YADGSHSRGVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTS 276

Query: 277 RRFNRKFSYCLVDRSTSAKPSSMVFGD--SAVSRTARFTPLLANPKLDTFYYVELVGISV 334
             +   FSYCL   ++ A    +V G   S       FTP+   P   TFY V + GISV
Sbjct: 277 SVYGGAFSYCLPALNSEA--GFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISV 334

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           GG  +  I  S F+      GG+IIDSGT  T L   AY AL  A R    +    P   
Sbjct: 335 GGKPLH-IPQSAFR------GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD 387

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF--AGTMSGLS 451
            FDTC++ +G + + VP V   F  GA + L   N ++  D     C AF  +G   GL 
Sbjct: 388 -FDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPDDGLG 441

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           IIGN+ Q+   V+YD     +GF    C
Sbjct: 442 IIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 177/341 (51%), Gaps = 24/341 (7%)

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
           +DTGSD++W QCAPC  C  Q  P FD  KS ++  +PCRS  C  L S  C ++  C+Y
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKK-MCVY 59

Query: 215 QVSYGDGSITVGDFSTETLTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRL 269
           Q  YGD + T G  + ET TF        R   +A GCG  N G    ++G++G GRG L
Sbjct: 60  QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPL 119

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--------DSAVSRTARFTPLLANPKL 321
           S  +Q G     +FSYCL     SA PS + FG        +++     + TP + NP L
Sbjct: 120 SLVSQLGP---SRFSYCLTSY-LSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPAL 175

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
              Y++ L  IS+ G  +  I   +F ++  G GGVIIDSGTS+T L + AY A+R    
Sbjct: 176 PNMYFLSLKAISL-GTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGL- 233

Query: 382 AGASSLKRAPDFSL-FDTCFDL--SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT 438
             A  L    D  +  DTCF         V VP +V HF  A+++L   NY++   ++G 
Sbjct: 234 VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGY 293

Query: 439 FCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C   A T  G +IIGN QQQ   ++YD+  S + F P  C
Sbjct: 294 LCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 134/366 (36%), Positives = 190/366 (51%), Gaps = 43/366 (11%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQTDPVFDPAKSRSFATVPC 193
           EY   +G+G+P     +V+DTGSDV W+QC PC     C++    +FDPA S ++A   C
Sbjct: 134 EYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNC 193

Query: 194 RSPLCRKL----DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCG 248
            +  C +L    +++GC+ ++ C Y V YGDGS T G +S++ LT  G+ V R    GC 
Sbjct: 194 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCS 253

Query: 249 HDN--EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF----- 301
           H     G+     GL+GLG    S  +QT  R+ + FSYCL      A P+S  F     
Sbjct: 254 HAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCL-----PATPASSGFLTLGA 308

Query: 302 -GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                    +RF  TP+L + K+ T+Y+  L  I+VGG  + G++ S+F        G +
Sbjct: 309 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL-GLSPSVFA------AGSL 361

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +DSGT +TRL   AY AL  AFRAG +   RA    + DTCF+ +G  +V +PTV L F 
Sbjct: 362 VDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFA 421

Query: 419 GADVSLPATNYLIPVDSSGTF---CFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIG 473
           G  V        + +D+ G     C AFA T        IGN+QQ+ F V+YD+     G
Sbjct: 422 GGAV--------VDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVFG 473

Query: 474 FAPRGC 479
           F    C
Sbjct: 474 FRAGAC 479


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 149/390 (38%), Positives = 197/390 (50%), Gaps = 36/390 (9%)

Query: 114 RSRGRANGGFSSSVISGLAQGS-------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RS+ RA    SSS  + ++ G+        EY   L +GTPP+ V + LDTGSD+VW QC
Sbjct: 60  RSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQC 119

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRR--NTCLYQVSYGDGS 222
            PC  C++Q+ P +D ++S +FA   C S  C KLD S   C  +   TC +  SYGD S
Sbjct: 120 QPCAVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQTCAFSYSYGDKS 178

Query: 223 ITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFN 280
            T+G    ET++F  G  V  V  GCG +N G+F +   G+ G GRG LS P+Q      
Sbjct: 179 ATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--- 235

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAV-----SRTARFTPLLANPKLDTFYYVELVGISVG 335
             FS+C    S   KPS+++F   A        T + TPL+ NP   TFYY+ L GI+VG
Sbjct: 236 GNFSHCFTAVS-GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVG 294

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +  +  S F L   G GG IIDSGT+ T L    Y  + D F A      + P    
Sbjct: 295 STRLP-VPESAFALK-NGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV----KLPVVPS 348

Query: 396 FDT----CFDLS--GKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
            +T    CF     GK    VP +VLHF GA + LP  NY+      G      A     
Sbjct: 349 NETGPLLCFSAPPLGKAP-HVPKLVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGE 407

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ++IIGN QQQ   V+YDL  S++ F    C
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 132/376 (35%), Positives = 197/376 (52%), Gaps = 29/376 (7%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V SG    +  Y   +G+G     V  ++DT S++ W+QCAPC+ C+ Q DP+FDP+ S 
Sbjct: 142 VTSGAKLRTLNYVATVGLGGGEATV--IVDTASELTWVQCAPCESCHDQQDPLFDPSSSP 199

Query: 187 SFATVPCRSPLCRKLD---------SSGCNRRN----TCLYQVSYGDGSITVGDFSTETL 233
           S+A VPC S  C  L          ++ C  ++     C Y +SY DGS + G  + + L
Sbjct: 200 SYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRL 259

Query: 234 TFRGTRVARVALGCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
           +  G  +     GCG  N+G  F   +GL+GLGR +LS  +QT  +F   FSYCL  + +
Sbjct: 260 SLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKES 319

Query: 293 SAKPSSMVFGDSAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
            +  S ++  DS+V R +    +  ++++P    FY+V L GI+VGG  V     S    
Sbjct: 320 DSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGG 379

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
                   IIDSGT +T L    Y A++  F +  +   +AP FS+ DTCF+++G  EV+
Sbjct: 380 GGK----AIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREVQ 435

Query: 410 VPTVVLHFRGA---DVSLPATNYLIPVDSSGTFCFAFAGTMSGL--SIIGNIQQQGFRVV 464
           VP++ L F G    +V      Y +  DSS   C A A   S    +IIGN QQ+  RV+
Sbjct: 436 VPSLKLVFDGGVEVEVDSGGVLYFVSSDSS-QVCLAMAPLKSEYETNIIGNYQQKNLRVI 494

Query: 465 YDLAASRIGFAPRGCA 480
           +D + S++GFA   C 
Sbjct: 495 FDTSGSQVGFAQETCG 510


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 138/373 (36%), Positives = 188/373 (50%), Gaps = 49/373 (13%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   +G+G     V  ++DT S++ W+QCAPC+ C+ Q  P+FDP+ S S+A VPC SP 
Sbjct: 143 YVATVGLGGGEATV--IVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPS 200

Query: 198 CRKLDSS------------GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL 245
           C  L                  R   C Y +SY DGS + G  + + L+  G  +     
Sbjct: 201 CDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVF 260

Query: 246 GCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           GCG  N+G  F   +GL+GLGR +LS  +QT  +F   FSYCL     S    S+V GD 
Sbjct: 261 GCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDD 320

Query: 305 A-----------VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR--GITASLFKLDP 351
                        S  +   PLL  P    FY V L GI+VGG  V   G +A       
Sbjct: 321 PSAYRNSTPVVYTSMVSNSDPLLQGP----FYLVNLTGITVGGQEVESTGFSAR------ 370

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
                 I+DSGT +T L    Y A+R  F +  +   +AP FS+ DTCF+++G  EV+VP
Sbjct: 371 -----AIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVP 425

Query: 412 TVVLHFR-GADVSLPATN--YLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYD 466
           ++ L F  GA+V + +    Y +  DSS   C A A   S    SIIGN QQ+  RVV+D
Sbjct: 426 SLTLVFDGGAEVEVDSGGVLYFVSSDSS-QVCLAVASLKSEDETSIIGNYQQKNLRVVFD 484

Query: 467 LAASRIGFAPRGC 479
            +AS++GFA   C
Sbjct: 485 TSASQVGFAQETC 497


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 144/418 (34%), Positives = 203/418 (48%), Gaps = 41/418 (9%)

Query: 77  FNRTPEHL--FNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQG 134
           +N    HL  +N  ++R V RV     F  +A  V P+              V S +   
Sbjct: 46  YNSQQTHLQRWNKAMRRSVSRVHH---FQRTAATVSPKE-------------VESEIIAN 89

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
            GEY   L +GTPP  +  + DTGSD++W QC PC KCY Q  P+FDP  S+++  + C 
Sbjct: 90  GGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCD 149

Query: 195 SPLCRKL-DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCG 248
           +  C+ L +SS C+    C Y   YGD S T G+ + +T+T   T        +  +GCG
Sbjct: 150 TRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCG 209

Query: 249 HDNEGLFVAA-AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA--KPSSMVFGDSA 305
             N G F    +G++GLG G +S  +Q G     KFSYCLV  S+ +    S + FG +A
Sbjct: 210 RRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNA 269

Query: 306 V--SRTARFTPLLA-NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
           V      + TPL++ NP  DTFYY+ L  +SVG   +    +S         G +IIDSG
Sbjct: 270 VVSGSGVQSTPLISKNP--DTFYYLTLEAMSVGDKKIEFGGSSFGGS----EGNIIIDSG 323

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTCFDLSGKTEVKVPTVVLHFRGAD 421
           TS+T      +     A      + +R  D S L   C+  +   ++KVP +  HF GAD
Sbjct: 324 TSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT--PDLKVPVITAHFNGAD 381

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V L   N  I + S    C AF  T SG +I GN+ Q  F + YD+    + F P  C
Sbjct: 382 VVLQTLNTFILI-SDDVLCLAFNSTQSG-AIFGNVAQMNFLIGYDIQGKSVSFKPTDC 437


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 145/402 (36%), Positives = 207/402 (51%), Gaps = 48/402 (11%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGS---GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP 168
           R+ +R  A    + + +S   Q S   GEY   L +GTPP     + DTGSD++W QCAP
Sbjct: 57  RHNARQLAASSSNGTTVSAPTQISPTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAP 116

Query: 169 C-KKCYSQTDPVFDPAKSRSFATVPCRSPLCR-------KLDSSGCNRRNTCLYQVSYGD 220
           C  +C+ Q  P+++P+ S +FA +PC S L              GC    TC+Y ++YG 
Sbjct: 117 CSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTPPPGC----TCMYNMTYGS 172

Query: 221 GSITVGDFSTETLTF------RGTRVARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFPT 273
           G  +V   S ET TF        T V  +A GC + + G    +A+GL+GLGRG LS  +
Sbjct: 173 GWTSVYQGS-ETFTFGSSTPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVS 231

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV---SRTARFTPLLANPK---LDTFYYV 327
           Q G     KFSYCL     +   S+++ G SA    +     TP +A+P    + T+YY+
Sbjct: 232 QLGV---PKFSYCLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYL 288

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            L GIS+G   +  I  +   L   G GG IIDSGT++T L   AY       RA   SL
Sbjct: 289 NLTGISLGTTALS-IPTTALSLKADGTGGFIIDSGTTITLLGNTAY----QQVRAAVVSL 343

Query: 388 KRAPDF------SLFDTCFDLSGKTEV--KVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
              P        +  D CF+L   T     +P++ LHF GAD+ LPA +Y++ +DS+  +
Sbjct: 344 VTLPTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHFDGADMVLPADSYMM-LDSN-LW 401

Query: 440 CFAFAG-TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C A    T  G+SI+GN QQQ   ++YD+    + FAP  C+
Sbjct: 402 CLAMQNQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKCS 443


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 142/388 (36%), Positives = 191/388 (49%), Gaps = 21/388 (5%)

Query: 106 AVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQ 165
            V++ PRN S+   N   + + +S       +Y   L +GTPP   Y  +DTGSD++W+Q
Sbjct: 30  TVKLIPRNSSQVLFNRITAQTPVS---VHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQ 86

Query: 166 CAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSIT 224
           C PC  CY Q +P+FDP  S +++ +   S  C KL S+ C+  +N C Y  SY D SIT
Sbjct: 87  CIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSIT 146

Query: 225 VGDFSTETLTFRGTRVARVAL-----GCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRR 278
            G  + ETLT   T    VAL     GCGH+N G+F     G++GLGRG LS  +Q G  
Sbjct: 147 EGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSS 206

Query: 279 FNRK-FSYCLVDRSTS---AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISV 334
           F  K FS CLV   T+     P S   G   +      TPL++      FY+V L+GISV
Sbjct: 207 FGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV 266

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
              ++     S   L+P   G ++IDSGT  T L    Y  L +  R   +      D +
Sbjct: 267 EDINLPFNDGS--SLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPT 324

Query: 395 L-FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSI 452
           L +  C+     T +K  T+  HF GADV L  T   IPV   G FCFAF  T S    I
Sbjct: 325 LGYQLCY--RTPTNLKGTTLTAHFEGADVLLTPTQIFIPVQ-DGIFCFAFTSTFSNEYGI 381

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            GN  Q  + + +DL    + F    C 
Sbjct: 382 YGNHAQSNYLIGFDLEKQLVSFKATDCT 409


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 147/403 (36%), Positives = 204/403 (50%), Gaps = 31/403 (7%)

Query: 95  RVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           R++S  A A+  +R     R      G    + + G    S EY   LG+GTP     ++
Sbjct: 83  RLRSDRARADHILRKASGRRMMSEGGGASIPTYLGGFVD-SLEYVVTLGIGTPAVQQTVL 141

Query: 155 LDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD----SSGCNR 208
           +DTGSD+ W+QC PC    CY Q DP+FDP+KS +FAT+PC S  C++L      +GC  
Sbjct: 142 IDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASDACKQLPVDGYDNGCTN 201

Query: 209 RNT-----CLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEGLFVAAAGLL 262
             +     C Y + YG+G+IT G +STETL    + V +    GCG D  G +    GLL
Sbjct: 202 NTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSAVVKSFRFGCGSDQHGPYDKFDGLL 261

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR--FTPLLA-NP 319
           GLG    S  +QT   +   FSYCL   ++ A   ++   +S  +  +   FTP+ A +P
Sbjct: 262 GLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSP 321

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           K+ TFY V L GISVGG  +  I  ++F        G I+DSGT +T +   AY ALR A
Sbjct: 322 KIATFYVVTLTGISVGGKALD-IPPAVFAK------GNIVDSGTVITGIPTTAYKALRTA 374

Query: 380 FRAGASSLKRAPDF-SLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSG 437
           FR+  +     P   S  DTC++ +G   V VP V L F  GA V L   + ++  D   
Sbjct: 375 FRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED--- 431

Query: 438 TFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             C AFA    G   IIGN+  +   V+YD     +GF    C
Sbjct: 432 --CLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 145/421 (34%), Positives = 205/421 (48%), Gaps = 35/421 (8%)

Query: 67  LRLHHVDSLS--FNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
           LR+ HV+S    F +     +   + +D  R++ L++ A+     P    + GRA     
Sbjct: 34  LRVFHVNSPCSPFKQPNTVSWESTLLKDKARLQYLSSLAKK----PSVPIASGRA----- 84

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
                 + Q S  Y  R  +GTP + + + LDT +D  W+ C+ C  C S    +FDP+K
Sbjct: 85  ------IVQ-SPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSK 135

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S S   + C +P C++  +  C    +C + ++YG GS      + +TLT     +    
Sbjct: 136 SSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLANDVIKSYT 194

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GC     G  + A GL+GLGRG LS  +QT   +   FSYCL +  +S    S+  G  
Sbjct: 195 FGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPK 254

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
                 + TPLL NP+  + YYV LVGI VG   V  I  S    D +   G I DSGT 
Sbjct: 255 YQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIV-DIPTSALAFDASTGAGTIFDSGTV 313

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADV 422
            TRL  PAY+A+R+ FR     +K A   SL  FDTC+  S    V  P+V   F G +V
Sbjct: 314 FTRLVEPAYVAVRNEFR---RRIKNANATSLGGFDTCYSGS----VVYPSVTFMFAGMNV 366

Query: 423 SLPATNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           +LP  N LI   S  T C A A       S L++I ++QQQ  RV+ DL  SR+G +   
Sbjct: 367 TLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRET 426

Query: 479 C 479
           C
Sbjct: 427 C 427


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 131/362 (36%), Positives = 189/362 (52%), Gaps = 24/362 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GEY   L +GTPP     + DTGSD++W QCAPC  +C+ Q    ++P+ S +F  +PC 
Sbjct: 86  GEYIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCN 145

Query: 195 S--PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVALGC 247
           S   +C  L         +C+Y  +YG G  T G  S ET TF       TRV  +A GC
Sbjct: 146 SSVSMCAALAGPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGC 204

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            + +   +  +AGL+GLGRG +S  +Q G      FSYCL     +   S+++ G SA  
Sbjct: 205 SNASSDDWNGSAGLVGLGRGSMSLVSQLGAGM---FSYCLTPFQDANSTSTLLLGPSAAL 261

Query: 308 RTARF--TPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
                  TP +A+P    + T+YY+ L GIS+G   +  I  + F L   G GG+IIDSG
Sbjct: 262 NGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALS-IPPNAFALRTDGTGGLIIDSG 320

Query: 363 TSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEV--KVPTVVLHFRG 419
           T++T L   AY  +R A  +  +  +    D +  D CF L+ +T     +P++  HF G
Sbjct: 321 TTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALTSETSTPPSMPSMTFHFDG 380

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAG-TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           AD+ LP  NY+I    SG +C A    T+  +S  GN QQQ   ++YD+    + FAP  
Sbjct: 381 ADMVLPVDNYMI--LGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAK 438

Query: 479 CA 480
           C+
Sbjct: 439 CS 440


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 165/483 (34%), Positives = 230/483 (47%), Gaps = 48/483 (9%)

Query: 11  LLFSFFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLH 70
           + FS     A   Q    V +S   PS +   + V+ S++ ++LPL       S  +   
Sbjct: 18  IAFSIVHGTADDAQRYMVVASSSLEPSEVCSGQKVTSSKNGATLPLVHRHGPCSPVMS-- 75

Query: 71  HVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRS-RGRANGGFSSSVIS 129
                     P H   L   RD LR  ++ A   S     PRN S +     G +    S
Sbjct: 76  -------KEKPSHEETLG--RDQLRAANIHAKLSS-----PRNSSAKELQQSGVTIPTSS 121

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRS 187
           G + G+ EY   + +GTP     M +DTGSDV W+QCAPC  + C SQ D +FDPAKS +
Sbjct: 122 GYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSAT 181

Query: 188 FATVPCRSPLCRKLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGTR-VARVAL 245
           ++   C S  C +L   G    N+ C Y V Y D S T G + ++TL    +  V     
Sbjct: 182 YSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQF 241

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK---PSSMVFG 302
           GC H   G      GL+GLG    S  +QT   + + FSYCL   S+SA          G
Sbjct: 242 GCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAG 301

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            ++ SR +R TPL+    + TFY V L  I+V G  +  + AS+F      +G  ++DSG
Sbjct: 302 GTSSSRYSR-TPLV-RFNVPTFYGVFLQAITVAGTKLN-VPASVF------SGASVVDSG 352

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGAD 421
           T +T+L   AY ALR AF+    +   A    + DTCFD SG   V+VP V L F RGA 
Sbjct: 353 TVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGA- 411

Query: 422 VSLPATNYLIPVDSSGTF---CFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAP 476
                   ++ +D SG F   C AF  T       I+GN+QQ+ F +++D+  S +GF P
Sbjct: 412 --------VMDLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRP 463

Query: 477 RGC 479
             C
Sbjct: 464 GAC 466


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 147/401 (36%), Positives = 200/401 (49%), Gaps = 24/401 (5%)

Query: 95  RVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
            V S   F ++ +     +RSR +A  G+ ++    L     EY   L +GTPP     +
Sbjct: 24  HVDSKIGFTKTELMRRAAHRSRLQALSGYDANSPR-LHSVQVEYLMELAIGTPPVPFVAL 82

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR-KLDSSGC-NRRNTC 212
            DTGSD+ W QC PCK C+ Q  PV+DP+ S +F+ VPC S  C     S  C N  + C
Sbjct: 83  ADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSPC 142

Query: 213 LYQVSYGDGSITVGDFSTETLTF------RGTRVARVALGCGHDNEGLFVAAAGLLGLGR 266
            Y  SY DG+ +VG   TETLT       +   V  VA GCG DN G  + + G +GLGR
Sbjct: 143 RYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGR 202

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV----SRTARFTPLLANPKLD 322
           G LS   Q G     KFSYCL D   S   S    G  A       T + TPLL +P   
Sbjct: 203 GTLSLLAQLGV---GKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNP 259

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
           + Y+V L GIS+G   +  I    F L   GNGG+++DSGT+ T L +  +   R+    
Sbjct: 260 SRYFVNLQGISLGDVRLP-IPNGTFDLRADGNGGMMVDSGTTFTILAKSGF---REVVDR 315

Query: 383 GASSLKRAP--DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTF 439
            A  L + P    SL   CF  S   E  +P +VLHF  GAD+ L   NY+   +   +F
Sbjct: 316 VAQLLGQPPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSF 374

Query: 440 CFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           C    G+ S  S +GN QQQ  ++++D+   ++ F P  C+
Sbjct: 375 CLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCS 415


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 134/379 (35%), Positives = 188/379 (49%), Gaps = 19/379 (5%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+  ++GG +S+ ++   Q    Y  R G+GTP + + + LDT +D  W  CAPC  C +
Sbjct: 57  SKAASSGGITSAPVAS-GQTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPA 115

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN-------TCLYQVSYGDGSITVGD 227
            +   F PA S S+A++PC S  C   +   C            C +   + D S     
Sbjct: 116 GSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-S 172

Query: 228 FSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
             ++TL      +A  A GC     G    +   GLLGLGRG +S  +QTG R+N  FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    +     S+  G +   R  R+TPLL NP   + YYV + G+SVG   V+ + A 
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVK-VPAG 291

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
            F  DPA   G +IDSGT +TR T P Y ALR+ FR   ++         FDTCF+    
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351

Query: 406 TEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQQG 460
                P V LH  G  D++LP  N LI   ++   C A A       + ++++ N+QQQ 
Sbjct: 352 AAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQN 411

Query: 461 FRVVYDLAASRIGFAPRGC 479
            RVV D+A SR+GFA   C
Sbjct: 412 VRVVVDVAGSRVGFAREPC 430


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 146/363 (40%), Positives = 192/363 (52%), Gaps = 34/363 (9%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
           +Y   LG GTP     +++DTGSD+ W+QC PC    CY Q DPVFDP+ S ++A VPC 
Sbjct: 121 QYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCG 180

Query: 195 SPLCRKLD----SSGCNRRNT----CLYQVSYGDGSITVGDFSTETLTFR---GTRVARV 243
           S  CR LD    ++GC   ++    C Y + YG+G  TVG +STETLT      T V   
Sbjct: 181 SEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNNF 240

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           + GCG   +G+F    GLLGLG    S  +QT   +   FSYCL   +++A   ++    
Sbjct: 241 SFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPA 300

Query: 304 SAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           +  + TA  +FTPL       TFY V+L GISVGG  +  I  ++F       GG+IIDS
Sbjct: 301 TGGNNTAGFQFTPLQVVET--TFYLVKLTGISVGGKQLD-IEPTVFA------GGMIIDS 351

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           GT VT L   AY ALR AFR+  S+    P  D    DTC+D +G T V VPTV L F G
Sbjct: 352 GTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEG 411

Query: 420 A---DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
               D+ +P+   L      G   F    +     IIGN+ Q+ F V+YD A   +GF  
Sbjct: 412 GVTIDLDVPSGVLL-----DGCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRA 466

Query: 477 RGC 479
             C
Sbjct: 467 GAC 469


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 149/390 (38%), Positives = 196/390 (50%), Gaps = 36/390 (9%)

Query: 114 RSRGRANGGFSSSVISGLAQGS-------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RS+ RA    SSS  + ++ G+        EY   L +GTPP+ V + LDTGS +VW QC
Sbjct: 60  RSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQC 119

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRR--NTCLYQVSYGDGS 222
            PC  C++Q+ P +D ++S +FA   C S  C KLD S   C  +   TC Y  SYGD S
Sbjct: 120 QPCAVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQTCAYSYSYGDKS 178

Query: 223 ITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFN 280
            T+G    ET++F  G  V  V  GCG +N G+F +   G+ G GRG LS P+Q      
Sbjct: 179 ATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--- 235

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAV-----SRTARFTPLLANPKLDTFYYVELVGISVG 335
             FS+C    S   KPS+++F   A        T + TPL+ NP   TFYY+ L GI+VG
Sbjct: 236 GNFSHCFTAVS-GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVG 294

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +  +  S F L   G GG IIDSGT+ T L    Y  + D F A      + P    
Sbjct: 295 STRLP-VPESAFALK-NGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV----KLPVVPS 348

Query: 396 FDT----CFDLS--GKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
            +T    CF     GK    VP +VLHF GA + LP  NY+      G      A     
Sbjct: 349 NETGPLLCFSAPPLGKAP-HVPKLVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGE 407

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ++IIGN QQQ   V+YDL  S++ F    C
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 134/379 (35%), Positives = 188/379 (49%), Gaps = 19/379 (5%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+  ++GG +S+ ++   Q    Y  R G+GTP + + + LDT +D  W  CAPC  C +
Sbjct: 57  SKAASSGGVTSAPVAS-GQTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPA 115

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN-------TCLYQVSYGDGSITVGD 227
            +   F PA S S+A++PC S  C   +   C            C +   + D S     
Sbjct: 116 GSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-S 172

Query: 228 FSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
             ++TL      +A  A GC     G    +   GLLGLGRG +S  +QTG R+N  FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    +     S+  G +   R  R+TPLL NP   + YYV + G+SVG   V+ + A 
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVK-VPAG 291

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
            F  DPA   G +IDSGT +TR T P Y ALR+ FR   ++         FDTCF+    
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351

Query: 406 TEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQQG 460
                P V LH  G  D++LP  N LI   ++   C A A       + ++++ N+QQQ 
Sbjct: 352 AAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQN 411

Query: 461 FRVVYDLAASRIGFAPRGC 479
            RVV D+A SR+GFA   C
Sbjct: 412 VRVVVDVAGSRVGFAREPC 430


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 147/386 (38%), Positives = 194/386 (50%), Gaps = 28/386 (7%)

Query: 114 RSRGRANGGFSSSVISGLAQGS-------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RS+ RA    SSS  + ++ G+        EY   L +GTPP+ V + LDTGS +VW QC
Sbjct: 4   RSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQC 63

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRR--NTCLYQVSYGDGS 222
            PC  C++Q+ P +D ++S +FA   C S  C KLD S   C  +   TC Y  SYGD S
Sbjct: 64  QPCAVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQTCAYSYSYGDKS 122

Query: 223 ITVGDFSTETLTF-RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFN 280
            T+G    ET++F  G  V  V  GCG +N G+F +   G+ G GRG LS P+Q      
Sbjct: 123 ATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--- 179

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAV-----SRTARFTPLLANPKLDTFYYVELVGISVG 335
             FS+C    S   KPS+++F   A        T + TPL+ NP   TFYY+ L GI+VG
Sbjct: 180 GNFSHCFTAVS-GRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVG 238

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +  +  S F L   G GG IIDSGT+ T L    Y  + D F A         + + 
Sbjct: 239 STRLP-VPESAFALK-NGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETG 296

Query: 396 FDTCFDLS--GKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSII 453
              CF     GK    VP +VLHF GA + LP  NY+      G      A     ++II
Sbjct: 297 PLLCFSAPPLGKAP-HVPKLVLHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII 355

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
           GN QQQ   V+YDL  S++ F    C
Sbjct: 356 GNFQQQNMHVLYDLKNSKLSFVRAKC 381


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 136/353 (38%), Positives = 182/353 (51%), Gaps = 33/353 (9%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN----- 207
           +++DTGSD+ W+QC PC  CY+Q DP+FDP+ S S+A VPC +  C     +        
Sbjct: 179 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238

Query: 208 ----------RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
                     +   C Y ++YGDGS + G  +T+T+   G  V     GCG  N GLF  
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 298

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRSTSAKPSSMVFGDSAVSRTA---RFT 313
            AGL+GLGR  LS  +QT  RF   FSYCL    S  A  S  + GD++  R A    +T
Sbjct: 299 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPVSYT 358

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
            ++A+P    FY++ + G SVGGA V                 V++DSGT +TRL    Y
Sbjct: 359 RMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN--------VLLDSGTVITRLAPSVY 410

Query: 374 IALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYL 430
            A+R  F  + GA     AP FSL D C++L+G  EVKVP + L    GAD+++ A   L
Sbjct: 411 RAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGML 470

Query: 431 IPVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                 G+  C A A         IIGN QQ+  RVVYD   SR+GFA   C+
Sbjct: 471 FMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 136/353 (38%), Positives = 182/353 (51%), Gaps = 33/353 (9%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN----- 207
           +++DTGSD+ W+QC PC  CY+Q DP+FDP+ S S+A VPC +  C     +        
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237

Query: 208 ----------RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
                     +   C Y ++YGDGS + G  +T+T+   G  V     GCG  N GLF  
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 297

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRSTSAKPSSMVFGDSAVSRTA---RFT 313
            AGL+GLGR  LS  +QT  RF   FSYCL    S  A  S  + GD++  R A    +T
Sbjct: 298 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPVSYT 357

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
            ++A+P    FY++ + G SVGGA V                 V++DSGT +TRL    Y
Sbjct: 358 RMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN--------VLLDSGTVITRLAPSVY 409

Query: 374 IALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYL 430
            A+R  F  + GA     AP FSL D C++L+G  EVKVP + L    GAD+++ A   L
Sbjct: 410 RAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGML 469

Query: 431 IPVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                 G+  C A A         IIGN QQ+  RVVYD   SR+GFA   C+
Sbjct: 470 FMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/356 (41%), Positives = 193/356 (54%), Gaps = 23/356 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 44  GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 103

Query: 191 VPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGC 247
           VPC  P+C  L   ++       C Y VSYGDGS T G +S++TLT   +  V     GC
Sbjct: 104 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 163

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAV 306
           GH   GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ V G S  
Sbjct: 164 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGA 223

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     T LL +P   T+Y V L GISVGG  +  + AS F          ++D+GT VT
Sbjct: 224 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVT 276

Query: 367 RLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
           RL   AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V+
Sbjct: 277 RLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVT 336

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L A   L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 337 LGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 386


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 185/381 (48%), Gaps = 22/381 (5%)

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
           NR         +S+  S +    G+Y     VGTPP   Y ++DTGSD+VW+QC PC++C
Sbjct: 62  NRVNHSNKNSLASTPESTVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQC 121

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           Y+QT P F+P+KS S+  + C S LC+ +  + CN +  C Y ++YG+ S + GD S ET
Sbjct: 122 YNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLET 181

Query: 233 LTFRGT-----RVARVALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYC 286
           LT   T        +  +GCG +N G F   +  +    G   S  TQ G     KFSYC
Sbjct: 182 LTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYC 241

Query: 287 LVDRSTSAKPSSM-----VFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           LV  S + K  SM      FGD A+        TP++       FYY+ +   SVG   V
Sbjct: 242 LVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDH-SFFYYLTIEAFSVGDKRV 300

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDT 398
               +S         G +IIDS T VT +    Y  L  A      +L+R  D    F  
Sbjct: 301 EFAGSS----KGVEEGNIIIDSSTIVTFVPSDVYTKLNSAI-VDLVTLERVDDPNQQFSL 355

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           C+++S   E   P +  HF+GAD+ L ATN  + V +    CFAFA +  G +I G+  Q
Sbjct: 356 CYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEV-ARDVLCFAFAPSNGG-AIFGSFSQ 413

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           Q F V YDL    + F    C
Sbjct: 414 QDFMVGYDLQQKTVSFKSVDC 434


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 137/364 (37%), Positives = 185/364 (50%), Gaps = 31/364 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FD ++S + A +PC S 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCEST 93

Query: 197 LCRKLDS--SGCNRRN----TCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGH 249
            C KLD   + C + N    TC Y  SYGD S+T+G  + +  TF  GT +  V  GCG 
Sbjct: 94  QC-KLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPGVTFGCGL 152

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF------- 301
           +N G+F +   G+ G GRG LS P+Q        FS+C     T A PS+++        
Sbjct: 153 NNTGVFNSNETGIAGFGRGPLSLPSQLKV---GNFSHCFT-TITGAIPSTVLLDLPADLF 208

Query: 302 --GDSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
             G  AV  T   ++    ANP   T YY+ L GI+VG   +  +  S F L   G GG 
Sbjct: 209 SNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLP-VPESAFALT-NGTGGT 263

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           IIDSGTS+T L    Y  +RD F A         + +   TCF    + +  VP +VLHF
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 323

Query: 418 RGADVSLPATNYL--IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
            GA + LP  NY+  +P D+  +            +IIGN QQQ   V+YDL  + + F 
Sbjct: 324 EGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFV 383

Query: 476 PRGC 479
              C
Sbjct: 384 AAQC 387


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 141/375 (37%), Positives = 191/375 (50%), Gaps = 30/375 (8%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G  EY   L +GTPP     + DTGSD+ W QC PCK C+ Q  P++D A S SF+ 
Sbjct: 88  LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSP 147

Query: 191 VPCRSPLCRKL--DSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGT-------- 238
           VPC S  C  +   S  C    T  C Y+ +Y DG+ + G   TETLTF G+        
Sbjct: 148 VPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPG 207

Query: 239 -RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
             V  VA GCG DN GL   + G +GLGRG LS   Q G     KFSYCL D   ++  S
Sbjct: 208 VSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGS 264

Query: 298 SMVFGDSAV--------SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
            ++FG  A             + TPL+  P   + YYV L GIS+G A +  I    F L
Sbjct: 265 PVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLP-IPNGTFDL 323

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD-LSGKTEV 408
              G+GG+I+DSGT  T L   A+  + +   AG  +       SL   CF   +G+ ++
Sbjct: 324 RDDGSGGMIVDSGTIFTVLVESAFRVVVNHV-AGVLNQPVVNASSLDSPCFPATAGEQQL 382

Query: 409 -KVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVY 465
             +P ++LHF  GAD+ L   NY+     S +FC   AG  S   SI+GN QQQ  ++++
Sbjct: 383 PDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLF 442

Query: 466 DLAASRIGFAPRGCA 480
           D+   ++ F P  C+
Sbjct: 443 DITVGQLSFVPTDCS 457


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 134/363 (36%), Positives = 192/363 (52%), Gaps = 26/363 (7%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y     +GTPP  +  VLDTGSD++W QC APC++C+ Q  P++ PA+S ++A V C S 
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159

Query: 197 LCRKLDS------------SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARV 243
           LC  L S            +    R  C Y  SYGDGS T G  +TET TF  GT V  +
Sbjct: 160 LCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGTTVHDL 219

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           A GCG DN G    ++GL+G+GRG LS  +Q G     KFSYC    + +   S +  G 
Sbjct: 220 AFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGV---TKFSYCFTPFNDTTTSSPLFLGS 276

Query: 304 SA----VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           SA     +++  F P  + P+  ++YY+ L GI+VG   +  I  ++F+L  +G GG+II
Sbjct: 277 SASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLP-IDPAVFRLTASGRGGLII 335

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS---GKTEVKVPTVVLH 416
           DSGT+ T L   A++ L  A  A  +    +        CF      G   V VP +VLH
Sbjct: 336 DSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRLVLH 395

Query: 417 FRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           F GAD+ LP ++ ++    +G  C     +  G+S++G++QQQ   V YD+    + F P
Sbjct: 396 FDGADMELPRSSAVVEDRVAGVACLGIV-SARGMSVLGSMQQQNMHVRYDVGRDVLSFEP 454

Query: 477 RGC 479
             C
Sbjct: 455 ANC 457


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/356 (41%), Positives = 193/356 (54%), Gaps = 23/356 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 191 VPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGC 247
           VPC  P+C  L   ++       C Y VSYGDGS T G +S++TLT   +  V     GC
Sbjct: 196 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 255

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAV 306
           GH   GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ V G S  
Sbjct: 256 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGA 315

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     T LL +P   T+Y V L GISVGG  +  + AS F          ++D+GT VT
Sbjct: 316 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVT 368

Query: 367 RLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
           RL   AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V+
Sbjct: 369 RLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVT 428

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L A   L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 429 LGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 139/385 (36%), Positives = 189/385 (49%), Gaps = 33/385 (8%)

Query: 114 RSRGRANGGFSSSVISGLAQG------SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA 167
           RS  RAN     SV S   +        G+Y     +GTPP  VY ++DT SD++W+QC 
Sbjct: 58  RSMNRANHFNQISVYSNAVESPVTLLDDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQ 117

Query: 168 PCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC--NRRNTCLYQVSYGDGSITV 225
            C+ CY+ T P+FDP+ S+++  +PC S  C+ +  + C  + R  C + V+Y DGS + 
Sbjct: 118 LCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQ 177

Query: 226 GDFSTETLTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
           GD   ET+T            R  +GC   N  +   + G++GLG G +S   Q     +
Sbjct: 178 GDLIVETVTLGSYNDPFVHFPRTVIGCIR-NTNVSFDSIGIVGLGGGPVSLVPQLSSSIS 236

Query: 281 RKFSYCLV---DRSTSAK--PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVG 335
           +KFSYCL    DRS+  K   ++MV GD  VS    F           FYY+ L   SVG
Sbjct: 237 KKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVF------KDWKKFYYLTLEAFSVG 290

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FS 394
              +   ++S      +G G +IIDSGT+ T L    Y  L  A  A    L+RA D   
Sbjct: 291 NNRIEFRSSSSRS---SGKGNIIIDSGTTFTVLPDDVYSKLESAV-ADVVKLERAEDPLK 346

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIG 454
            F  C+  S   +V VP +  HF GADV L A N  I V S    C AF  + SG +I G
Sbjct: 347 QFSLCYK-STYDKVDVPVITAHFSGADVKLNALNTFI-VASHRVVCLAFLSSQSG-AIFG 403

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           N+ QQ F V YDL    + F P  C
Sbjct: 404 NLAQQNFLVGYDLQRKIVSFKPTDC 428


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 185/358 (51%), Gaps = 26/358 (7%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y     +GTPP  +Y V+DT +D +W QC PCK C++ T P+FDP+KS ++ T+PC SP 
Sbjct: 89  YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPK 148

Query: 198 CRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHD 250
           C+ ++++ C  + +  C Y  +YG  + + GD S +TLT             + +GCGH 
Sbjct: 149 CKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHR 208

Query: 251 NEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGD-SAVS 307
           N+G L    +G +GLGRG LSF +Q       KFSYCLV   S       + FGD S VS
Sbjct: 209 NKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGDKSVVS 268

Query: 308 RTARF-TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                 TP+ A    +  Y   L  +SVG  H+     S  K D  GN   IIDSGT++T
Sbjct: 269 GVGTVSTPITAG---EIGYSTTLNALSVGD-HIIKFENSTSKNDNLGN--TIIDSGTTLT 322

Query: 367 RLTRPAYIALRDAFRAGASSLKRA--PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
            L    Y  L ++       L+RA  P+   F  C+  + K  + VP +  HF GADV L
Sbjct: 323 ILPENVYSRL-ESIVTSMVKLERAKSPN-QQFKLCYKATLK-NLDVPIITAHFNGADVHL 379

Query: 425 PATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            + N   P+D     CFAF   G   G +IIGNI QQ F V +DL  + I F P  C 
Sbjct: 380 NSLNTFYPIDHE-VVCFAFVSVGNFPG-TIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 137/347 (39%), Positives = 184/347 (53%), Gaps = 20/347 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M++DTGSDV W+QC PC +C+SQ D +FDP+ S +++   C S 
Sbjct: 126 EYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSA 185

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG--L 254
            C +L   GC+    C Y V YGDGS   G +S++TL    + V     GC     G  L
Sbjct: 186 ACAQLRQRGCSSSQ-CQYTVKYGDGSTGSGTYSSDTLALGSSTVENFQFGCSQSESGNLL 244

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
               AGL+GLG G  S  TQT   F + FSYCL    T      +  G S      + TP
Sbjct: 245 QDQTAGLMGLGGGAESLATQTAGTFGKAFSYCL--PPTPGSSGFLTLGASTSGFVVK-TP 301

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +L + ++ ++Y V L  I VGG  +  I AS F      + G I+DSGT +TRL R AY 
Sbjct: 302 MLRSTQVPSYYGVLLQAIRVGGRQLN-IPASAF------SAGSIMDSGTIITRLPRTAYS 354

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           AL  AF+AG      A    +FDTCFD SG++ V +PTV L F G  V   A++ +I   
Sbjct: 355 ALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLASDGIILGS 414

Query: 435 SSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                C AFA     + L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 415 -----CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/356 (41%), Positives = 193/356 (54%), Gaps = 23/356 (6%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 191 VPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGC 247
           VPC  P+C  L   ++       C Y VSYGDGS T G +S++TLT   +  V     GC
Sbjct: 196 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 255

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAV 306
           GH   GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ V G S  
Sbjct: 256 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGA 315

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     T LL +P   T+Y V L GISVGG  +  + AS F          ++D+GT VT
Sbjct: 316 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVT 368

Query: 367 RLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
           RL   AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V+
Sbjct: 369 RLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVT 428

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L A   L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 429 LGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 133/366 (36%), Positives = 181/366 (49%), Gaps = 15/366 (4%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           + + S +  G G Y   + +GTPP  +  + DTGSD++W QC PC  CY Q +P+FDP +
Sbjct: 81  NDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKE 140

Query: 185 SRSFATVPCRSPLCRKLDSSG-CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR---- 239
           S ++ T+ C +  C+ L   G C+  NTC Y  SYGD S T GD S++TLT   T     
Sbjct: 141 SETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPA 200

Query: 240 -VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPT-QTGRRFNRKFSYCLVDRSTSAKPS 297
               +A GCGHDN G F    G L    G       Q       +FSYCLV  S+ +  S
Sbjct: 201 SFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVS 260

Query: 298 SMV-FGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG-- 353
           S + FG S  VS +   +  L     DTFYY+ L G+SVG   V     S  K  PA   
Sbjct: 261 SKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVE 320

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
            G +IIDSGT++T L +  Y  +  A               +F  C+  S    +++PT+
Sbjct: 321 EGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTI 378

Query: 414 VLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
             HF GADV LP  N  + V      CF+   + S L+I GN+ Q  F V YDL  +++ 
Sbjct: 379 TAHFTGADVQLPPLNTFVQVQED-LVCFSMIPS-SNLAIFGNLAQINFLVGYDLKNNKVS 436

Query: 474 FAPRGC 479
           F    C
Sbjct: 437 FKQTDC 442


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 133/379 (35%), Positives = 187/379 (49%), Gaps = 19/379 (5%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+  ++GG +S+ ++   Q    Y  R G+GTP + + + LDT +D  W  CAPC  C +
Sbjct: 57  SKAASSGGVTSAPVAS-GQTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPA 115

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN-------TCLYQVSYGDGSITVGD 227
            +   F PA S S+A++PC S  C   +   C            C +   + D S     
Sbjct: 116 GSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-S 172

Query: 228 FSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
             ++TL      +A  A GC     G    +   GLLGLGRG +S  +QTG  +N  FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSY 232

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    +     S+  G +   R  R+TPLL NP   + YYV + G+SVG   V+ + A 
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVK-VPAG 291

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
            F  DPA   G +IDSGT +TR T P Y ALR+ FR   ++         FDTCF+    
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351

Query: 406 TEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQQG 460
                P V LH  G  D++LP  N LI   ++   C A A       + ++++ N+QQQ 
Sbjct: 352 AAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQN 411

Query: 461 FRVVYDLAASRIGFAPRGC 479
            RVV D+A SR+GFA   C
Sbjct: 412 VRVVVDVAGSRVGFAREPC 430


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 171/352 (48%), Gaps = 18/352 (5%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y  R+ +GTP + ++MVLDT  D  W+ CA C  C S   P F P  S ++A++ C  
Sbjct: 97  GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSS---PTFSPNTSSTYASLQCSV 153

Query: 196 PLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           P C ++    C    T  C +  +YG  S      S ++L      +   + GC +   G
Sbjct: 154 PQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPSYSFGCVNAVSG 213

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
             +   GLLGLGRG +S  +Q+G  ++  FSYC     +     S+  G     +  R T
Sbjct: 214 STLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTT 273

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           PLL NP   T YYV L G+SVG   V  +   L   DP    G IIDSGT +TR   P Y
Sbjct: 274 PLLRNPHRPTLYYVNLTGVSVGRVLVP-VAPELLAFDPNTGAGTIIDSGTVITRFVEPVY 332

Query: 374 IALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
            A+RD FR       + P  ++  FDTCF  +   E   P V  HF G D+ LP  N LI
Sbjct: 333 AAIRDEFRKQV----KGPFATIGAFDTCF--AATNEDIAPPVTFHFTGMDLKLPLENTLI 386

Query: 432 PVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +    C A A       S L++I N+QQQ  R+++D+  SR+G A   C
Sbjct: 387 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 125/360 (34%), Positives = 182/360 (50%), Gaps = 20/360 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           S +  GSGEY   + +GTPP     + DTGSD+ W QC PC KCY Q  P+F+P KS SF
Sbjct: 83  SSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSF 142

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           + VPC +  C  +D   C  +  C Y  +YGD + + GD   E +T   + V  V +GCG
Sbjct: 143 SHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSVKSV-IGCG 201

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           H + G F  A+G++GLG G+LS  +Q  +    +R+FSYCL    + A    + FG++AV
Sbjct: 202 HASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN-GKINFGENAV 260

Query: 307 SRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
                   TPL++   + T+YY+ L  IS+G             +  A  G VIIDSGT+
Sbjct: 261 VSGPGVVSTPLISKNTV-TYYYITLEAISIGNER---------HMAFAKQGNVIIDSGTT 310

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GAD 421
           +T L +  Y  +  +      + +        D CFD  ++    + +P +  HF  GA+
Sbjct: 311 LTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGAN 370

Query: 422 VS-LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           V+ LP   +    D+        A   +   IIGN+ Q  F + YDL A R+ F P  CA
Sbjct: 371 VNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 131/386 (33%), Positives = 184/386 (47%), Gaps = 27/386 (6%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+    G  S+ V SG  Q    Y  R G+G+P + + + LDT +D  W  C+PC  C S
Sbjct: 60  SKAATAGVSSAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPS 117

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN-------------TCLYQVSYGDG 221
            +  +F PA S S+A++PC S  C       C                 TC +   + D 
Sbjct: 118 SS--LFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADA 175

Query: 222 SITVGDFSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRF 279
           S      +++TL      +     GC     G    +   GLLGLGRG ++  +Q G  +
Sbjct: 176 SFQAA-LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLY 234

Query: 280 NRKFSYCLVDRSTSAKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
           N  FSYCL    +     S+  G      R+ R+TP+L NP   + YYV + G+SVG A 
Sbjct: 235 NGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAW 294

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           V+ + A  F  D A   G ++DSGT +TR T P Y ALR+ FR   ++         FDT
Sbjct: 295 VK-VPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 353

Query: 399 CFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSII 453
           CF+         P V +H  G  D++LP  N LI   ++   C A A       S +++I
Sbjct: 354 CFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVI 413

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
            N+QQQ  RVV+D+A SRIGFA   C
Sbjct: 414 ANLQQQNIRVVFDVANSRIGFAKESC 439


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 130/386 (33%), Positives = 184/386 (47%), Gaps = 27/386 (6%)

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           S+    G  S+ V SG  Q    Y  R G+G+P + + + LDT +D  W  C+PC  C S
Sbjct: 58  SKAATAGVSSAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPS 115

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN-------------TCLYQVSYGDG 221
            +  +F PA S S+A++PC S  C       C                 TC +   + D 
Sbjct: 116 SS--LFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADA 173

Query: 222 SITVGDFSTETLTFRGTRVARVALGCGHDNEG--LFVAAAGLLGLGRGRLSFPTQTGRRF 279
           S      +++TL      +     GC     G    +   GLLGLGRG ++  +Q G  +
Sbjct: 174 SFQAA-LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLY 232

Query: 280 NRKFSYCLVDRSTSAKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
           N  FSYCL    +     S+  G      R+ R+TP+L NP   + YYV + G+SVG A 
Sbjct: 233 NGVFSYCLPSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAW 292

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT 398
           V+ + A  F  D A   G ++DSGT +TR T P Y ALR+ FR   ++         FDT
Sbjct: 293 VK-VPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 351

Query: 399 CFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSII 453
           CF+         P V +H  G  D++LP  N LI   ++   C A A       S +++I
Sbjct: 352 CFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVI 411

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
            N+QQQ  RVV+D+A SR+GFA   C
Sbjct: 412 ANLQQQNIRVVFDVANSRVGFAKESC 437


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 155/459 (33%), Positives = 206/459 (44%), Gaps = 56/459 (12%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           A + +  RL HVD+      PE +  +  +    R        E A   P R R R    
Sbjct: 26  AGAGIVARLTHVDAGRGLARPELVRRMAQRSRARRRLLSHDEKEEAADRPVRARVRTAGA 85

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ-TDPV 179
           GG       G+   + EY   L VGTPPR V + LDTGSD+VW QCAPC  C+ Q   PV
Sbjct: 86  GG-------GIV--TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPV 136

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGDFSTETL 233
            DPA S + A V C +P+CR L  + C R        +C+Y   YGD SITVG  +++  
Sbjct: 137 LDPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRF 196

Query: 234 TF--------RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           TF         G    R+  GCGH N+G+F A   G+ G GRGR S P+Q G      FS
Sbjct: 197 TFGPGDNADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLG---VTSFS 253

Query: 285 YC---LVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           YC   + + ++S     +   +  ++   + TPLL +P   + Y++ L  I+VG   +  
Sbjct: 254 YCFTSMFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIP- 312

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           I     +L  A     IIDSG S+T L    Y A++  F A       A + S  D CF 
Sbjct: 313 IPERRQRLREA---SAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFA 369

Query: 402 LSGKTE-----------------VKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAF 443
           L                      V+VP +V H   GAD  LP  NY+     +   C   
Sbjct: 370 LPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVL 429

Query: 444 AGTMSG---LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                G     +IGN QQQ   VVYDL    + FAP  C
Sbjct: 430 DAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/364 (37%), Positives = 179/364 (49%), Gaps = 30/364 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FDP+ S + +   C S 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 93

Query: 197 LCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGH 249
           LC+ L  + C         TC+Y  SYGD S+T G    +  TF   G  V  VA GCG 
Sbjct: 94  LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 153

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF------- 301
            N G+F +   G+ G GRG LS P+Q        FS+C     T A PS+++        
Sbjct: 154 FNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFT-TITGAIPSTVLLDLPADLF 209

Query: 302 --GDSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
             G  AV  T   ++    ANP   T YY+ L GI+VG   +  +  S F L   G GG 
Sbjct: 210 SNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLP-VPESAFALT-NGTGGT 264

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           IIDSGTS+T L    Y  +RD F A         + +   TCF    + +  VP +VLHF
Sbjct: 265 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 324

Query: 418 RGADVSLPATNYL--IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
            GA + LP  NY+  +P D+  +            +IIGN QQQ   V+YDL  + + F 
Sbjct: 325 EGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFV 384

Query: 476 PRGC 479
              C
Sbjct: 385 AAQC 388


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 143/377 (37%), Positives = 194/377 (51%), Gaps = 33/377 (8%)

Query: 127 VISGLAQ-GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAK 184
           ++  LA+ G+G Y   L VGTPP     ++DTGSD+ W QCAPC   C++Q  P++DPA+
Sbjct: 84  LLEALAENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPAR 143

Query: 185 SRSFATVPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFR------ 236
           S +F+ +PC SPLC+ L S+   CN    C+Y   Y  G  T G  + +TL         
Sbjct: 144 SSTFSKLPCASPLCQALPSAFRACNATG-CVYDYRYAVG-FTAGYLAADTLAIGDGDGDG 201

Query: 237 --GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRSTS 293
              +  A VA GC   N G    A+G++GLGR  LS  +Q G     +FSYCL  D    
Sbjct: 202 DASSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAG 258

Query: 294 AKPSSMVFGDSA--VSRTARFTPLLANP----KLDTFYYVELVGISVGGAHVRGITASLF 347
           A P  ++FG  A       + T LL NP    +   +YYV L GI+VG   +  +T+S F
Sbjct: 259 ASP--ILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLP-VTSSTF 315

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RAGASSLKRAPDFSLFDTCFDLSG 404
               AG GGVI+DSGT+ T L    Y  LR AF    AG  +      F  FD CF+ +G
Sbjct: 316 GFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFD-FDLCFE-AG 373

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
             +  VP +V  F  GA+ ++P  +Y   VD  G           G+S+IGN+ Q    V
Sbjct: 374 AADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHV 433

Query: 464 VYDLAASRIGFAPRGCA 480
           +YDL  +   FAP  CA
Sbjct: 434 LYDLDGATFSFAPADCA 450


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 136/371 (36%), Positives = 192/371 (51%), Gaps = 42/371 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GE+   L +GTPP     + DTGSD++W QCAPC ++C+ Q  P+++P+ S +F+ +PC 
Sbjct: 83  GEFLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCN 142

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT------RVARVALGCG 248
           S L        C     C+Y ++YG G   V    TET TF  +      RV  +A GC 
Sbjct: 143 SSL------GLCAPACACMYNMTYGSGWTYVFQ-GTETFTFGSSTPADQVRVPGIAFGCS 195

Query: 249 HDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV- 306
           + + G    +A+GL+GLGRG LS  +Q G     KFSYCL     +   S+++ G SA  
Sbjct: 196 NASSGFNASSASGLVGLGRGSLSLVSQLGA---PKFSYCLTPYQDTNSTSTLLLGPSASL 252

Query: 307 --SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
             +     TP +A+P    +YY+ L GIS+G   +  I  + F L   G GG+IIDSGT+
Sbjct: 253 NDTGVVSSTPFVASPS-SIYYYLNLTGISLGTTALP-IPPNAFSLKADGTGGLIIDSGTT 310

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAP--DFSL---FDTCFDLSGKTEV--KVPTVVLHF 417
           +T L   AY       RA   SL   P  D S     D CF+L   T     +P++ LHF
Sbjct: 311 ITMLGNTAY----QQVRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHF 366

Query: 418 RGADVSLPATNYLI----PVDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAA 469
            GAD+ LPA NY++    P   S  +C A           +SI+GN QQQ   ++YD+  
Sbjct: 367 DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGK 426

Query: 470 SRIGFAPRGCA 480
             + FAP  C+
Sbjct: 427 ETLSFAPAKCS 437


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 160/454 (35%), Positives = 217/454 (47%), Gaps = 58/454 (12%)

Query: 61  AESS-LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA 119
           AES+ L   L HVDS     T   L    + R   R+ SL + A       P +      
Sbjct: 31  AESAALRADLTHVDS-GRGFTKHELLRRMVARSKARLASLRSSACDTALTAPVDHG---- 85

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
                     G   GS EY   LG+GTP P+ V + LDTGSD+VW QCA C  C+ Q  P
Sbjct: 86  ----------GSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVP 134

Query: 179 VFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLT 234
           VF  + S +F+ VPC  PLC     L  SGC  R+ +C Y   Y D SIT G  + +T T
Sbjct: 135 VFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFT 194

Query: 235 FRG-------TRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
           F+          V  +  GCG  N GLF    +G+ G G G LS P+Q   R   +FSYC
Sbjct: 195 FKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVR---RFSYC 251

Query: 287 LVDRSTSAKPSSMVFGDSAVSRTARFT-PLLANP----------KLDTFYYVELVGISVG 335
                 S + S ++ G    +  A  T P+ + P              FY++ L G++VG
Sbjct: 252 FTAMEES-RVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVG 310

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +    AS F L   G+GG  IDSGT++T   +  + +LR+AF A    L  A  ++ 
Sbjct: 311 ETRLP-FNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVP-LPVAKGYTD 368

Query: 396 FDT--CFDLSGKTEV-KVPTVVLHFRGADVSLPATNYLIPVDSSGT-----FCFAF--AG 445
            D   CF +  K +   VP ++LH  GAD  LP  NY++  D  G+      C     AG
Sbjct: 369 PDNLLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAG 428

Query: 446 TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +G +IIGN QQQ   +VYDL ++++ FAP  C
Sbjct: 429 NSNG-TIIGNFQQQNMHIVYDLESNKMVFAPARC 461


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 179/351 (50%), Gaps = 16/351 (4%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
            Y  R G+GTP + + + +D  +D  W+ C+ C  C + + P F P +S ++ TVPC SP
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 197 LCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C ++ S  C     ++C + ++Y   +        ++L      V     GC     G 
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENNVVVSYTFGCLRVVSGN 218

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GL+G GRG LSF +QT   +   FSYCL +  +S    ++  G     +  + TP
Sbjct: 219 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 278

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP   + YYV ++GI VG   V+ +  S    +P    G IID+GT  TRL  P Y 
Sbjct: 279 LLYNPHRPSLYYVNMIGIRVGSKVVQ-VPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 337

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPV 433
           A+RDAFR G      AP    FDTC++++    V VPTV   F GA  V+LP  N +I  
Sbjct: 338 AVRDAFR-GRVRTPVAPPLGGFDTCYNVT----VSVPTVTFMFAGAVAVTLPEENVMIHS 392

Query: 434 DSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+GF+   C
Sbjct: 393 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 179/351 (50%), Gaps = 16/351 (4%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
            Y  R G+GTP + + + +D  +D  W+ C+ C  C + + P F P +S ++ TVPC SP
Sbjct: 82  NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140

Query: 197 LCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C ++ S  C     ++C + ++Y   +        ++L      V     GC     G 
Sbjct: 141 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENNVVVSYTFGCLRVVSGN 199

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GL+G GRG LSF +QT   +   FSYCL +  +S    ++  G     +  + TP
Sbjct: 200 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 259

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP   + YYV ++GI VG   V+ +  S    +P    G IID+GT  TRL  P Y 
Sbjct: 260 LLYNPHRPSLYYVNMIGIRVGSKVVQ-VPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 318

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPV 433
           A+RDAFR G      AP    FDTC++++    V VPTV   F GA  V+LP  N +I  
Sbjct: 319 AVRDAFR-GRVRTPVAPPLGGFDTCYNVT----VSVPTVTFMFAGAVAVTLPEENVMIHS 373

Query: 434 DSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+GF+   C
Sbjct: 374 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 149/439 (33%), Positives = 223/439 (50%), Gaps = 41/439 (9%)

Query: 64  SLSLRLHHVDS-LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG 122
           +LS+ L H DS LS    P++    R+    LR  S               RSR   N  
Sbjct: 25  NLSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSIS---------------RSRRLNNIL 69

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
             + + SGL    GE+F  + +GTPP  V+ + DTGSD+ W+QC PC++CY +  P+FD 
Sbjct: 70  SQTDLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDK 129

Query: 183 AKSRSFATVPCRSPLCRKLDSS--GCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            KS ++ + PC S  C  L SS  GC+  +N C Y+ SYGD S + GD +TET++     
Sbjct: 130 KKSSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSAS 189

Query: 240 VARVA-----LGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
            + V+      GCG++N G F    +G++GLG G LS  +Q G   ++KFSYCL  +S +
Sbjct: 190 GSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSAT 249

Query: 294 AKPSSMV-FGDSAV-SRTARFTPLLANPKLD----TFYYVELVGISVGGAHVRGITASLF 347
              +S++  G +++ S  ++ + +++ P +D    T+YY+ L  ISVG   +   T S +
Sbjct: 250 TNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIP-YTGSSY 308

Query: 348 KLDPAG-----NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFD 401
             +  G     +G +IIDSGT++T L    +     A     +  KR  D   L   CF 
Sbjct: 309 NPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFK 368

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGF 461
            SG  E+ +P + +HF GADV L   N  + V S    C +   T   ++I GN  Q  F
Sbjct: 369 -SGSAEIGLPEITVHFTGADVRLSPINAFVKV-SEDMVCLSMVPTTE-VAIYGNFAQMDF 425

Query: 462 RVVYDLAASRIGFAPRGCA 480
            V YDL    + F    C+
Sbjct: 426 LVGYDLETRTVSFQRMDCS 444


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 159/463 (34%), Positives = 217/463 (46%), Gaps = 54/463 (11%)

Query: 39  LSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDS-LSFNRTPEHLFNLRIQRDVLRVK 97
           L+WP   + S S S+      +    L   L H+DS   F R      N  ++R VLR +
Sbjct: 14  LAWP---ATSGSGSA------NHHHGLRADLTHIDSGRGFTR------NELLRRMVLRSR 58

Query: 98  SLTAFAESAVRVPPRNRSRGRANGGFSSSVISG-LAQGSGEYFTRLGVGTP-PRYVYMVL 155
                A +A ++ P   SR       ++ V SG    G  EY    G+GTP P+ V + +
Sbjct: 59  -----ARAAKQLCP---SRSGTPVRVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEV 110

Query: 156 DTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQ 215
           DTGSDVVW QC PC  C++Q  P FD + S +   V C  P+CR L    C     C YQ
Sbjct: 111 DTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHAC-FLGGCTYQ 169

Query: 216 VSYGDGSITVGDFSTETLTFRG-----TRVARVALGCGHDNEGLFVA-AAGLLGLGRGRL 269
           V+YGD S+T+G  + ++ TF G       V  +  GCG  N G F +   G+ G GRG L
Sbjct: 170 VNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPL 229

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT-PLLANPKLDT---FY 325
           S P Q G      FSYC      S      + G  A    A  T P+L+ P L     +Y
Sbjct: 230 SLPRQLGV---SSFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPILSTPFLPNHPEYY 286

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           Y+ L GI+VG   +  +  S F +   G+GG IIDSGT++T   R  + +L +AF A   
Sbjct: 287 YLSLKGITVGKTRL-AVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVP 345

Query: 386 SLKRAPDFSLFDT------CF---DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSS 436
                P  S  DT      CF    +   ++V VP + LH  GAD  LP  NY+     S
Sbjct: 346 ----LPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRENYMAEYPDS 401

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C          ++IGN QQQ   +V+DLA +++   P  C
Sbjct: 402 DQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQC 444


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 156/469 (33%), Positives = 225/469 (47%), Gaps = 41/469 (8%)

Query: 21  ASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLPAPDAESSLSLRLHHVDSLSFNRT 80
           A  Q    V  S   PS +     V+ S++ S+L         +LS R      +     
Sbjct: 27  ADAQRYIVVATSSLKPSEVCSGHKVTPSKNGSTL---------ALSHRHGPCSPVISKEK 77

Query: 81  PEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFT 140
           P H   LR  RD LR     A+ ++ V     N ++       +    SG + G+ EY  
Sbjct: 78  PSHEETLR--RDQLRA----AYIQAKVSSRYNNVAKELQQSAVTIPTSSGYSLGTTEYVI 131

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
            + +GTP     M +DTGSDV W+QCAPC  + C SQ D +FDPA S +++   C S  C
Sbjct: 132 TVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQC 191

Query: 199 RKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLF 255
            +L  + +GC  ++ C Y V YGDGS T G + ++TL+   +  V     GC H   G  
Sbjct: 192 AQLGDEGNGC-LKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQFGCSHRAAGFV 250

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF--T 313
               GL+GLG    S  +QT   + + FSYCL   S+S     +  G +  + ++R+  T
Sbjct: 251 GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGG-GFLTLGAAGGASSSRYSHT 309

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           P++    + TFY V L GI+V G  +  + AS+F      +G  ++DSGT +T+L   AY
Sbjct: 310 PMV-RFSVPTFYGVFLQGITVAGTMLN-VPASVF------SGASVVDSGTVITQLPPTAY 361

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIP 432
            ALR AF+    +   A      DTCFD SG   + VPTV L F RGA + L  +  L  
Sbjct: 362 QALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLDISGILY- 420

Query: 433 VDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                  C AF  T       I+GN+QQ+ F +++D+    IGF    C
Sbjct: 421 -----AGCLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 186/337 (55%), Gaps = 23/337 (6%)

Query: 153 MVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCN 207
           M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A VPC  P+C  L   ++   
Sbjct: 1   MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60

Query: 208 RRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGR 266
               C Y VSYGDGS T G +S++TLT   +  V     GCGH   GLF    GLLGLGR
Sbjct: 61  SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAVSRTARFTPLLANPKLDTFY 325
            + S   QT   +   FSYCL  + ++A   ++ V G S  +     T LL +P   T+Y
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYY 180

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
            V L GISVGG  +  + AS F          ++D+GT VTRL   AY ALR AFR+G +
Sbjct: 181 VVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVTRLPPTAYAALRSAFRSGMA 233

Query: 386 S--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFA 442
           S     AP   + DTC++ +G   V +P V L F  GA V+L A   L    S G   FA
Sbjct: 234 SYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFA 289

Query: 443 FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 290 PSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 126/367 (34%), Positives = 182/367 (49%), Gaps = 32/367 (8%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L    GEY  R  +G+PP     ++DTGS ++W+QC+PC  C+ Q  P+F+P KS ++  
Sbjct: 82  LIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKY 141

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA---- 244
             C S  C  L  S   C +   C+Y + YGD S +VG   TETL+F  T  A+      
Sbjct: 142 ATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPN 201

Query: 245 --LGCGHDNEGLFVAA---AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
              GCG DN      +    G+ GLG G LS  +Q G +   KFSYCL+   +++  S +
Sbjct: 202 TIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCLLPYDSTST-SKL 260

Query: 300 VFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGG 356
            FG  A+  T     TPL+  P L T+Y++ L  +++G   V  G T          +G 
Sbjct: 261 KFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQT----------DGN 310

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
           ++IDSGT +T L    Y     + +   G   L+  P  S   TCF    +  + +P + 
Sbjct: 311 IVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLP--SPLKTCF--PNRANLAIPDIA 366

Query: 415 LHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIG 473
             F GA V+L   N LIP+  S   C A   +   G+S+ G+I Q  F+V YDL   ++ 
Sbjct: 367 FQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVS 426

Query: 474 FAPRGCA 480
           FAP  CA
Sbjct: 427 FAPTDCA 433


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/354 (35%), Positives = 178/354 (50%), Gaps = 25/354 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R  +GTP + + + +DT +D  WI C+ C  C S    VF+  KS +F TV C 
Sbjct: 93  SPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST---VFNNVKSTTFKTVGCE 149

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ +S C   + C + ++YG  SI   + S + +T     +     GC  +  G 
Sbjct: 150 APQCKQVPNSKCGG-SACAFNMTYGSSSI-AANLSQDVVTLATDSIPSYTFGCLTEATGS 207

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            +   GLLGLGRG +S  +QT   +   FSYCL    +     S+  G     +  + TP
Sbjct: 208 SIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTP 267

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L+ I V G  V  I  S    +P    G I DSGT  TRL  PAY 
Sbjct: 268 LLKNPRRSSLYYVNLMAIRV-GRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYT 326

Query: 375 ALRDAFRAGASSLKRAPDFSL-----FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
           A+RDAFR      KR  + ++     FDTC+     + +  PT+   F G +V+LP  N 
Sbjct: 327 AVRDAFR------KRVGNATVTSLGGFDTCY----TSPIVAPTITFMFSGMNVTLPPDNL 376

Query: 430 LIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LI   +S   C A A       S L++I N+QQQ  R+++D+  SR+G A   C
Sbjct: 377 LIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPC 430


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 145/423 (34%), Positives = 214/423 (50%), Gaps = 32/423 (7%)

Query: 78  NRTPEHLFNLR-IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSS------SVISG 130
           N  P+  F +  I RD  +     +   S+ R+    R   R+   FS+      S  S 
Sbjct: 19  NAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSF 78

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +    GEY   + +GTPP  +  + DTGSD++W QC PC+ CY QT P+FDP +S ++  
Sbjct: 79  ITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRK 138

Query: 191 VPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVA 244
           V C S  CR L+ + C+   NTC Y ++YGD S T GD + +T+T      R   +  + 
Sbjct: 139 VSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMI 198

Query: 245 LGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFG 302
           +GCGH+N G F  A +G++GLG G  S  +Q  +  N KFSYCLV   S +   S + FG
Sbjct: 199 IGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFG 258

Query: 303 -DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            +  VS     +  +      T+Y++ L  ISVG   ++  T+++F     G G ++IDS
Sbjct: 259 TNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQ-FTSTIFG---TGEGNIVIDS 314

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLK----RAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           GT++T L    Y  L       AS++K    + PD  +   C+  S  +  KVP + +HF
Sbjct: 315 GTTLTLLPSNFYYELESVV---ASTIKAERVQDPD-GILSLCYRDS--SSFKVPDITVHF 368

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +G DV L   N  + V S    CFAFA     L+I GN+ Q  F V YD  +  + F   
Sbjct: 369 KGGDVKLGNLNTFVAV-SEDVSCFAFAAN-EQLTIFGNLAQMNFLVGYDTVSGTVSFKKT 426

Query: 478 GCA 480
            C+
Sbjct: 427 DCS 429


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 144/385 (37%), Positives = 198/385 (51%), Gaps = 22/385 (5%)

Query: 114 RSRGRANGGFSSSVI-------SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           RS  RAN     S +       S +    GEY     VGTPP  +  V+DTGS + W+QC
Sbjct: 66  RSINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQC 125

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-GCNRRNT-CLYQVSYGDGSIT 224
             C+ CY QT P+FDP+KS+++ T+PC S +C+ + S+  C+     C Y + YGDGS +
Sbjct: 126 QRCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHS 185

Query: 225 VGDFSTETLTFRGTRVARV-----ALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRR 278
            GD S ETLT   T  + V      +GCGH+N+G F    +G++GLG G +S  +Q    
Sbjct: 186 QGDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSS 245

Query: 279 FNRKFSYCLVDR-STSAKPSSMVFGDSAVSR--TARFTPLLANPKLDTFYYVELVGISVG 335
              KFSYCL    S S   S + FGD+AV     A  TPL++    + FYY+ L   SVG
Sbjct: 246 IGGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVG 305

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +  +  S       G G +IIDSGT++T L +  Y  L  A  A A    R  D S 
Sbjct: 306 DKRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAV-ADAIQANRVSDPSN 364

Query: 396 F-DTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIG 454
           F   C+  +   ++ VP +  HF+GADV L   +  + V + G  CFAF  +   +SI G
Sbjct: 365 FLSLCYQTTPSGQLDVPVITAHFKGADVELNPISTFVQV-AEGVVCFAFHSS-EVVSIFG 422

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           N+ Q    V YDL    + F P  C
Sbjct: 423 NLAQLNLLVGYDLMEQTVSFKPTDC 447


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 135/384 (35%), Positives = 201/384 (52%), Gaps = 32/384 (8%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP---VFD 181
           S ++SG + GSG+YF  L VGTP +   +++DTGSD+ WIQC P     + + P    +D
Sbjct: 14  SRLVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYD 73

Query: 182 PAKSRSFATVPCRSPLCRKLDS---SGCNRRNT--CLYQVSYGDGSITVGDFSTETLTF- 235
            + S S+  +PC    C  L +   S C+ ++   C Y   Y D S T G  + ET++  
Sbjct: 74  KSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMK 133

Query: 236 --------------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRR-F 279
                         R  R+  VALGC  ++ G  F+ A+G+LGLG+G +S  TQT     
Sbjct: 134 SRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 193

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
              FSYCLVD    +  SS +       R    TP++ NP   +FYYV + G++V G  V
Sbjct: 194 GGIFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 253

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF-SLFDT 398
            GI +S + +D  GN G I DSGT+++ L  PAY  +  A  A +  L RA +    F+ 
Sbjct: 254 DGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNA-SIYLPRAQEIPEGFEL 312

Query: 399 CFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAG--TMSGLSIIGN 455
           C++++ + E  +P + + F+G  V  LP  NY++ V +    C A     T +G +I+GN
Sbjct: 313 CYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGN 370

Query: 456 IQQQGFRVVYDLAASRIGFAPRGC 479
           + QQ   + YDLA +RIGF    C
Sbjct: 371 LLQQDHHIEYDLAKARIGFKWSPC 394


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 135/384 (35%), Positives = 201/384 (52%), Gaps = 32/384 (8%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP---VFD 181
           S ++SG + GSG+YF  L VGTP +   +++DTGSD+ WIQC P     + + P    +D
Sbjct: 46  SRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYD 105

Query: 182 PAKSRSFATVPCRSPLCRKLDS---SGCN--RRNTCLYQVSYGDGSITVGDFSTETLTF- 235
            + S S+  +PC    C+ L +   S C+    + C Y   Y D S T G  + ET++  
Sbjct: 106 KSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMK 165

Query: 236 --------------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRR-F 279
                         R  R+  VALGC  ++ G  F+ A+G+LGLG+G +S  TQT     
Sbjct: 166 SRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 225

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
              FSYCLVD    +  SS +       R    TP++ NP   +FYYV + G++V G  V
Sbjct: 226 GGIFSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 285

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF-SLFDT 398
            GI +S + +D  GN G I DSGT+++ L  PAY  +  A  A +  L RA +    F+ 
Sbjct: 286 DGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNA-SIYLPRAQEIPEGFEL 344

Query: 399 CFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAG--TMSGLSIIGN 455
           C++++ + E  +P + + F+G  V  LP  NY++ V +    C A     T +G +I+GN
Sbjct: 345 CYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGN 402

Query: 456 IQQQGFRVVYDLAASRIGFAPRGC 479
           + QQ   + YDLA +RIGF    C
Sbjct: 403 LLQQDHHIEYDLAKARIGFKWSPC 426


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 129/366 (35%), Positives = 179/366 (48%), Gaps = 15/366 (4%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           + + S +  G G Y   + +GTPP  +  + DTGSD++W QC PC  CY Q +P+FDP K
Sbjct: 81  NDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKK 140

Query: 185 SRSFATVPCRSPLCRKLDSSG-CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR---- 239
           S+++ T+ C +  C+ L   G C   NTC    SYGD S T  D S+ET T   T     
Sbjct: 141 SKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPA 200

Query: 240 -VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPT-QTGRRFNRKFSYCLVDRSTSAKPS 297
               +A GCGH N G F      L    G       Q   +   +FSYCLV  S+ +  S
Sbjct: 201 SFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTAS 260

Query: 298 SMV-FGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHV--RGITASLFKLDPAG 353
           S + FG SA VS +   +  L     DTFYY+ L G+S+G   V  +G + +      A 
Sbjct: 261 SKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAE 320

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
              +IIDSGT++T L R  Y  +  A                F  C+  SG  ++++PT+
Sbjct: 321 ESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTI 378

Query: 414 VLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
             HF GADV LP  N  +        CF+   + S L+I GN+ Q  F V YDL  +++ 
Sbjct: 379 TAHFIGADVQLPPLNTFVQAQED-LVCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVS 436

Query: 474 FAPRGC 479
           F P  C
Sbjct: 437 FKPTDC 442


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 130/353 (36%), Positives = 185/353 (52%), Gaps = 43/353 (12%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQTDPVFDPAKSRSFATVPC 193
           EY   +G+G+P     +V+DTGSDV W+QC PC     C++    +FDPA S ++A   C
Sbjct: 107 EYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNC 166

Query: 194 RSPLCRKL----DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCG 248
            +  C +L    +++GC+ ++ C Y V YGDGS T G +S++ LT  G+ V R    GC 
Sbjct: 167 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCS 226

Query: 249 HDN--EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF----- 301
           H     G+     GL+GLG    S  +QT  R+ + F YCL      A P+S  F     
Sbjct: 227 HAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCL-----PATPASSGFLTLGA 281

Query: 302 -GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                    +RF  TP+L + K+ T+Y+  L  I+VGG  + G++ S+F        G +
Sbjct: 282 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL-GLSPSVFA------AGSL 334

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +DSGT +TRL   AY AL  AFRAG +   RA    + DTCF+ +G  +V +PTV L F 
Sbjct: 335 VDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFA 394

Query: 419 GADVSLPATNYLIPVDSSGTF---CFAFAGTMS--GLSIIGNIQQQGFRVVYD 466
           G  V        + +D+ G     C AFA T        IGN+QQ+ F V+YD
Sbjct: 395 GGAV--------VDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 132/400 (33%), Positives = 187/400 (46%), Gaps = 38/400 (9%)

Query: 91  RDVLRVKSLTAFAE---SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           +D  R+K L+  A+   +AV + P  +    AN                 Y  R+ +GTP
Sbjct: 65  KDPERLKYLSTLADQKTTAVPIAPGQQVLKIAN-----------------YVVRVKLGTP 107

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC- 206
            + ++MVLDT +D  W+ C+ C  C S T   F P  S +  ++ C    C ++    C 
Sbjct: 108 GQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSGAQCSQVRGFSCP 164

Query: 207 -NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLG 265
               + CL+  SYG  S        + +T     +     GC +   G  +   GLLGLG
Sbjct: 165 ATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLG 224

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           RG +S  +Q G  ++  FSYCL    +     S+  G     ++ R TPLL NP   + Y
Sbjct: 225 RGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLY 284

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           YV L G+SVG   V  I +     DP    G IIDSGT +TR  +P Y A+RD FR   +
Sbjct: 285 YVNLTGVSVGRIKVP-IPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN 343

Query: 386 SLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF 443
                P  SL  FDTCF  +   E + P + LHF G ++ LP  N LI   S    C + 
Sbjct: 344 ----GPISSLGAFDTCF--AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSM 397

Query: 444 AG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A       S L++I N+QQQ  R+++D   SR+G A   C
Sbjct: 398 AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 139/385 (36%), Positives = 195/385 (50%), Gaps = 25/385 (6%)

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
           +RSR RA  G+ ++    L     EY   L +GTPP     + DTGSD+ W QC PCK C
Sbjct: 53  HRSRLRALSGYDANSPR-LHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLC 111

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRK-LDSSGCNRRNT-CLYQVSYGDGSITVGDFST 230
           + Q  PV+DP+ S +F+ VPC S  C   L S  C+  ++ C Y  SY DG+ + G   T
Sbjct: 112 FPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGT 171

Query: 231 ETLTF------RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           ETLT       +   V+ VA GCG DN G  + + G +GLGRG LS   Q G     KFS
Sbjct: 172 ETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGV---GKFS 228

Query: 285 YCLVDRSTSAKPSSMVFGDSAV----SRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
           YCL D   S   S  + G  A         + TPLL +P   + Y V L GI++G   + 
Sbjct: 229 YCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLP 288

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDT 398
            I    F L     GG+++DSGT+ + L    +  + D     A  L + P    SL   
Sbjct: 289 -IPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHV---AQVLGQPPVNASSLDSP 344

Query: 399 CFDL-SGKTEVK-VPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGN 455
           CF   +G+ ++  +P +VLHF  GAD+ L   NY+       +FC    GT S  S++GN
Sbjct: 345 CFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGN 404

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
            QQQ  ++++D+   ++ F P  C+
Sbjct: 405 FQQQNIQMLFDMTVGQLSFLPTDCS 429


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 134/361 (37%), Positives = 183/361 (50%), Gaps = 33/361 (9%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
           EY   +G+GTP     +++DTGSD+ W+QC PC    CY Q DP+FDP+KS ++A +PC 
Sbjct: 123 EYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCN 182

Query: 195 SPLCRKLDSS----GCNRRN---TCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALG 246
           +  CR L       GC   +    C + ++YGDGS T G +S ETL    G  V     G
Sbjct: 183 TDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFRFG 242

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV---DRSTSAKPSSMVFGD 303
           CGHD +G      GLLGLG    S   QT   +   FSYCL    ++             
Sbjct: 243 CGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPS 302

Query: 304 SAVSRTAR--FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
             V  T+   FTP++   + +TFY V + GI+VGG  +  +  S F      +GG+IIDS
Sbjct: 303 GGVVNTSGFVFTPMIR--EEETFYVVNMTGITVGGEPID-VPPSAF------SGGMIIDS 353

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GA 420
           GT VT L   AY AL+ AFR   ++     +  L DTC+D SG + V +P V L F  GA
Sbjct: 354 GTVVTELQHTAYNALQAAFRKAMAAYPLVRNGEL-DTCYDFSGYSNVTLPKVALTFSGGA 412

Query: 421 DVSLPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            + L   N ++  D     C AF  +G      I+GN+ Q+   V+YD    R+GF    
Sbjct: 413 TIDLDVPNGILLDD-----CLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAV 467

Query: 479 C 479
           C
Sbjct: 468 C 468


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 144/440 (32%), Positives = 201/440 (45%), Gaps = 46/440 (10%)

Query: 59  PDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR 118
           P   ++L   L HVD      T   L    + R   R  +L  ++ +  R  P     GR
Sbjct: 27  PVTSATLRAHLSHVDD-GRGFTKRELLRRMVVRSRARAANLCPYSGATAR--PATAPVGR 83

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
           AN   +S           EY   L +G P  + V + LDTGSDVVW QC PC +C++Q  
Sbjct: 84  ANTDVNS-----------EYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPL 132

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           P FD A S +  +V C  PLC      GC     C Y   YGDGS++ G F  ++ TF  
Sbjct: 133 PRFDTAASNTVRSVACSDPLCNAHSEHGCFLHG-CTYVSGYGDGSLSFGHFLRDSFTFDD 191

Query: 238 TR------VARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
            +      V  +  GCG  N G F+    G+ G GRG LS P+Q      R+FSYC   R
Sbjct: 192 GKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKV---RQFSYCFTTR 248

Query: 291 STSAKPSSMVFGDSAVSRTARFTPLLANPKL--------DTFYYVELVGISVGGAHVRGI 342
              AK S +  G +   +     P+L+ P +        ++ Y +   G++VG   +   
Sbjct: 249 -FEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRL--- 304

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFD 401
              + ++   G+G   IDSGT +T      +  L+ AF A A+  + +  D    D CF 
Sbjct: 305 --PVPEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADED--DICFS 360

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQ 459
             GK    +P +V H  GAD  LP  NY+     SG  C A   +G M   ++IGN QQQ
Sbjct: 361 WDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMD-RTLIGNFQQQ 419

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              +VYDLAA ++   P  C
Sbjct: 420 NTHIVYDLAAGKLLLVPAQC 439


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/361 (34%), Positives = 175/361 (48%), Gaps = 13/361 (3%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V SG     G Y  R  +GTPP+ ++MVLDT +D VW+ C+ C  C S     F+   S 
Sbjct: 93  VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 151

Query: 187 SFATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           +++TV C +  C +     C     + + C +  SYG  S        +TLT     +  
Sbjct: 152 TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 211

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
            + GC +   G  +   GL+GLGRG +S  +QT   ++  FSYCL    +     S+  G
Sbjct: 212 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 271

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
                ++ R+TPLL NP+  + YYV L G+SVG   V  +       D     G IIDSG
Sbjct: 272 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP-VDPVYLTFDANSGAGTIIDSG 330

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV 422
           T +TR  +P Y A+RD FR    ++        FDTCF  S   E   P + LH    D+
Sbjct: 331 TVITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCF--SADNENVAPKITLHMTSLDL 387

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            LP  N LI   +    C + AG        L++I N+QQQ  R+++D+  SRIG AP  
Sbjct: 388 KLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEP 447

Query: 479 C 479
           C
Sbjct: 448 C 448


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 147/436 (33%), Positives = 215/436 (49%), Gaps = 42/436 (9%)

Query: 63  SSLSLRLHHVDS-LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           S  S+ L H +S LS    P    + RI+  VLR     +FA S      + R R   N 
Sbjct: 27  SGFSINLIHRESPLSPFYNPSLTPSERIKNTVLR-----SFARS------KRRLRLSQND 75

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
             S   I+   +   EY  R  +GTPP   + + DTGSD++W+QCAPC+KC  Q  P+FD
Sbjct: 76  DRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFD 135

Query: 182 PAKSRSFATVPCRSPLCRKLDSS--GC-NRRNTCLYQVSYGDGSITVGDFSTETLTF--- 235
           P KS +F TVPC S  C  L  S   C  +   C YQ  YGD ++  G    E++ F   
Sbjct: 136 PRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSK 195

Query: 236 -RGTRVARVALGCGHDNEGLFVAAA---GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
               +  ++  GC   N      +    GL+GLG G LS  +Q G +  RKFSYC    S
Sbjct: 196 NNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLS 255

Query: 292 TSAKPSSMVFGDSAVSRTAR---FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           +++  S M FG+ A+ +  +    TPL+      ++YY+ L G+S+G   V+   +    
Sbjct: 256 SNST-SKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQT-- 312

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAY---IAL-RDAFRAGASSLKRAPDFSLFDTCFDLSG 404
                +G ++IDSGTS T L +  Y   +AL ++ +  G  ++K  P   +++ CF+  G
Sbjct: 313 -----DGNILIDSGTSFTILKQSFYNKFVALVKEVY--GVEAVKIPP--LVYNFCFENKG 363

Query: 405 KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVV 464
           K + + P VV  F GA V + A+N     D++     A   +    SI GN  Q G++V 
Sbjct: 364 KRK-RFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVE 422

Query: 465 YDLAASRIGFAPRGCA 480
           YDL    + FAP  CA
Sbjct: 423 YDLQGGMVSFAPADCA 438


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 139/365 (38%), Positives = 203/365 (55%), Gaps = 32/365 (8%)

Query: 137 EYFTRLGVGTPP-RYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           EY   + +G+PP +   M++DTGSD+ W++C PC ++C  Q DP+FDP+ S +++   C 
Sbjct: 139 EYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCS 198

Query: 195 SPLCRKL----DSSGCNRRNTCLYQVSYGDGSI-TVGDFSTETLTFRGTR----VARVAL 245
           S  C +L    +++GC+    C Y   YGDGS+ T G +S++TL          V++   
Sbjct: 199 SAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRF 258

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKPSSMVFGDS 304
           GC H   G+    AGL+GLG G  S  +QT   F    FSYCL    +S   S  +   +
Sbjct: 259 GCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSS---SGFLTLGA 315

Query: 305 AVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
           A + +A F  TP+L + ++  FY V L  I VGG  +  I  ++F      + G+I+DSG
Sbjct: 316 AGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLS-IPTTVF------SAGMIMDSG 368

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFS---LFDTCFDLSGKTEVKVPTVVLHFRG 419
           T VTRL   AY +L  AF+AG      AP  +     DTCFD+SG++ V +PTV L F G
Sbjct: 369 TVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFSG 428

Query: 420 AD---VSLPATNYLIPVDSSGTFCFAFAGTMSGLS--IIGNIQQQGFRVVYDLAASRIGF 474
           A    V+L A+  L+ +++S  FC AF  T    S  IIGN+QQ+ F+V+YD+A   +GF
Sbjct: 429 AGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVGF 488

Query: 475 APRGC 479
               C
Sbjct: 489 KAGAC 493


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 124/361 (34%), Positives = 175/361 (48%), Gaps = 13/361 (3%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V SG     G Y  R  +GTPP+ ++MVLDT +D VW+ C+ C  C S     F+   S 
Sbjct: 19  VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 77

Query: 187 SFATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           +++TV C +  C +     C     + + C +  SYG  S        +TLT     +  
Sbjct: 78  TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 137

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
            + GC +   G  +   GL+GLGRG +S  +QT   ++  FSYCL    +     S+  G
Sbjct: 138 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 197

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
                ++ R+TPLL NP+  + YYV L G+SVG   V  +       D     G IIDSG
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP-VDPVYLTFDANSGAGTIIDSG 256

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV 422
           T +TR  +P Y A+RD FR    ++        FDTCF  S   E   P + LH    D+
Sbjct: 257 TVITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCF--SADNENVAPKITLHMTSLDL 313

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            LP  N LI   +    C + AG        L++I N+QQQ  R+++D+  SRIG AP  
Sbjct: 314 KLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEP 373

Query: 479 C 479
           C
Sbjct: 374 C 374


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 186/368 (50%), Gaps = 33/368 (8%)

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRS 187
           +S L    GEY     VGTPP  VY  +DTGS++VW+QC PC  C++QT P+F+P+KS S
Sbjct: 79  VSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSS 138

Query: 188 FATVPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR----- 239
           +  +PC S  C+  +    S  N  + C Y ++YG  + + GD S ++LT   T      
Sbjct: 139 YKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVL 198

Query: 240 VARVALGCGH-----DNEGLFVAAAGLLGLGRGRLSFPTQTG-RRFNRKFSYCLVDRSTS 293
              + +GCGH     DN      ++G++G+GRG +S   Q G      KFSYCL+  ++ 
Sbjct: 199 FPNIVIGCGHINVLQDNS----QSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSD 254

Query: 294 AKPSS-MVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
           +  SS ++FG+  V        TP++     + +Y++ L   SVG   +     S     
Sbjct: 255 SNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERS----- 309

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA--PDFSLFDTCFDLSGKTEV 408
            A    ++IDSGT +T L    +++   ++ A    L R   PD  L   C++ +GK ++
Sbjct: 310 NASTQNILIDSGTPLTMLPN-LFLSKLVSYVAQEVKLPRIEPPDHHL-SLCYNTTGK-QL 366

Query: 409 KVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
            VP +  HF GADV L +     P +  G  CF F  + +GL I GNI Q    + YDL 
Sbjct: 367 NVPDITAHFNGADVKLNSNGTFFPFE-DGIMCFGFISS-NGLEIFGNIAQNNLLIDYDLE 424

Query: 469 ASRIGFAP 476
              I F P
Sbjct: 425 KEIISFKP 432


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 132/361 (36%), Positives = 180/361 (49%), Gaps = 30/361 (8%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y   + +GTPP  +Y + DTGSD+ W  C PC KCY Q +P+FDP KS S+  + C S
Sbjct: 23  GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDS 82

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-----GCGHD 250
            LC KLD+  C+ +  C Y  +Y   +IT G  + ET+T   T+   V L     GCGH+
Sbjct: 83  KLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHN 142

Query: 251 NEGLFVA-AAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLV----DRSTSAKPS----SMV 300
           N G F     G++GLG G +SF +Q G  F  ++FS CLV    D S S+K S    S V
Sbjct: 143 NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEV 202

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
            G   VS     TPL+A  +  T Y+V L+GISVG  ++    +S   ++    G V +D
Sbjct: 203 SGKGVVS-----TPLVAK-QDKTPYFVTLLGISVGNTYLHFNGSSSQSVE---KGNVFLD 253

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFDLSGKTEVKVPTVVLHFRG 419
           SGT  T L    Y  L    R+  +      D  L    C+    K  ++ P +  HF G
Sbjct: 254 SGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRT--KNNLRGPVLTAHFEG 311

Query: 420 ADVS-LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            DV  LP   ++ P D  G FC  F  T S   + GN  Q  + + +DL    + F P  
Sbjct: 312 GDVKLLPTQTFVSPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMD 369

Query: 479 C 479
           C
Sbjct: 370 C 370


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 114/227 (50%), Positives = 149/227 (65%), Gaps = 3/227 (1%)

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
           +FV AAGLLGLG G +SF  Q G +    FSYCLV R T +   S+ FG  +V   A + 
Sbjct: 1   MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESS-GSLEFGRESVPVGASWV 59

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
            L+ NP+  +FYY+ L G+ VGG  V  I+  +F+L+  G GGV++D+GT+VTRL   AY
Sbjct: 60  SLIHNPRAPSFYYIGLSGLGVGGLRVP-ISEDIFRLNELGEGGVVMDTGTAVTRLPAAAY 118

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIP 432
            A RDAF A  ++L +    S+FDTC+DL+G   V+VPT+  +F G  + +LPA N+LIP
Sbjct: 119 NAFRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIP 178

Query: 433 VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           VDS GTFCFAFA + SGLSIIGNIQQ+G  +  D A   IGF P  C
Sbjct: 179 VDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 141/390 (36%), Positives = 203/390 (52%), Gaps = 38/390 (9%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC----AP---CKKCYSQTD 177
           S + SG   G G+Y   +  GTPP+ V ++ DTGSD++W+QC    AP   C K      
Sbjct: 41  SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 100

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRN--TCLYQVSYGDGSITVGDFST 230
           P F  +KS + + VPC +  C  + +       C+      C Y   Y DGS T G  + 
Sbjct: 101 PAFVASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLAR 160

Query: 231 ETLTFR-----GTRVARVALGCGHDNE-GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +T T       G  V  VA GCG  N+ G F    G++GLG+G+LSFP Q+G  F + FS
Sbjct: 161 DTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFS 220

Query: 285 YCLVDRSTS--AKPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           YCL+D       + SS +F      R A  +TPL++NP   TFYYV +V I VG   V  
Sbjct: 221 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGN-RVLP 279

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD----FSLFD 397
           +  S + +D  GNGG +IDSG+++T L   AY+ L  AF A +  L R P     F   +
Sbjct: 280 VPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAF-AASVHLPRIPSSATFFQGLE 338

Query: 398 TCFDLSGKTEVK-----VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMS--G 449
            C+++S  + +       P + + F +G  + LP  NYL+ V +    C A   T+S   
Sbjct: 339 LCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 397

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +++GN+ QQG+ V +D A++RIGFA   C
Sbjct: 398 FNVLGNLMQQGYHVEFDRASARIGFARTEC 427


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 179/343 (52%), Gaps = 35/343 (10%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS---GCN 207
           +V+DT SD+ W+QC PC   +C+ Q DP++DPAKS +FA +PC SP C++L SS   GC+
Sbjct: 171 VVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCS 230

Query: 208 -RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEGLFV-AAAGLLGL 264
              + C Y V+YGDG  T G + T+TLT   T V +    GC H   G F    AG+L L
Sbjct: 231 PTTDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILAL 290

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF----GDSAVSRTARFTPLLANPK 320
           G GR S   QT   +   FSYC+       KPSS  F    G    S    +TPL+ N  
Sbjct: 291 GGGRGSLLEQTADAYGNAFSYCI------PKPSSAGFLSLGGPVEASLKFSYTPLIKNKH 344

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
             TFY V L  I V G  +  +  + F        G ++DSG  VT+L    Y ALR AF
Sbjct: 345 APTFYIVHLEAIIVAGKQL-AVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAAF 397

Query: 381 RAGASSLK--RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT 438
           R+  ++     AP  +L DTC+D +   +VKVP V L F G      AT  L P      
Sbjct: 398 RSAMAAYGPLAAPVRNL-DTCYDFTRFPDVKVPKVSLVFAGG-----ATLDLEPASIILD 451

Query: 439 FCFAFAGTMSGLSI--IGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C AFA T    S+  IGN+QQQ + V+YD+   ++GF    C
Sbjct: 452 GCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 124/361 (34%), Positives = 175/361 (48%), Gaps = 14/361 (3%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V SG     G Y  R  +GTPP+ ++MVLDT +D VW+ C+ C  C S     F+   S 
Sbjct: 94  VASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 152

Query: 187 SFATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR 242
           +++TV C +  C +     C     + + C +  SYG  S    +   +TLT     +  
Sbjct: 153 TYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPDVIPN 212

Query: 243 VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
            + GC +   G  +   GL+GLGRG +S  +QT   ++  FSYCL    +     S+  G
Sbjct: 213 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 272

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
                ++ R+TPLL NP+  + YYV L G+SVG   V  +       D     G IIDSG
Sbjct: 273 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP-VDPVYLTFDSNSGAGTIIDSG 331

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV 422
           T +TR  +P Y A+RD FR   +          FDTCF  S   E   P + LH    D+
Sbjct: 332 TVITRFAQPVYEAIRDEFRKQVNG--SFSTLGAFDTCF--SADNENVTPKITLHMTSLDL 387

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            LP  N LI   +    C + AG        L++I N+QQQ  R+++D+  SRIG AP  
Sbjct: 388 KLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEP 447

Query: 479 C 479
           C
Sbjct: 448 C 448


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 136/390 (34%), Positives = 208/390 (53%), Gaps = 27/390 (6%)

Query: 113 NRSRGRANGGFSSSVI-SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
           +RSR R N   S + + SGL    GE+F  + +GTPP  V+ + DTGSD+ W+QC PC++
Sbjct: 60  SRSR-RFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ 118

Query: 172 CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNR-RNTCLYQVSYGDGSITVGDF 228
           CY +  P+FD  KS ++ + PC S  C+ L S+  GC+   N C Y+ SYGD S + GD 
Sbjct: 119 CYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDV 178

Query: 229 STETLTFRGTRVARVA-----LGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRK 282
           +TET++      + V+      GCG++N G F    +G++GLG G LS  +Q G   ++K
Sbjct: 179 ATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKK 238

Query: 283 FSYCLVDRSTSAKPSSMV-FGDSAV-SRTARFTPLLANPKLD----TFYYVELVGISVGG 336
           FSYCL  +S +   +S++  G +++ S  ++ + +++ P +D    T+YY+ L  ISVG 
Sbjct: 239 FSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK 298

Query: 337 AHVRGITASLFKLDPAG-----NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP 391
             +   T S +  +  G     +G +IIDSGT++T L    +     A     +  KR  
Sbjct: 299 KKIP-YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVS 357

Query: 392 D-FSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGL 450
           D   L   CF  SG  E+ +P + +HF GADV L   N  + + S    C +   T   +
Sbjct: 358 DPQGLLSHCFK-SGSAEIGLPEITVHFTGADVRLSPINAFVKL-SEDMVCLSMVPTTE-V 414

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +I GN  Q  F V YDL    + F    C+
Sbjct: 415 AIYGNFAQMDFLVGYDLETRTVSFQHMDCS 444


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 101/163 (61%), Positives = 119/163 (73%), Gaps = 2/163 (1%)

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           NP+LDT+YYV LVGISVGG  +  I  + F++D AGNGG+I+DSGT+VTRL    Y  +R
Sbjct: 4   NPQLDTYYYVGLVGISVGG-ELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVR 62

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSS 436
           DAF  G   L    + SLFDTC+DLS KT V+VPTV  HF  G  + LPA NYL+PVDS 
Sbjct: 63  DAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSV 122

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           GTFCFAFA TMS LSIIGNIQQQG RV +DLA S +GF+P  C
Sbjct: 123 GTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/355 (35%), Positives = 178/355 (50%), Gaps = 25/355 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  VGTPP+ + M LD   D  WI   PCK C   +  VF+  KS +F T+ C 
Sbjct: 32  SPSYIVKAKVGTPPQTLLMALDNSYDAAWI---PCKGCVGCSSTVFNTVKSTTFKTLGCG 88

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ +  C   +TC +  +YG  +I + + + +T+      V   A GC     G 
Sbjct: 89  APQCKQVPNPICGG-STCTWNTTYGSSTI-LSNLTRDTIALSMDPVPYYAFGCIQKATGS 146

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLG GRG LSF +QT   +   FSYCL    T     S+  G        + TP
Sbjct: 147 SVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTP 206

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV+L GI V G  +  I  S    +P    G I DSGT  TRL  PAYI
Sbjct: 207 LLKNPRRSSLYYVKLNGIRV-GRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYI 265

Query: 375 ALRDAFRAGASSLKRAPDFSL-----FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
           A+R+ FR      KR  + ++     FDTC+ +     +  PT+   F G +V++P  N 
Sbjct: 266 AVRNEFR------KRVGNATVSSLGGFDTCYSV----PIVPPTITFMFSGMNVTMPPENL 315

Query: 430 LIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LI   +  T C A A       S L++I ++QQQ  R+++D+  SR+G A   C+
Sbjct: 316 LIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 138/394 (35%), Positives = 197/394 (50%), Gaps = 32/394 (8%)

Query: 101 AFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSD 160
           AF  S  RV      R R     S  + S +   +GEY   L +GTPP  V  ++DTGSD
Sbjct: 60  AFRRSVSRV-----GRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSD 114

Query: 161 VVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-GCNRRNTCLYQVSYG 219
           + W QC PC  CY Q  P+FDP  S ++    C +  C  L     C++   C ++ SY 
Sbjct: 115 LTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYA 174

Query: 220 DGSITVGDFSTETLTFRGTRVARV-----ALGCGHDNEGLF-VAAAGLLGLGRGRLSFPT 273
           DGS T G+ ++ETLT   T    V     A GCGH + G+F  +++G++GLG G LS  +
Sbjct: 175 DGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLIS 234

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMV-FGDSA-VSRTARFTPLLANPKLDTFYYVELVG 331
           Q     N  FSYCL+  ST +  SS + FG S  VS     +  L     DTFYY+ L G
Sbjct: 235 QLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEG 294

Query: 332 ISVGGAHV--RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK- 388
           ISVG   +  +G +    K      G +I+DSGT+ T L +  Y  L    ++ A+S+K 
Sbjct: 295 ISVGKKRLPYKGYS----KKTEVEEGNIIVDSGTTYTFLPQEFYSKLE---KSVANSIKG 347

Query: 389 ---RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAG 445
              R P+  +F  C++ +   E+  P +  HF+ A+V L   N  + +      CF  A 
Sbjct: 348 KRVRDPN-GIFSLCYNTTA--EINAPIITAHFKDANVELQPLNTFMRMQED-LVCFTVAP 403

Query: 446 TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           T S + ++GN+ Q  F V +DL   R+ F    C
Sbjct: 404 T-SDIGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 184/348 (52%), Gaps = 36/348 (10%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   + +GTPP  +  VLDTGSD++W QC APC++C+ Q  P++ PA+S ++A V CRSP
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 197 LCRKLDS--SGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
           +C+ L S  S C+  +T C Y  SYGDG+ T G  +TET T    T V  VA GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G    ++GL+G+GRG LS  +Q G    R+             P++              
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVTRPRRSCRARAAARGGGAPTT-------------- 257

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
                +P         L GI+VG   +  I  ++F+L P G+GGVIIDSGT+ T L   A
Sbjct: 258 ----TSP---------LEGITVGDTLLP-IDPAVFRLTPMGDGGVIIDSGTTFTALEERA 303

Query: 373 YIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
           ++AL  A  A    L  A    L    CF  +    V+VP +VLHF GAD+ L   +Y++
Sbjct: 304 FVALARAL-ASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVV 362

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              S+G  C     +  G+S++G++QQQ   ++YDL    + F P  C
Sbjct: 363 EDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 143/366 (39%), Positives = 189/366 (51%), Gaps = 28/366 (7%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRS 187
           G A  S EY   LG+GTP     +++DTGSD+ W+QC PC    CY Q DP++DP  S +
Sbjct: 119 GAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASST 178

Query: 188 FATVPCRSPLCRKL-----DSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFR-GTR 239
           +A VPC S  C+ L     D    N   T  C Y + YG+   TVG +STETLT      
Sbjct: 179 YAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVS 238

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRSTSAKPSS 298
           V     GCG   +G F    GLLGLG    S  +QT   +   FSYCL    ST+   + 
Sbjct: 239 VKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFLAL 298

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
               ++  +    FTPL + P+  TFY V L G+SVGG  +  I  ++       +GG+I
Sbjct: 299 GAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLD-IPPTVL------SGGMI 351

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP--DFSLFDTCFDLSGKTEVKVPTVVLH 416
           IDSGT +T L   AY ALR AFR   S+    P  +  + DTC++ +G   V VPTV L 
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411

Query: 417 FR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIG 473
           F  GA + L   + ++  D     C AFAG  S   + IIGN+ Q+ F V+YD     +G
Sbjct: 412 FDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHVG 466

Query: 474 FAPRGC 479
           F P  C
Sbjct: 467 FRPGAC 472


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/370 (33%), Positives = 183/370 (49%), Gaps = 21/370 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SGL    GEYF  + +GTPP     + DTGSD+ W+QC PC++CY Q  P+FD  KS ++
Sbjct: 76  SGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTY 135

Query: 189 ATVPCRSPLCRKLD--SSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV-- 243
            T  C S  C  L     GC+  RN C Y+ SYGD S T G+ +TET++   +  + V  
Sbjct: 136 KTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSF 195

Query: 244 ---ALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
              A GCG++N G F      +    G  LS  +Q G    +KFSYCL   S +   +S+
Sbjct: 196 PGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNGTSV 255

Query: 300 VF--GDSAVSRTARFTPLLANPKL----DTFYYVELVGISVGGAHVRGITASLFKLD--P 351
           +    +S  S+ ++ + +L  P +    +T+Y++ L  I+VG   +       + L+   
Sbjct: 256 INLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKS 315

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKTEVKV 410
              G +IIDSGT++T L    Y           +  KR  D   +   CF  SG  E+ +
Sbjct: 316 KKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTHCFK-SGDKEIGL 374

Query: 411 PTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           PT+ +HF GADV L   N  + + S    C +   T   ++I GN+ Q  F V YDL   
Sbjct: 375 PTITMHFTGADVKLSPINSFVKL-SEDIVCLSMIPTTE-VAIYGNMVQMDFLVGYDLETK 432

Query: 471 RIGFAPRGCA 480
            + F    C+
Sbjct: 433 TVSFQRMDCS 442


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 157/445 (35%), Positives = 218/445 (48%), Gaps = 57/445 (12%)

Query: 67  LRLHHVDS-LSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSS 125
           L L H DS LS   TP   F+ R+Q   LR  S                 R   +  F +
Sbjct: 29  LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAIS-----------------RQSRHVDFQT 71

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKS 185
            ++       GEY   L +GTPP  +  + DTGSD+ W+Q  PC +CY Q  P+FDP+ S
Sbjct: 72  DLLPS----GGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNS 127

Query: 186 RSFATVPCRSPLCRKLDSSG--CNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVA 241
            +F  +PC +  C  LD S   C    TC Y  SYGD S T G  +++T+T      ++ 
Sbjct: 128 TTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIR 187

Query: 242 RVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL--VDRSTSAKPSS 298
            VA GCG  N G F    +G++GLG G LSF +Q G    +KFSYCL  ++   S++PS 
Sbjct: 188 NVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSD 247

Query: 299 ------MVFGDSAVSRTAR-------FTPLLANPKLDTFYYVELVGISVGGAHV-----R 340
                 +VFGD+ V  ++         TPL+ N +  T+YY+ +  I+VG   +      
Sbjct: 248 SPATSRIVFGDNPVFSSSSTNGVVFATTPLV-NKEPSTYYYLTIEAITVGRKKLLYSSSS 306

Query: 341 GITASLFKLDPAG--NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF--SLF 396
             TAS      +    G +IIDSGT++T L    Y AL  A       ++R  D   S+F
Sbjct: 307 SKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAAL-VEEIKMERVNDVKNSMF 365

Query: 397 DTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGN 455
             CF  SGK EV++P + +HFR GADV L   N  +  +  G  CF    T + + I GN
Sbjct: 366 SLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-EGLVCFTMLPT-NDVGIYGN 422

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + Q  F V YDL    + F P  C+
Sbjct: 423 LAQMNFVVGYDLGKRTVSFLPADCS 447


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 167/350 (47%), Gaps = 13/350 (3%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y  R+ +GTP + +YMVLDT +D  W  C+ C  C S T   F    S +FAT+ C  
Sbjct: 93  GNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTT--TFSAQNSSTFATLDCSK 150

Query: 196 PLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           P C +     C       CL+  +YG  S        ++L      +   + GC     G
Sbjct: 151 PECTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGCISSASG 210

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
             +   GL+GLGRG LS  +Q+G  ++  FSYCL    +     S+  G     +  R T
Sbjct: 211 SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTT 270

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           PLL NP   + YYV L GISVG   V  I+  L   DP    G IIDSGT +TR     Y
Sbjct: 271 PLLHNPHRPSLYYVNLTGISVGRVLVP-ISPELLAFDPNTGAGTIIDSGTVITRFVPAIY 329

Query: 374 IALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPV 433
            A+RD FR        +P    FDTCF  +   EV  P + LH  G D+ LP  N LI  
Sbjct: 330 TAVRDEFRKQVGG-SFSP-LGAFDTCF--ATNNEVSAPAITLHLSGLDLKLPMENSLIHS 385

Query: 434 DSSGTFCFAFAGT----MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +    C A A       S +++I N+QQQ  R+++D+  S++G A   C
Sbjct: 386 SAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 132/353 (37%), Positives = 184/353 (52%), Gaps = 30/353 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   + +GTP     +++DTGSDV W+ C    +  + +   FDP KS ++    C S  
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSLFFDPGKSSTYTPFSCSSAA 182

Query: 198 CRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDN--- 251
           C +L+   +GC+  +TC Y V YGDGS T G + ++TL    T +V     GC   +   
Sbjct: 183 CTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEKVENFQFGCSETSDPG 242

Query: 252 EGLFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           EGL      GL+GLG G  S  +QT   +   FSYCL   +T+     +  G S  +   
Sbjct: 243 EGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL--PATTRSSGFLTLGASTGTSGF 300

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP+  + +  TFY+V L GI+VGG  V  I+ ++F        G I+DSGT +TRL  
Sbjct: 301 VTTPMFRSRRAPTFYFVILQGINVGGDPV-AISPTVFA------AGSIMDSGTIITRLPP 353

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYL 430
            AY AL  AFRAG     RA  FS+ DTCFD +G+  V +P V L F G  V        
Sbjct: 354 RAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFSGGAV-------- 405

Query: 431 IPVDSSGTF---CFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + +D+ G     C AFA    G+ SIIGN+QQ+ F V++D+  S +GF P  C
Sbjct: 406 VDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPGAC 458


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 184/387 (47%), Gaps = 24/387 (6%)

Query: 106 AVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQ 165
           +V++  RN S         S++ S ++    EY   L +GTPP  +Y   DTGSD+VW Q
Sbjct: 31  SVKLIRRNSSHDSYK---PSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQ 87

Query: 166 CAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSIT 224
           C PC KCY Q +P+FDP  S S+  + C +  C KLDSS C+  + TC Y  SY D SIT
Sbjct: 88  CIPCTKCYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSIT 147

Query: 225 VGDFSTETLTFRGTRVARVA-----LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
            G  + ETLT   T    VA      GCGH+N G      GL+GLGRG LS  +Q G   
Sbjct: 148 QGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSL 207

Query: 280 ---NRKFSYCLVDRSTS-AKPSSMVFGDSA--VSRTARFTPLLANPKLDTFYYVELVGIS 333
                 FS CLV  +T  +  S M FG  +  +      TPL++  K  T Y+  L+GIS
Sbjct: 208 GAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLIS--KDGTGYFATLLGIS 265

Query: 334 VGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF 393
           V   ++     S   L     G ++IDSGT++T L    Y  L +  R   +      D 
Sbjct: 266 VEDINLPFSNGS--SLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG 323

Query: 394 SLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSII 453
             ++ C+     T +  PT+ +HF G DV L      IPV     FCFA   T       
Sbjct: 324 --YELCYQT--PTNLNGPTLTIHFEGGDVLLTPAQMFIPVQDD-NFCFAVFDTNEEYVTY 378

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           GN  Q  + + +DL    + F    C 
Sbjct: 379 GNYAQSNYLIGFDLERQVVSFKATDCT 405


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 130/400 (32%), Positives = 185/400 (46%), Gaps = 38/400 (9%)

Query: 91  RDVLRVKSLTAFAE---SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           +D  R+K L+  A+   +AV + P  +    AN                 Y  R+ +GTP
Sbjct: 65  KDPERLKYLSTLADQKTTAVPIAPGQQVLKIAN-----------------YVVRVKLGTP 107

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC- 206
            + ++MVLDT +D  W+   PC  C   +   F P  S +  ++ C    C ++    C 
Sbjct: 108 GQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCP 164

Query: 207 -NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLG 265
               + CL+  SYG  S        + +T     +     GC +   G  +   GLLGLG
Sbjct: 165 ATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLG 224

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           RG +S  +Q G  ++  FSYCL    +     S+  G     ++ R TPLL NP   + Y
Sbjct: 225 RGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLY 284

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           YV L G+SVG   V  I +     DP    G IIDSGT +TR  +P Y A+RD FR   +
Sbjct: 285 YVNLTGVSVGRIKVP-IPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN 343

Query: 386 SLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF 443
                P  SL  FDTCF  +   E + P + LHF G ++ LP  N LI   S    C + 
Sbjct: 344 ----GPISSLGAFDTCF--AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSM 397

Query: 444 AG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A       S L++I N+QQQ  R+++D   SR+G A   C
Sbjct: 398 AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 141/390 (36%), Positives = 202/390 (51%), Gaps = 38/390 (9%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC----AP---CKKCYSQTD 177
           S + SG   G G+Y   +  GTPP+ V ++ DTGSD++W+QC    AP   C K      
Sbjct: 40  SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 99

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDS-----SGCNRRN--TCLYQVSYGDGSITVGDFST 230
           P F  +KS + + VPC +  C  + +       C+      C Y   Y DGS T G  + 
Sbjct: 100 PAFVASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLAR 159

Query: 231 ETLTFR-----GTRVARVALGCGHDNE-GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +T T       G  V  VA GCG  N+ G F    G++GLG+G+LSFP Q+G  F + FS
Sbjct: 160 DTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFS 219

Query: 285 YCLVDRSTS--AKPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           YCL+D       + SS +F      R A  +TPL++NP   TFYYV +V I VG   V  
Sbjct: 220 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGN-RVLP 278

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD----FSLFD 397
           +  S + +D  GNGG +IDSG+++T L   AY+ L  AF A +  L R P     F   +
Sbjct: 279 VPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAF-AASVHLPRIPSSATFFQGLE 337

Query: 398 TCFDLSGKTEVK-----VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMS--G 449
            C+++S  +         P + + F +G  + LP  NYL+ V +    C A   T+S   
Sbjct: 338 LCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 396

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +++GN+ QQG+ V +D A++RIGFA   C
Sbjct: 397 FNVLGNLMQQGYHVEFDRASARIGFARTEC 426


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 133/357 (37%), Positives = 200/357 (56%), Gaps = 35/357 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G +   +  GTP   + ++LDTGS + W QC  C  C   ++  FD + S +++   C  
Sbjct: 126 GNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSC-- 183

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGL 254
                + S+  N      Y ++YGD S +VG++  +T+T   + V  +   GCG +N+G 
Sbjct: 184 -----IPSTVENN-----YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGD 233

Query: 255 FVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
           F +   G+LGLG+G+LS  +QT  +FN+ FSYCL +  +     S++FG+ A S+++  +
Sbjct: 234 FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIG---SLLFGEKATSQSSSLK 290

Query: 312 FTPLLANP---KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           FT L+  P   +   +Y+V L  ISVG   +  I +S+F      + G IIDS T +TRL
Sbjct: 291 FTSLVNGPGTLQESGYYFVNLSDISVGNERLN-IPSSVF-----ASPGTIIDSRTVITRL 344

Query: 369 TRPAYIALRDAFRAGAS----SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
            + AY AL+ AF+   +    S  R     + DTC++LSG+ +V +P +VLHF  GADV 
Sbjct: 345 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVR 404

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           L  TN +   D+S   C AFAGT S L+IIGN QQ    V+YD+   RIGF   GC+
Sbjct: 405 LNGTNIVWGSDAS-RLCLAFAGT-SELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 128/358 (35%), Positives = 184/358 (51%), Gaps = 22/358 (6%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  GSGEY   + +GTPP     + DTGSD++W QC PC KCY Q+ P+FDP KS SF+ 
Sbjct: 85  LTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSH 144

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           VPC S  C+ +D S C  +  C Y  +YGD + T GD   E +T   + V  V +GCGH+
Sbjct: 145 VPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSVKSV-IGCGHE 203

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
           + G F  A+G++GLG G+LS  +Q  +    +R+FSYCL    + A    + FG +AV  
Sbjct: 204 SGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN-GKINFGQNAVVS 262

Query: 309 TARF--TPLLA-NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
                 TPL++ NP   T+YYV L  IS+G             +  A  G VIIDSGT++
Sbjct: 263 GPGVVSTPLISKNPV--TYYYVTLEAISIGNER---------HMASAKQGNVIIDSGTTL 311

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GADV 422
           + L +  Y  +  +      + +     + +D CFD  ++  T   +P +   F  GA+V
Sbjct: 312 SFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANV 371

Query: 423 S-LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + LP   +    ++        A       IIGN+    F + YDL A R+ F P  C
Sbjct: 372 NLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 126/355 (35%), Positives = 191/355 (53%), Gaps = 23/355 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY   L +GTPP  +  V DTGS+++W QC PC  CY+Q DP+FDP  S ++  V C S
Sbjct: 92  GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSS 151

Query: 196 PLCRKLDS-SGCNRRN-TCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVALGCG 248
             C  L++ + C+  + TC Y VSY DGS T+G F+ +TLT      R  ++  + +GCG
Sbjct: 152 SQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCG 211

Query: 249 HDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            +N   F   ++G++GLG G +S   Q G   + KFSYCLV  +   + S + FG +AV 
Sbjct: 212 QNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPEND--QTSKINFGTNAVV 269

Query: 308 R--TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
                  TPL+   + DTFYY+ L  ISVG  +++   +++        G ++IDSGT++
Sbjct: 270 SGPGTVSTPLVVKSR-DTFYYLTLKSISVGSKNMQTPDSNI-------KGNMVIDSGTTL 321

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
           T L    YI + +A  +  ++ K   +      C++ +   ++ +P + +HF GADV L 
Sbjct: 322 TLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATA--DLNIPVITMHFEGADVKLY 379

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             N    V +    C AF  +     I GN+ Q+ F V YD A+  + F P  CA
Sbjct: 380 PYNSFFKV-TEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDCA 433


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 133/351 (37%), Positives = 186/351 (52%), Gaps = 21/351 (5%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   +G+G+P     M +DTGSDV W+QC PC +C+S+ D +FDP+ S +++   C S 
Sbjct: 121 EYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSA 180

Query: 197 LCRKLDSS----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
            C +L  S    GC   + C Y V+YGD S T G +S++TLT   + +     GC     
Sbjct: 181 PCAQLSQSQEGNGC-MSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDFQFGCSQSES 239

Query: 253 GLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR 311
           G F     GL+GLG G  S  +QT   F   FSYCL   S S+   ++  G S   +   
Sbjct: 240 GGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLTLGTGSSGFVK--- 296

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
            TP+L + ++ T+Y V L  I VG   +  +  S+F      + G ++DSGT +TRL   
Sbjct: 297 -TPMLRSTQIPTYYVVLLESIKVGSQQLN-LPTSVF------SAGSLMDSGTIITRLPPT 348

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYL 430
           AY AL  AF+AG      A    + DTCFD SG++ + +PTV L F  GA V L     +
Sbjct: 349 AYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDGIM 408

Query: 431 IPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + + SS   C AF   G  S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 409 LEISSS-IRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 204/371 (54%), Gaps = 41/371 (11%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           + L    G +   +  GTPP+   ++LDTGS + W QC  C  C   +   FD   S ++
Sbjct: 118 NNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTY 177

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGC 247
           +   C       + S+  N      Y ++YGD S +VG++  +T+T   + V  +   GC
Sbjct: 178 SFGSC-------IPSTVGN-----TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGC 225

Query: 248 GHDNEGLFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           G +NEG F + A G+LGLG+G+LS  +QT  +F + FSYCL + ++     S++FG+ A 
Sbjct: 226 GRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIG---SLLFGEKAT 282

Query: 307 SRTA--RFTPLLANP-----KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           S+++  +FT L+  P     +   +Y+V+L+ ISVG   +  I +S+F      + G II
Sbjct: 283 SQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRL-NIPSSVF-----ASPGTII 336

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGAS----SLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           DSGT +TRL + AY AL+ AF+   +    S  R  +  + DTC++LSG+ +V +P  VL
Sbjct: 337 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVL 396

Query: 416 HF-RGADVSLPATNYLIPVDSSGTFCFAFAG----TMS-GLSIIGNIQQQGFRVVYDLAA 469
           HF  GADV L     +   D+S   C AFAG    TM+  L+IIGN QQ    V+YD+  
Sbjct: 397 HFGDGADVRLNGKRVVWGNDAS-RLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRG 455

Query: 470 SRIGFAPRGCA 480
            RIGF   GC+
Sbjct: 456 RRIGFGGNGCS 466


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 135/363 (37%), Positives = 184/363 (50%), Gaps = 27/363 (7%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRS 187
           G +  S EY   +G+GTP     ++LDTGS + W+QC PC   +CY Q  P+FDP  S S
Sbjct: 121 GSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSS 180

Query: 188 FATVPCRSPLCRKL----DSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTF-RGTRV 240
           ++ VPC S  CR L    D  GC       C Y++ YG G+   G++ST+ LT   G  V
Sbjct: 181 YSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIV 240

Query: 241 ARVALGCGHDNE-GLFVAAAGLLGLGRGRLSFPTQ-TGRRFNRKFSYCLVDRSTSAKPSS 298
            R   GCGH  + G F  A G+LGLGR   S   Q + RR    FS+CL    T      
Sbjct: 241 KRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCL--PPTGVSTGF 298

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           +  G    +    FTPLL       FY +    ISV G  +  I  ++F+       GVI
Sbjct: 299 LALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAG-QLLDIPPAVFR------EGVI 351

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
            DSGT ++ L   AY ALR AFR+  +    AP     DTCF+ +G   V VPTV L FR
Sbjct: 352 TDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFR 411

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS-IIGNIQQQGFRVVYDLAASRIGFAP 476
            GA V L A++ ++ +D     C AF  +    + +IG++ Q+   V+YD+   ++GF  
Sbjct: 412 GGATVHLDASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRT 466

Query: 477 RGC 479
             C
Sbjct: 467 GAC 469


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 190/354 (53%), Gaps = 23/354 (6%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRS 195
           E+   +G G+P +    + DTGSD+ WIQC PC   CY Q DPVFDPAKS S+A VPC +
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGT 170

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNEGL 254
             C       CN   TC+Y V YGDGS T G  + ETLTF  +        GCG  N G 
Sbjct: 171 TECAAAGGE-CNG-TTCVYGVEYGDGSSTTGVLARETLTFSSSSEFTGFIFGCGETNLGD 228

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT--ARF 312
           F    GLLGLGRG LS  +Q    F   FSYCL   +T+  P  +  G + V+     ++
Sbjct: 229 FGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTT--PGYLSIGATPVTGQIPVQY 286

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           T ++  P   +FY++ELV I++GG +V  +  S F        G ++DSGT +T L  PA
Sbjct: 287 TAMVNKPDYPSFYFIELVSINIGG-YVLPVPPSEFT-----KTGTLLDSGTILTYLPPPA 340

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYL-- 430
           Y ALRD F+      K AP +   DTC+D +G++ + +P V  +F  +D ++   N+   
Sbjct: 341 YTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNF--SDGAVFNLNFFGI 398

Query: 431 --IPVDSSGTF-CFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              P D+     C AF    + +  S++G+  Q+   V+YD+ A +IGF P  C
Sbjct: 399 MTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 130/355 (36%), Positives = 173/355 (48%), Gaps = 63/355 (17%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVP 192
           GSG Y   +G+G+P R +  + DTGSD+ W QC PC   CY Q + +FDP+ S S++ V 
Sbjct: 85  GSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVS 144

Query: 193 CRSPLCRKLDSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALG 246
           C SP C KL+S+     GC+  +TCLY + YGDGS ++G F+ E L+   T V      G
Sbjct: 145 CDSPSCEKLESATGNSPGCSS-STCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFG 203

Query: 247 CGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
           CG +N GLF   AGLLGL R  LS  +QT +++ + FSYCL   S+S    S   GD   
Sbjct: 204 CGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLSFGSGDGD- 262

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S+  +FTP L      T Y           + V+ +   L    P   G  I+D      
Sbjct: 263 SKAVKFTPRLP----PTVY-----------SSVQKVFRELMSDYPRVKGVSILD------ 301

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
                                          TC+DLS    VKVP ++L+F G      A
Sbjct: 302 -------------------------------TCYDLSKYKTVKVPKIILYFSGGAEMDLA 330

Query: 427 TNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +I V      C AFAG      ++IIGN+QQ+   VVYD A  R+GFAP GC
Sbjct: 331 PEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 131/385 (34%), Positives = 191/385 (49%), Gaps = 47/385 (12%)

Query: 114 RSRGRANGGFSSSVI----SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           RS  RAN  + +++     S +    GEY     VGTPP  +Y + DTGSD+VW+QC PC
Sbjct: 59  RSINRANHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPC 118

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           K+CY+QT P F P+KS ++  +PC S LC+                 S   G+++V   +
Sbjct: 119 KECYNQTTPKFKPSKSSTYKNIPCSSDLCK-----------------SGQQGNLSVDTLT 161

Query: 230 TETLTFRGTRVARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
            E+ T       +  +GCG DN   F  A++G++GLG G  S  TQ G   + KFSYCL+
Sbjct: 162 LESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLL 221

Query: 289 DRSTSAKPSSMV-FGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
                +  +S + FGD+AV        TP++    +  FYY+ L   SVG   +      
Sbjct: 222 PNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPI-VFYYLTLEAFSVGNKRI------ 274

Query: 346 LFKLDPAGNGG----VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTCF 400
             + + + NGG    +IIDSGT++T +    Y  L  A       LKR  D + LF+ C+
Sbjct: 275 --EFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLE-LVKLKRVNDPTRLFNLCY 331

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGL-----SIIGN 455
            ++       P +  HF+GADV L   +  + V + G  C AFA T + +     SI GN
Sbjct: 332 SVTSD-GYDFPIITTHFKGADVKLHPISTFVDV-ADGIVCLAFATTSAFIPSDVVSIFGN 389

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQ   V YDL    + F P  C+
Sbjct: 390 LAQQNLLVGYDLQQKIVSFKPTDCS 414


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 129/359 (35%), Positives = 179/359 (49%), Gaps = 24/359 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G++   + +GTPP  +  ++DTGSD++WIQCAPC  CY Q  P+FDP KS ++  + C S
Sbjct: 66  GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDS 125

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVALGCGHD 250
           PLC KLD+  C+    C Y   YGD S+T G  + +T TF     +   ++R   GCGH+
Sbjct: 126 PLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185

Query: 251 NEGLFV-AAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKPSS-MVFGDSA-- 305
           N G F     GL+GLG G  S  +Q G  F  +KFS CLV   T  K SS M FG  +  
Sbjct: 186 NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQV 245

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTS 364
           +      TPL+   K DT Y+V L+GISV   +        F ++   G   +++DSGT 
Sbjct: 246 LGNGVVTTPLVPREK-DTSYFVTLLGISVEDTY--------FPMNSTIGKANMLVDSGTP 296

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
              L +  Y  +    R    +LK   D     T      +T +K PT+  HF GA+V L
Sbjct: 297 PILLPQQLYDKVFAEVR-NKVALKPITDDPSLGTQLCYRTQTNLKGPTLTFHFVGANVLL 355

Query: 425 PATNYLIP--VDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                 IP    + G FC A +  T S   + GN  Q  + + +DL    + F P  C 
Sbjct: 356 TPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDCT 414


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 128/357 (35%), Positives = 177/357 (49%), Gaps = 23/357 (6%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y   L +GTPP  +Y + DTGSD+ W  C PC  CY Q +P+FDP KS ++  + C S
Sbjct: 70  GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDS 129

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-----GCGHD 250
            LC KLD+  C+ +  C Y  +Y   +IT G  + ET+T   T+   V L     GCGH+
Sbjct: 130 KLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHN 189

Query: 251 NEGLFV-AAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKPSS-MVFGD-SAV 306
           N G F     G++GLG G +S  +Q G  F  ++FS CLV   T    SS M FG  S V
Sbjct: 190 NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKV 249

Query: 307 S-RTARFTPLLANPKLDTFYYVELVGISVGGA--HVRGITASLFKLDPAGNGGVIIDSGT 363
           S +    TPL+A  +  T Y+V L+GISV     H  G + ++ K      G + +DSGT
Sbjct: 250 SGKGVVSTPLVAK-QDKTPYFVTLLGISVENTYLHFNGSSQNVEK------GNMFLDSGT 302

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
             T L    Y  +    R+   ++K   D            K  ++ P +  HF GADV 
Sbjct: 303 PPTILPTQLYDQVVAQVRSEV-AMKPVTDDPDLGPQLCYRTKNNLRGPVLTAHFEGADVK 361

Query: 424 L-PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           L P   ++ P D  G FC  F  T S   + GN  Q  + + +DL    + F P+ C
Sbjct: 362 LSPTQTFISPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 171/355 (48%), Gaps = 18/355 (5%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY  R  +GTPP     + DT SD++W+QC+PC+ C+ Q  P+F+P KS +FA + C S
Sbjct: 88  GEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDS 147

Query: 196 PLCRKLDSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFRGTRVA--RVALGCGHDNE 252
             C   +   C    N CLY  +YGDGS T G   TE++ F    V   +   GCG +N+
Sbjct: 148 QPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSNND 207

Query: 253 GLFVAA---AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            +   +    G++GLG G LS  +Q G +   KFSYCL+  ++++        D+ ++  
Sbjct: 208 FMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTITGN 267

Query: 310 ARF-TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
               TPL+ +P   ++Y++ LVGI++G   ++  T          NG +IID GT +T L
Sbjct: 268 GVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTD------HTNGNIIIDLGTVLTYL 321

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSLPAT 427
               Y       R      +   D    FD CF    +  +  P +V  F GA V L   
Sbjct: 322 EVNFYHNFVTLLREALGISETKDDIPYPFDFCF--PNQANITFPKIVFQFTGAKVFLSPK 379

Query: 428 NYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           N     D     C A        G S+ GN+ Q  F+V YD    ++ FAP  C+
Sbjct: 380 NLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/374 (36%), Positives = 184/374 (49%), Gaps = 36/374 (9%)

Query: 53  SLPLPAPDAESSL--SLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVP 110
           +L LPA     ++   L+L HVD+   + T   L +  I R   RV +L    +SA  +P
Sbjct: 15  TLSLPAAHCNDNVGFQLKLTHVDA-GTSYTKLQLLSRAIARSKARVAAL----QSAAVLP 69

Query: 111 PRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
           P       A    ++S        SGEY   L +GTPP Y   ++DTGSD++W QCAPC 
Sbjct: 70  PVVDPITAARVLVTAS--------SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCL 121

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFST 230
            C  Q  P FD  KS ++  +PCRS  C  L S  C ++  C+YQ  YGD + T G  + 
Sbjct: 122 LCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKK-MCVYQYYYGDTASTAGVLAN 180

Query: 231 ETLTFRG-----TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
           ET TF        R   +A GCG  N G    ++G++G GRG LS  +Q G     +FSY
Sbjct: 181 ETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSY 237

Query: 286 CLVDRSTSAKPSSMVFG--------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGA 337
           CL     SA PS + FG        +++     + TP + NP L   Y++ L  IS+ G 
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISL-GT 295

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-F 396
            +  I   +F ++  G GGVIIDSGTS+T L + AY A+R      A  L    D  +  
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGL-VSAIPLTAMNDTDIGL 354

Query: 397 DTCFDLSGKTEVKV 410
           DTCF       V V
Sbjct: 355 DTCFQWPPPPNVTV 368


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 129/362 (35%), Positives = 184/362 (50%), Gaps = 24/362 (6%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCY-SQTDPVFDPAKSRSFATVPCRS 195
            Y  R  +GTPP+ + + +D  +D  W+ C+ C  C    + P FDP +S ++  V C +
Sbjct: 99  SYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGA 158

Query: 196 PLCRKLD----SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV-----ALG 246
           P C ++     S       +C + +SY   ++       + L+   +  A V       G
Sbjct: 159 PQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA-VLGQDALSLSDSNGAAVPDDHYTFG 217

Query: 247 CGH--DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
           C       G  V   GL+G GRG LSF +QT   +   FSYCL    +S    ++  G +
Sbjct: 218 CLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPA 277

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGT 363
              R  + TPLL+NP   + YYV +VG+ V G  V  I AS   LD A G GG I+D+GT
Sbjct: 278 GQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVP-IPASALALDAATGRGGTIVDAGT 336

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
             TRL+ PAY ALR+AFR G S+   AP    FDTC+ ++G     VP V   F  GA V
Sbjct: 337 MFTRLSPPAYAALRNAFRRGVSA-PAAPALGGFDTCYYVNGTKS--VPAVAFVFAGGARV 393

Query: 423 SLPATNYLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           +LP  N +I   S G  C A A     G  +GL+++ ++QQQ  RVV+D+   R+GF+  
Sbjct: 394 TLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRE 453

Query: 478 GC 479
            C
Sbjct: 454 LC 455


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 136/366 (37%), Positives = 187/366 (51%), Gaps = 29/366 (7%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G  EY   L +GTPP     + DTGSD+ W QC PCK C+ Q  P++D   S SF+ 
Sbjct: 76  LRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSP 135

Query: 191 VPCRSPLCRKLDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
           +PC S  C  + SS C+  + TC Y+ +Y DG+     +S E     G  V  +A GCG 
Sbjct: 136 LPCSSATCLPIWSSRCSTPSATCRYRYAYDDGA-----YSPEC---AGISVGGIAFGCGV 187

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG------- 302
           DN GL   + G +GLGRG LS   Q G     KFSYCL D   ++  S + FG       
Sbjct: 188 DNGGLSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLSSPVFFGSLAELAA 244

Query: 303 --DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL-DPAGNGGVII 359
              SA +   + TPL+ +P   + YYV L GIS+G A +  I    F L D  G+GG+I+
Sbjct: 245 SSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLP-IPNGTFDLNDDDGSGGMIV 303

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF--DLSGKTEV-KVPTVVLH 416
           DSGT  T L    +  + D   AG          SL   CF    +G  E+  +P +VLH
Sbjct: 304 DSGTIFTILVETGFRVVVDHV-AGVLGQPVVNASSLDRPCFPAPAAGVQELPDMPDMVLH 362

Query: 417 FR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGF 474
           F  GAD+ L   NY+   +   +FC    GT S   S++GN QQQ  ++++D+   ++ F
Sbjct: 363 FAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSF 422

Query: 475 APRGCA 480
            P  C+
Sbjct: 423 MPTDCS 428


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/361 (36%), Positives = 175/361 (48%), Gaps = 20/361 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           ++  +GEY  ++ +GTPP  VY + DTGSD++W QC PC  CY Q +P+FDP+KS SF  
Sbjct: 84  VSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKE 143

Query: 191 VPCRSPLCRKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVA 244
           V C S  CR LD+  C++ +  C +   YGDGS+  G  +TETLT      + T +  + 
Sbjct: 144 VSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIV 203

Query: 245 LGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRF--NRKFSYCLVD-RSTSAKPSSMV 300
            GCGH+N G F     GL G G   LS  +Q        RKFS CLV  R+  +  S ++
Sbjct: 204 FGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263

Query: 301 FGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           FG  A VS +   +  L      T+Y+V L GISVG       ++S      A  G V I
Sbjct: 264 FGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPM----ATKGNVFI 319

Query: 360 DSGTSVTRLTRPAYIALRDAFR-AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           D+GT  T L R  Y  L    + A      + PD      C+     T +  P +  HF 
Sbjct: 320 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP-QLCY--RSATLIDGPILTAHFD 376

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GADV L   N  I     G +CFA         I GN  Q  F + +DL   ++ F    
Sbjct: 377 GADVQLKPLNTFI-SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVD 435

Query: 479 C 479
           C
Sbjct: 436 C 436


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 136/369 (36%), Positives = 197/369 (53%), Gaps = 36/369 (9%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC---RK 200
           +GTPPR V +++DT S++ W+Q   C  C     P F+P  S SF + PC S +C    K
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64

Query: 201 LD-SSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFR-----GTRVARVALGCG-HDNE 252
           L   S CNR   +C +QV+Y DGS   G  + E  + +      + +  V  GC   D +
Sbjct: 65  LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRR----FNRKFSYCLVDRSTSAKPSS-MVFGDSAV- 306
                ++G LGL RG  SFP Q G R     + +FSYC  +R+     S  ++FGDS + 
Sbjct: 125 RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIP 184

Query: 307 SRTARFTPLLANPKLDT---FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           +   ++  L   P + +   FYYV L GISVGG  +  I  S FK+D  GNGG   DSGT
Sbjct: 185 AHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLH-IPRSAFKIDRLGNGGTYFDSGT 243

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKVPT---VVLHFR 418
           +V+ L  PA+ AL +AF      L R    DF+  + C+D++   + ++PT   V LHF+
Sbjct: 244 TVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTK-ELCYDVAAG-DARLPTAPLVTLHFK 301

Query: 419 -GADVSLPATNYLIPVDSSG---TFCFAF--AGTMS--GLSIIGNIQQQGFRVVYDLAAS 470
              D+ L   +  +P+  +    T C AF  AG ++  G+++IGN QQQ + + +DL  S
Sbjct: 302 NNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLERS 361

Query: 471 RIGFAPRGC 479
           RIGFAP  C
Sbjct: 362 RIGFAPANC 370


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 128/366 (34%), Positives = 178/366 (48%), Gaps = 32/366 (8%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC--YSQTDPVFDPAKSRSFATV 191
           G GEY   L +GTPP+ +  ++DTGSD+VW++C  C  C      + +F    S S+  +
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 192 PCRSPLCRKLDSSGCNRR--NTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR------- 242
           PC S  C  + S+G   R   TC Y+  YGDGS T GD  ++ ++FR             
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFD 120

Query: 243 -VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
               GCG   +G +    GL+GLG+   S   Q G +   KFSYCLV   +     S +F
Sbjct: 121 GFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180

Query: 302 -GDSAVSRTARF--TPLLANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G SA  R      TP+L    LD T YYV+L  I+VG     G+   ++  +   N  V
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVG-----GVPVVVYDKESGHNTSV 235

Query: 358 --------IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
                   +IDSGT+ T LT P Y A+R +       L    + +  D CF+ SG T   
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYG 294

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
            P+V  +F     + LP  N +  V S    C +   +   LSIIGN+QQQ F ++YDL 
Sbjct: 295 FPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLV 353

Query: 469 ASRIGF 474
           AS+I F
Sbjct: 354 ASQISF 359


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 129/361 (35%), Positives = 173/361 (47%), Gaps = 20/361 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           ++  +GEY  ++ +GTPP  VY + DTGSD++W QC PC  CY Q +P+FDP+KS SF  
Sbjct: 84  VSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKE 143

Query: 191 VPCRSPLCRKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVA 244
           V C S  CR LD+  C++ +  C +   YGDGS+  G  +TETLT          +  + 
Sbjct: 144 VSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIV 203

Query: 245 LGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRF--NRKFSYCLVD-RSTSAKPSSMV 300
            GCGH+N G F     GL G G   LS  +Q        RKFS CLV  R+  +  S ++
Sbjct: 204 FGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263

Query: 301 FGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           FG  A VS +   +  L      T+Y+V L GISVG       ++S      A  G V I
Sbjct: 264 FGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPM----ATKGNVFI 319

Query: 360 DSGTSVTRLTRPAYIALRDAFR-AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           D+GT  T L R  Y  L    + A      + PD      C+     T +  P +  HF 
Sbjct: 320 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP-QLCY--RSATLIDGPILTAHFD 376

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           GADV L   N  I     G +CFA         I GN  Q  F + +DL   ++ F    
Sbjct: 377 GADVQLKPLNTFI-SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVD 435

Query: 479 C 479
           C
Sbjct: 436 C 436


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 127/359 (35%), Positives = 179/359 (49%), Gaps = 19/359 (5%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +   +G+Y  +L +G+PP  +Y ++DTGSD+VW QC PC  CY Q  P+F+P +S++++ 
Sbjct: 75  VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSP 134

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVAL 245
           +PC S  C     S C+ +  C Y  SY D S+T G  + E +TF  T      V  +  
Sbjct: 135 IPCESEQCSFFGYS-CSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIF 193

Query: 246 GCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRF-NRKFSYCLVDRSTSAKPSSMV-FG 302
           GCGH N G F      +    G  LS  +Q G  + +++FS CLV   T A  S  + FG
Sbjct: 194 GCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFG 253

Query: 303 -DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            +S VS     T  LA+ +  T Y V L GISVG   VR  ++          G ++IDS
Sbjct: 254 EESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLS-----KGNIMIDS 308

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 421
           GT  T + +  Y  L +  +  +S L    D  L  T      +T ++ P +  HF GAD
Sbjct: 309 GTPATYIPQEFYERLVEELKVQSSLLPIEDDPDL-GTQLCYRSETNLEGPILTAHFEGAD 367

Query: 422 VS-LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           V  LP   ++ P D  G FCFA AG+  G  I GN  Q    + +DL    I F P  C
Sbjct: 368 VQLLPIQTFIPPKD--GVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDC 424


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 174/349 (49%), Gaps = 14/349 (4%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R  +GTPP+ + + +DT +D  WI C  C  C S    +F P KS +F  V C 
Sbjct: 75  SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCA 131

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ + GC   ++C + ++YG  SI   +   +T+T     V     GC     G 
Sbjct: 132 APECKQVPNPGCG-VSSCNFNLTYGSSSI-AANLVQDTITLATDPVPSYTFGCVSKTTGT 189

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GLLGLGRG LS  +QT   +   FSYCL    +     S+  G  A  +  ++TP
Sbjct: 190 SAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTP 249

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L  I V G  V  I  +    +P    G I DSGT  TRL  P Y+
Sbjct: 250 LLKNPRRSSLYYVNLEAIRV-GRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYV 308

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           A+RD FR              FDTC+++     + VPT+   F G +V+LP  N LI   
Sbjct: 309 AVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIFTGMNVTLPQDNILIHST 364

Query: 435 SSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +  T C A AG      S L++I N+QQQ  RV+YD+  SR+G A   C
Sbjct: 365 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELC 413


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 134/390 (34%), Positives = 197/390 (50%), Gaps = 32/390 (8%)

Query: 113 NRSRGRANGGFSSSVISG------LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           +RS  RAN    +SV +       +  G GEYF R+ +GTPP  V ++ DTGSD++W+QC
Sbjct: 63  HRSISRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQC 122

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRR---NTCLYQVSYGDG 221
            PC++CY Q  P+F+P +S ++  V C +  C  L+S    C+       C Y  SYGD 
Sbjct: 123 QPCQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDH 182

Query: 222 SITVGDFSTETLTFRGTR--VARVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRR 278
           S T+G  +TE      T   +  +A GCG+ N G F    +G++GLG G LS  +Q G +
Sbjct: 183 SFTMGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTK 242

Query: 279 FNRKFSYCLVD--RSTSAKPSSMVFGDSAV---SRTARFTPLLANPKLDTFYYVELVGIS 333
            + KFSYCLV     ++     +VFGD++    S T   TPL++    +TFYY+ L  IS
Sbjct: 243 IDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEP-ETFYYLTLEAIS 301

Query: 334 VGGAHVRGITASLFKLDPAGN---GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
           VG   +    +        GN   G +IIDSGT++T L    Y  L           + +
Sbjct: 302 VGNERLAYENSR-----NDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVS 356

Query: 391 PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGL 450
               +F  CF    K  +++P + +HF  ADV L   N     +     CF    + +G+
Sbjct: 357 DPNGIFSICF--RDKIGIELPIITVHFTDADVELKPINTFAKAEED-LLCFTMIPS-NGI 412

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +I GN+ Q  F V YDL  + + F P  C+
Sbjct: 413 AIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 175/349 (50%), Gaps = 16/349 (4%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTPP+ + + +DT +D  WI CA C  C + + P FDPA S S+ +VPC SPL
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169

Query: 198 CRKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           C +  ++ C      C + ++Y D S+     S ++L   G  V     GC     G   
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYADSSLQAA-LSQDSLAVAGDAVKTYTFGCLQKATGTAA 228

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
              GLLGLGRG LSF +QT   +   FSYCL    +     ++  G +      + TPLL
Sbjct: 229 PPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLL 288

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           ANP   + YYV + GI V G  V  I       DPA   G ++DSGT  TRL  PAY+A+
Sbjct: 289 ANPHRSSLYYVNMTGIRV-GRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAV 347

Query: 377 RDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           RD  R        AP  SL  FDTCF+    T V  P V L F G  V+LP  N +I   
Sbjct: 348 RDEVRRRVG----APVSSLGGFDTCFN---TTAVAWPPVTLLFDGMQVTLPEENVVIHST 400

Query: 435 SSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                C A A    G    L++I ++QQQ  RV++D+   R+GFA   C
Sbjct: 401 YGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 180/370 (48%), Gaps = 21/370 (5%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SGL    GEYF  + +GTPP  V+ + DTGSD+ W+QC PC++CY Q  P+FD  KS ++
Sbjct: 76  SGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTY 135

Query: 189 ATVPCRSPLCRKLD--SSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR--- 242
            T  C S  C+ L     GC+  ++ C Y+ SYGD S T GD +TET++   +  +    
Sbjct: 136 KTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSF 195

Query: 243 --VALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
                GCG++N G F      +    G  LS  +Q G    +KFSYCL   + +   +S+
Sbjct: 196 PGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSV 255

Query: 300 V-FGDSAV-----SRTARFTPLLANPKLDTFYYVELVGISVGGAHV--RGITASLFKLDP 351
           +  G +++       +A  T  L     +T+Y++ L  ++VG   +   G    L     
Sbjct: 256 INLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSS 315

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKTEVKV 410
              G +IIDSGT++T L    Y     A     +  KR  D   L   CF  SG  E+ +
Sbjct: 316 KRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK-SGDKEIGL 374

Query: 411 PTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           P + +HF  ADV L   N  + ++   T C +   T   ++I GN+ Q  F V YDL   
Sbjct: 375 PAITMHFTNADVKLSPINAFVKLNED-TVCLSMIPTTE-VAIYGNMVQMDFLVGYDLETK 432

Query: 471 RIGFAPRGCA 480
            + F    C+
Sbjct: 433 TVSFQRMDCS 442


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 133/368 (36%), Positives = 187/368 (50%), Gaps = 33/368 (8%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           S L   +GEY   L +GTPP     + DTGSD++W+QC+PC+ C+ Q  P+F+P KS +F
Sbjct: 83  SLLIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTF 142

Query: 189 ATVPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA-- 244
               C S  C  +  S   C +   C+Y  SYGD S TVG   TETL+F  T  A+    
Sbjct: 143 KAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSF 202

Query: 245 ----LGCGHDNEGLFVAA---AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
                GCG  N   F  +    GL+GLG G LS  +Q G +   KFSYCL+  S+++  S
Sbjct: 203 PSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNST-S 261

Query: 298 SMVFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG-- 353
            + FG  A+  T     TPL+  P   +FY++ L  +++G            K+ P G  
Sbjct: 262 KLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQ-----------KVVPTGRT 310

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPT 412
           +G +IIDSGT +T L +  Y     + +    S++ A D    F  CF     T   +P 
Sbjct: 311 DGNIIIDSGTVLTYLEQTFYNNFVASLQE-VLSVESAQDLPFPFKFCFPYRDMT---IPV 366

Query: 413 VVLHFRGADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLSIIGNIQQQGFRVVYDLAASR 471
           +   F GA V+L   N LI +      C A    ++SG+SI GN+ Q  F+VVYDL   +
Sbjct: 367 IAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKK 426

Query: 472 IGFAPRGC 479
           + FAP  C
Sbjct: 427 VSFAPTDC 434


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 133/384 (34%), Positives = 192/384 (50%), Gaps = 34/384 (8%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD--PVFD 181
           S +V + L  G+G Y   + +GTPP    +++DTGS+++W QCAPC +C+ +    PV  
Sbjct: 77  SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQ 136

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           PA+S +F+ +PC    C+ L +S     CN    C Y  +YG G  T G  +TETLT   
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD 195

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
               +VA GC  +N      ++G++GLGRG LS  +Q       +FSYCL         S
Sbjct: 196 GTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGAS 250

Query: 298 SMVFGDSA---VSRTARFTPLLANPKLD--TFYYVELVGISVGGAHVRGITASLFKLDPA 352
            ++FG  A        + TPLL NP L   T YYV L GI+V    +  +T S F     
Sbjct: 251 PILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELP-VTGSTFGFTQT 309

Query: 353 G-NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS----LFDTCFDLS---G 404
           G  GG I+DSGT++T L +  Y  ++ AF++  ++L +    S      D C+  S   G
Sbjct: 310 GLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGG 369

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYL--IPVDSSGTFCFAFAGTMSG-----LSIIGNI 456
              V+VP + L F  GA  ++P  NY   +  DS G    A    +       +SIIGN+
Sbjct: 370 GKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNL 429

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
            Q    ++YD+      FAP  CA
Sbjct: 430 MQMDMHLLYDIDGGMFSFAPADCA 453


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 150/436 (34%), Positives = 209/436 (47%), Gaps = 38/436 (8%)

Query: 58  APDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           +PD   SL+L +H    LS    P H        D  R+++  AF+ S  RV   N  + 
Sbjct: 29  SPDPGFSLNL-IHRDSPLSPLYNPNH-------TDFDRLRN--AFSRSISRV---NVFKT 75

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
           +A     +S  + L    GEYF ++ +GTP   V ++ DTGSD+ W+QC PC  CY Q  
Sbjct: 76  KAVD--INSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKS 133

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRR-NTCLYQVSYGDGSITVGDFSTETLT 234
           P+FDP++S S+  + C S  C  LD S   C    N C Y  SYGD S T G+ +TE  T
Sbjct: 134 PLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFT 193

Query: 235 F-----RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
                 R   ++ +  GCG  N G F    +G++GLG G LS  +Q       KFSYCLV
Sbjct: 194 IGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLV 253

Query: 289 DRSTSAKPSSMV-FG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             S  +  +S + FG DS +S     +  L + + DT+YYV L  ISVG   +      L
Sbjct: 254 PLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLL 313

Query: 347 FKLDPAGN---GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS 403
                 GN   G VIIDSGT++T L    +  L         + + +    LF  CF  +
Sbjct: 314 -----NGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSA 368

Query: 404 GKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
           G  ++ +P + +HF  ADV L   N  +  D     CF    + + + I GN+ Q  F V
Sbjct: 369 G--DIDLPVIAVHFNDADVKLQPLNTFVKADED-LLCFTMISS-NQIGIFGNLAQMDFLV 424

Query: 464 VYDLAASRIGFAPRGC 479
            YDL    + F P  C
Sbjct: 425 GYDLEKRTVSFKPTDC 440


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 129/357 (36%), Positives = 176/357 (49%), Gaps = 30/357 (8%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           +GVGTPP+   ++LD GSD++W QC+       Q +PVFD A+S SF+ +PC S LC   
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAG 170

Query: 201 -LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR--VARVALGCGHDNEGLFVA 257
              +  C  R  C Y+  YG  + T G  +TET TF       A +  GCG    G    
Sbjct: 171 TFTNKTCTDRK-CAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCGKLANGTIAE 228

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV---DRSTSAKPSSMVFGDSA------VSR 308
           A+G+LGL  G LS   Q       KFSYCL    DR T    S ++FG  A       + 
Sbjct: 229 ASGILGLSPGPLSMLKQLAI---TKFSYCLTPFADRKT----SPVMFGAMADLGKYKTTG 281

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
             +  PLL NP  D +YYV +VG+SVG   +  +      + P G GG ++DS T++  L
Sbjct: 282 KVQTIPLLKNPVEDIYYYVPMVGMSVGSKRL-DVPQETLAIKPDGTGGTVLDSATTLAYL 340

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS---GKTEVKVPTVVLHFRG-ADVSL 424
             PA+  L+ A   G            +  CF+L        V+VP +VLHF G A++SL
Sbjct: 341 VEPAFTELKKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSL 400

Query: 425 PATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           P  NY     S G  C A   A      ++IGN+QQQ   V+YD+   +  +AP  C
Sbjct: 401 PRDNYFQE-PSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 134/384 (34%), Positives = 194/384 (50%), Gaps = 34/384 (8%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD--PVFD 181
           S +V + L  G+G Y   + +GTPP    +++DTGS+++W QCAPC +C+ +    PV  
Sbjct: 77  SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQ 136

Query: 182 PAKSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           PA+S +F+ +PC    C+ L +S     CN    C Y  +YG G  T G  +TETLT   
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD 195

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
               +VA GC  +N      ++G++GLGRG LS  +Q       +FSYCL         S
Sbjct: 196 GTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGAS 250

Query: 298 SMVFGDSA--VSRT-ARFTPLLANPKLD--TFYYVELVGISVGGAHVRGITASLFKLDPA 352
            ++FG  A    R+  + TPLL NP L   T YYV L GI+V    +  +T S F     
Sbjct: 251 PILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELP-VTGSTFGFTQT 309

Query: 353 G-NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS----LFDTCFDLS---G 404
           G  GG I+DSGT++T L +  Y  ++ AF++  ++L +    S      D C+  S   G
Sbjct: 310 GLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGG 369

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYL--IPVDSSGTFCFAFAGTMSG-----LSIIGNI 456
              V+VP + L F  GA  ++P  NY   +  DS G    A    +       +SIIGN+
Sbjct: 370 GKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNL 429

Query: 457 QQQGFRVVYDLAASRIGFAPRGCA 480
            Q    ++YD+      FAP  CA
Sbjct: 430 MQMDMHLLYDIDGGMFSFAPADCA 453


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 133/375 (35%), Positives = 196/375 (52%), Gaps = 42/375 (11%)

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC-- 198
           +LG+G+  + +  ++DTGS+ V +QC       S++ PVFDPA S+S+  VPC S LC  
Sbjct: 103 QLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLA 156

Query: 199 -RKLDSSG-----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-------RVAL 245
            ++  S+G      N   TC Y +SYGD   + GDFS + +    T  +        VA 
Sbjct: 157 VQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAF 216

Query: 246 GCGHDNEGLFV--AAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKPSSMVF- 301
           GC H  +G  V   + G++G  RG LS P+Q   R    KFSYC   +    + + ++F 
Sbjct: 217 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 276

Query: 302 GDSAVSRT-ARFTPLLAN---PKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGG 356
           GDS +S++   +TPLL N   P     YYV L  ISV G  +  I  S FKLDP+ G+GG
Sbjct: 277 GDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTL-AIPESAFKLDPSTGDGG 335

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLS-GKTEVKVPTV 413
            ++DSGT+ TR+   AY A R+AF A   S   K+    + FD C+++S G +   VP V
Sbjct: 336 TVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 395

Query: 414 VLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTMSG----LSIIGNIQQQGFRVVY 465
            L  +    + L   +  +PV ++G   T C A   +       ++++GN QQ  + V Y
Sbjct: 396 RLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEY 455

Query: 466 DLAASRIGFAPRGCA 480
           D   SR+GF    C+
Sbjct: 456 DNERSRVGFERADCS 470


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 196/346 (56%), Gaps = 37/346 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G +   +  GTPP+   ++LDTGS + W QC PC +C   +   FDP+ S +++   C  
Sbjct: 160 GNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC-- 217

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGL 254
                + S+  N      Y ++YGD S +VG++  +T+T   + V  +   GCG +NEG 
Sbjct: 218 -----IPSTVGNT-----YNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRNNEGD 267

Query: 255 FVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
           F + A G+LGLG+G+LS  +QT  +F + FSYCL +  +     S++FG+ A S+++  +
Sbjct: 268 FGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIG---SLLFGEKATSQSSSLK 324

Query: 312 FTPLLANP-----KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           FT L+  P     +   +Y+V+L+ ISVG   +  I +S+F      + G IIDSGT +T
Sbjct: 325 FTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLN-IPSSVF-----ASPGTIIDSGTVIT 378

Query: 367 RLTRPAYIALRDAFRAGAS----SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGAD 421
           RL + AY AL+ AF+   +    S  R     + DTC++LSG+ +V +P +VLHF  GAD
Sbjct: 379 RLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGAD 438

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDL 467
           V L     +   D+S   C AFAG  S L+IIGN QQ    V+YD+
Sbjct: 439 VRLNGKRVIWGNDAS-RLCLAFAGN-SELTIIGNRQQVSLTVLYDI 482


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 133/364 (36%), Positives = 187/364 (51%), Gaps = 23/364 (6%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G G Y   + VGTP     +V DTGSD++W QCAPC KC+ Q  P F PA S +F+ 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           +PC S  C+ L +S   CN    C+Y   YG G  T G  +TETL         VA GC 
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG-CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCS 196

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSA-- 305
            +N G+  + +G+ GLGRG LS   Q G     +FSYCL  RS SA  +S ++FG  A  
Sbjct: 197 TEN-GVGNSTSGIAGLGRGALSLIPQLGV---GRFSYCL--RSGSAAGASPILFGSLANL 250

Query: 306 VSRTARFTPLLANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAG-NGGVIIDSGT 363
                + TP + NP +  ++YYV L GI+VG   +  +T S F     G  GG I+DSGT
Sbjct: 251 TDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLP-VTTSTFGFTQNGLGGGTIVDSGT 309

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD-LSGKTEVKVPTVVLHFR-GAD 421
           ++T L +  Y  ++ AF +  +++         D CF    G   + VP++VL F  GA+
Sbjct: 310 TLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAE 369

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSG-----LSIIGNIQQQGFRVVYDLAASRIGFAP 476
            ++P     +  DS G+   A    +       +S+IGN+ Q    ++YDL      F+P
Sbjct: 370 YAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSP 429

Query: 477 RGCA 480
             CA
Sbjct: 430 ADCA 433


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 126/366 (34%), Positives = 177/366 (48%), Gaps = 32/366 (8%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC--YSQTDPVFDPAKSRSFATV 191
           G GEY   L +GTPP+ +  ++DTGSD+VW++C  C  C      + +F    S S+  +
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 192 PCRSPLCRKLDSSGCNRR--NTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR------- 242
           PC S  C  + S+G   R   TC Y+  YGDGS T GD  ++ ++FR             
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFD 120

Query: 243 -VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
               GC    +G +    GL+GLG+   S   Q G +   KFSYCLV   +     S +F
Sbjct: 121 GFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLF 180

Query: 302 -GDSAVSRTARF--TPLLANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G SA  R      TP+L    LD T YYV+L  I++G     G+   ++  +   N  V
Sbjct: 181 LGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIG-----GVPVVVYDKESGHNTSV 235

Query: 358 --------IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
                   +IDSGT+ T LT P Y A+R +       L    + +  D CF+ SG T   
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYG 294

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
            P+V  +F     + LP  N +  V S    C +   +   LSIIGN+QQQ F ++YDL 
Sbjct: 295 FPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLV 353

Query: 469 ASRIGF 474
           AS+I F
Sbjct: 354 ASQISF 359


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 134/365 (36%), Positives = 186/365 (50%), Gaps = 24/365 (6%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G G Y   + VGTP     +V DTGSD++W QCAPC KC+ Q  P F PA S +F+ 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           +PC S  C+ L +S   CN    C+Y   YG G  T G  +TETL         VA GC 
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG-CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCS 196

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSA-- 305
            +N G+  + +G+ GLGRG LS   Q G     +FSYCL  RS SA  +S ++FG  A  
Sbjct: 197 TEN-GVGNSTSGIAGLGRGALSLIPQLGV---GRFSYCL--RSGSAAGASPILFGSLANL 250

Query: 306 VSRTARFTPLLANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAG-NGGVIIDSGT 363
                + TP + NP +  ++YYV L GI+VG   +  +T S F     G  GG I+DSGT
Sbjct: 251 TDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLP-VTTSTFGFTQNGLGGGTIVDSGT 309

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GA 420
           ++T L +  Y  ++ AF +  + +         D CF     G   + VP++VL F  GA
Sbjct: 310 TLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGA 369

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSG-----LSIIGNIQQQGFRVVYDLAASRIGFA 475
           + ++P     +  DS G+   A    +       +S+IGN+ Q    ++YDL      FA
Sbjct: 370 EYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFA 429

Query: 476 PRGCA 480
           P  CA
Sbjct: 430 PADCA 434


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 172/358 (48%), Gaps = 28/358 (7%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY  R  +GTP      + DTGSD+ W+QC PCK CY Q  P+FDP +S ++  VPC S
Sbjct: 86  GEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCES 145

Query: 196 PLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-------GTRVARVALG 246
             C     +   C     C+Y   YG  S T+G    +T++F        G    +   G
Sbjct: 146 QPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFG 205

Query: 247 CGHDNEGLF---VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           C   +   F     A G +GLG G LS  +Q G +   KFSYC+V  S+++    + FG 
Sbjct: 206 CAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTST-GKLKFGS 264

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
            A +     TP + NP   ++Y + L GI+VG   V  +T  +        G +IIDS  
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--LTGQI-------GGNIIIDSVP 315

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDF-SLFDTCFDLSGKTEVKVPTVVLHFRGADV 422
            +T L +  Y     + +  A +++ A D  + F+ C  +   T +  P  V HF GADV
Sbjct: 316 ILTHLEQGIYTDFISSVKE-AINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTGADV 372

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            L   N  I +D++   C     +  G+SI GN  Q  F+V YDL   ++ FAP  C+
Sbjct: 373 VLGPKNMFIALDNN-LVCMTVVPS-KGISIFGNWAQVNFQVEYDLGEKKVSFAPTNCS 428


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 181/369 (49%), Gaps = 21/369 (5%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDP 182
           F+ +  S +    GEY  +  +GTP   +  + DTGSD++W QC PC +CY Q  P+FDP
Sbjct: 77  FTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDP 136

Query: 183 AKSRSFATVPCRSPLCRKL-DSSGCNRR--NTCLYQVSYGDGSITVGDFSTETLTF---- 235
             S ++  + C +  C  L + + C+     TC Y  SYGD S T G+ + +T+T     
Sbjct: 137 KSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTS 196

Query: 236 -RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGR-LSFPTQTGRRFNRKFSYCLVDRSTS 293
            R   + +  +GCGH+N G F      +    G  +S  +Q G   + KFSYCLV  S++
Sbjct: 197 GRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSN 256

Query: 294 AKPSSMV-FGDSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
           A  SS + FG + +      + TPL++    DTFY++ L  +SVG   ++   +S     
Sbjct: 257 ATNSSKLNFGSNGIVSGGGVQSTPLISKDP-DTFYFLTLEAVSVGSERIKFPGSSF---- 311

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
               G +IIDSGT++T      +  L  A +   +         +   C+ +    ++K 
Sbjct: 312 GTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--DLKF 369

Query: 411 PTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
           P++  HF GADV L   N  + V S    CFAF    SG +I GN+ Q  F V YDL   
Sbjct: 370 PSITAHFDGADVKLNPLNTFVQV-SDTVLCFAFNPINSG-AIFGNLAQMNFLVGYDLEGK 427

Query: 471 RIGFAPRGC 479
            + F P  C
Sbjct: 428 TVSFKPTDC 436


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 131/349 (37%), Positives = 175/349 (50%), Gaps = 20/349 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTPP+ + + +DT +D  WI CA C  C + +   FDPA S S+ TVPC SPL
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171

Query: 198 CRKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           C +  ++ C      C + ++Y D S+     S ++L   G  V     GC     G   
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSLQAA-LSQDSLAVAGNAVKAYTFGCLQRATGTAA 230

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
              GLLGLGRG LSF +QT   +   FSYCL    +     ++  G +   +  + TPLL
Sbjct: 231 PPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLL 290

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           ANP   + YYV + GI VG   V      +   DPA   G ++DSGT  TRL  PAY+A+
Sbjct: 291 ANPHRSSLYYVNMTGIRVGRKVV-----PIPAFDPATGAGTVLDSGTMFTRLVAPAYVAV 345

Query: 377 RDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           RD  R        AP  SL  FDTCF+    T V  P V L F G  V+LP  N +I   
Sbjct: 346 RDEVRRRVG----APVSSLGGFDTCFN---TTAVAWPPVTLLFDGMQVTLPEENVVIHST 398

Query: 435 SSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                C A A    G    L++I ++QQQ  RV++D+   R+GFA   C
Sbjct: 399 YGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 130/366 (35%), Positives = 192/366 (52%), Gaps = 40/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQC----APCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +G+GTPP+   +++DTGSD++W QC    +        + PV+DP +S +FA +PC   L
Sbjct: 95  VGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRL 154

Query: 198 CRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA--RVALGCGHDNEG 253
           C++   S   C  +N C+Y+  YG  +  VG  ++ET TF   R    R+  GCG  + G
Sbjct: 155 CQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAG 213

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VS 307
             + A G+LGL    LS  TQ      ++FSYCL   +   K S ++FG  A       +
Sbjct: 214 SLIGATGILGLSPESLSLITQLK---IQRFSYCLTPFA-DKKTSPLLFGAMADLSRHKTT 269

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGGVIIDSGTSVT 366
           R  + T +++NP    +YYV LVGIS+G  H R  + A+   + P G GG I+DSG++V 
Sbjct: 270 RPIQTTAIVSNPVKTVYYYVPLVGISLG--HKRLAVPAASLAMRPDGGGGTIVDSGSTVA 327

Query: 367 RLTRPAYIALRDAFRAGASSLKRAP----DFSLFDTCFDLSGKT------EVKVPTVVLH 416
            L   A+ A+++A       + R P        ++ CF L  +T       V+VP +VLH
Sbjct: 328 YLVEAAFEAVKEAVM----DVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLH 383

Query: 417 FR-GADVSLPATNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIG 473
           F  GA + LP  NY      +G  C A   T   SG+SIIGN+QQQ   V++D+   +  
Sbjct: 384 FDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFS 442

Query: 474 FAPRGC 479
           FAP  C
Sbjct: 443 FAPTQC 448


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 129/375 (34%), Positives = 190/375 (50%), Gaps = 34/375 (9%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-PCK--KCYSQT------DPVFDPAK 184
           G G+YF    VGTP +   +V DTGSD+ W+ C   C+   C ++         VF    
Sbjct: 79  GIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 138

Query: 185 SRSFATVPCRSPLCR-----KLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF--- 235
           S SF T+PC + +C+         + C    T C Y   Y DGS  +G F+ ET+T    
Sbjct: 139 SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 198

Query: 236 --RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
             R  ++  V +GC    +G  F AA G++GLG  + SF  +   +F  KFSYCLVD  +
Sbjct: 199 EGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 258

Query: 293 SAKPSS-MVFGDS----AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
               S+ + FG S    A+     +T L+    +++FY V ++GIS+GGA ++ I + ++
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLK-IPSEVW 316

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKT 406
             D  G GG I+DSG+S+T LT PAY  +  A R      ++   D    + CF+ +G  
Sbjct: 317 --DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFE 374

Query: 407 EVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVV 464
           E  VP +V HF  GA+   P  +Y+I   + G  C  F      G S++GNI QQ     
Sbjct: 375 ESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSVVGNIMQQNHLWE 433

Query: 465 YDLAASRIGFAPRGC 479
           +DL   ++GFAP  C
Sbjct: 434 FDLGLKKLGFAPSSC 448


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  188 bits (477), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 140/425 (32%), Positives = 206/425 (48%), Gaps = 40/425 (9%)

Query: 87  LRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGT 146
           +R++   +  K      E   R   R   R  + GG ++ +  G   G  +Y     +G 
Sbjct: 23  IRLELTHVDAKEHYTVEERVRRATERTHRRLASMGGVTAPIHWG---GQSQYIAEYLIGD 79

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCK-KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG 205
           PP+    ++DTGS+++W QC+ C+  C+ Q  P +DP++SR+   V C    C     + 
Sbjct: 80  PPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGSETQ 139

Query: 206 CNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDNEGLFVAAAGL 261
           C   N TC     YG G+I  G  +TE LTF+   V+ V  GC      + G    A+G+
Sbjct: 140 CLSDNKTCAVVTGYGAGNI-AGTLATENLTFQSETVSLV-FGCIVVTKLSPGSLNGASGI 197

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSA--VSRTARFTPLLAN 318
           +GLGRG+LS P+Q G   + +FSYCL      + +PS MV G SA  ++ +A  TP+   
Sbjct: 198 IGLGRGKLSLPSQLG---DTRFSYCLTPYFEDTIEPSHMVVGASAGLINGSASSTPVTTV 254

Query: 319 P--------KLDTFYYVELVGISVGGAHVRGITAS--LFKLDPAGNGGVIIDSGTSVTRL 368
           P           TFYY+ L GI+ G   +   +A+  L ++ P    G  IDSG  +T L
Sbjct: 255 PFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSL 314

Query: 369 TRPAYIALRD--AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-----RGAD 421
              AY ALR   A + GA+ ++     + FD C  L    E  VP +VLHF      G D
Sbjct: 315 VDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALK-DAERLVPPLVLHFGGGSGTGTD 373

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGT------MSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           + +P  NY  PVDS+      F+        M+  ++IGN  QQ   V+YDLA   + F 
Sbjct: 374 LVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQ 433

Query: 476 PRGCA 480
           P  C+
Sbjct: 434 PADCS 438


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 132/374 (35%), Positives = 194/374 (51%), Gaps = 42/374 (11%)

Query: 141 RLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC-- 198
           +LG+G+  + +  ++DTGS+ V +QC       S++ PVFDPA S+S+  VPC S LC  
Sbjct: 2   QLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLA 55

Query: 199 -RKLDSSG-----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-------VAL 245
            ++  S+G      N    C Y +SYGD   + GDFS + +    T  +        VA 
Sbjct: 56  VQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAF 115

Query: 246 GCGHDNEGLFV--AAAGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKPSSMVF- 301
           GC H  +G  V   + G++G  RG LS P+Q   R    KFSYC   +    + + ++F 
Sbjct: 116 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 175

Query: 302 GDSAVSRT-ARFTPLLAN---PKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGG 356
           GDS +S++   +TPLL N   P     YYV L  ISV G  +  I  S FKLDP+ G+GG
Sbjct: 176 GDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTL-AIPESAFKLDPSTGDGG 234

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLS-GKTEVKVPTV 413
            ++DSGT+ TR+   AY A R+AF A   S   K+    + FD C+++S G +   VP V
Sbjct: 235 TVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294

Query: 414 VLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTMSG----LSIIGNIQQQGFRVVY 465
            L  +    + L   +  +PV ++G   T C A   +       ++++GN QQ  + V Y
Sbjct: 295 RLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEY 354

Query: 466 DLAASRIGFAPRGC 479
           D   SR+GF    C
Sbjct: 355 DNERSRVGFERADC 368


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 175/351 (49%), Gaps = 20/351 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTP + + + +DT +D  WI C+ C  C   T   F+PA S S+  VPC SP 
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC--PTSSPFNPAASASYRPVPCGSPQ 164

Query: 198 CRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           C    +  C+    +C + +SY D S+     S +TL   G  V     GC     G   
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGDVVKAYTFGCLQRATGTAA 223

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
              GLLGLGRG LSF +QT   +   FSYCL    +     ++  G +   R  + TPLL
Sbjct: 224 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 283

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           ANP   + YYV + GI V G  V  I AS    DPA   G ++DSGT  TRL  P Y+AL
Sbjct: 284 ANPHRSSLYYVNMTGIRV-GKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLAL 342

Query: 377 RDAFR----AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           RD  R    AGA+++        FDTC++    T V  P V L F G  V+LP  N +I 
Sbjct: 343 RDEVRRRVGAGAAAVS---SLGGFDTCYN----TTVAWPPVTLLFDGMQVTLPEENVVIH 395

Query: 433 VDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                T C A A    G    L++I ++QQQ  RV++D+   R+GFA   C
Sbjct: 396 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/361 (32%), Positives = 181/361 (50%), Gaps = 30/361 (8%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVP 192
           +G Y  R+ +GTP      + DTGSD+ W+QC+PC   KC++Q  P++DP  S +F  +P
Sbjct: 93  NGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLP 152

Query: 193 CRSPLCRKLDSSG--CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGC 247
           C S  C +L  S   C+    C+Y  +YGD S + G  S++++     ++   +++  GC
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGC 212

Query: 248 GHDNEGLFVA-----AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG 302
           G  N+  F A       G++GLG G LS  +Q G     KFSYCL+  S+++  S + FG
Sbjct: 213 GFQNK--FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSN-SKLKFG 269

Query: 303 DSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGGVII 359
           ++A+ +      TPL+  P L  FYY+ L GI+VG   V+ G T          +G +II
Sbjct: 270 EAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVKTGQT----------DGNIII 318

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           DSG+++T L    Y       +   +  +       FD CF          P VV HF G
Sbjct: 319 DSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTG 377

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            DV L   N L+ ++ +            G++I GN+ Q  F V YD+   ++ FAP  C
Sbjct: 378 GDVVLKPMNTLVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437

Query: 480 A 480
           +
Sbjct: 438 S 438


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 138/376 (36%), Positives = 194/376 (51%), Gaps = 35/376 (9%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L   +G Y   L +GTPP    ++ DTGS ++W QCAPC +C ++  P F PA S +F+ 
Sbjct: 83  LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           +PC S LC+ L S    CN    C+Y   YG G  T G  +TETL   G     VA GC 
Sbjct: 143 LPCASSLCQFLTSPYLTCNATG-CVYYYPYGMG-FTAGYLATETLHVGGASFPGVAFGCS 200

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA--V 306
            +N G+  +++G++GLGR  LS  +Q G     +FSYCL      A  S ++FG  A   
Sbjct: 201 TEN-GVGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCL-RSDADAGDSPILFGSLAKVT 255

Query: 307 SRTARFTPLLANPKL--DTFYYVELVGISVGGAHVRGITASLFKLDPAGN----GGVIID 360
               + TPLL NP++   ++YYV L GI+VG   +  +T++ F           GG I+D
Sbjct: 256 GGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLP-VTSTTFGFTRGAGAGLVGGTIVD 314

Query: 361 SGTSVTRLTRPAYIALRDAF-----RAGASSLKRAPDFSLFDTCFDLS---GKTEVKVPT 412
           SGT++T L +  Y  ++ AF      A  ++      F  FD CFD +   G + V VPT
Sbjct: 315 SGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG-FDLCFDATAAGGGSGVPVPT 373

Query: 413 VVLHFR-GADVSLPATNY--LIPVDSSG---TFCFAF--AGTMSGLSIIGNIQQQGFRVV 464
           +VL F  GA+ ++   +Y  ++ VDS G     C     A     +SIIGN+ Q    V+
Sbjct: 374 LVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVL 433

Query: 465 YDLAASRIGFAPRGCA 480
           YDL      FAP  CA
Sbjct: 434 YDLDGGMFSFAPADCA 449


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 177/369 (47%), Gaps = 31/369 (8%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           S L   +GEY  R  +GTPP       DTGSD++W+QC+PC  C+ Q+ P+F P KS +F
Sbjct: 81  SVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTF 140

Query: 189 ATVPCRSPLCRKL--DSSGCNRRNTCLYQVSYGDG-SITVGDFSTETLTFRGT-RVARVA 244
               CRS  C  L  +  GC +   C+Y   YGD  S + G  STETL F     V  VA
Sbjct: 141 MPTTCRSQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVA 200

Query: 245 -----LGCG-HDNEGLF--VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
                 GCG ++N  +F      G++GLG G LS  +Q G +   KFSYCL+   +++  
Sbjct: 201 FPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTST- 259

Query: 297 SSMVFGDSAV--SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG- 353
           S + FG+ ++        TP++  P L T+Y++ L  ++V    V           P G 
Sbjct: 260 SKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTV-----------PTGS 308

Query: 354 -NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
            +G VIIDSGT +T L    Y     + +   +        S    CF    +     P 
Sbjct: 309 TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPY--RDNFVFPE 366

Query: 413 VVLHFRGADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLSIIGNIQQQGFRVVYDLAASR 471
           +   F GA VSL   N  +  +   T C   A  ++SG+SI G+  Q  F+V YDL   +
Sbjct: 367 IAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKK 426

Query: 472 IGFAPRGCA 480
           + F P  C+
Sbjct: 427 VSFQPTDCS 435


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 175/351 (49%), Gaps = 20/351 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTP + + + +DT +D  WI C+ C  C   T   F+PA S S+  VPC SP 
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC--PTSSPFNPAASASYRPVPCGSPQ 111

Query: 198 CRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           C    +  C+    +C + +SY D S+     S +TL   G  V     GC     G   
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGDVVKAYTFGCLQRATGTAA 170

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
              GLLGLGRG LSF +QT   +   FSYCL    +     ++  G +   R  + TPLL
Sbjct: 171 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 230

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           ANP   + YYV + GI V G  V  I AS    DPA   G ++DSGT  TRL  P Y+AL
Sbjct: 231 ANPHRSSLYYVNMTGIRV-GKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLAL 289

Query: 377 RDAFR----AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           RD  R    AGA+++        FDTC++    T V  P V L F G  V+LP  N +I 
Sbjct: 290 RDEVRRRVGAGAAAVS---SLGGFDTCYN----TTVAWPPVTLLFDGMQVTLPEENVVIH 342

Query: 433 VDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                T C A A    G    L++I ++QQQ  RV++D+   R+GFA   C
Sbjct: 343 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 168/340 (49%), Gaps = 27/340 (7%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  DV WIQCAPC   +CY Q DP+FDP  S + A V CRSP CR L    +GC+ 
Sbjct: 150 MAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSN 209

Query: 209 RNT---CLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEGLFV-AAAGLLG 263
           R+    C Y + Y D   T G + T+TLT  GT   R    GC H   G F    AG + 
Sbjct: 210 RSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAGTMS 269

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF--TPLLANPKL 321
           LG G  S   QT R     FSYC+   S S   S  + G +  + T  F  TPL+ +   
Sbjct: 270 LGGGAQSLLAQTARSLGNAFSYCVPQASASGFLS--IGGPATTNSTTVFATTPLVRSAIN 327

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
            + Y V L GI V G  + GI    F      + G ++DS   +T+L   AY ALR AFR
Sbjct: 328 PSLYLVRLQGIVVAGRRL-GIPPVAF------SAGAVMDSSAVITQLPPTAYRALRRAFR 380

Query: 382 AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCF 441
               +  R+      DTC+D  G T V+VP V L F G  V +     L P       C 
Sbjct: 381 NAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVV-----LDPPAVMIGGCL 435

Query: 442 AFAGTMSGLSI--IGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AF  T S L++  IGN+QQQ   V+YD+AA  +GF    C
Sbjct: 436 AFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 132/377 (35%), Positives = 198/377 (52%), Gaps = 38/377 (10%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L +  GEY+T + +G+P +   +++DTGS++ W++C PCK C    D ++D A+S S+  
Sbjct: 93  LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKP 152

Query: 191 VPC-RSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTF------RGTR 239
           V C  S LC    S G    C R + C +   YGDGS + G  ST+TL        +   
Sbjct: 153 VTCNNSQLCSN-SSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVT 211

Query: 240 VARVALGCGH-DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
           V   A GC   D E +   A+G+LGL  G+++ P Q G+RF  KFS+C  DRS+    + 
Sbjct: 212 VQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTG 271

Query: 299 MV-FGDSAV-SRTARFTPL-LANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           +V FG++ +     ++T + L N +L   FY+V L G+S        I +    L P G+
Sbjct: 272 VVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVS--------INSHELVLLPRGS 323

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFD--TCFDLSG----KTE 407
             VI+DSG+S +   RP +  LR+AF +    SLK     S  D  TCF +S     +  
Sbjct: 324 -VVILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELH 382

Query: 408 VKVPTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFA-GTMSGLSIIGNIQQQGFR 462
             +P++ L F  G  + +P+   L+PV    +    CFAF  G  + +++IGN QQQ   
Sbjct: 383 RTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLW 442

Query: 463 VVYDLAASRIGFAPRGC 479
           V YD+  SR+GFA   C
Sbjct: 443 VEYDIQRSRVGFARASC 459


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 126/345 (36%), Positives = 172/345 (49%), Gaps = 39/345 (11%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           MVLDT SDV W+QC+PC    CY Q D ++DP KS S     C SP C +L   ++GC  
Sbjct: 171 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTN 230

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFV---AAAGLLGL 264
            N C Y+V Y DG+ T G + ++ LT    T V     GC H  +G F    +AAG++ L
Sbjct: 231 NNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMAL 290

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF----TPLLANPK 320
           G G  S  +QT   + R FS+C         P+   F    V R A +    TP+L NP 
Sbjct: 291 GGGPESLVSQTAATYGRVFSHCF------PPPTRRGFFTLGVPRVAAWRYVLTPMLKNPA 344

Query: 321 LD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           +  TFY V L  I+V G  +  +  ++F        G  +DS T++TRL   AY ALR A
Sbjct: 345 IPPTFYMVRLEAIAVAGQRI-AVPPTVFA------AGAALDSRTAITRLPPTAYQALRQA 397

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           FR   +  + AP     DTC+D++G     +P + L F          N  + +D SG  
Sbjct: 398 FRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD--------KNAAVELDPSGVL 449

Query: 440 ---CFAF-AGTMSGLS-IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C AF AG    +  IIGNIQ Q   V+Y++ A+ +GF    C
Sbjct: 450 FQGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 132/377 (35%), Positives = 197/377 (52%), Gaps = 38/377 (10%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L +  GEY+T + +G+P +   +++DTGS++ W+QC PCK C    D ++D A+S S+  
Sbjct: 93  LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRP 152

Query: 191 VPC-RSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTF------RGTR 239
           V C  S LC    S G    C R + C +   YGDGS + G  ST+TL        +   
Sbjct: 153 VTCNNSQLCSN-SSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVT 211

Query: 240 VARVALGCGH-DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
           V   A GC   D E +   A+G+LGL  G+++ P Q G+RF  KFS+C  DRS+    + 
Sbjct: 212 VQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTG 271

Query: 299 MV-FGDSAV-SRTARFTPL-LANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           +V FG++ +     ++T + L N +L   FY+V L G+S        I +      P G+
Sbjct: 272 VVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVS--------INSHELVFLPRGS 323

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFD--TCFDLSG----KTE 407
             VI+DSG+S +   RP +  LR+AF +    SLK     S  D  TCF +S     +  
Sbjct: 324 -VVILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELH 382

Query: 408 VKVPTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFA-GTMSGLSIIGNIQQQGFR 462
             +P++ L F  G  + +P+   L+PV    +    CFAF  G  + +++IGN QQQ   
Sbjct: 383 RTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLW 442

Query: 463 VVYDLAASRIGFAPRGC 479
           V YD+  SR+GFA   C
Sbjct: 443 VEYDIQRSRVGFARASC 459


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/369 (35%), Positives = 186/369 (50%), Gaps = 22/369 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ V + +   +G+Y  +L +GTPP  VY ++DTGSD+VW QC PC+ CY Q  P+F+P 
Sbjct: 36  SNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPL 95

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR---- 239
           +S ++  +PC S  C  L    C+ +  C Y  +Y D S+T G  + ET+TF  T     
Sbjct: 96  RSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPV 155

Query: 240 -VARVALGCGHDNEGLFVAA-AGLLGLGRGRLSFPTQTGRRF-NRKFSYCLVDRSTSAKP 296
            V  +  GCGH N G F     G++GLG G LS  +Q G  + +++FS CLV     A P
Sbjct: 156 VVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLV--PFHADP 213

Query: 297 SSM---VFGD-SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
            ++    FGD S VS        L + +  T Y V L GISVG   V   ++ +      
Sbjct: 214 HTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLS---- 269

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
             G ++IDSGT  T L +  Y  L    +  ++ L    D  L  T      +T ++ P 
Sbjct: 270 -KGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDL-GTQLCYRSETNLEGPI 327

Query: 413 VVLHFRGADVSL-PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           ++ HF GADV L P   ++ P D  G FCFA AGT  G  I GN  Q    + +DL    
Sbjct: 328 LIAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKT 385

Query: 472 IGFAPRGCA 480
           + F    C+
Sbjct: 386 VSFKATDCS 394


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 133/358 (37%), Positives = 174/358 (48%), Gaps = 48/358 (13%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FDP+ S + +   C S 
Sbjct: 88  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 147

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           LC+ L  +   R +                     T    G  V  VA GCG  N G+F 
Sbjct: 148 LCQGLPVASLPRSDKF-------------------TFVGAGASVPGVAFGCGLFNNGVFK 188

Query: 257 A-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF---------GDSAV 306
           +   G+ G GRG LS P+Q        FS+C     T A PS+++          G  AV
Sbjct: 189 SNETGIAGFGRGPLSLPSQLKV---GNFSHCFT-TITGAIPSTVLLDLPADLFSNGQGAV 244

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
               + TPL+ NP   TFYY+ L GI+VG   +  +  S F L   G GG IIDSGT++T
Sbjct: 245 ----QTTPLIQNPANPTFYYLSLKGITVGSTRLP-VPESEFALK-NGTGGTIIDSGTAMT 298

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFS--LFDTCFDLSGKTEVK--VPTVVLHFRGADV 422
            L    Y  +RDAF A      + P  S    D  F LS     K  VP +VLHF GA +
Sbjct: 299 SLPTRVYRLVRDAFAAQV----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATM 354

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LP  NY+  V+ +G+     A    G ++ IGN QQQ   V+YDL  S++ F P  C
Sbjct: 355 DLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 153/445 (34%), Positives = 207/445 (46%), Gaps = 39/445 (8%)

Query: 48  SESESSLPLPAPDAESSLSLRLHHVDSLS----FNRTPEHLFNLRIQ--RDVLRVKSLTA 101
           S S SS P   PDA ++L +  H     S        P     L  Q  RD  R+  L +
Sbjct: 29  SHSRSSCPATPPDAGNTLQVS-HAFGPCSPLGPGTAAPSWAGFLADQASRDASRLLYLDS 87

Query: 102 FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDV 161
            A        R R+R  A       ++  L      Y  R  +GTPP+ + + +DT +D 
Sbjct: 88  LAV-------RGRARAYAPIASGRQLLQTL-----TYVVRASLGTPPQQLLLAVDTSNDA 135

Query: 162 VWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-RNTCLYQVSYGD 220
            WI CA C  C + +   FDPA S S+ TVPC SPLC +  ++ C      C + ++Y D
Sbjct: 136 SWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYAD 195

Query: 221 GSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
            S+     S ++L   G  V     GC     G      GLLGLGRG LSF +QT   + 
Sbjct: 196 SSLQAA-LSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYE 254

Query: 281 RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
             FSYCL    +     ++  G +   +  + TPLLANP   + YYV + G+ VG   V 
Sbjct: 255 ATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVV- 313

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDT 398
                +   DPA   G ++DSGT  TRL  PAY+A+RD  R        AP  SL  FDT
Sbjct: 314 ----PIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVG----APVSSLGGFDT 365

Query: 399 CFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG----LSIIG 454
           CF+    T V  P + L F G  V+LP  N +I        C A A    G    L++I 
Sbjct: 366 CFN---TTAVAWPPMTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIA 422

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           ++QQQ  RV++D+   R+GFA   C
Sbjct: 423 SMQQQNHRVLFDVPNGRVGFARERC 447


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 176/349 (50%), Gaps = 18/349 (5%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTPP+ + + +DT +D  WI C+ C  C + T   F+PA S+S+  VPC SP 
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165

Query: 198 CRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV 256
           C +  +  C+    +C + ++Y D S+     S ++L      V     GC     G   
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSLEAA-LSQDSLAVANDVVKSYTFGCLQKATGTAT 224

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
              GLLGLGRG LSF +QT   +   FSYCL    +     ++  G        + TPLL
Sbjct: 225 PPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLL 284

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
            NP   + YYV + GI V G  V  I  +    DPA   G ++DSGT  TRL  PAY+A+
Sbjct: 285 VNPHRSSLYYVSMTGIRV-GKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAV 343

Query: 377 RDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           RD  R     ++ AP  SL  FDTC++    T VK P V   F G  V+LPA N +I   
Sbjct: 344 RDEVR---RRIRGAPLSSLGGFDTCYN----TTVKWPPVTFMFTGMQVTLPADNLVIHST 396

Query: 435 SSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              T C A A    G    L++I ++QQQ  R+++D+   R+GFA   C
Sbjct: 397 YGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 141/440 (32%), Positives = 212/440 (48%), Gaps = 48/440 (10%)

Query: 60  DAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRA 119
           +AE+ L ++L HVD      T E          VLR  +++           R + + R 
Sbjct: 28  EAEAGLRMKLAHVDDKGGYTTEER---------VLRAVAVS-----------RQQQQQRL 67

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQT 176
             G    V + + + + +Y     +G+PP+    ++DTGSD++W QCA     K C  Q 
Sbjct: 68  MAGAEDDVSAQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQG 127

Query: 177 DPVFDPAKSRSFATVPC--RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
            P ++ ++S +F  VPC  ++  C       C    +C +  SYG G + +G   TE+  
Sbjct: 128 LPYYNLSQSSTFVPVPCADKAGFCAANGVHLCGLDGSCTFIASYGAGRV-IGSLGTESFA 186

Query: 235 FRGTRVARVALGC---GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           F  +    +A GC        G    A+GL+GLGRGRLS  +Q G     +FSYCL    
Sbjct: 187 FE-SGTTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIG---ATRFSYCLTPYF 242

Query: 292 TSAKPSSMVF--GDSAVSRTARFTPLLANPK---LDTFYYVELVGISVGGAHVRGITASL 346
            S+  SS +F    +++       P + +PK     TFYY+ L GI+VG   +  + ++ 
Sbjct: 243 HSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTT 302

Query: 347 FKL----DPAGNGGVIIDSGTSVTRLTRPAYIALRD--AFRAGASSLKRAPDFSLFDTCF 400
           F+L         GGVIID+G+ +T+L   AY AL++  A + G  SL  AP+ S  + C 
Sbjct: 303 FQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCV 362

Query: 401 DLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
              G  +V VP +V HF  GAD+++PA +Y  PVD +        G     SIIGN QQQ
Sbjct: 363 AREGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYD--SIIGNFQQQ 419

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              ++YDL   R  F    C
Sbjct: 420 DMHLLYDLRRGRFSFQTADC 439


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 126/345 (36%), Positives = 172/345 (49%), Gaps = 39/345 (11%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           MVLDT SDV W+QC+PC    CY Q D ++DP KS S     C SP C +L   ++GC  
Sbjct: 146 MVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTN 205

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFV---AAAGLLGL 264
            N C Y+V Y DG+ T G + ++ LT    T V     GC H  +G F    +AAG++ L
Sbjct: 206 NNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMAL 265

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF----TPLLANPK 320
           G G  S  +QT   + R FS+C         P+   F    V R A +    TP+L NP 
Sbjct: 266 GGGPESLVSQTAATYGRVFSHCF------PPPTRRGFFTLGVPRVAAWRYVLTPMLKNPA 319

Query: 321 LD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
           +  TFY V L  I+V G  +  +  ++F        G  +DS T++TRL   AY ALR A
Sbjct: 320 IPPTFYMVRLEAIAVAGQRI-AVPPTVFA------AGAALDSRTAITRLPPTAYQALRQA 372

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF 439
           FR   +  + AP     DTC+D++G     +P + L F          N  + +D SG  
Sbjct: 373 FRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD--------KNAAVELDPSGVL 424

Query: 440 ---CFAF-AGTMSGLS-IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C AF AG    +  IIGNIQ Q   V+Y++ A+ +GF    C
Sbjct: 425 FQGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 125/358 (34%), Positives = 179/358 (50%), Gaps = 22/358 (6%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           SGE+   + +GTPP  V  + DTGSD+ W QC PC++C++Q+ P+F+P +S S+  V C 
Sbjct: 87  SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCA 146

Query: 195 SPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           S  CR L+S  C     +C Y  SYGD S T GD +++ +T    ++ +  +GCGH N G
Sbjct: 147 SDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGG 206

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRF---NRKFSYCLVDRSTSAKPSSMV-FGDSAV--S 307
            F      +    G         R       +FSYCL    ++A  +  + FG  AV   
Sbjct: 207 TFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSG 266

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVR---GITASLFKLDPAGNGGVIIDSGTS 364
           R    TPL+     DTFY++ L  ISVG    +   GI+A         +G +IIDSGT+
Sbjct: 267 RQVVSTPLVPRSP-DTFYFLTLEAISVGKKRFKAANGISAM------TNHGNIIIDSGTT 319

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
           +T L R  Y  +     A     KR  D S + + C+      ++ +P +  HF  GADV
Sbjct: 320 LTLLPRSLYYGVFSTL-ARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 378

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            L   N   PV  + T C  FA   + ++I GN+ Q  F V YDL   R+ F P+ CA
Sbjct: 379 KLLPVNTFAPVADNVT-CLTFA-PATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 146/445 (32%), Positives = 211/445 (47%), Gaps = 66/445 (14%)

Query: 63  SSLSLRLHHVDSLSFNRTPEHLFNLRI--------------QRDVLRVKSLTAFAESAVR 108
           SS  L L     LS  +T  H FN+ +              +  + R+ S+  ++ + VR
Sbjct: 5   SSFVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVR 64

Query: 109 VPPRNRSRGRANGGFSSSVISGLA----QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
                 S       FS + I  +      G+G Y     +GTPP  +Y ++DTG+D +W 
Sbjct: 65  YLNHVFS-------FSPNKIQDVPLSSFMGAG-YVMSYSIGTPPFQLYSLIDTGNDNIWF 116

Query: 165 QCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSIT 224
           QC PCK C +QT P+F P+KS ++ T+PC SP+C+  D                G  ++T
Sbjct: 117 QCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKNADGH------------YLGVDTLT 164

Query: 225 VGDFSTETLTFRGTRVARVALGCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
           +   +   ++F+      + +GCGH N+G L    +G +GL RG LSF +Q       KF
Sbjct: 165 LNSNNGTPISFK-----NIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKF 219

Query: 284 SYCLVDRSTSAKPSSMV-FGD-SAVSRTARF-TPLLANPKLDTFYYVELVGISVGGAHVR 340
           SYCLV   +    SS + FGD S VS      TP+    K +  Y+V L   SVG     
Sbjct: 220 SYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVGD---- 271

Query: 341 GITASLFKLDPAGN-GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDT 398
                + KL+ + N G  IIDSGT++T L +  Y  L ++       LKR  D S  F+ 
Sbjct: 272 ----HIIKLENSDNRGNSIIDSGTTMTILPKDVYSRL-ESVVLDMVKLKRVKDPSQQFNL 326

Query: 399 CFDLSGKTEV-KVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGN 455
           C+  +  T + KV  +  HF G++V L A N   P+ +    CFAF   G  S L+I GN
Sbjct: 327 CYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYPI-TDEVICFAFVSGGNFSSLAIFGN 385

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQ F V +DL    I F P  C 
Sbjct: 386 VVQQNFLVGFDLNKKTISFKPTDCT 410


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 128/373 (34%), Positives = 182/373 (48%), Gaps = 28/373 (7%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V S +  G GEY  R+ +G P   +  + DTGSD++W+QC PC+ CY Q  P+FDP +S 
Sbjct: 82  VQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSS 141

Query: 187 SFATVPCRSPLCRKLDSSG--CNRR---NTCLYQVSYGDGSITVGDFSTETLTFRGTR-- 239
           S+  V C +  C KLD     C+ R    TC Y  SYGD S + G  + E      T   
Sbjct: 142 SYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSN 201

Query: 240 -------VARVALGCGHDNEGLF-VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
                     VA GCG  N G F    +G++GLG G +S  +Q G + + KFSYCLV  S
Sbjct: 202 TSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTS 261

Query: 292 TSAKPSSMV-FGD----SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             +  +S + FG+    S  +     TPLL   K +T+YY+ L  ISV   + R    +L
Sbjct: 262 EQSNYTSKINFGNDINISGSNYNVVSTPLLPK-KPETYYYLTLEAISV--ENKRLPYTNL 318

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT 406
           +  +    G +IIDSGT++T L    +  L  A        + +    LF+ CF    + 
Sbjct: 319 WNGE-VEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICF--KDEK 375

Query: 407 EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
            +++P +  HF GADV L   N    V+     CF    + + ++I GN+ Q  F V YD
Sbjct: 376 AIELPIITAHFTGADVELQPVNTFAKVEED-LLCFTMIPS-NDIAIFGNLAQMNFLVGYD 433

Query: 467 LAASRIGFAPRGC 479
           L    + F P  C
Sbjct: 434 LEKKAVSFLPTDC 446


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 139/388 (35%), Positives = 196/388 (50%), Gaps = 48/388 (12%)

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD 177
            A+GG  S +I+     S EY   + VGTPP  +  + DTGSD+VW+ C+        +D
Sbjct: 84  EADGGVESKIITR----SFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASD 139

Query: 178 P--VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF 235
              VF P++S +++ + C+S  C+ L  + C+  + C YQ +YGDGS T+G  STET +F
Sbjct: 140 GAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSF 199

Query: 236 RG--------TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG--RRFNRKFSY 285
                      RV RV+ GC   + G F  + GL+GLG G LS  +Q G   R  R+FSY
Sbjct: 200 AAAGGGGEGQVRVPRVSFGCSTGSAGSF-RSDGLVGLGAGALSLVSQLGAAARIARRFSY 258

Query: 286 CLVDRSTSAKPSS-MVFGDSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           CLV    +A  SS + FG  AV     A  TPL+ + ++D++Y V L  ++V G  V   
Sbjct: 259 CLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQDVASA 317

Query: 343 TASLFKLDPAGNGGVIIDSGTSVT----RLTRPAYIALRDAFRAGASSLKRA-PDFSLFD 397
            +S           +I+DSGT++T     L RP    L    R     L RA P   L  
Sbjct: 318 NSSR----------IIVDSGTTLTFLDPALLRPLVAELERRIR-----LPRAQPPEQLLQ 362

Query: 398 TCFDLSGKTEVK---VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LS 451
            C+D+ GK++ +   +P V L F  GA V+L   N    ++  GT C           +S
Sbjct: 363 LCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVS 421

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           I+GNI QQ F V YDL A  + FA   C
Sbjct: 422 ILGNIAQQNFHVGYDLDARTVTFAAVDC 449


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 128/375 (34%), Positives = 189/375 (50%), Gaps = 34/375 (9%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-PCK--KCYSQT------DPVFDPAK 184
           G G+Y     VGTP +   +V DTGSD+ W+ C   C+   C ++         VF    
Sbjct: 79  GIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 138

Query: 185 SRSFATVPCRSPLCR-----KLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF--- 235
           S SF T+PC + +C+         + C    T C Y   Y DGS  +G F+ ET+T    
Sbjct: 139 SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 198

Query: 236 --RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
             R  ++  V +GC    +G  F AA G++GLG  + SF  +   +F  KFSYCLVD  +
Sbjct: 199 EGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 258

Query: 293 SAKPSS-MVFGDS----AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
               S+ + FG S    A+     +T L+    +++FY V ++GIS+GGA ++ I + ++
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLK-IPSEVW 316

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKT 406
             D  G GG I+DSG+S+T LT PAY  +  A R      ++   D    + CF+ +G  
Sbjct: 317 --DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFE 374

Query: 407 EVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVV 464
           E  VP +V HF  GA+   P  +Y+I   + G  C  F      G S++GNI QQ     
Sbjct: 375 ESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSVVGNIMQQNHLWE 433

Query: 465 YDLAASRIGFAPRGC 479
           +DL   ++GFAP  C
Sbjct: 434 FDLGLKKLGFAPSSC 448


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 134/396 (33%), Positives = 199/396 (50%), Gaps = 29/396 (7%)

Query: 96  VKSLTAFAESAVR-VPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           +++L A + + VR +  R  S   ++   ++ V S L    G Y   + VGTP +    +
Sbjct: 12  IRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAI 71

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
            DTGSD+VW+Q  PC  C   T  +FDP +S +F  + C S LC +L  S     +TC Y
Sbjct: 72  ADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSY 129

Query: 215 QVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNEGLFVAAAGLLGLGRGRL 269
              YG G  T G+F+ +T++   T     +    A+GCG  N G F    GL+GLG+G +
Sbjct: 130 SYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPV 187

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTF--YYV 327
           S  +Q     + KFSYCLVD ++ ++ S ++FG SA             P  DT+  YY+
Sbjct: 188 SLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYL 247

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
               ++V G  V G T           G  IIDSGT++T +    Y  +     +   +L
Sbjct: 248 ----LTVNGIAVAGQTM-------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMES-MVTL 295

Query: 388 KRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSG-TFCFAFAG 445
            R    S+  D C+D S     K P + +   GA ++ P++NY + VD SG T C A  G
Sbjct: 296 PRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVCLAM-G 354

Query: 446 TMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + SGL  SIIGN+ QQG+ ++YD  +S + F    C
Sbjct: 355 SASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 127/299 (42%), Positives = 165/299 (55%), Gaps = 21/299 (7%)

Query: 201 LDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGT---------RVARVALGCGHD 250
           L ++ C   N TC Y   YGD S T GDF+ ET T   T         RV  V  GCGH 
Sbjct: 62  LVTNPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHW 121

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG---DSAV 306
           N GLF  AAGLLGLGRG LSF +Q    +   FSYCLVDR++ A  SS ++FG   D   
Sbjct: 122 NRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLS 181

Query: 307 SRTARFTPLLA---NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
                FT L+A   NP +DTFYYV++  I VGG  V  I    +++   G+GG IIDSGT
Sbjct: 182 HPELNFTTLVAGKENP-VDTFYYVQIKSIVVGG-EVVNIPEEKWQIATDGSGGTIIDSGT 239

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADV 422
           +++    PAY  +++AF A         DF + + C++++G  +  +P   + F  GA  
Sbjct: 240 TLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVW 299

Query: 423 SLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           + P  NY I ++     C A  GT  S LSIIGN QQQ F ++YD   SR+GFAP  CA
Sbjct: 300 NFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCA 358


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 128/375 (34%), Positives = 189/375 (50%), Gaps = 34/375 (9%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-PCK--KCYSQT------DPVFDPAK 184
           G G+Y     VGTP +   +V DTGSD+ W+ C   C+   C ++         VF    
Sbjct: 8   GIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 67

Query: 185 SRSFATVPCRSPLCR-----KLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF--- 235
           S SF T+PC + +C+         + C    T C Y   Y DGS  +G F+ ET+T    
Sbjct: 68  SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 127

Query: 236 --RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
             R  ++  V +GC    +G  F AA G++GLG  + SF  +   +F  KFSYCLVD  +
Sbjct: 128 EGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 187

Query: 293 SAKPSS-MVFGDS----AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
               S+ + FG S    A+     +T L+    +++FY V ++GIS+GGA ++ I + ++
Sbjct: 188 HKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLK-IPSEVW 245

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-DFSLFDTCFDLSGKT 406
             D  G GG I+DSG+S+T LT PAY  +  A R      ++   D    + CF+ +G  
Sbjct: 246 --DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFE 303

Query: 407 EVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGT-MSGLSIIGNIQQQGFRVV 464
           E  VP +V HF  GA+   P  +Y+I   + G  C  F      G S++GNI QQ     
Sbjct: 304 ESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSVVGNIMQQNHLWE 362

Query: 465 YDLAASRIGFAPRGC 479
           +DL   ++GFAP  C
Sbjct: 363 FDLGLKKLGFAPSSC 377


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 128/344 (37%), Positives = 175/344 (50%), Gaps = 35/344 (10%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR 208
           +++D+GSDV W+QC PC    C+ Q DP+FDPA S ++A VPC S  C +L     GC+ 
Sbjct: 170 VIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSA 229

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEG--LFVAAAGLLGLG 265
              C + ++YGDGS   G +S + LT     V R    GC H + G       AG L LG
Sbjct: 230 NAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTS-------AKPSSMVFGDSAVSRTARFTPLLAN 318
            G  S   QT  R+ R FSYCL   ++S         P       S VS     TPLL++
Sbjct: 290 GGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS-----TPLLSS 344

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
               TFY V L  I V G  +  +  ++F      +   +IDS T ++RL   AY ALR 
Sbjct: 345 SMAPTFYRVLLRAIIVAGRPL-AVPPAVF------SASSVIDSSTIISRLPPTAYQALRA 397

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSG 437
           AFR+  +  + AP  S+ DTC+D +G   + +P++ L F  GA V+L A   L+     G
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----G 452

Query: 438 TFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + C AFA T S      IGN+QQ+   VVYD+ A  + F    C
Sbjct: 453 S-CLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 123/284 (43%), Positives = 157/284 (55%), Gaps = 20/284 (7%)

Query: 205 GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEGLFVAAAGLLG 263
           GC+  + CLY V YGDGS T+G F+ +TLT       +    GCG  NEGLF  AAGLLG
Sbjct: 15  GCSGGH-CLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLG 73

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-AVSRTARFTPLLANPKLD 322
           LGRG+ S P QT  ++   F++C   RS+         G S AVS     TP+L +    
Sbjct: 74  LGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTPMLIDTG-P 132

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
           TFYYV + GI VGG  +  I  S+F        G I+DSGT +TRL   AY +LR AF A
Sbjct: 133 TFYYVGMTGIRVGGKLLP-IPQSVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAA 186

Query: 383 --GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA---DVSLPATNYLIPVDSSG 437
              A   KRAP  SL DTC+DL+G +EV +PTV L F+G    DV      Y   V  + 
Sbjct: 187 SMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASVSQA- 245

Query: 438 TFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             C  FAG  +   ++I+GN Q + F VVYD+A+  +GF P  C
Sbjct: 246 --CLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 170/343 (49%), Gaps = 18/343 (5%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R  +GTPP+ + + +DT +D  WI C  C  C S    +F P KS +F  V C 
Sbjct: 90  SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCA 146

Query: 195 SPLCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
           +P C+++ + GC  + RN   + ++YG  SI   +   +T+T     V     GC     
Sbjct: 147 APECKQVPNPGCGVSSRN---FNLTYGSSSI-AANLVQDTITLATDPVPSYTFGCVSKTT 202

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G      GLLGLGRG LS  +QT   +   FSYCL    +     S+  G  A  +  ++
Sbjct: 203 GTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKY 262

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TPLL NP+  + YYV L  I V G  V  I  +    +P    G I DSGT  TRL  P 
Sbjct: 263 TPLLKNPRRSSLYYVNLEAIRV-GRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPV 321

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           Y+A+RD FR              FDTC+++     + VPT+   F G +V+LP  N LI 
Sbjct: 322 YVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIFTGMNVTLPQDNILIH 377

Query: 433 VDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASR 471
             +  T C A AG      S L++I N+QQQ  RV+YD+  SR
Sbjct: 378 STAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 132/356 (37%), Positives = 183/356 (51%), Gaps = 30/356 (8%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G G Y     +GTPP+ +  + DTGSD++W +C  C +C  Q  P + P KS SF+ 
Sbjct: 75  LDSGGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSK 134

Query: 191 VPCRSPLCRKLDSSGCNRRNT-CLYQVSYGDGS----ITVGDFSTETLTFRGTRVARVAL 245
           +PC   LC  L SS C+     C Y+ SYG  S     T G   +ET T     V  +  
Sbjct: 135 LPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGF 194

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GC   +EG + + +GL+GLGRG LS  +Q        FSYCL   S +AK S ++FG  A
Sbjct: 195 GCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNV---GAFSYCLT--SDAAKTSPLLFGSGA 249

Query: 306 VSRTA-RFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           ++    + TPLL   +  T+YY V L  IS+G A   G           G+ G+I DSGT
Sbjct: 250 LTGAGVQSTPLL---RTSTYYYTVNLESISIGAATTAGT----------GSSGIIFDSGT 296

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
           +V  L  PAY   ++A  +  ++L  A     ++ CF  SG      P++VLHF G D+ 
Sbjct: 297 TVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSGAV---FPSMVLHFDGGDMD 353

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           LP  NY   VD S + C+    + S LSI+GNI Q  + + YD+  S + F P  C
Sbjct: 354 LPTENYFGAVDDSVS-CWIVQKSPS-LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 123/356 (34%), Positives = 176/356 (49%), Gaps = 25/356 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
            Y  R  +GTP + + + +D  +D  W+ CA          P FDP +S ++  V C +P
Sbjct: 106 SYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGAP 163

Query: 197 LCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGT--RVARVALGCGHDNE 252
            C +  +  C     ++C + +SY   S        + L        VA    GC H   
Sbjct: 164 QCSQAPAPSCPGGLGSSCAFNLSYA-ASTFQALLGQDALALHDDVDAVAAYTFGCLHVVT 222

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G  V   GL+G GRG LSFP+QT   +   FSYCL    +S    ++  G +   +  + 
Sbjct: 223 GGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKT 282

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TPLL+NP   + YYV +VGI VGG  V  + AS    DP    G I+D+GT  TRL+ P 
Sbjct: 283 TPLLSNPHRPSLYYVNMVGIRVGGRPVP-VPASALAFDPTSGRGTIVDAGTMFTRLSAPV 341

Query: 373 YIALRDAFRAGASSLKRAP---DFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN 428
           Y A+RD FR    S  RAP       FDTC++++    + VPTV   F G   V+LP  N
Sbjct: 342 YAAVRDVFR----SRVRAPVAGPLGGFDTCYNVT----ISVPTVTFSFDGRVSVTLPEEN 393

Query: 429 YLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +I   S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+GF+   C
Sbjct: 394 VVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 184/361 (50%), Gaps = 36/361 (9%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD--PVFDPAKSRS 187
           G A  + EY   +G+G+P     M++DTGSDV W++C       + TD   +FDP+KS +
Sbjct: 121 GSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRC-------NSTDGLTLFDPSKSTT 173

Query: 188 FATVPCRSPLCRKLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGT-RVARVAL 245
           +A   C S  C +L ++G    N+ C Y+V YGDGS T G +S++TL    +  V     
Sbjct: 174 YAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHF 233

Query: 246 GCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD- 303
           GC H  E        GL+GLG    S  +QT   + + FSYCL    T+     + FG  
Sbjct: 234 GCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL--PPTNRTSGFLTFGAP 291

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           +  S     TP+L  PK  T Y V L  ISVGG  + GI  S+       + G ++DSGT
Sbjct: 292 NGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPL-GIQPSVL------SNGSVMDSGT 344

Query: 364 SVTRLTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 421
            +T L R AY AL  AFR+  + L+  RA    + DTC+D +G   V +P V L   G  
Sbjct: 345 VITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGA 404

Query: 422 VSLPATNYLIPVDSSGTF---CFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           V        + +D +G     C AFA T SG SIIGN+QQ+ F V++D+     GF    
Sbjct: 405 V--------VDLDGNGIMIQDCLAFAAT-SGDSIIGNVQQRTFEVLHDVGQGVFGFRSGA 455

Query: 479 C 479
           C
Sbjct: 456 C 456


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 133/354 (37%), Positives = 180/354 (50%), Gaps = 30/354 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCR 194
           EY   +  GTP     +V+DTGSD+ W+QC PC   +C  Q DP+FDP+ S +++ VPC 
Sbjct: 111 EYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170

Query: 195 SPLCRKLDS----SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGH 249
           S  C+KL +    SGC+    C + +SY DG+ TVG +  + LT   G  V     GCGH
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAPGAIVKDFYFGCGH 230

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
               L     GLLGLGR   S   Q        FSYCL   + ++KP  + FG       
Sbjct: 231 SKSSLPGLFDGLLGLGRLSESLGAQ--YGGGGGFSYCL--PAVNSKPGFLAFGAGRNPSG 286

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD---PAGNGGVIIDSGTSVT 366
             FTP+   P   TF  V L GI+VGG           KLD    A +GG+I+DSGT VT
Sbjct: 287 FVFTPMGRVPGQPTFSTVTLAGITVGGK----------KLDLRPSAFSGGMIVDSGTVVT 336

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
            L    Y ALR AFR    + +        DTC+DL+G   V VP + L F  GA ++L 
Sbjct: 337 VLQSTVYRALRAAFREAMKAYRLV--HGDLDTCYDLTGYKNVVVPKIALTFSGGATINLD 394

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             N ++    +G   FA  G      ++GN+ Q+ F V++D +AS+ GF  + C
Sbjct: 395 VPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 124/311 (39%), Positives = 162/311 (52%), Gaps = 27/311 (8%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   L +GTPP+ V + LDTGSD++W QC PC  C+ Q  P FDP+ S + +   C S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 197 LCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGH 249
           LC+ L  + C         TC+Y  SYGD S+T G    +  TF   G  V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 250 DNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-VS 307
            N G+F +   G+ G GRG LS P+Q        FS+C    +   KPS+++    A + 
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVN-GLKPSTVLLDLPADLY 256

Query: 308 RTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           ++ R     TPL+ NP   TFYY+ L GI+VG   +  +  S F L   G GG IIDSGT
Sbjct: 257 KSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLP-VPESEFALK-NGTGGTIIDSGT 314

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFS--LFDTCFDLSGKTEVK--VPTVVLHFRG 419
           ++T L    Y  +RDAF A      + P  S    D  F LS     K  VP +VLHF G
Sbjct: 315 AMTSLPTRVYRLVRDAFAAQV----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 420 ADVSLPATNYL 430
           A + LP  NY+
Sbjct: 371 ATMDLPRENYV 381


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 115/344 (33%), Positives = 171/344 (49%), Gaps = 20/344 (5%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +GTPP     + DTGSD+ W QC PC KCY Q  P+F+P KS SF+ VPC +  C  +D 
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD 145

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLG 263
             C  +  C Y  +YGD + + GD   E +T   + V  V +GCGH + G F  A+G++G
Sbjct: 146 GHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSVKSV-IGCGHASSGGFGFASGVIG 204

Query: 264 LGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF--TPLLANP 319
           LG G+LS  +Q  +    +R+FSYCL    + A    + FG +AV        TPL++  
Sbjct: 205 LGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN-GKINFGQNAVVSGPGVVSTPLISKN 263

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
            + T+YY+ L  IS+G             +  A  G VIIDSGT+++ L +  Y  +  +
Sbjct: 264 TV-TYYYITLEAISIGNER---------HMAFAKQGNVIIDSGTTLSFLPKELYDGVVSS 313

Query: 380 FRAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GADVS-LPATNYLIPVDS 435
                 + +     + +D CFD  ++  T   +P +   F  GA+V+ LP   +    ++
Sbjct: 314 LLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANN 373

Query: 436 SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                   A       IIGN+    F + YDL A R+ F P  C
Sbjct: 374 VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 135/398 (33%), Positives = 199/398 (50%), Gaps = 45/398 (11%)

Query: 91  RDVLRVKSLTA--FAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +D  RV+S+ A  F + + +          +  G+S   +  L +  G +   +G GTP 
Sbjct: 90  QDRSRVRSINAKIFGQYSTQ---------ESKDGWSPESMDTLNE-DGLFLVNVGFGTPQ 139

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           +   +++DTGSD  WIQC  C          F+P+ S S++   C       + S+  N 
Sbjct: 140 QKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSC-------IPSTDTN- 191

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG- 267
                Y + Y D S + G F  + +T +     +   GCG    G F  A+G+LGL +G 
Sbjct: 192 -----YTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGEFGTASGVLGLAKGE 246

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--RFTPLLANPKLDTFY 325
           + S  +QT  +F +KFSYC   +  +    S++FG+ A+S +   +FT LL NP     Y
Sbjct: 247 QYSLISQTASKFKKKFSYCFPPKEHTL--GSLLFGEKAISASPSLKFTQLL-NPPSGLGY 303

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA--- 382
           +VEL+GISV    +  +++SLF      + G IIDSGT +TRL   AY ALR AF+    
Sbjct: 304 FVELIGISVAKKRLN-VSSSLF-----ASPGTIIDSGTVITRLPTAAYEALRTAFQQEML 357

Query: 383 GASSLKRAPDFSLFDTCFDLS--GKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTF 439
              S+   P   L DTC++L   G   +K+P +VLHF G  DVSL  +  L         
Sbjct: 358 HCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQA 417

Query: 440 CFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           C AFA     S ++IIGN QQ   +VVYD+   R+GF 
Sbjct: 418 CLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 455


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 130/368 (35%), Positives = 194/368 (52%), Gaps = 39/368 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-------CYSQTDPVFDPAKSRSFATVPCR 194
           +G+GTPP+   +++DTGSD++W QC+   +          Q +P+++P +S SFA +PC 
Sbjct: 88  VGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCS 147

Query: 195 SPLCR--KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL----GCG 248
             LC+  +     C R N C+Y   YG      G  ++ET TF G   A+V+L    GCG
Sbjct: 148 DRLCQEGQFSYKNCARNNRCMYDELYGSAEAG-GVLASETFTF-GVN-AKVSLPLGFGCG 204

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
             + G  V A+GL+GL  G +S  +Q       +FSYCL   +   K S ++FG  A  R
Sbjct: 205 ALSAGDLVGASGLMGLSPGIMSLVSQLSVP---RFSYCLTPFA-ERKTSPLLFGAMADLR 260

Query: 309 ------TARFTPLLANPKLDT-FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
                 T + T +L NP ++T +YYV LVG+S+G   +     SL  + P G+GG I+DS
Sbjct: 261 RYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDS 320

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRA----PDFSLFDTCFDLS---GKTEVKVPTVV 414
           G++++ L   A+ A++ A    A  L  A     D+  ++ CF L        VK P +V
Sbjct: 321 GSTMSYLEETAFRAVKKAV-VEAVRLPVANGTDEDYDDYELCFALPTGVAMEAVKTPPLV 379

Query: 415 LHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASR 471
           LHF  GA ++LP  NY      +G  C A   +    G+SIIGN+QQQ   V++D+   +
Sbjct: 380 LHFDGGAAMTLPRDNYF-QEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVLFDVRNQK 438

Query: 472 IGFAPRGC 479
             FAP  C
Sbjct: 439 FSFAPTKC 446


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 133/397 (33%), Positives = 193/397 (48%), Gaps = 32/397 (8%)

Query: 101 AFAESAVRVPPRNRSRGRANGGFSSSVISG-----LAQG-----SGEYFTRLGVGTPPRY 150
           ++AES +++  ++++R +    F +S+++G     +A G     S  Y  R  +GTPP+ 
Sbjct: 54  SWAESVLQLQAKDQARLQ----FLASMVAGRSIVPIASGRQIIQSPTYIVRAKIGTPPQT 109

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           + + +DT +D  WI C  C  C   T  +F P KS +F  V C SP C K+ S  C   +
Sbjct: 110 LLLAIDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCGSPECNKVPSPSCGT-S 165

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
            C + ++YG  SI   +   +T+T     +     GC     G      GLLGLGRG LS
Sbjct: 166 ACTFNLTYGSSSI-AANVVQDTVTLATDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLS 224

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV 330
             +QT   +   FSYCL    +     S+  G  A     ++TPLL NP+  + YYV L 
Sbjct: 225 LLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLF 284

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
            I VG   +  I  +    + A   G + DSGT  TRL  P Y A+RD FR   +   +A
Sbjct: 285 AIRVG-RKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKA 343

Query: 391 ----PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFA-- 444
                    FDTC+ +     +  PT+   F G +V+LP  N LI   +  T C A A  
Sbjct: 344 NLTVTSLGGFDTCYTV----PIVAPTITFMFSGMNVTLPQDNILIHSTAGSTSCLAMASA 399

Query: 445 --GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                S L++I N+QQQ  RV+YD+  SR+G A   C
Sbjct: 400 PDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 436


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/396 (33%), Positives = 196/396 (49%), Gaps = 29/396 (7%)

Query: 96  VKSLTAFAESAVR-VPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           ++ L A + + VR +  R  S   ++   ++ V S L    G Y   + VGTP +    +
Sbjct: 12  IRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAI 71

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
            DTGSD+VW+Q  PC  C   T  +FDP +S +F  + C S LC +L  S     + C Y
Sbjct: 72  ADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSY 129

Query: 215 QVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNEGLFVAAAGLLGLGRGRL 269
              YG G  T G+F+ +T++   T     +    A+GCG  N G F    GL+GLG+G +
Sbjct: 130 SYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPV 187

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTF--YYV 327
           S  +Q     + KFSYCLVD ++ ++ S ++FG SA             P  DT+  YY+
Sbjct: 188 SLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYL 247

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
               ++V G  V G T           G  IIDSGT++T +    Y  +     +   +L
Sbjct: 248 ----LTVNGIAVAGQTM-------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMES-MVTL 295

Query: 388 KRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSG-TFCFAFAG 445
            R    S+  D C+D S     K P + +   GA ++ P++NY + VD SG T C A  G
Sbjct: 296 PRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVCLAM-G 354

Query: 446 TMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +  GL  SIIGN+ QQG+ ++YD  +S + F    C
Sbjct: 355 SAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 181/382 (47%), Gaps = 28/382 (7%)

Query: 118 RANGGFSSSVISGLAQGS-----GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
           R +   SS+ I  + Q       G+Y   L +GTPP  +   +DTGSD++W+QC PC  C
Sbjct: 39  RKSSHLSSNNIQDIVQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGC 98

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           Y+Q +P+FDP KS ++  + C SPLC K     C+    C Y   Y D S+T G  + ET
Sbjct: 99  YNQINPMFDPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQET 158

Query: 233 LTFRGTRVARVAL-----GCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRF-NRKFSY 285
           +T        ++L     GCGH+N G F     GL+GLG G  S  +Q G  F  +KFS 
Sbjct: 159 VTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQ 218

Query: 286 CLVDRSTSAKPSS-MVFGD--SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGI 342
           CLV   T    SS M FG     +      TPL+   +  T YYV L+GISV   ++  +
Sbjct: 219 CLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLP-M 277

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF-DTCFD 401
            +++ K      G +++DSGT    L +  Y  +    +          D SL    C+ 
Sbjct: 278 NSTIEK------GNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCY- 330

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPATNYLIP--VDSSGTFCFAFAGTM-SGLSIIGNIQQ 458
              +T +K PT+  HF GA++ L      IP   ++ G FC A      S   I GN  Q
Sbjct: 331 -RTQTNLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQ 389

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
             + + +DL    + F P  C 
Sbjct: 390 TNYLIGFDLDRQIVSFKPTDCT 411


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/397 (33%), Positives = 194/397 (48%), Gaps = 32/397 (8%)

Query: 101 AFAESAVRVPPRNRSRGRANGGFSSSVISG-----LAQG-----SGEYFTRLGVGTPPRY 150
           ++AES +++  ++++R +    F +S+++G     +A G     S  Y  R  +G+PP+ 
Sbjct: 55  SWAESVLQLQAKDQARLQ----FLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQT 110

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           + + +DT +D  WI C  C  C   T  +F P KS +F  V C SP C ++ +  C   +
Sbjct: 111 LLLAMDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCGSPQCNQVPNPSCGT-S 166

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
            C + ++YG  SI   +   +T+T     +     GC     G      GLLGLGRG LS
Sbjct: 167 ACTFNLTYGSSSI-AANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLS 225

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV 330
             +QT   +   FSYCL    +     S+  G  A     ++TPLL NP+  + YYV LV
Sbjct: 226 LLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLV 285

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
            I VG   V  I       + A   G + DSGT  TRL  PAY A+RD F+   +   +A
Sbjct: 286 AIRVG-RKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKA 344

Query: 391 ----PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFA-- 444
                    FDTC+ +     +  PT+   F G +V+LP  N LI   +  T C A A  
Sbjct: 345 NLTVTSLGGFDTCYTV----PIVAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMASA 400

Query: 445 --GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                S L++I N+QQQ  RV+YD+  SR+G A   C
Sbjct: 401 PDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 437


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 128/368 (34%), Positives = 184/368 (50%), Gaps = 37/368 (10%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDP 182
           S V++  A G G      GV        +VLD+ SDV W+QC PC    C+ Q D  +DP
Sbjct: 138 SGVVNASAAGGGSRSKLPGV-----IQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDP 192

Query: 183 AKSRSFATVPCRSPLCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTR 239
           ++S S A   C SP C  L   ++GC   N C Y V Y DGS T G +  + LT   G  
Sbjct: 193 SRSPSSAPFSCSSPTCTALGPYANGC-ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA 251

Query: 240 VARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
           V+    GC H  +G F A AAG++ LG G  S  +QT  R+   FSYC+   ++ +   +
Sbjct: 252 VSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFT 311

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           +     A SR    TP++   +  TFY V L  I+VGG  + G+  ++F        G +
Sbjct: 312 LGVPRRASSRYV-VTPMVRFRQAATFYGVLLRTITVGGQRL-GVAPAVFA------AGSV 363

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +DS T++TRL   AY ALR AFR+  +  + AP     DTC+D +G   +++P + L F 
Sbjct: 364 LDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD 423

Query: 419 GADVSLPATNYLIPVDSSGTF---CFAFAGT----MSGLSIIGNIQQQGFRVVYDLAASR 471
                    N ++P+D SG     C AF       M G  ++G++QQQ   V+YD+    
Sbjct: 424 --------RNAVLPLDPSGILFNDCLAFTSNADDRMPG--VLGSVQQQTIEVLYDVGGGA 473

Query: 472 IGFAPRGC 479
           +GF    C
Sbjct: 474 VGFRQGAC 481


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 125/347 (36%), Positives = 179/347 (51%), Gaps = 40/347 (11%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR 208
           +++D+GSDV W+QC PC    C+ Q DP+FDPA S ++A VPC S  C +L     GC  
Sbjct: 83  VIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLA 142

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-GCGHDNEG--LFVAAAGLLGLG 265
            + C + ++Y +G+   G +S++ LT     V R  L GC H ++G       AG L LG
Sbjct: 143 NSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSYDVAGTLALG 202

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---------DSAVSRTARFTPLL 316
            G  SF  QT  +++R FSYC+        PS+  FG          +A+  T   TPLL
Sbjct: 203 GGSQSFVQQTASQYSRVFSYCV-------PPSTSSFGFIMFGVPPQRAALVPTFVSTPLL 255

Query: 317 ANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA 375
           ++  +  TFY V L  I V G  +  +  ++F      +   +IDS T ++R+   AY A
Sbjct: 256 SSSTMSPTFYRVLLRSIIVAGRPLP-VPPTVF------SASSVIDSATVISRIPPTAYQA 308

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD 434
           LR AFR+  +  + AP  S+ DTC+D SG   + +P++ L F  GA V+L A   L+   
Sbjct: 309 LRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL--- 365

Query: 435 SSGTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                C AFA T S      IGN+QQ+   VVYD+    I F    C
Sbjct: 366 ---QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 126/346 (36%), Positives = 174/346 (50%), Gaps = 41/346 (11%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           MV+DT SDV W+QCAPC    C++QTD ++DP+KS S A  PC SP CR L   ++GC  
Sbjct: 158 MVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTP 217

Query: 209 R-NTCLYQVSYGDGSITVGDFSTETLTFRGTR----VARVALGCGHD--NEGLFV-AAAG 260
             + C Y+V Y DGS + G + ++ LT    +    ++    GC H     G F    +G
Sbjct: 218 AGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSG 277

Query: 261 LLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK 320
           ++ LGRG  S PTQT   +   FSYCL      +    +     A SR A  TP+L +  
Sbjct: 278 IMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYA-VTPMLRSKA 336

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
               Y V L+ I V G  +  +  ++F        G ++DS T VTRL   AY+ALR AF
Sbjct: 337 APMLYLVRLIAIEVAGKRLP-VPPAVFA------AGAVMDSRTIVTRLPPTAYMALRAAF 389

Query: 381 RAGASSLKRAPDFSLFDTCFDLS-----GKTEVKVPTVVLHFRGADVSLPATNYLIPVDS 435
            A   + + A      DTC+D S     G   VK+P + L F G        N  + +D 
Sbjct: 390 VAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDG-------PNGAVELDP 442

Query: 436 SGTF---CFAFA----GTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
           SG     C AFA      M+G  IIGN+QQQ   V+Y++  + +GF
Sbjct: 443 SGVLLDGCLAFAPNTDDQMTG--IIGNVQQQALEVLYNVDGATVGF 486


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 102/277 (36%), Positives = 151/277 (54%), Gaps = 17/277 (6%)

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           C Y ++YGDGS T G+   E L F    V     GCG +N+GLF   +GL+GLGR  LS 
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSL 192

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA---RFTPLLANPKLDTFYYVE 328
            +QT   F   FSYCL         S ++ G+S+V R +    +  ++ NP+L  FY++ 
Sbjct: 193 ISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFIN 252

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           L GIS+GG  ++  +         G   +++DSGT +TRL    Y AL+  F    +   
Sbjct: 253 LTGISIGGVALQAPS--------VGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 304

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN--YLIPVDSSGTFCFAFAG 445
            AP FS+ DTCF+LS   EV +PT+ +HF G A++++  T   Y +  D+S   C A A 
Sbjct: 305 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDAS-QVCLALAS 363

Query: 446 T--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                 ++I+GN QQ+  RV+YD   +++GFA   C+
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 122/380 (32%), Positives = 187/380 (49%), Gaps = 42/380 (11%)

Query: 114 RSRGRANGGFSSSVISG----LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           RS  R N  +  S+ S     +    GEY     +GTPP  V+  +DTGSD+VW+QC PC
Sbjct: 60  RSINRVNHFYKYSLTSTPQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPC 119

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           K+CY Q  P+FDP+ S S+  +PC S  C  + ++ C+ R           G ++V   +
Sbjct: 120 KQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCDVR-----------GYLSVETLT 168

Query: 230 TETLTFRGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCL- 287
            ++ T       +  +GCG+ N G F   ++G++GLG G +S P+Q G     KFSYCL 
Sbjct: 169 LDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLG 228

Query: 288 --VDRSTSA---KPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR-- 340
             +  STS      +++V+GD A++     TP++      + YY+ L   SVG   +   
Sbjct: 229 PWLPNSTSKLNFGDAAIVYGDGAMT-----TPIVKK-DAQSGYYLTLEAFSVGNKLIEFG 282

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTC 399
           G T           G ++IDSGT+ T L    Y     A  A   +L+   D +  F  C
Sbjct: 283 GPTYG------GNEGNILIDSGTTFTFLPYDVYYRFESAV-AEYINLEHVEDPNGTFKLC 335

Query: 400 FDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
           ++++     + P +  HF+GAD+ L   +  I V S G  C AF  + +  +I GN+ QQ
Sbjct: 336 YNVAYH-GFEAPLITAHFKGADIKLYYISTFIKV-SDGIACLAFIPSQT--AIFGNVAQQ 391

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              V Y+L  + + F P  C
Sbjct: 392 NLLVGYNLVQNTVTFKPVDC 411


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 134/392 (34%), Positives = 191/392 (48%), Gaps = 38/392 (9%)

Query: 107 VRVPPRNRSR--------GRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTG 158
            R   R+R R        G A+ G + S +  +  G G Y     +GTPP+ +  + DTG
Sbjct: 43  TRAAHRSRERLSILATRLGAASAGSAQSPLQ-MDSGGGAYDMTFSMGTPPQTLSALADTG 101

Query: 159 SDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG---CN----RRNT 211
           SD++W +C  CK+C  +    + P KS SF+ +PC S LCR L+S     C     R   
Sbjct: 102 SDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTRARGAV 161

Query: 212 CLYQVSYGDGS----ITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
           C Y+ SYG  S     T G   +ET T     V  +  GC   +EG + + +GL+GLGRG
Sbjct: 162 CSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFGCTTMSEGGYGSGSGLVGLGRG 221

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
           +LS   Q        FSYCL    +++ P  ++FG  A++     +  L N K  TFY V
Sbjct: 222 KLSLVRQLKV---GAFSYCLTSDPSTSSP--LLFGAGALTGPGVQSTPLVNLKTSTFYTV 276

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            L  IS+G A   G           G  G+I DSGT++T L  PAY        +  ++L
Sbjct: 277 NLDSISIGAAKTPG----------TGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNL 326

Query: 388 KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTM 447
            R P    ++ CF  SG      P++VLHF G D++L   NY   V+ S + C+    + 
Sbjct: 327 TRVPGTDGYEVCFQTSGG--AVFPSMVLHFDGGDMALKTENYFGAVNDSVS-CWLVQKSP 383

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           S +SI+GNI Q  + + YDL  S + F P  C
Sbjct: 384 SEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 123/350 (35%), Positives = 174/350 (49%), Gaps = 13/350 (3%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  +  R  +GTP + + + LDT +D  WI C+ C  C S T  VF   KS SF  +PC+
Sbjct: 23  SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQ 80

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SP C ++ +  C+  + C + ++YG  ++   D   + LT     V     GC     G 
Sbjct: 81  SPQCNQVPNPSCS-GSACGFNLTYGSSTV-AADLVQDNLTLATDSVPSYTFGCIRKATGS 138

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLGLGRG LS   Q+   +   FSYCL    +     S+  G  A     ++TP
Sbjct: 139 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTP 198

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L+ I V G  +  I  S    + A   G +IDSGT+ TRL  PAY 
Sbjct: 199 LLRNPRRSSLYYVNLISIRV-GRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYT 257

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           A+RD FR              FDTC+ +     +  PT+   F G +V+LP  N+LI   
Sbjct: 258 AVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMFAGMNVTLPPDNFLIHST 313

Query: 435 SSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           S  T C A A       S L++I ++QQQ  R+++D+  SR+G A   C+
Sbjct: 314 SGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 363


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 123/350 (35%), Positives = 174/350 (49%), Gaps = 13/350 (3%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  +  R  +GTP + + + LDT +D  WI C+ C  C S T  VF   KS SF  +PC+
Sbjct: 100 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQ 157

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SP C ++ +  C+  + C + ++YG  ++   D   + LT     V     GC     G 
Sbjct: 158 SPQCNQVPNPSCS-GSACGFNLTYGSSTV-AADLVQDNLTLATDSVPSYTFGCIRKATGS 215

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLGLGRG LS   Q+   +   FSYCL    +     S+  G  A     ++TP
Sbjct: 216 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTP 275

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L+ I VG   V  I  S    + A   G +IDSGT+ TRL  PAY 
Sbjct: 276 LLRNPRRSSLYYVNLISIRVGRKIV-DIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYT 334

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           A+RD FR              FDTC+ +     +  PT+   F G +V+LP  N+LI   
Sbjct: 335 AVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMFAGMNVTLPPDNFLIHST 390

Query: 435 SSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +  T C A A       S L++I ++QQQ  R+++D+  SR+G A   C+
Sbjct: 391 AGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 440


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 124/355 (34%), Positives = 183/355 (51%), Gaps = 40/355 (11%)

Query: 153 MVLDTGSDVVWIQC----APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR--KLDSSGC 206
           +++DTGSD++W QC    +        + PV+DP +S +FA +PC   LC+  +     C
Sbjct: 28  LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA--RVALGCGHDNEGLFVAAAGLLGL 264
             +N C+Y+  YG  +  VG  ++ET TF   R    R+  GCG  + G  + A G+LGL
Sbjct: 88  TSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGL 146

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFTPLLAN 318
               LS  TQ   +   +FSYCL   +   K S ++FG  A       +R  + T +++N
Sbjct: 147 SPESLSLITQLKIQ---RFSYCLTPFA-DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSN 202

Query: 319 PKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           P    +YYV LVGIS+G  H R  + A+   + P G GG I+DSG++V  L   A+ A++
Sbjct: 203 PVETVYYYVPLVGISLG--HKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK 260

Query: 378 DAFRAGASSLKRAP----DFSLFDTCFDLSGKTE------VKVPTVVLHFRG-ADVSLPA 426
           +A       + R P        ++ CF L  +T       V+VP +VLHF G A + LP 
Sbjct: 261 EAVM----DVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPR 316

Query: 427 TNYLIPVDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            NY      +G  C A   T   SG+SIIGN+QQQ   V++D+   +  FAP  C
Sbjct: 317 DNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 169/349 (48%), Gaps = 13/349 (3%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +   GTPP+ + + LDT SD  WI C+ C  C   T   F P KS SF  V C 
Sbjct: 94  SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC--STSKPFAPIKSTSFRNVSCG 151

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SP C+++ +  C   + C +  +YG  SI       +TLT     +     GC +   G 
Sbjct: 152 SPHCKQVPNPTCG-GSACAFNFTYGSSSI-AASVVQDTLTLAADPIPGYTFGCVNKTTGS 209

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GLLGLGRG LS  +Q+   +   FSYCL    +     S+  G     +  ++TP
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV LV I V G  +  I  +    +P    G I DSGT  TRL  P Y 
Sbjct: 270 LLRNPRRSSLYYVNLVAIKV-GRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYT 328

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           A+R+ FR              FDTC+++     + VPT+   F G +V+LP  N +I   
Sbjct: 329 AVRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLFSGMNVALPPDNIVIHST 384

Query: 435 SSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +  T C A AG      S L++I N+QQQ  RV++D+  SRIG A   C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 137/403 (33%), Positives = 194/403 (48%), Gaps = 35/403 (8%)

Query: 95  RVKSLT-AFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
           R + LT AF  SA RV      R R +   S  + S L   +GEY   L +GTPP  V  
Sbjct: 53  RTERLTDAFHRSASRV-----GRFRQSAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIA 107

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-DSSGCNRRNTC 212
           ++DTGSD+ W QC PC  CY Q  P FDP  S ++    C +  C  L +   C     C
Sbjct: 108 IVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKC 167

Query: 213 LYQVSYGDGSITVGDFSTETLTFRGTRVARV-----ALGCGHDNEGLF-VAAAGLLGLGR 266
            +  SY DGS T G+ + ETLT   T    V     A GC H + G+F   ++G++GLG 
Sbjct: 168 TFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGV 227

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV-FGDSAVSRTARF--TPLLANPKLDT 323
             LS  +Q     N +FSYCL+   T +  SS + FG S +   A    TPL+     DT
Sbjct: 228 AELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGP-DT 286

Query: 324 FYY-VELVGISVGGAHV--RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
           +YY + L G SVG   +  +G +    K      G +I+DSGT+ T L    Y+ L ++ 
Sbjct: 287 YYYLITLEGFSVGKKRLSYKGFS----KKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESV 342

Query: 381 RAGASSLK----RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSS 436
              A S+K    R P+  +   C++ +   ++  P +  HF+ A+V L   N  + +   
Sbjct: 343 ---AHSIKGKRVRDPN-GISSLCYNTT-VDQIDAPIITAHFKDANVELQPWNTFLRMQED 397

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              CF    T S + I+GN+ Q  F V +DL   R+ F    C
Sbjct: 398 -LVCFTVLPT-SDIGILGNLAQVNFLVGFDLRKKRVSFKAADC 438


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 131/381 (34%), Positives = 185/381 (48%), Gaps = 38/381 (9%)

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP- 178
           +GG  S +I+     S EY   + VGTPP  +  + DTGSD+VW+ C+      +  D  
Sbjct: 89  DGGVESKIITR----SFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAG 144

Query: 179 ---VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF 235
              VF P +S +++ + C+S  C+ L  + C+  + C YQ SYGDGS T+G  STET +F
Sbjct: 145 GNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSF 204

Query: 236 -----RG-TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG--RRFNRKFSYCL 287
                +G  RV RV  GC   + G F  + GL+GLG G  S  +Q G     +RK SYCL
Sbjct: 205 VDGGGKGQVRVPRVNFGCSTASAGTF-RSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCL 263

Query: 288 VDRSTSAKPSSMVFGDSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           +    +   S++ FG  AV     A  TPL+ +  +D++Y V L  ++VGG  V      
Sbjct: 264 IPSYDANSSSTLNFGSRAVVSEPGAASTPLVPS-DVDSYYTVALESVAVGGQEV------ 316

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
                   +  +I+DSGT++T L       L           +  P   L   C+D+ GK
Sbjct: 317 -----ATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGK 371

Query: 406 TEVK---VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQ 459
           +E     +P V L F  GA V+L   N    +   GT C           +SI+GNI QQ
Sbjct: 372 SETDNFGIPDVTLRFGGGAAVTLRPENTF-SLLQEGTLCLVLVPVSESQPVSILGNIAQQ 430

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
            F V YDL A  + FA   CA
Sbjct: 431 NFHVGYDLDARTVTFAAADCA 451


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 138/373 (36%), Positives = 195/373 (52%), Gaps = 39/373 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GEY   L +GTPP+    + DTGSD+VW QCAPC ++C+ Q  P+++P+ S +F  +PC 
Sbjct: 90  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149

Query: 195 SP--LC---RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVA 244
           S   LC    +L  +       C Y  +YG G  T G   +ET TF  +     RV  +A
Sbjct: 150 SALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIA 208

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GC + +   +  +AGL+GLGRG LS  +Q        FSYCL     +   S+++ G +
Sbjct: 209 FGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGM---FSYCLTPFQDTKSKSTLLLGPA 265

Query: 305 AVS--------RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           A +        R+  F P  + P + T+YY+ L GISVG A +  I    F L   G GG
Sbjct: 266 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALP-IPPGAFALRADGTGG 324

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-----DFSLFDTCFDL--SGKTEVK 409
           +IIDSGT++T L   AY  +R A R    SL + P     + +  D CF L  S      
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVR----SLVKLPVTDGSNATGLDLCFALPSSSAPPAT 380

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDL 467
           +P++ LHF  GAD+ LP  NY+I +D  G +C A      G LS +GN QQQ   ++YD+
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 438

Query: 468 AASRIGFAPRGCA 480
               + FAP  C+
Sbjct: 439 QKETLSFAPAKCS 451


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 125/339 (36%), Positives = 164/339 (48%), Gaps = 42/339 (12%)

Query: 116 RGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ 175
           R R   G  ++   G+A  + EY   L VGTPPR V + LDTGSD+VW QCAPC+ C+ Q
Sbjct: 67  RARVRAGLVAAA-GGIA--TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQ 123

Query: 176 TDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF 235
             P+ DPA S ++A +PC +P CR L  + C  R+ C+Y   YGD S+TVG  +T+  TF
Sbjct: 124 GIPLLDPAASSTYAALPCGAPRCRALPFTSCGGRS-CVYVYHYGDKSVTVGKIATDRFTF 182

Query: 236 --RGTR--------VARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFS 284
              G R          R+  GCGH N+G+F +   G+ G GRGR S P+Q        FS
Sbjct: 183 GDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLN---ATSFS 239

Query: 285 YCLVDRSTSAKPSSMVFGDS-------AVSRTARFTPLLANPKLDTFYYVELVGISVGGA 337
           YC      S K S +  G +       A S   R TPL  NP   + Y++ L GISVG  
Sbjct: 240 YCFTSMFDS-KSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKT 298

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD 397
            +  +  + F+         IIDSG S+T L    Y A++  F A         + S  D
Sbjct: 299 RLP-VPETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALD 350

Query: 398 TCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSS 436
            CF L        P   L  R A  SL    +  P  SS
Sbjct: 351 VCFAL--------PVSALWRRPAVPSLTRCTWRAPTGSS 381


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 169/349 (48%), Gaps = 13/349 (3%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +   GTPP+ + + LDT SD  WI C+ C  C   T   F P KS SF  V C 
Sbjct: 94  SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC--STSKPFAPIKSTSFRNVSCG 151

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           SP C+++ +  C   + C +  +YG  SI       +TLT     +     GC +   G 
Sbjct: 152 SPHCKQVPNPTCG-GSACAFNFTYGSSSI-AASVVQDTLTLATDPIPGYTFGCVNKTTGS 209

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
                GLLGLGRG LS  +Q+   +   FSYCL    +     S+  G     +  ++TP
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV LV I V G  +  I  +    +P    G I DSGT  TRL  P Y 
Sbjct: 270 LLRNPRRSSLYYVNLVAIKV-GRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYT 328

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
           A+R+ FR              FDTC+++     + VPT+   F G +V+LP  N +I   
Sbjct: 329 AVRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLFSGMNVTLPPDNIVIHST 384

Query: 435 SSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +  T C A AG      S L++I N+QQQ  RV++D+  SRIG A   C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 200/379 (52%), Gaps = 51/379 (13%)

Query: 111 PRNRSRGRANGGFSSSVIS---GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSD-VVWIQC 166
           P+N+    A GG     I+   G   GSG          PP    ++ +   D + W QC
Sbjct: 51  PKNKCSASARGGSQGLPITQKYGPCSGSGH-------SQPPSPQEILAEMNPDSITWTQC 103

Query: 167 APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVG 226
            PC +C   +   FDP+ S +++   C       + S+  N      Y ++YGD S +VG
Sbjct: 104 KPCVRCLKDSHRHFDPSASLTYSLGSC-------IPSTVGNT-----YNMTYGDKSTSVG 151

Query: 227 DFSTETLTFRGTRV-ARVALGCGHDNEGLFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFS 284
           ++  +T+T   + V  +   GCG +NEG F + A G+LGLG+G+LS  +QT  +F + FS
Sbjct: 152 NYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFS 211

Query: 285 YCLVDRSTSAKPSSMVFGDSAVSRTA-RFTPLLANP-----KLDTFYYVELVGISVGGAH 338
           YCL +  +     S++FG+ A S+++ +FT L+  P     +   +Y+V+L+ ISVG   
Sbjct: 212 YCLPEEDSIG---SLLFGEKATSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKR 268

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS----SLKRAPDFS 394
           +  + +S+F      + G IIDSGT +T L + AY AL  AF+   +    S  R     
Sbjct: 269 LN-VPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGD 322

Query: 395 LFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAG----TM-S 448
           + DTC++LSG+ +V +P +VLHF  GADV L     +   D+S   C AFAG    TM S
Sbjct: 323 ILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDAS-RLCLAFAGNSKSTMNS 381

Query: 449 GLSIIGNIQQQGFRVVYDL 467
            L+IIGN QQ    V+YD+
Sbjct: 382 ELTIIGNRQQVSLTVLYDI 400


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  177 bits (450), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 176/345 (51%), Gaps = 32/345 (9%)

Query: 153 MVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCN- 207
           MV+DT SDV W+QCAPC +  CY+Q+D ++DP KS   A  PC SP CR L   ++GC  
Sbjct: 176 MVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTG 235

Query: 208 --RRNTCLYQVSYGDGSITVGDFSTETLTFRGT---RVARVALGCGHD--NEGLFV-AAA 259
                TC Y+V Y DGS T G + ++ LT        V++   GC H     G F    A
Sbjct: 236 AGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKTA 295

Query: 260 GLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLA 317
           G + LGRG  S  +QT   F++   FSYCL    +     S+     A SR A  TP+L 
Sbjct: 296 GFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYA-VTPMLK 354

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           +      Y V L+GI V G  +  +  ++F  + A      +DS T +TRL   AY+ALR
Sbjct: 355 SKMAPMIYMVRLIGIDVAGQRLP-VPPAVFAANAA------MDSRTIITRLPPTAYMALR 407

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSS 436
            AFRA   + +        DTC+D +G   V++P V L F R A V L  +  ++  DS 
Sbjct: 408 AAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML--DS- 464

Query: 437 GTFCFAFAGTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C AFA   +     IIGN+QQQ   V+Y++  + +GF    C
Sbjct: 465 ---CLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 132/353 (37%), Positives = 177/353 (50%), Gaps = 43/353 (12%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           VPC  P+C  L           +Y              ++     +   V     GCGH 
Sbjct: 196 VPCGGPVCAGLG----------IYA-------------ASACSAAQCGAVQGFFFGCGHA 232

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAVSRT 309
             GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ V G S  +  
Sbjct: 233 QSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG 292

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T LL +P   T+Y V L GISVGG  +  + AS F          ++D+GT VTRL 
Sbjct: 293 FSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVTRLP 345

Query: 370 RPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPA 426
             AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V+L A
Sbjct: 346 PTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGA 405

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 406 DGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 138/373 (36%), Positives = 195/373 (52%), Gaps = 39/373 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GEY   L +GTPP+    + DTGSD+VW QCAPC ++C+ Q  P+++P+ S +F  +PC 
Sbjct: 95  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 154

Query: 195 SP--LC---RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVA 244
           S   LC    +L  +       C Y  +YG G  T G   +ET TF  +     RV  +A
Sbjct: 155 SALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIA 213

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GC + +   +  +AGL+GLGRG LS  +Q        FSYCL     +   S+++ G +
Sbjct: 214 FGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGM---FSYCLTPFQDTKSKSTLLLGPA 270

Query: 305 AVS--------RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           A +        R+  F P  + P + T+YY+ L GISVG A +  I    F L   G GG
Sbjct: 271 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALP-IPPGAFALRADGTGG 329

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-----DFSLFDTCFDL--SGKTEVK 409
           +IIDSGT++T L   AY  +R A R    SL + P     + +  D CF L  S      
Sbjct: 330 LIIDSGTTITSLVDAAYKRVRAAVR----SLVKLPVTDGSNATGLDLCFALPSSSAPPAT 385

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDL 467
           +P++ LHF  GAD+ LP  NY+I +D  G +C A      G LS +GN QQQ   ++YD+
Sbjct: 386 LPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 443

Query: 468 AASRIGFAPRGCA 480
               + FAP  C+
Sbjct: 444 QKETLSFAPAKCS 456


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 174/355 (49%), Gaps = 25/355 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  VGTP +   M LDT +D  WI C  C  C S    VF+   S +F T+ C 
Sbjct: 87  SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCD 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ +  C   +TC +  +YG GS  + + + +T+      V     GC     G 
Sbjct: 144 APQCKQVPNPTCG-GSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLGLGRG LSF +QT   +   FSYCL    T     ++  G +      + TP
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTP 261

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L+GI V G  +  I AS    +P    G I DSGT  TRL  P Y 
Sbjct: 262 LLKNPRRSSLYYVNLIGIRV-GRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYT 320

Query: 375 ALRDAFR-----AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
           A+RD FR     A  SSL        FDTC+       +  PT+   F G +V+LP  N 
Sbjct: 321 AVRDEFRKRVGNAIVSSLGG------FDTCY----TGPIVAPTMTFMFSGMNVTLPTDNL 370

Query: 430 LIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LI   +  T C A A       S L++I N+QQQ  R+++D+  SRIG A   C+
Sbjct: 371 LIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 138/373 (36%), Positives = 195/373 (52%), Gaps = 39/373 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDPVFDPAKSRSFATVPCR 194
           GEY   L +GTPP+    + DTGSD+VW QCAPC ++C+ Q  P+++P+ S +F  +PC 
Sbjct: 90  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149

Query: 195 SP--LC---RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVA 244
           S   LC    +L  +       C Y  +YG G  T G   +ET TF  +     RV  +A
Sbjct: 150 SALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIA 208

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GC + +   +  +AGL+GLGRG LS  +Q        FSYCL     +   S+++ G +
Sbjct: 209 FGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGM---FSYCLTPFQDTKSKSTLLLGPA 265

Query: 305 AVS--------RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           A +        R+  F P  + P + T+YY+ L GISVG A +  I    F L   G GG
Sbjct: 266 AAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALP-IPPGAFALRADGTGG 324

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP-----DFSLFDTCFDL--SGKTEVK 409
           +IIDSGT++T L   AY  +R A R    SL + P     + +  D CF L  S      
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVR----SLVKLPVTDGSNATGLDLCFALPSSSAPPAT 380

Query: 410 VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDL 467
           +P++ LHF  GAD+ LP  NY+I +D  G +C A      G LS +GN QQQ   ++YD+
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDV 438

Query: 468 AASRIGFAPRGCA 480
               + FAP  C+
Sbjct: 439 QKETLSFAPAKCS 451


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 174/355 (49%), Gaps = 25/355 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  VGTP +   M LDT +D  WI C  C  C S    VF+   S +F T+ C 
Sbjct: 87  SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCD 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ +  C   +TC +  +YG GS  + + + +T+      V     GC     G 
Sbjct: 144 APQCKQVPNPTCG-GSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLGLGRG LSF +QT   +   FSYCL    T     ++  G +      + TP
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTP 261

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV L+GI V G  +  I AS    +P    G I DSGT  TRL  P Y 
Sbjct: 262 LLKNPRRSSLYYVNLIGIRV-GRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYT 320

Query: 375 ALRDAFR-----AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
           A+RD FR     A  SSL        FDTC+       +  PT+   F G +V+LP  N 
Sbjct: 321 AVRDEFRKRVGNAIVSSLGG------FDTCY----TGPIVAPTMTFMFSGMNVTLPPDNL 370

Query: 430 LIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LI   +  T C A A       S L++I N+QQQ  R+++D+  SRIG A   C+
Sbjct: 371 LIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 132/353 (37%), Positives = 177/353 (50%), Gaps = 43/353 (12%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK---KCYSQTDPVFDPAKSRSFAT 190
           G+  Y     +GTP     M +DTGSD+ W+QC PC     CYSQ DP+FDPA+S S+A 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           VPC  P+C  L           +Y              ++     +   V     GCGH 
Sbjct: 196 VPCGGPVCAGLG----------IYA-------------ASACSAAQCGAVQGFFFGCGHA 232

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM-VFGDSAVSRT 309
             GLF    GLLGLGR + S   QT   +   FSYCL  + ++A   ++ V G S  +  
Sbjct: 233 QSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG 292

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T LL +P   T+Y V L GISVGG  +  + AS F          ++D+GT VTRL 
Sbjct: 293 FSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFAGG------TVVDTGTVVTRLP 345

Query: 370 RPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPA 426
             AY ALR AFR+G +S     AP   + DTC++ +G   V +P V L F  GA V+L A
Sbjct: 346 PTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGA 405

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              L    S G   FA +G+  G++I+GN+QQ+ F V  D   + +GF P  C
Sbjct: 406 DGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 120/340 (35%), Positives = 175/340 (51%), Gaps = 32/340 (9%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           +VLD+ SDV W+QC PC    C+ Q D  +DP++S + A   C SP C  L   ++GC  
Sbjct: 31  VVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGC-A 89

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVA-AAGLLGLGR 266
            N C Y V Y DGS T G +  + LT   G  V+    GC H  +G F A AAG++ LG 
Sbjct: 90  NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 149

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYY 326
           G  S  +QT  R+   FSYC+   ++ +   ++     A SR    TP++   +  TFY 
Sbjct: 150 GPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYV-VTPMVRFRQAATFYG 208

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V L  I+VGG  + G+  ++F        G ++DS T++TRL   AY ALR AFR+  + 
Sbjct: 209 VLLRTITVGGQRL-GVAPAVFA------AGSVLDSRTAITRLPPTAYQALRAAFRSSMTM 261

Query: 387 LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFAF 443
            + AP     DTC+D +G   +++P + L F          N ++P+D SG     C AF
Sbjct: 262 YRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD--------RNAVLPLDPSGILFNDCLAF 313

Query: 444 AGT----MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                  M G  ++G++QQQ   V+YD+    +GF    C
Sbjct: 314 TSNADDRMPG--VLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 147/443 (33%), Positives = 212/443 (47%), Gaps = 54/443 (12%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           + + + ++L HVD+      PE     R++R +   + +             N +  RA 
Sbjct: 30  SNTGIRMKLTHVDAKGNYTAPE-----RVRRAIALSRQI-------------NLASTRAE 71

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDP 178
           GG  S+ +    +   +Y     VG PP+    ++DTGS ++W QC  C  K C  Q  P
Sbjct: 72  GGGVSAPVHWATR---QYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLP 128

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT 238
            F+ + S SFA VPC+   C       C    TC ++V+YG G I +G   T+  TF+ +
Sbjct: 129 YFNASSSGSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGI-IGFLGTDAFTFQ-S 186

Query: 239 RVARVALGCGHDNE----GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
             A +A GC          +   A+GL+GLGRGRLS  +QTG    ++FSYCL     + 
Sbjct: 187 GGATLAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGA---KRFSYCLTPYFHNN 243

Query: 295 KPSSMVFGDSAVSRTARFTPLLA-----NPK---LDTFYYVELVGISVGGAHVRGITASL 346
             SS +F  +A S +     +++     +PK     TFYY+ LVGI+VG   +  I ++ 
Sbjct: 244 GASSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKL-AIPSTA 302

Query: 347 FKLDPA----GNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAP--DFSLFDTC 399
           F L         GGVIIDSG+  T L   AY  L     R    SL   P  D      C
Sbjct: 303 FDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALC 362

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLSIIGNIQ 457
               G  +  VPT+VLHF  GAD++LP  NY  P++ S T C A   G +   SIIGN Q
Sbjct: 363 V-ARGDLDRVVPTLVLHFSGGADMALPPENYWAPLEKS-TACMAIVRGYLQ--SIIGNFQ 418

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ   +++D+   R+ F    C+
Sbjct: 419 QQNMHILFDVGGGRLSFQNADCS 441


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/363 (38%), Positives = 182/363 (50%), Gaps = 31/363 (8%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS--QTDPVFDPAKSRS 187
           G + G+ +Y   + +GTP     + +DTGSDV W+QCAPC       Q D +FDPAKS S
Sbjct: 492 GHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSS 551

Query: 188 FATVPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVA 244
           ++ VPC +  C +L +   GC   + C Y VSYGDGS T G + ++TLT      V    
Sbjct: 552 YSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDADAVTGFL 611

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT-GRRFNRKFSYCLVDRSTSAKPSSMVF-- 301
            GCGH   GLF    GLL LGR  +S  +QT G      FSYCL        PSS  F  
Sbjct: 612 FGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCL-----PPSPSSTGFLT 666

Query: 302 --GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             G S+ S  A  T LL    + TFY V L GI VGG  + G+ AS F       GG ++
Sbjct: 667 LGGPSSASGFAT-TGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA------GGTVV 719

Query: 360 DSGTSVTRL--TRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           D+GT +TRL  T  A +               AP   + DTC++ +    V +PTV L F
Sbjct: 720 DTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTF 779

Query: 418 R-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
             GA + L A  +L    SSG   FA        +I+GN+QQ+ F V +D   S +GF P
Sbjct: 780 SGGATLKLDAPGFL----SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--GSSVGFMP 833

Query: 477 RGC 479
             C
Sbjct: 834 HSC 836


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/424 (33%), Positives = 198/424 (46%), Gaps = 39/424 (9%)

Query: 87  LRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGT 146
           LR++   +  K   +  E   R   R   R  + G  S+ V    +Q   EY     +G 
Sbjct: 24  LRLELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVHWAESQYIAEYL----IGD 79

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS 204
           PP+    ++DTGS+++W QC+ C+   C+SQ    +DP++SR+   V C    C     +
Sbjct: 80  PPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSET 139

Query: 205 GCNRRN-TCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGC---GHDNEGLFVAAA 259
            C R N  C    +YG G I  G   TE  TF+  +    +A GC        G    A+
Sbjct: 140 RCARDNKACAVLTAYGAGVIG-GVLGTEAFTFQPQSENVSLAFGCIAATRLTPGSLDGAS 198

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF-----GDSAVSRTARFTP 314
           G++GLGRG LS  +Q G   + KFSYCL    + +  +S +F     G S+    A   P
Sbjct: 199 GIIGLGRGNLSLVSQLG---DNKFSYCLTPYFSQSTNTSRLFVGASAGLSSGGAPATSVP 255

Query: 315 LLANPKLD---TFYYVELVGISVGGAHVRGITASLFKLDPAGNG---GVIIDSGTSVTRL 368
            L NP +D   TFYY+ L GI+VG A +  +  + F L     G   G +IDSG+  T L
Sbjct: 256 FLKNPDVDPFSTFYYLPLTGITVGDAKL-AVPEAAFDLRQVATGLWAGTLIDSGSPFTSL 314

Query: 369 TRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLS-GKTEVKVPTVVLHF--RGADVS 423
              AY ALRD    + GAS +         D C  ++ G     VP +VLHF   G DV+
Sbjct: 315 VDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVLHFGSGGGDVA 374

Query: 424 LPATNYLIPVDSSGTFCFAFAG-------TMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           +P  NY  PVD S      F+         M+  +IIGN  QQ   ++YDL    + F P
Sbjct: 375 VPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLYDLEKGMLSFQP 434

Query: 477 RGCA 480
             C+
Sbjct: 435 ADCS 438


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 146/440 (33%), Positives = 214/440 (48%), Gaps = 48/440 (10%)

Query: 63  SSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG 122
           + L ++L HVD  +   T E     R++R V   +   A+ +         + + RA+G 
Sbjct: 26  AGLRMKLTHVDDKAGYTTEE-----RVRRAVAVSRERLAYTQ--------QQQQLRASGD 72

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQTDPV 179
            S+ V     Q   EY     +G PP+    ++DTGS+++W QC      K C  Q  P 
Sbjct: 73  VSAPVHLATRQYIAEYL----IGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPY 128

Query: 180 FDPAKSRSFATVPC--RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           ++ ++S +FA VPC   + LC       C    +C +  SYG GS+  G   TE  TF+ 
Sbjct: 129 YNLSRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYGAGSV-FGSLGTEAFTFQ- 186

Query: 238 TRVARVALGC---GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
           +  A++  GC       +G    A+GL+GLGRGRLS  +QTG     KFSYCL     + 
Sbjct: 187 SGAAKLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTG---ATKFSYCLTPYLRNH 243

Query: 295 KPSSMVFGDSAVSRTA-----RFTPLLANPK---LDTFYYVELVGISVGGAHVRGITASL 346
             SS +F  ++ S +         P + +P+     TFYY+ LVGISVG   +  I ++ 
Sbjct: 244 GASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLP-IPSAA 302

Query: 347 FKLD--PAG--NGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFD 401
           F+L    AG  +GGVIID+G+ VT L   AY AL D   R    SL + P  +  D C  
Sbjct: 303 FELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCVA 362

Query: 402 LSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
                +V VP +V HF  GAD+++ A +Y  PVD S        G     ++IGN QQQ 
Sbjct: 363 RQDVDKV-VPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEGGYE--TVIGNFQQQD 419

Query: 461 FRVVYDLAASRIGFAPRGCA 480
             ++YD+    + F    C+
Sbjct: 420 VHLLYDIGKGELSFQTADCS 439


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 153/446 (34%), Positives = 205/446 (45%), Gaps = 81/446 (18%)

Query: 63  SSLSLRLHHVDSLSFNRTPEHLFNLR--IQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           ++L L+L HVD+    R   H   LR   QR   R   L +  + +         RGR+ 
Sbjct: 22  ANLRLQLSHVDA---GRGLTHWELLRRMAQRSKARATHLLSAQDQS--------GRGRS- 69

Query: 121 GGFSSSVISGLAQGSG----EYFTRLGVGTPPRYVYMVLDTGSDVVWIQC--APCKKCYS 174
              +S+ ++  A   G    EY   L  GTPP+ V + LDTGSD+ W QC   P   C++
Sbjct: 70  ---ASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFN 126

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT---CLYQVSYGDGSITVGDFSTE 231
           QT P+FDP+ S SFA++PC SP C      G     T   C Y +SYGDGS++ G+   E
Sbjct: 127 QTLPLFDPSASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGRE 186

Query: 232 TLTF-----RGTRVARVAL--GCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKF 283
             TF      G+  A   L  GCGH N G+F +   G+ G GRG LS P+Q        F
Sbjct: 187 VFTFASGTGEGSSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKV---GNF 243

Query: 284 SYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
           S+C     T +K S+++ G   V+      P  A+P          +G   G    R   
Sbjct: 244 SHCFTT-ITGSKTSAVLLGLPGVA------PPSASP----------LGRRRGSYRCRSTP 286

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-TCFD- 401
            S              +SGTS+T L    Y A+R+ F A    L   P  +    TCF  
Sbjct: 287 RS-------------SNSGTSITSLPPRTYRAVREEF-AAQVKLPVVPGNATDPFTCFSA 332

Query: 402 -LSGKTEVKVPTVVLHFRGADVSLPATNYLIPV-------DSSGTFCFAFAGTMSGLSII 453
            L G  +  VPT+ LHF GA + LP  NY+  V       +SS   C A      G  I+
Sbjct: 333 PLRGP-KPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEIIL 389

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
           GNIQQQ   V+YDL  S++ F P  C
Sbjct: 390 GNIQQQNMHVLYDLQNSKLSFVPAQC 415


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 189/370 (51%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VG+PP+ V MVLDTGS++ W+ C      +S    VFDP +S S++ +PC SP CR  
Sbjct: 67  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 122

Query: 202 DSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                    C+++  C   +SY D S   G+ +++T     + +     GC       N 
Sbjct: 123 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNS 182

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTA 310
                  GL+G+ RG LSF TQ G    +KFSYC+  + +S     ++FG+S+ S  +  
Sbjct: 183 DEDSKTTGLIGMNRGSLSFVTQMGL---QKFSYCISGQDSSGI---LLFGESSFSWLKAL 236

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           ++TPL+      P  D   Y V+L GI V  + ++ +  S++  D  G G  ++DSGT  
Sbjct: 237 KYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQ-LPKSVYAPDHTGAGQTMVDSGTQF 295

Query: 366 TRLTRPAYIALRDAF-RAGASSLK--RAPDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y AL++ F R   +SLK    P+F      D C+   L+ +T   +PTV L F
Sbjct: 296 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 355

Query: 418 RGADVSLPATNYLIPV-----DSSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
           RGA++S+ A   +  V      S   +CF F  + + G+   IIG+  QQ   + +DLA 
Sbjct: 356 RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAK 415

Query: 470 SRIGFAPRGC 479
           SR+GFA   C
Sbjct: 416 SRVGFAEVRC 425


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 189/370 (51%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VG+PP+ V MVLDTGS++ W+ C      +S    VFDP +S S++ +PC SP CR  
Sbjct: 60  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 115

Query: 202 DSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                    C+++  C   +SY D S   G+ +++T     + +     GC       N 
Sbjct: 116 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNS 175

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTA 310
                  GL+G+ RG LSF TQ G    +KFSYC+  + +S     ++FG+S+ S  +  
Sbjct: 176 DEDSKTTGLIGMNRGSLSFVTQMGL---QKFSYCISGQDSSGI---LLFGESSFSWLKAL 229

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           ++TPL+      P  D   Y V+L GI V  + ++ +  S++  D  G G  ++DSGT  
Sbjct: 230 KYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQ-LPKSVYAPDHTGAGQTMVDSGTQF 288

Query: 366 TRLTRPAYIALRDAF-RAGASSLK--RAPDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y AL++ F R   +SLK    P+F      D C+   L+ +T   +PTV L F
Sbjct: 289 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 348

Query: 418 RGADVSLPATNYLIPV-----DSSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
           RGA++S+ A   +  V      S   +CF F  + + G+   IIG+  QQ   + +DLA 
Sbjct: 349 RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAK 408

Query: 470 SRIGFAPRGC 479
           SR+GFA   C
Sbjct: 409 SRVGFAEVRC 418


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 132/372 (35%), Positives = 174/372 (46%), Gaps = 47/372 (12%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ V SG  Q    Y  R G+GTP + + + LDT +D  W  CAPC  C + +   F PA
Sbjct: 67  SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122

Query: 184 KSRSFATVPCRS---PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
            S S+A++PC S   PL R+    G   R               VG  +   L    +R 
Sbjct: 123 SSSSYASLPCASDWCPLFRRPAVPGEPGR---------------VGAAADVRLLQAASRT 167

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGR--------GRLSFPTQTGRRFNRKFSYCLVDRST 292
            R             V AA   G  R        G +S  +QTG R+N  FSYCL    +
Sbjct: 168 PRSG-----------VLAATRCGWARTPSPATRSGPMSLLSQTGSRYNGVFSYCLPSYRS 216

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
                S+  G +   R  R+TPLL NP   + YYV + G+SVG A V+    S F  DP+
Sbjct: 217 YYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGS-FAFDPS 275

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
              G +IDSGT +TR T P Y ALRD FR   ++         FDTCF+         P 
Sbjct: 276 TGAGTVIDSGTVITRWTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPP 335

Query: 413 VVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQQGFRVVYDL 467
           V LH  G  D++LP  N LI   ++   C A A       S ++++ N+QQQ  RVV D+
Sbjct: 336 VTLHMGGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDV 395

Query: 468 AASRIGFAPRGC 479
           A SR+GFA   C
Sbjct: 396 AGSRVGFAREPC 407


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 174/348 (50%), Gaps = 43/348 (12%)

Query: 153 MVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGC-- 206
           M+LDT SDV W+QC PC   +CY+QTD ++DP+KSRS  +  C SP CR+L   ++GC  
Sbjct: 184 MLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSS 243

Query: 207 --NRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDNEGLFV--AAAGL 261
             N    C Y+V Y DGS T G    + L+   T +V +   GC H   G F     AG+
Sbjct: 244 SSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGSFSRSKTAGI 303

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--DSAVSRTARFTPLLANP 319
           + LGRG  S  +QT  ++ + FSYC     T++     V G    + SR A  TP+L  P
Sbjct: 304 MALGRGVQSLVSQTSTKYGQVFSYCF--PPTASHKGFFVLGVPRRSSSRYA-VTPMLKTP 360

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
            L   Y V L  I+V G  +  +  ++F        G  +DS T +TRL   AY ALR A
Sbjct: 361 ML---YQVRLEAIAVAGQRLD-VPPTVFA------AGAALDSRTVITRLPPTAYQALRSA 410

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF--RGADVSLPATNYLIPVDSSG 437
           FR   S  + A      DTC+D +G + + +PT+ L F   GA V L         D SG
Sbjct: 411 FRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQL---------DPSG 461

Query: 438 TF---CFAFAGTMS---GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                C AFA T        IIG +Q Q   V+Y++A   +GF    C
Sbjct: 462 VLFGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 131/341 (38%), Positives = 170/341 (49%), Gaps = 31/341 (9%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L    +GC+ 
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCS- 222

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAA-AGLLGLGR 266
            N C Y V YGDG  T G +  + LT    T V     GC H   G F A+ +G + LG 
Sbjct: 223 NNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGG 282

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS-SMVFGDSAVSRTARFTPLLANPK-LDTF 324
           GR S  +QT   F   FSYC+ D S+S   S           R AR TPL+ NP  + T 
Sbjct: 283 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFAR-TPLVRNPSIIPTL 341

Query: 325 YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGA 384
           Y V L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  
Sbjct: 342 YLVRLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAM 394

Query: 385 SSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---C 440
           ++  R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C
Sbjct: 395 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGC 446

Query: 441 FAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            AF  T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 447 LAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 131/341 (38%), Positives = 170/341 (49%), Gaps = 31/341 (9%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L    +GC+ 
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCS- 206

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAA-AGLLGLGR 266
            N C Y V YGDG  T G +  + LT    T V     GC H   G F A+ +G + LG 
Sbjct: 207 NNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGG 266

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS-SMVFGDSAVSRTARFTPLLANPK-LDTF 324
           GR S  +QT   F   FSYC+ D S+S   S           R AR TPL+ NP  + T 
Sbjct: 267 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFAR-TPLVRNPSIIPTL 325

Query: 325 YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGA 384
           Y V L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  
Sbjct: 326 YLVRLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAM 378

Query: 385 SSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---C 440
           ++  R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C
Sbjct: 379 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGC 430

Query: 441 FAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            AF  T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 431 LAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 131/365 (35%), Positives = 178/365 (48%), Gaps = 40/365 (10%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  G G Y   + VGTP     +V DTGSD++W QCAPC KC+ Q  P F PA S +F+ 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           +PC S  C+ L +S   CN    C+Y   YG G  T G  +TETL         VA GC 
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG-CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCS 196

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSA-- 305
            +N GL     G L LG GR              FSYCL  RS SA  +S ++FG  A  
Sbjct: 197 TEN-GL-----GQLDLGVGR--------------FSYCL--RSGSAAGASPILFGSLANL 234

Query: 306 VSRTARFTPLLANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAG-NGGVIIDSGT 363
                + TP + NP +  ++YYV L GI+VG   +  +T S F     G  GG I+DSGT
Sbjct: 235 TDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLP-VTTSTFGFTQNGLGGGTIVDSGT 293

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHFR-GA 420
           ++T L +  Y  ++ AF +  + +         D CF     G   + VP++VL F  GA
Sbjct: 294 TLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGA 353

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSG-----LSIIGNIQQQGFRVVYDLAASRIGFA 475
           + ++P     +  DS G+   A    +       +S+IGN+ Q    ++YDL      FA
Sbjct: 354 EYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFA 413

Query: 476 PRGCA 480
           P  CA
Sbjct: 414 PADCA 418


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 128/356 (35%), Positives = 176/356 (49%), Gaps = 34/356 (9%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY   + +G+P     M +DTGSDV W++C            ++DP  S ++A   C +P
Sbjct: 130 EYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTYAPFSCSAP 180

Query: 197 LCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR---VARVALGCGHDN 251
            C +L    +GC+  +TC+Y V YGDGS T G + ++TLT  GT    ++    GC    
Sbjct: 181 ACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAVE 240

Query: 252 EGLFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
            G       GL+GLG    SF +QT   +   FSYCL     S+   ++    S+ S   
Sbjct: 241 HGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLTLGAPSSSTSAAF 300

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP+L + +  TFY + L GISVGG  +  I +S+F      + G I+DSGT +TRL  
Sbjct: 301 STTPMLRSKQAATFYGLLLRGISVGGKTLE-IPSSVF------SAGSIVDSGTVITRLPP 353

Query: 371 PAYIALRDAFRAGASSLKRAPDF--SLFDTCFDLSGKTE---VKVPTVVLHFRGADVSLP 425
            AY AL  AFR G +  +  P     L DTCFD +G  E     VP+V L   G  V   
Sbjct: 354 TAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGGAVVDL 413

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             N ++        C AFA T       IIGN+QQ+ F V+YD+  S  GF P  C
Sbjct: 414 HPNGIVQ-----DGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 162/348 (46%), Gaps = 34/348 (9%)

Query: 91  RDVLRVKSLTAFAE---SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           +D  R+K L+  A+   +AV + P  +    AN                 Y  R+ +GTP
Sbjct: 12  KDPERLKYLSTLADQKTTAVPIAPGQQVLKIAN-----------------YVVRVKLGTP 54

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC- 206
            + ++MVLDT +D  W+ C+ C  C S T   F P  S +  ++ C    C ++    C 
Sbjct: 55  GQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCP 111

Query: 207 -NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLG 265
               + CL+  SYG  S        + +T     +     GC +   G  +   GLLGLG
Sbjct: 112 ATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLG 171

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           RG +S  +Q G  ++  FSYCL    +     S+  G     ++ R TPLL NP   + Y
Sbjct: 172 RGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLY 231

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           YV L G+SVG   V  I +     DP    G IIDSGT +TR  +P Y A+RD FR   +
Sbjct: 232 YVNLTGVSVGRIKVP-IPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN 290

Query: 386 SLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
                P  SL  FDTCF  +   E + P V LHF G ++ LP  N LI
Sbjct: 291 ----GPISSLGAFDTCF--AATNEAEAPAVTLHFEGLNLVLPMENSLI 332


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 131/368 (35%), Positives = 183/368 (49%), Gaps = 32/368 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTPP+ + + +DT +D  W+ CA C  C + T P F+PA S +F  VPC +P 
Sbjct: 94  YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPT-TAPSFNPASSATFRPVPCGAPP 152

Query: 198 CRKLDSSGC----NRRNTCLYQVSYGDGSITVGDFSTETL--TFRGTRVARVALGCGHDN 251
           C +  +  C      +N+C + +SYGD S+     S + L  T  G  +     GC   +
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSYGDSSLDA-TLSQDNLAVTANGGVIKGYTFGCLTKS 211

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD--RSTSAKPSSMVFGDSA--VS 307
            G    A GLLGLGRG L F  QT   +   FSYCL    RS +    S+  G       
Sbjct: 212 NGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKGQPAP 271

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
              + TPLLA+P   + YYV + G+ +G   V  I  S    D A   G ++DSGT   R
Sbjct: 272 EKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVP-IPPSALAFDAATGAGTVLDSGTMFAR 330

Query: 368 LTRPAYIALRDAFR---AGASSLKRAPDFSL-------FDTCFDLSGKTEVKVPTVVLHF 417
           L +PAY A+RD  R   AG+   +     S+       FDTC+++S    V  P V L F
Sbjct: 331 LAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWPAVTLVF 387

Query: 418 RGA-DVSLPATNYLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASR 471
            G  +V LP  N +I      T C A A     G  + L++IG++QQQ  RV++D+  +R
Sbjct: 388 GGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNAR 447

Query: 472 IGFAPRGC 479
           +GFA   C
Sbjct: 448 VGFARERC 455


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 162/348 (46%), Gaps = 34/348 (9%)

Query: 91  RDVLRVKSLTAFAE---SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           +D  R+K L+  A+   +AV + P  +    AN                 Y  R+ +GTP
Sbjct: 12  KDPERLKYLSTLADQKTTAVPIAPGQQVLKIAN-----------------YVVRVKLGTP 54

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC- 206
            + ++MVLDT +D  W+ C+ C  C S T   F P  S +  ++ C    C ++    C 
Sbjct: 55  GQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCP 111

Query: 207 -NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLG 265
               + CL+  SYG  S        + +T     +     GC +   G  +   GLLGLG
Sbjct: 112 ATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLG 171

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           RG +S  +Q G  ++  FSYCL    +     S+  G     ++ R TPLL NP   + Y
Sbjct: 172 RGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLY 231

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           YV L G+SVG   V  I +     DP    G IIDSGT +TR  +P Y A+RD FR   +
Sbjct: 232 YVNLTGVSVGRIKVP-IPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN 290

Query: 386 SLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
                P  SL  FDTCF  +   E + P V LHF G ++ LP  N LI
Sbjct: 291 ----GPISSLGAFDTCF--AETNEAEAPAVTLHFEGLNLVLPMENSLI 332


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 125/358 (34%), Positives = 185/358 (51%), Gaps = 28/358 (7%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCR-- 199
           + +GTPP+   ++LDTGSD++W QC        +  P++DPAKS SFA  PC   LC   
Sbjct: 93  VSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCETG 152

Query: 200 KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--GCGHDNEGLFVA 257
             ++  C+ RN C+Y  +YG  + T G+ ++ET TF   R   V+L  GCG    G    
Sbjct: 153 SFNTKNCS-RNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSLDFGCGKLTSGSLPG 210

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCL---VDRSTSAKPSSMVFGDSAVSRTA---R 311
           A+G+LG+   RLS  +Q       +FSYCL   +DR+T++        D +  RT    +
Sbjct: 211 ASGILGISPDRLSLVSQLQI---PRFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGPIQ 267

Query: 312 FTPLLANPK-LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
            T L+ NP   + +YYV L+GISVG   +  +  S F +   G+GG  +DSG +   L  
Sbjct: 268 TTSLVTNPDGSNYYYYVPLIGISVGTKRLN-VPVSSFAIGRDGSGGTFVDSGDTTGMLPS 326

Query: 371 PAYIALRDAF-RAGASSLKRAPDFSL-FDTCFDL------SGKTEVKVPTVVLHFR-GAD 421
               AL++A   A    +  A D    ++ CF L      + +T V+VP +V HF  GA 
Sbjct: 327 VVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAA 386

Query: 422 VSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + L   +Y++ V S+G  C   +    G +IIGN QQQ   V++D+      FAP  C
Sbjct: 387 MLLRRDSYMVEV-SAGRMCLVISSGARG-AIIGNYQQQNMHVLFDVENHEFSFAPTQC 442


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 170/355 (47%), Gaps = 23/355 (6%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +     VG PP    + +DTGSD++W+QC PC  C+ Q+ P+FDP+KS ++  +   SP+
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF----RGT-RVARVALGCGHDNE 252
           C        N  N C+Y  SY DGS + G+ +TE + F    +GT  V+ V  GCGH N 
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSRTA 310
           G F    +G+LGL  G  S  ++ G R    FSYC+ D        + +V GD  V    
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP       + FYYV L GISVG   +  I   +F+   +G GGV++DSGT+ T L +
Sbjct: 234 SSTPF---HTFNGFYYVTLEGISVGETRL-DINPEVFQRTESGQGGVVMDSGTTATFLAK 289

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV---PTVVLHF-RGADVSLPA 426
             +  L +  +       +   +          G+    +   P +  HF  GAD+ L A
Sbjct: 290 DGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349

Query: 427 TNYLIPVDSSGTFCFA-FAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            N L    +   FC A     +  + S+IG + QQ + V YDL   R+ F    C
Sbjct: 350 -NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 170/355 (47%), Gaps = 23/355 (6%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +     VG PP    + +DTGSD++W+QC PC  C+ Q+ P+FDP+KS ++  +   SP+
Sbjct: 91  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 150

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF----RGT-RVARVALGCGHDNE 252
           C        N  N C+Y  SY DGS + G+ +TE + F    +GT  V+ V  GCGH N 
Sbjct: 151 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 210

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSRTA 310
           G F    +G+LGL  G  S  ++ G R    FSYC+ D        + +V GD  V    
Sbjct: 211 GRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGD-GVKMEG 265

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP       + FYYV L GISVG   +  I   +F+   +G GGV++DSGT+ T L +
Sbjct: 266 SSTPF---HTFNGFYYVTLEGISVGETRL-DINPEVFQRTESGQGGVVMDSGTTATFLAK 321

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV---PTVVLHF-RGADVSLPA 426
             +  L +  +       +   +          G+    +   P +  HF  GAD+ L A
Sbjct: 322 DGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA 381

Query: 427 TNYLIPVDSSGTFCFA-FAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            N L    +   FC A     +  + S+IG + QQ + V YDL   R+ F    C
Sbjct: 382 -NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/329 (33%), Positives = 170/329 (51%), Gaps = 24/329 (7%)

Query: 65  LSLRLHHVDS--LSFNRTPEHLFNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSRG--RA 119
           + + +HHV     S    P   F+  +  D  RVK+L +       R P    ++   R 
Sbjct: 40  VQMTIHHVHGPGSSLAPQPPVSFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRF 99

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDP 178
               S  +  G + GSG Y+ ++G G+P RY  M++DTGS + W+QC PC   C+ Q DP
Sbjct: 100 PKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADP 159

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGDFSTET 232
           +FDP+ S+++ ++ C S  C  L  +  N        N C+Y  SYGD S ++G  S + 
Sbjct: 160 LFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDL 219

Query: 233 LTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           LT   ++ +     GCG D++GLF  AAG+LGLGR +LS   Q   +F   FSYCL  R 
Sbjct: 220 LTLAPSQTLPGFVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG 279

Query: 292 TSAKPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
                  +  G ++++ +A +FTP+  +P   + Y++ L  I+VGG    G+ A+ +++ 
Sbjct: 280 GGG---FLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGG-RALGVAAAQYRVP 335

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
                  IIDSGT +TRL    Y   + A
Sbjct: 336 ------TIIDSGTVITRLPMSVYTPFQQA 358


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 170/355 (47%), Gaps = 23/355 (6%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +     VG PP    + +DTGSD++W+QC PC  C+ Q+ P+FDP+KS ++  +   SP+
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF----RGT-RVARVALGCGHDNE 252
           C        N  N C+Y  SY DGS + G+ +TE + F    +GT  V+ V  GCGH N 
Sbjct: 119 CPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 253 GLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSRTA 310
           G F    +G+LGL  G  S  ++ G R    FSYC+ D        + +V GD  V    
Sbjct: 179 GRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLGD-GVKMEG 233

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP       + FYYV L GISVG   +  I   +F+   +G GGV++DSGT+ T L +
Sbjct: 234 SSTPF---HTFNGFYYVTLEGISVGETRL-DINPEVFQRTESGQGGVVMDSGTTATFLAK 289

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV---PTVVLHF-RGADVSLPA 426
             +  L +  +       +   +          G+    +   P +  HF  GAD+ L A
Sbjct: 290 DGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA 349

Query: 427 TNYLIPVDSSGTFCFA-FAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            N L    +   FC A     +  + S+IG + QQ + V YDL   R+ F    C
Sbjct: 350 -NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 123/354 (34%), Positives = 182/354 (51%), Gaps = 18/354 (5%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  ++ +GTP + + + +DT SDV WI C+ C  C S T   F PAKS SF  V C 
Sbjct: 96  STTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNT--AFSPAKSTSFKNVSCS 153

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD--NE 252
           +P C+++ +  C  R  C + ++YG  SI   + S +T+      +     GC +     
Sbjct: 154 APQCKQVPNPACGAR-ACSFNLTYGSSSI-AANLSQDTIRLAADPIKAFTFGCVNKVAGG 211

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G      GLLGLGRG LS  +Q    +   FSYCL    +     S+  G ++  +  ++
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKY 271

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           T LL NP+  + YYV LV I V G  V  +  +    +P+   G I DSGT  TRL +P 
Sbjct: 272 TQLLRNPRRSSLYYVNLVAIRV-GRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPV 330

Query: 373 YIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYL 430
           Y A+R+ FR        A   SL  FDTC+  SG  +VKVPT+   F+G ++++PA N +
Sbjct: 331 YEAVRNEFRKRVKP-PTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGVNMTMPADNLM 385

Query: 431 IPVDSSGTFCFAFA----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +   +  T C A A       S +++I ++QQQ  RV+ D+   R+G A   C+
Sbjct: 386 LHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 166/324 (51%), Gaps = 35/324 (10%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR 208
           +++D+GSDV W+QC PC    C+ Q DP+FDPA S ++A VPC S  C +L     GC+ 
Sbjct: 170 VIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSA 229

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEG--LFVAAAGLLGLG 265
              C + ++YGDGS   G +S + LT     V R    GC H + G       AG L LG
Sbjct: 230 NAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTS-------AKPSSMVFGDSAVSRTARFTPLLAN 318
            G  S   QT  R+ R FSYCL   ++S         P       S VS     TPLL++
Sbjct: 290 GGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS-----TPLLSS 344

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
               TFY V L  I V G  +  +  ++F      +   +IDS T ++RL   AY ALR 
Sbjct: 345 SMAPTFYRVLLRAIIVAGRPL-AVPPAVF------SASSVIDSSTIISRLPPTAYQALRA 397

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSG 437
           AFR+  +  + AP  S+ DTC+D +G   + +P++ L F  GA V+L A   L+     G
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----G 452

Query: 438 TFCFAFAGTMSGL--SIIGNIQQQ 459
           + C AFA T S      IGN+QQ+
Sbjct: 453 S-CLAFAPTASDRMPGFIGNVQQK 475



 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 126/284 (44%), Gaps = 51/284 (17%)

Query: 205 GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGL 264
           GC+    C + ++YGDGS   G +S + LT     V         D +GL          
Sbjct: 479 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV---------DRQGL---------- 519

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF-------GDSAVSRTARFTPLLA 317
                  P +T  ++ R FSYC+        PSS+ F         +A+  T   TPLL+
Sbjct: 520 -------PLRTATQYGRVFSYCI-----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLS 567

Query: 318 NPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           +  +  TFY V L  I V G  +  +  ++F          +I S T ++RL   AY AL
Sbjct: 568 SSSMPPTFYRVLLRAIIVAGRPLP-VPPTVFSTS------SVIASTTVISRLPPTAYQAL 620

Query: 377 RDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDS 435
           R AFR   +  + AP  S+ DTC+D +G   + +P++ L F  GA V+L A   L+    
Sbjct: 621 RAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 676

Query: 436 SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            G   FA   T      IGN+QQ+   VVYD+    I F    C
Sbjct: 677 QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 132/363 (36%), Positives = 178/363 (49%), Gaps = 26/363 (7%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPCR 194
           +Y     +G PP+    ++DTGSD+VW QC+ C  K C  Q  P ++ + S +FA VPC 
Sbjct: 89  QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148

Query: 195 SPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GH 249
           + +C   D     C+    C     YG G +  G   TE   F+ +  A +A GC     
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAG-VVAGTLGTEAFAFQ-SGTAELAFGCVTFTR 206

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
             +G    A+GL+GLGRGRLS  +QTG     KFSYCL     +   +  +F  ++ S  
Sbjct: 207 IVQGALHGASGLIGLGRGRLSLVSQTGA---TKFSYCLTPYFHNNGATGHLFVGASASLG 263

Query: 310 AR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG----NGGVIIDS 361
                  T  +  PK   FYY+ L+G++VG   +  I A++F L        +GGVIIDS
Sbjct: 264 GHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLP-IPATVFDLREVAPGLFSGGVIIDS 322

Query: 362 GTSVTRLTRPAYIALRD--AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR- 418
           G+  T L   AY AL    A R   S +   PD      C        V VP VV HFR 
Sbjct: 323 GSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRV-VPAVVFHFRG 381

Query: 419 GADVSLPATNYLIPVDS-SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           GAD+++PA +Y  PVD  +     A AG     S+IGN QQQ  RV+YDLA     F P 
Sbjct: 382 GADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPA 441

Query: 478 GCA 480
            C+
Sbjct: 442 DCS 444


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  171 bits (433), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 172/351 (49%), Gaps = 15/351 (4%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  +GTP + + + +DT +D  W+ C  C  C S T P F PAKS +F  V C 
Sbjct: 95  SPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGC-STTTP-FAPAKSTTFKKVGCG 152

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +  C+++ +  C+  + C +  +YG  S+       +T+T     V   A GC     G 
Sbjct: 153 ASQCKQVRNPTCD-GSACAFNFTYGTSSV-AASLVQDTVTLATDPVPAYAFGCIQKVTGS 210

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
            V   GLLGLGRG LS   QT + +   FSYCL    T     S+  G  A  +  +FTP
Sbjct: 211 SVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGSLRLGPVAQPKRIKFTP 270

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP+  + YYV LV I V G  +  I       +     G + DSGT  TRL  PAY 
Sbjct: 271 LLKNPRRSSLYYVNLVAIRV-GRRIVDIPPEALAFNANTGAGTVFDSGTVFTRLVEPAYN 329

Query: 375 ALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIP 432
           A+R+ FR   +  K+    SL  FDTC+       +  PT+   F G +V+LP  N LI 
Sbjct: 330 AVRNEFRRRIAVHKKLTVTSLGGFDTCY----TAPIVAPTITFMFSGMNVTLPPDNILIH 385

Query: 433 VDSSGTFCFAFA----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +    C A A       S L++I N+QQQ  RV++D+  SR+G A   C
Sbjct: 386 STAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELC 436


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 175/358 (48%), Gaps = 26/358 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  +GTP + + + +DT SDV WI C+ C  C S T   F PAKS SF  V C 
Sbjct: 112 STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNT--AFSPAKSTSFKNVSCS 169

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +P C+++ +  C  R  C + ++YG  SI   + S +T+      +     GC +     
Sbjct: 170 APQCKQVPNPTCGAR-ACSFNLTYGSSSI-AANLSQDTIRLAADPIKAFTFGCVNK---- 223

Query: 255 FVAAAGLLGLGRGRLSFP-------TQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            VA  G +   +G L          +Q    +   FSYCL    +     S+  G ++  
Sbjct: 224 -VAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQP 282

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
           +  ++T LL NP+  + YYV LV I V G  V  +  +    +P+   G I DSGT  TR
Sbjct: 283 QRVKYTQLLRNPRRSSLYYVNLVAIRV-GRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTR 341

Query: 368 LTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
           L +P Y A+R+ FR     +         FDTC+  SG  +VKVPT+   F+G ++++PA
Sbjct: 342 LAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGVNMTMPA 397

Query: 427 TNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            N ++   +  T C A A       S +++I ++QQQ  RV+ D+   R+G A   C+
Sbjct: 398 DNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 121/324 (37%), Positives = 166/324 (51%), Gaps = 35/324 (10%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR 208
           +++D+GSDV W+QC PC    C+ Q DP+FDPA S ++A VPC S  C +L     GC+ 
Sbjct: 79  VIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSA 138

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-VALGCGHDNEG--LFVAAAGLLGLG 265
              C + ++YGDGS   G +S + LT     V R    GC H + G       AG L LG
Sbjct: 139 NAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 198

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTS-------AKPSSMVFGDSAVSRTARFTPLLAN 318
            G  S   QT  R+ R FSYCL   ++S         P       S VS     TPLL++
Sbjct: 199 GGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS-----TPLLSS 253

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
               TFY V L  I V G  +  +  ++F      +   +IDS T ++RL   AY ALR 
Sbjct: 254 SMAPTFYRVLLRAIIVAGRPL-AVPPAVF------SASSVIDSSTIISRLPPTAYQALRA 306

Query: 379 AFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSG 437
           AFR+  +  + AP  S+ DTC+D +G   + +P++ L F  GA V+L A   L+     G
Sbjct: 307 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----G 361

Query: 438 TFCFAFAGTMSGL--SIIGNIQQQ 459
           + C AFA T S      IGN+QQ+
Sbjct: 362 S-CLAFAPTASDRMPGFIGNVQQK 384



 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 127/284 (44%), Gaps = 51/284 (17%)

Query: 205 GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGL 264
           GC+    C + ++YGDGS   G +S + LT     V         D +GL          
Sbjct: 388 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV---------DRQGL---------- 428

Query: 265 GRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF-------GDSAVSRTARFTPLLA 317
                  P +T  ++ R FSYC+        PSS+ F         +A+  T   TPLL+
Sbjct: 429 -------PLRTATQYGRVFSYCI-----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLS 476

Query: 318 NPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           +  +  TFY V L  I V G  +  +  ++F      +   +I S T ++RL   AY AL
Sbjct: 477 SSSMPPTFYRVLLRAIIVAGRPLP-VPPTVF------STSSVIASTTVISRLPPTAYQAL 529

Query: 377 RDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDS 435
           R AFR   +  + AP  S+ DTC+D +G   + +P++ L F  GA V+L A   L+    
Sbjct: 530 RAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 585

Query: 436 SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            G   FA   T      IGN+QQ+   VVYD+    I F    C
Sbjct: 586 QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  171 bits (432), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 175/352 (49%), Gaps = 18/352 (5%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y  R+ +GTP ++++MVLDT +D  W+ C+ C  C S T        S ++ ++ C  
Sbjct: 95  GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFST---NTSSTYGSLDCSM 151

Query: 196 PLCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
             C ++    C     ++C++  SYG  S        ++L      +   A GC +   G
Sbjct: 152 AQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVNDVIPNFAFGCINSISG 211

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFT 313
             V   GLLGLGRG LS   Q+G  ++  FSYCL    +     S+  G +   ++ R+T
Sbjct: 212 GSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRYT 271

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           PLL NP   + YYV L G+SVG   V  I   L   +P    G IIDSGT +TR  +P Y
Sbjct: 272 PLLRNPHRPSLYYVNLTGVSVGRTLVP-IAPELLAFNPNTGAGTIIDSGTVITRFVQPIY 330

Query: 374 IALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
            A+RD FR   +     P  SL  FDTCF  +   E   P V LHF G ++ LP  N LI
Sbjct: 331 TAIRDEFRKQVA----GPFSSLGAFDTCF--AATNEAVAPAVTLHFTGLNLVLPMENSLI 384

Query: 432 PVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +    C A A       S L++I N+QQQ  R+++D+  SR+G A   C
Sbjct: 385 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 179/353 (50%), Gaps = 16/353 (4%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  +  +GTP + + + +DT SDV WI C+ C  C S T   F PAKS SF  V C 
Sbjct: 96  STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNT--AFSPAKSTSFKNVSCS 153

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD--NE 252
           +P C+++ +  C  R  C + ++YG  SI   + S +T+      +     GC +     
Sbjct: 154 APQCKQVPNPTCGAR-ACSFNLTYGSSSI-AANLSQDTIRLAADPIKAFTFGCVNKVAGG 211

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G      GLLGLGRG LS  +Q    +   FSYCL    +     S+  G ++  +  ++
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKY 271

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           T LL NP+  + YYV LV I V G  V  +  +    +P+   G I DSGT  TRL +P 
Sbjct: 272 TQLLRNPRRSSLYYVNLVAIRV-GRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPV 330

Query: 373 YIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
           Y A+R+ FR     +         FDTC+  SG  +VKVPT+   F+G ++++PA N ++
Sbjct: 331 YEAVRNEFRKRVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGVNMTMPADNLML 386

Query: 432 PVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              +  T C A A       S +++I ++QQQ  RV+ D+   R+G A   C+
Sbjct: 387 HSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 135/345 (39%), Positives = 178/345 (51%), Gaps = 32/345 (9%)

Query: 153 MVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  DV WIQC PC   +CY Q +  FDP +S + A V C S  CR L   ++GC++
Sbjct: 161 MAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSK 220

Query: 209 RNT---CLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVA-AAGLLG 263
            N+   CLY++ Y D  +T+G + T+TLT    T       GC H   G F A A+G + 
Sbjct: 221 PNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTMS 280

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS--SMVFGDSAVSRTA-RFTPLL--AN 318
           LG G  S  +QT R +   FSYC+   S +   S    V GD      A   TPL+  AN
Sbjct: 281 LGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSAN 340

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
               T Y V L GI V G  +  +   +F      +GG ++DS   +T+L   AY ALR 
Sbjct: 341 VINPTIYVVRLQGIEVAGRRLN-VPPVVF------SGGTVMDSSAVITQLPPTAYRALRL 393

Query: 379 AFRAGASSLK-RAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSS 436
           AFR    + K RAP  +L DTCFD  G ++V VPTV L F  GA + L   + L+  DS 
Sbjct: 394 AFRNAMRAYKTRAPTGNL-DTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL--DS- 449

Query: 437 GTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C AFA   +   L  IGN+QQQ   V+YD+A   +GF    C
Sbjct: 450 ---CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 124/320 (38%), Positives = 170/320 (53%), Gaps = 33/320 (10%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           +Q  G+Y  +  +G PP  ++  +DTGSD++W++C+PC  C     P++DPA+SRS   +
Sbjct: 81  SQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKL 140

Query: 192 PCRSPLCRKLD-----SSGC-NRRNTCLYQVSYGDGS--ITVGDFSTETLTFRGTRVA-R 242
           PC S LC+ L      S  C +    C Y  +YG      T G   TET TF    VA  
Sbjct: 141 PCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANN 200

Query: 243 VALGCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP---SS 298
           V+ G     +G  F   AGL+GLGRG LS  +Q G     +F+YCL     +A P   S+
Sbjct: 201 VSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGA---GRFAYCL-----AADPNVYST 252

Query: 299 MVFGDSAVSRTA----RFTPLLANPK--LDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           ++FG  A   T+      TPL+ NPK   DT YYV L GISVGG+ +  I    F ++  
Sbjct: 253 ILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLP-IKDGTFAINSD 311

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV-KVP 411
           G+GGV  DSG   T L   AY  +R A     S ++R    +  DTCF  + +  V ++P
Sbjct: 312 GSGGVFFDSGAIDTSLKDAAYQVVRQAI---TSEIQRLGYDAGDDTCFVAANQQAVAQMP 368

Query: 412 TVVLHF-RGADVSLPATNYL 430
            +VLHF  GAD+SL   NYL
Sbjct: 369 PLVLHFDDGADMSLNGRNYL 388


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 127/362 (35%), Positives = 188/362 (51%), Gaps = 66/362 (18%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G +   +  GTPP+   ++LDTGS + W QC   K C  + +                  
Sbjct: 126 GNFLVDVAFGTPPQNFTLILDTGSSITWTQC---KACTVENN------------------ 164

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGL 254
                             Y ++YGD S +VG++  +T+T   + V  +   G G +N+G 
Sbjct: 165 ------------------YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRNNKGD 206

Query: 255 FVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
           F +   G+LGLG+G+LS  +QT  +FN+ FSYCL +  +     S++FG+ A S+++  +
Sbjct: 207 FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIG---SLLFGEKATSQSSSLK 263

Query: 312 FTPLLANP---KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           FT L+  P   +   +Y+V L  ISVG   +  I +S+F      + G IIDS T +TRL
Sbjct: 264 FTSLVNGPGTLQESGYYFVNLSDISVGNERLN-IPSSVF-----ASPGTIIDSRTVITRL 317

Query: 369 TRPAYIALRDAFRAGAS----SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVS 423
            + AY AL+ AF+   +    S  R     + DTC++LSG+ +V +P +VLHF  GADV 
Sbjct: 318 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVR 377

Query: 424 LPATNYLIPVDSSGTFCFAFAG----TMS-GLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           L  TN +   D S   C AFAG    TM+  L+IIGN QQ    V+YD+   RIGF   G
Sbjct: 378 LNGTNIVWGSDES-RLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNG 436

Query: 479 CA 480
           C+
Sbjct: 437 CS 438


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 142/432 (32%), Positives = 196/432 (45%), Gaps = 44/432 (10%)

Query: 87  LRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGT 146
           LR++   +  K      E   R   R   R  +  G      + +     +Y     +G 
Sbjct: 33  LRLELTHVDAKQNCTTKERMRRATERTHRRLASMAGGGGEASAPIHWNETQYIAEYLIGD 92

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS 204
           PP+    ++DTGS+++W QC+ C+   C+ Q    +DP++SR+   V C    C     +
Sbjct: 93  PPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLGSET 152

Query: 205 GCNRR-NTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR----VALGC---GHDNEGLFV 256
            C R    C    +YG G+I  G   TE  TF   + +     +A GC        G   
Sbjct: 153 RCARDGKACAVLTAYGAGAIG-GFLGTEVFTFGHGQSSENNVSLAFGCITASRLTPGSLD 211

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF-----GDSAVSRTAR 311
            A+G++GLGRG+LS P+Q G   + KFSYCL    + A  +S +F     G S     A 
Sbjct: 212 GASGIIGLGRGKLSLPSQLG---DNKFSYCLTPYFSDAANTSTLFVGASAGLSGGGAPAT 268

Query: 312 FTPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFKL---DPAGNGGVIIDSGTSV 365
             P L NP     D+FYY+ L GI+VG A +  + A+ F L    PA  GG +IDSG+  
Sbjct: 269 SVPFLKNPDDDPFDSFYYLPLTGITVGTAKLD-VPAAAFDLREVAPAKWGGTLIDSGSPF 327

Query: 366 TRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFD--LSGKTEVKVPTVVLHF---- 417
           T L   AY ALRD    + GAS +         D C      G     VP +VLHF    
Sbjct: 328 TSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPLVLHFGSGG 387

Query: 418 -RGADVSLPATNYLIPVDSSGTFC---FAFAGTMSGL-----SIIGNIQQQGFRVVYDLA 468
             G DV +P  NY  PVD S T C   F+  G  S L     +IIGN  QQ   ++YDL 
Sbjct: 388 GGGGDVVVPPENYWGPVDDS-TACMVVFSSGGPNSTLPLNETTIIGNYMQQDMHLLYDLG 446

Query: 469 ASRIGFAPRGCA 480
              + F P  C+
Sbjct: 447 QGVLSFQPADCS 458


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 130/400 (32%), Positives = 198/400 (49%), Gaps = 28/400 (7%)

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVIS-----GLAQG-----SGEYFTRLG 143
            R K   ++ ES +++  ++++R +    F SS+++      +A G     +  Y  R  
Sbjct: 51  FRPKEPLSWEESVLQMQAKDKARLQ----FLSSLVARKSVVPIASGRQIVQNPTYIVRAK 106

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +GTP + + M +DT SDV WI   PC  C   +  +F+   S ++ ++ C++  C+++  
Sbjct: 107 IGTPAQTMLMAMDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPK 163

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLG 263
             C     C + ++YG GS    + S +T+T     V   + GC     G  + A GLLG
Sbjct: 164 PTCGG-GVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLG 221

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDT 323
           LGRG LS  +QT   +   FSYCL    +     S+  G     +  ++TPLL NP+  +
Sbjct: 222 LGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPS 281

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
            Y+V L+ + VG   V     S F  +P+   G I DSGT  TRL  PAYIA+RDAFR  
Sbjct: 282 LYFVNLMAVRVGRRVVDVPPGS-FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNR 340

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF 443
                       FDTC+ +     +  PT+   F G +V+LP  N LI   +  T C A 
Sbjct: 341 VGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAM 396

Query: 444 AG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           A       S L++I N+QQQ  R++YD+  SR+G A   C
Sbjct: 397 AAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 436


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 138/383 (36%), Positives = 184/383 (48%), Gaps = 50/383 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA---PCKKC--YSQTDPVFDPAKSRSFAT 190
           G Y   L  GTPP+ +  V+DTGS  VW  C     C  C   S+  P F P  S S   
Sbjct: 75  GGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHSSSSKI 133

Query: 191 VPCRSPLCR-----KLDSSGC--NRRNTCL----YQVSYGDGSITVGDFSTETLTFRGTR 239
           + C++P C       L  + C  N RN       Y + YG G+ T G   +ETL   G  
Sbjct: 134 IGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGT-TGGVALSETLHLHGLI 192

Query: 240 VARVALGCGHDNEGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS--TSAK 295
           V    +GC      +F +   AG+ G GRG  S P+Q G     KFSYCL+      + +
Sbjct: 193 VPNFLVGC-----SVFSSRQPAGIAGFGRGPSSLPSQLGL---TKFSYCLLSHKFDDTQE 244

Query: 296 PSSMVFGDSAVS--RTA--RFTPLLANPKLD------TFYYVELVGISVGGAHVRGITAS 345
            SS+V    + S  +TA   +TPL+ NPK+        +YYV L  IS+GG  V+ I   
Sbjct: 245 SSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVK-IPYK 303

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA---PDFSLFDTCFDL 402
               D  GNGG IIDSGT+ T ++  A+  L + F +   + +RA      S    CF++
Sbjct: 304 YLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNV 363

Query: 403 SGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF----AGTMSGLS-IIGNI 456
           SG  E+++P + LHF+ GADV LP  NY   + S    CF      A   SG   I+GN 
Sbjct: 364 SGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNF 423

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           Q Q F V YDL   R+GF    C
Sbjct: 424 QMQNFYVEYDLQNERLGFKKESC 446


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/268 (36%), Positives = 145/268 (54%), Gaps = 17/268 (6%)

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           C Y ++YGDGS T G+   E L F    V     GCG +N+GLF   +GL+GLGR  LS 
Sbjct: 76  CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSL 135

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA---RFTPLLANPKLDTFYYVE 328
            +QT   F   FSYCL         S ++ G+S+V R +    +  ++ NP+L  FY++ 
Sbjct: 136 ISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFIN 195

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           L GIS+GG  ++  +         G   +++DSGT +TRL    Y AL+  F    +   
Sbjct: 196 LTGISIGGVALQAPS--------VGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 247

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN--YLIPVDSSGTFCFAFAG 445
            AP FS+ DTCF+LS   EV +PT+ +HF G A++++  T   Y +  D+S   C A A 
Sbjct: 248 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDAS-QVCLALAS 306

Query: 446 T--MSGLSIIGNIQQQGFRVVYDLAASR 471
                 ++I+GN QQ+  RV+YD   ++
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYDTKETK 334


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 129/366 (35%), Positives = 177/366 (48%), Gaps = 19/366 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           ++ + SG A G G Y  R+ +G+P +  +MVLDT +D  W+ C  C  C S +   + P 
Sbjct: 94  AAPIASGQAFGIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGC-SSSSTYYSPQ 152

Query: 184 KSRSF-ATVPCRSPLCRKLDSS-GC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            S ++   V C +P C +   +  C       C +  SY  GS        ++L      
Sbjct: 153 ASTTYGGAVACYAPRCAQARGALPCPYTGSKACTFNQSYA-GSTFSATLVQDSLRLGIDT 211

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
           +   A GC +   G  + A GLLGLGRG LS P+Q+ + ++  FSYCL    +S    S+
Sbjct: 212 LPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYFSGSL 271

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             G +   R  R TPLL NP+  + YYV L G++VG   V  +       DP    G I+
Sbjct: 272 KLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVP-LPIEYLAFDPNKGSGTIL 330

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHF 417
           DSGT +TR   P Y A+RD FR       + P FS   FDTCF      E   P + L F
Sbjct: 331 DSGTVITRFVGPVYSAIRDEFRNQV----KGPFFSRGGFDTCF--VKTYENLTPLIKLRF 384

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            G DV+LP  N LI     G  C A A       S L++I N QQQ  RV++D   +R+G
Sbjct: 385 TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNNRVG 444

Query: 474 FAPRGC 479
            A   C
Sbjct: 445 IARELC 450


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 125/389 (32%), Positives = 182/389 (46%), Gaps = 48/389 (12%)

Query: 101 AFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSD 160
           AF  S  RV      R R     S  + S +   +GEY   L +GTPP  V  ++DTGSD
Sbjct: 60  AFRRSVSRV-----GRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSD 114

Query: 161 VVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS-GCNRRNTCLYQVSYG 219
           + W QC PC  CY Q  P+FDP  S ++    C +  C  L     C++   C ++ SY 
Sbjct: 115 LTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYA 174

Query: 220 DGSITVGDFSTETLTFRGTRVARV-----ALGCGHDNEGLF-VAAAGLLGLGRGRLSFPT 273
           DGS T G+ ++ETLT   T    V     A GCGH + G+F  +++G++GLG G LS  +
Sbjct: 175 DGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLIS 234

Query: 274 QTGRRFNRKFSYCLVDRSTSAKPSSMV-FGDSAVSRTARFTPLLANPKLDTFYYVELVGI 332
           Q     N  FSYCL+  ST +  SS + FG S                        + G 
Sbjct: 235 QLKSTINGLFSYCLLPVSTDSSISSRINFGASG----------------------RVSGY 272

Query: 333 SVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK---- 388
                 +R       K      G +I+DSGT+ T L +  Y  L    ++ A+S+K    
Sbjct: 273 GTVSTPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLE---KSVANSIKGKRV 329

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
           R P+  +F  C++ +   E+  P +  HF+ A+V L   N  + +      CF  A T S
Sbjct: 330 RDPN-GIFSLCYNTTA--EINAPIITAHFKDANVELQPLNTFMRMQED-LVCFTVAPT-S 384

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
            + ++GN+ Q  F V +DL   R GF+ +
Sbjct: 385 DIGVLGNLAQVNFLVGFDLRKKR-GFSKK 412



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 11/130 (8%)

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK----RAPDFSLFDTCFDLSGKTEVK 409
            G +I+DSGT+ T L    Y+ L ++    A S+K    R P+  +   C++ +   ++ 
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVKLEESV---AHSIKGKRVRDPN-GISSLCYNTT-VDQID 471

Query: 410 VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
            P +  HF+ A+V L   N  + +      CF    T S + I+GN+ Q  F V +DL  
Sbjct: 472 APIITAHFKDANVELQPWNTFLRMQED-LVCFTVLPT-SDIGILGNLAQVNFLVGFDLRK 529

Query: 470 SRIGFAPRGC 479
            R+ F    C
Sbjct: 530 KRVSFKAADC 539


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 134/371 (36%), Positives = 189/371 (50%), Gaps = 33/371 (8%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG   G+G+YF +L VGTP +   +V DTGSD+ W++CA      S    VF P  SRS+
Sbjct: 107 SGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGA----SPPGRVFRPKTSRSW 162

Query: 189 ATVPCRSPLCRKLDS----SGCNR-RNTCLYQVSYGDGSITV-GDFSTE--TLTFRGTRV 240
           A +PC S  C KLD     + C+   + C Y   Y +GS    G   TE  T+   G +V
Sbjct: 163 APIPCSSDTC-KLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKV 221

Query: 241 AR---VALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP 296
           A+   V LGC   ++G  F +A G+L LG  ++SF TQ   RF   FSYCLVD       
Sbjct: 222 AQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNA 281

Query: 297 SS-MVFGDSAVSRT-ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
           +  + FG   V RT A  T L  +P++  FY V++  I V G  +  I A ++    A +
Sbjct: 282 TGYLAFGPGQVPRTPATQTKLFLDPEM-PFYGVKVDAIHVAGKALD-IPAEVWD---AKS 336

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS----GKTEVKV 410
           GGVI+DSG ++T L  PAY A+  A       + +   F  F+ C++ +    G  E+ +
Sbjct: 337 GGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKV-SFPPFEHCYNWTARRPGAPEI-I 394

Query: 411 PTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLA 468
           P + + F G A +  PA +Y+I V   G  C     G   GLS+IGNI QQ     +DL 
Sbjct: 395 PKLAVQFAGSARLEPPAKSYVIDV-KPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLK 453

Query: 469 ASRIGFAPRGC 479
             ++ F    C
Sbjct: 454 NMQVRFKQSNC 464


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 173/346 (50%), Gaps = 14/346 (4%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  R  +GTP + + M +DT SDV WI   PC  C   +  +F+   S ++ ++ C++  
Sbjct: 36  YIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQ 92

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
           C+++    C     C + ++YG GS    + S +T+T     V   + GC     G  + 
Sbjct: 93  CKQVPKPTCG-GGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLP 150

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLA 317
           A GLLGLGRG LS  +QT   +   FSYCL    +     S+  G     +  ++TPLL 
Sbjct: 151 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLK 210

Query: 318 NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
           NP+  + Y+V L+ + VG   V     S F  +P+   G I DSGT  TRL  PAYIA+R
Sbjct: 211 NPRRPSLYFVNLMAVRVGRRVVDVPPGS-FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVR 269

Query: 378 DAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSG 437
           DAFR              FDTC+ +     +  PT+   F G +V+LP  N LI   +  
Sbjct: 270 DAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTGMNVTLPPDNLLIHSTAGS 325

Query: 438 TFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           T C A A       S L++I N+QQQ  R++YD+  SR+G A   C
Sbjct: 326 TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 126/361 (34%), Positives = 181/361 (50%), Gaps = 33/361 (9%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L +    +   L +G PP  VY+VLDTGSD+ WIQC PC  CY Q DP+++  KS S+  
Sbjct: 99  LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTE 158

Query: 191 VPCRSPLCRKLDSSG-CNRRNTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVA 244
           + C  P C  L   G C+   +CLYQ SY DGS T G  S E + F        + A+V 
Sbjct: 159 MLCNEPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVG 218

Query: 245 LGCGHDNEGLFVAA--AGLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRSTSAKPSSMV 300
            GCG  N     ++   G+LGLG G +S  +Q     + ++ F+YC  + S       +V
Sbjct: 219 FGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLV 278

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGGVII 359
           FGD A       TP++    +  FYYV L+GI +G    R  I +S F+  P G+GGVII
Sbjct: 279 FGD-ATYLNGDMTPMV----IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVII 333

Query: 360 DSGTSVTRLTRPAYIALR----DAFRAG--ASSLKRAPDFSLFDTCFDLS-GKTEVKVPT 412
           DSG++++      Y  +R    D  + G   S L  +PD      CF+   G+     PT
Sbjct: 334 DSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD------CFEGKIGRDLPLFPT 387

Query: 413 VVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           +VL+     + +   + +L   D    FC  F  +  GLSIIG + QQ ++  Y+L  S 
Sbjct: 388 LVLYLESTGILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNLELST 444

Query: 472 I 472
           +
Sbjct: 445 L 445


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 165/356 (46%), Gaps = 40/356 (11%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +L +GTPP  +  VLDTGS+ +W QC PC  CY+QT P+FDP+KS +F  + C + 
Sbjct: 64  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 122

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDN 251
                        ++C Y++ YG  S T G   TET+T   T      +    +GCG +N
Sbjct: 123 -----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN 171

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSR 308
            G     AG++GL RG  S  TQ G  +    SYC   + TS      +++V GD  VS 
Sbjct: 172 SGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVST 231

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           T     +        FYY+ L  +SVG   +  +      L     G ++IDSG+++T  
Sbjct: 232 T-----VFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHAL----KGNIVIDSGSTLTYF 282

Query: 369 TRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
                  +R A     ++++  R+     +    D+        P + +HF  GAD+ L 
Sbjct: 283 PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLD 335

Query: 426 ATNYLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             N  +  ++ G FC A    +    +I GN  Q  F V YD ++  + F P  C+
Sbjct: 336 KYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 165/353 (46%), Gaps = 34/353 (9%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK-SRSFATVPCRSPLCRKLD 202
           +GTPP  V + L+ G++++W    P  +C+ Q  P F+P   SR      C SP      
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWP-- 58

Query: 203 SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--RGTRVARVALGCGHDNEGLFVA-AA 259
                   TC+Y  SYGD S+T G    +  TF   G  V  VA GCG  N G+F +   
Sbjct: 59  ------NQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNET 112

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF---------GDSAVSRTA 310
           G+ G GRG LS P+Q        FS+C     T A PS+++          G  AV  T 
Sbjct: 113 GIAGFGRGPLSLPSQLKV---GNFSHCFT-TITGAIPSTVLLDLPADLFSNGQGAVQTTP 168

Query: 311 --RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
             ++    ANP   T YY+ L GI+VG   +  +  S F L   G GG IIDSGTS+T L
Sbjct: 169 LIQYAKNEANP---TLYYLSLKGITVGSTRLP-VPESAFALT-NGTGGTIIDSGTSITSL 223

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATN 428
               Y  +RD F A         + +   TCF    + +  VP +VLHF GA + LP  N
Sbjct: 224 PPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPREN 283

Query: 429 YL--IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           Y+  +P D+  +            +IIGN QQQ   V+YDL  + + F    C
Sbjct: 284 YVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 165/356 (46%), Gaps = 40/356 (11%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +L +GTPP  +  VLDTGS+ +W QC PC  CY+QT P+FDP+KS +F  + C + 
Sbjct: 58  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 116

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDN 251
                        ++C Y++ YG  S T G   TET+T   T      +    +GCG +N
Sbjct: 117 -----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN 165

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSR 308
            G     AG++GL RG  S  TQ G  +    SYC   + TS      +++V GD  VS 
Sbjct: 166 SGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVST 225

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           T     +        FYY+ L  +SVG   +  +      L     G ++IDSG+++T  
Sbjct: 226 T-----VFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHAL----KGNIVIDSGSTLTYF 276

Query: 369 TRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLP 425
                  +R A     ++++  R+     +    D+        P + +HF  GAD+ L 
Sbjct: 277 PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLD 329

Query: 426 ATNYLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             N  +  ++ G FC A    +    +I GN  Q  F V YD ++  + F P  C+
Sbjct: 330 KYNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 143/468 (30%), Positives = 198/468 (42%), Gaps = 65/468 (13%)

Query: 56  LPAPDAESSLSL-RLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNR 114
           LP P     L + R+   D+ S N T   L    IQR   R+ S+      A R+ P + 
Sbjct: 17  LPVPRQSYHLDIARVDASDTESLNLTDHELLRRAIQRSRDRLASI------APRLLPTS- 69

Query: 115 SRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS 174
           SR +     +  + +G     GEY  +LG+GTP       +DT SD++W QC PC KCY 
Sbjct: 70  SRNKVVVAEAPVLSAG-----GEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYK 124

Query: 175 QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGDF 228
           Q DPVF+P  S S+A VPC S  C +LD+  C R       + C Y  SYG  + T G  
Sbjct: 125 QLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGIL 184

Query: 229 STETLTFRGTRVARVALGCGHDNE-GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
           + + L         V  GC   +  G     +G++GLGRG LS  +Q      R+F YCL
Sbjct: 185 AVDRLAIGDDVFRGVVFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLS---VRRFMYCL 241

Query: 288 ---VDRSTSAKPSSMVFGDSAVS--RTAR---FTPLLANPKLDTFYYVELVGISVGGAHV 339
              V RS       +V G  A +  R A      P+    +  ++YY+ L GIS+G   +
Sbjct: 242 PPPVSRSA----GRLVLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAM 297

Query: 340 RGITASLFKLDPAGNG------------------------GVIIDSGTSVTRLTRPAYIA 375
              + +       G                          G+IID  +++T L    Y  
Sbjct: 298 SFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEE 357

Query: 376 LRDAFRAGASSLKRAPDFSL-FDTCFDLSG---KTEVKVPTVVLHFRGADVSLPATNYLI 431
           + D        L R     L  D CF L      + V  P V L F G  + L      +
Sbjct: 358 MVDDLEEEI-RLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFV 416

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +SG  C    G   G+SI+GN QQQ  +V+Y+L   RI F    C
Sbjct: 417 EDRASGMMCL-MVGKTDGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 182/363 (50%), Gaps = 37/363 (10%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L +    +   L +G PP  VY+VLDTGSD+ WIQC PC  CY Q DP+++  KS S+  
Sbjct: 86  LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTE 145

Query: 191 VPCRSPLCRKLDSSG-CNRRNTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVA 244
           + C  P C  L   G C+   +CLYQ +Y DG+ T G  S E + F        + A+V 
Sbjct: 146 MLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVG 205

Query: 245 LGCGHDNEGLFVAA--AGLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRSTSAKPSSMV 300
            GCG  N     +    G+LGLG G +S  +Q     + ++ F+YC  + S       +V
Sbjct: 206 FGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLV 265

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR-GITASLFKLDPAGNGGVII 359
           FGD A       TP++    +  FYYV L+GI +G    R  I +S F+  P G+GGVII
Sbjct: 266 FGD-ATYLNGDMTPMV----IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVII 320

Query: 360 DSGTSVTRLTRPAYIALR----DAFRAG--ASSLKRAPDFSLFDTCFDLSGKTEVKV--- 410
           DSG++++      Y  +R    D  + G   S L  +PD      CF+  GK E  +   
Sbjct: 321 DSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD------CFE--GKIERDLPLF 372

Query: 411 PTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
           PT+VL+     + +   + +L   D    FC  F  +  GLSIIG + QQ ++  Y+L  
Sbjct: 373 PTLVLYLESTGILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNLEL 429

Query: 470 SRI 472
           S +
Sbjct: 430 STL 432


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 165/353 (46%), Gaps = 35/353 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  +L VGTPP  +   +DTGSD++W QC PC  CYSQ  P+FDP+ S +F    C    
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG-- 118

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
                       N+C Y++ Y D + + G  +TET+T   T      +    +GCGH++ 
Sbjct: 119 ------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSS 166

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSRT 309
                 +G++GL  G  S  TQ G  +    SYC   + TS      +++V GD  VS T
Sbjct: 167 WFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVVSTT 226

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T   A P L   YY+ L  +SVG  HV  +  +   L+    G +IIDSGT++T   
Sbjct: 227 MFLT--TAKPGL---YYLNLDAVSVGDTHVETMGTTFHALE----GNIIIDSGTTLTYFP 277

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATN 428
                 +R+A     ++++ A        C+     T    P + +HF  GAD+ L   N
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCY--YTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 429 YLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             I   + GTFC A         +I GN  Q  F V YD ++  + F+P  C+
Sbjct: 336 MYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 133/370 (35%), Positives = 179/370 (48%), Gaps = 40/370 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VGTPP+ V MVLDTGS++ W+ CAP       +   F P  S +FA VPC S  CR  
Sbjct: 89  LAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSR 148

Query: 202 D---SSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDNEGL 254
           D      C+  ++ C   +SY DGS + G  +T+          R A GC     D+   
Sbjct: 149 DLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAAFGCMSSAFDSSPD 208

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS------AVSR 308
            VA+AGLLG+ RG LSF +Q      R+FSYC+ DR  +     ++ G S       ++ 
Sbjct: 209 GVASAGLLGMNRGALSFVSQAS---TRRFSYCISDRDDAGV---LLLGHSDLPTFLPLNY 262

Query: 309 TARFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
           T  + P L  P  D   Y V+L+GI VGG H+  I AS+   D  G G  ++DSGT  T 
Sbjct: 263 TPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLP-IPASVLAPDHTGAGQTMVDSGTQFTF 321

Query: 368 LTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDL-SGKTE--VKVPTVVLHFR 418
           L   AY AL+  F   A  L  A   P F+    FDTCF +  G++    ++P V L F 
Sbjct: 322 LLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFN 381

Query: 419 GADVSLPATNYLIPVDSS-----GTFCFAFAGTMSGLSI----IGNIQQQGFRVVYDLAA 469
           GA++++     L  V        G +C  F G    + I    IG+  Q    V YDL  
Sbjct: 382 GAEMAVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPIMAYVIGHHHQMNVWVEYDLER 440

Query: 470 SRIGFAPRGC 479
            R+G AP  C
Sbjct: 441 GRVGLAPVRC 450


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 182/385 (47%), Gaps = 32/385 (8%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--- 178
            F+  + SG   G+G+YF +  VGTP +   +V DTGSD+ W++C   +       P   
Sbjct: 94  AFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLAS 153

Query: 179 --VFDPAKSRSFATVPCRSPLCRKLD-------SSGCNRRNTCLYQVSYGDGSITVGDFS 229
             VF PA S+S+A +PC S  C+          S+G      C Y   Y D S   G   
Sbjct: 154 PRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVG 213

Query: 230 TETLTF--------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFN 280
           T+  T         R  ++  V LGC    +G  F ++ G+L LG   +SF ++   RF 
Sbjct: 214 TDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFG 273

Query: 281 RKFSYCLVDRSTSAKPSS-MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
            +FSYCLVD       +S + FG    + +   TPLL + ++  FY V +  +SV G  +
Sbjct: 274 GRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKAL 333

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FD 397
             I A ++  D   NGG I+DSGTS+T L  PAY A+  A    +  L R P  ++  F+
Sbjct: 334 -NIPAEVW--DVKKNGGAILDSGTSLTILATPAYKAVVAAL---SKQLARVPRVTMDPFE 387

Query: 398 TCFDLSG-KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGN 455
            C++ +  +    VP + + F G+    P T   +   + G  C     G   G+S+IGN
Sbjct: 388 YCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGN 447

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           I QQ     +DLA   + F    CA
Sbjct: 448 ILQQEHLWEFDLANRWLRFQESRCA 472


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 167/353 (47%), Gaps = 35/353 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  +L VGTPP  +  ++DTGS++ W QC PC  CY Q  P+FDP+KS +F    C    
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCDG-- 122

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
                       ++C Y+V Y D + T+G  +TET+T   T      +    +GCGH+N 
Sbjct: 123 ------------HSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNS 170

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSRT 309
               + +G++GL  G  S  TQ G  +    SYC   + TS      +++V GD  VS T
Sbjct: 171 WFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKINFGANAIVAGDGVVSTT 230

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T   A P    FYY+ L  +SVG   +  +  +   L+    G ++IDSGT++T   
Sbjct: 231 MFMT--TAKPG---FYYLNLDAVSVGNTRIETMGTTFHALE----GNIVIDSGTTLTYFP 281

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATN 428
                 +R A     ++++ A        C++    T    P + +HF  G D+ L   N
Sbjct: 282 VSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN--SDTIDIFPVITMHFSGGVDLVLDKYN 339

Query: 429 YLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +  ++ G FC A    + +  +I GN  Q  F V YD ++  + F+P  C+
Sbjct: 340 MYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 134/398 (33%), Positives = 203/398 (51%), Gaps = 43/398 (10%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRY 150
           +D  RV+S+ A      R+  +  +    +GG   S+ S      G +   +G G P + 
Sbjct: 90  QDRSRVRSINA------RILGQYSTEESKDGGSPESMHS--LNEDGFFLVNVGFGKPQQN 141

Query: 151 VYMVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
           + +++DTGSD  WI+C  C    C+++  P F+P+ S S++   C       + S+  N 
Sbjct: 142 LNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSC-------IPSTKTN- 193

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG- 267
                Y ++Y D S + G F  + +T +     +   GCG    G F +A+G+LGL +G 
Sbjct: 194 -----YTMNYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSASGVLGLAQGE 248

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--RFTPLLANPKLDTFY 325
           + S  +QT  +F +KFSYC      +    S++FG+ A+S +   +FT LL NP   + Y
Sbjct: 249 QYSLISQTASKFKKKFSYCFPHNENTR--GSLLFGEKAISASPSLKFTRLL-NPSSGSVY 305

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA--- 382
           +VEL+GISV    +  +++SLF      + G IIDSGT +T L   AY ALR AF+    
Sbjct: 306 FVELIGISVAKKRLN-VSSSLF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEML 359

Query: 383 GASSLKRAPDFSLFDTCFDLS--GKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTF 439
              S+   P     DTC++L   G   +K+P +VLHF G  DVSL  +  L         
Sbjct: 360 HCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQA 419

Query: 440 CFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           C AFA     S ++IIGN QQ   +VVYD+   R+GF 
Sbjct: 420 CLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 457


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 111/351 (31%), Positives = 163/351 (46%), Gaps = 38/351 (10%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
            Y  R G+GTP + + + +D  +D  W+ C+ C  C + + P F P +S ++ TVPC SP
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 197 LCRKLDSSGC--NRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C ++ S  C     ++C + ++Y   +        ++L      V     GC     G 
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENNVVVSYTFGCLRVVNGN 218

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTP 314
             AAAG                 R   + +  LV             G     +  + TP
Sbjct: 219 SRAAAG---------------AHRLRPRAALLLVADQGH-------LGPIGQPKRIKTTP 256

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           LL NP   + YYV ++GI VG   V+ +  S    +P    G IID+GT  TRL  P Y 
Sbjct: 257 LLYNPHRPSLYYVNMIGIRVGSKVVQ-VPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 315

Query: 375 ALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPV 433
           A+RDAFR G      AP    FDTC++++    V VPTV   F GA  V+LP  N +I  
Sbjct: 316 AVRDAFR-GRVRTPVAPPLGGFDTCYNVT----VSVPTVTFMFAGAVAVTLPEENVMIHS 370

Query: 434 DSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+GF+   C
Sbjct: 371 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 421


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 165/353 (46%), Gaps = 35/353 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  +L VGTPP  +   +DTGSD++W QC PC  CYSQ  P+FDP+ S +F    C    
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG-- 118

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
                       N+C Y++ Y D + + G  +TET+T   T      +    +GCGH++ 
Sbjct: 119 ------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSS 166

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSRT 309
                 +G++GL  G  S  TQ G  +    SYC   + TS      +++V GD  VS T
Sbjct: 167 WFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVVSTT 226

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T   A P L   YY+ L  +SVG  HV  +  +   L+    G +IIDSGT++T   
Sbjct: 227 MFLT--TAKPGL---YYLNLDAVSVGDTHVETMGTTFHALE----GNIIIDSGTTLTYFP 277

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATN 428
                 +R+A     ++++ A        C+     T    P + +HF  GAD+ L   N
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCY--YTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 429 YLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             I   + GTFC A         +I GN  Q  F V YD ++  + F+P  C+
Sbjct: 336 MYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 145/414 (35%), Positives = 191/414 (46%), Gaps = 44/414 (10%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +QR   R+  L A A S     P              S  + L +GSG+Y    G+GTP 
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAP------------GESAQTPLKKGSGDYAMSFGIGTPA 102

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
             +    DTGSD++W +C  C +C  +  P + P  S S A V C    C +L    C+ 
Sbjct: 103 TGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSN 162

Query: 209 -------RNTCLYQVSYGDG----SITVGDFSTETLTFRGTRVA--RVALGCGHDNEGLF 255
                     C Y  +YG+       T G   TET TF     A   +A GC   +EG F
Sbjct: 163 VAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGF 222

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---DSAVSRTARF 312
              +GL+GLGRG+LS  TQ        F Y L   S  + PS + FG   D        F
Sbjct: 223 GTGSGLVGLGRGKLSLVTQLNV---EAFGYRL--SSDLSAPSPISFGSLADVTGGNGDSF 277

Query: 313 --TPLLANPKLDT--FYYVELVGISVGGAHVRGITASLFKLD-PAGNGGVIIDSGTSVTR 367
             TPLL NP +    FYYV L GISVGG  V+ I +  F  D   G GGVI DSGT++T 
Sbjct: 278 MSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ-IPSGTFSFDRSTGAGGVIFDSGTTLTM 336

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPA 426
           L  PAY  +RD   +     K  P  +  D      G +    P++VLHF  GAD+ L  
Sbjct: 337 LPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLST 396

Query: 427 TNYLIPV---DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA-SRIGFAP 476
            NYL  +   +     C++   +   L+IIGNI Q  F VV+DL+  +R+ F P
Sbjct: 397 ENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 450


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 142/446 (31%), Positives = 199/446 (44%), Gaps = 58/446 (13%)

Query: 64  SLSLRLHHVDSLS---FN--RTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGR 118
           S +  L H+DS +   FN   T  H     +QR   RV  L   + S             
Sbjct: 37  SFTAELIHIDSPNSPFFNASETTTHRLAKALQRSANRVARLNPLSNS------------- 83

Query: 119 ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP 178
            + G  +S+ SG     G Y  +L +GTPP  ++  +DTGS+V+WI C  CK C++Q+  
Sbjct: 84  -DEGVHASIFSG----DGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSS 138

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY------QVSYGDGSITVGDFSTET 232
           +F+P  S ++   PC S  C    SS C   N CLY      Q++  +G I V   +  +
Sbjct: 139 IFNPLASSTYQDAPCDSYQCETT-SSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTS 197

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
              R   +      CG+     F A  G++GLGRG LS  ++     + KFSYCL D   
Sbjct: 198 SDGRPFPLPYSDFVCGNSIYKTF-AGVGVIGLGRGALSLTSKLYHLSDGKFSYCLADY-Y 255

Query: 293 SAKPSSMVFG-DSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           S +PS + FG  S +S       +  L + +    YYV L GISVG          L+ +
Sbjct: 256 SKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKR-----QDLYYV 310

Query: 350 D-----PAGNGGVIIDSGTSVTRLTRPAYIALRDAFR-AGASSLKRAPDFSLFDTCFDLS 403
           D     P GN  ++IDSGT  T L +  Y  L      A   + +  P  S F    D +
Sbjct: 311 DDPFAPPVGN--MLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNT 368

Query: 404 GKT--------EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSII-G 454
            K         E+K P + +HF  ADV L   N  I V +    CFAFA T  G S + G
Sbjct: 369 LKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRV-AEDVVCFAFAATQPGQSTVYG 427

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGCA 480
           + QQ  F + YDL    + F    C+
Sbjct: 428 SWQQMNFILGYDLKRGTVSFKRTDCS 453


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 134/395 (33%), Positives = 187/395 (47%), Gaps = 47/395 (11%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP-------- 178
           + SG   G G+YF R  VGTP +   +V DTGSD+ W++C       S   P        
Sbjct: 86  LTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPG 145

Query: 179 -VFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETL 233
             F P  SR++A + C S  C K      + C    + C Y   Y DGS   G   TE+ 
Sbjct: 146 RAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESA 205

Query: 234 TF-------RGTRVARVALGCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSY 285
           T        R  ++  + LGC     G  F A+ G+L LG   +SF +    RF  +FSY
Sbjct: 206 TIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSY 265

Query: 286 CLVDR-STSAKPSSMVFG-DSAVS-------------RTARFTPLLANPKLDTFYYVELV 330
           CLVD  S     S + FG + AVS               AR TPLL + ++  FY V L 
Sbjct: 266 CLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLK 325

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
            ISV G  ++ I  +++ ++    GGVI+DSGTS+T L +PAY A+  A   G + L R 
Sbjct: 326 AISVAGEFLK-IPRAVWDVE--AGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRV 382

Query: 391 PDFSLFDTCFDL---SGK-TEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-A 444
                F+ C++    SGK  +V VP + +HF G A +  P  +Y+I   + G  C     
Sbjct: 383 -TMDPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDA-APGVKCIGLQE 440

Query: 445 GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           G   G+S+IGNI QQ     +D+   R+ F    C
Sbjct: 441 GPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 159/318 (50%), Gaps = 30/318 (9%)

Query: 185 SRSFATVPCRSPLCRK---LDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRGTR- 239
           S +F  V C  P+CR    +  S C   N  C Y  SYGD SIT G    +T TF     
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 240 ----VARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRST 292
               V+ +A GCG  N GLFV+  +G+ G GRG  S P+Q   GR     FSYCL    T
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGR-----FSYCLT-LVT 115

Query: 293 SAKPSSMVFG-----DSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
            +K S ++ G     D   + T    + TP++ NP + TFYY+ L GI+VG   +     
Sbjct: 116 ESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLP-FDK 174

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-TCFDL- 402
           S+F L   G+GG +IDSGTS+T L    +  L++   A     +      + D  CF   
Sbjct: 175 SVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRLCFRRP 234

Query: 403 SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTM-SGLSIIGNIQQQGF 461
            G  +V VP ++LH  GAD+ LP  NY +    SG  C    G   + + +IGN QQQ  
Sbjct: 235 KGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNM 294

Query: 462 RVVYDLAASRIGFAPRGC 479
            VVYD+  +++ FAP  C
Sbjct: 295 HVVYDVENNKLLFAPAQC 312


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 145/414 (35%), Positives = 191/414 (46%), Gaps = 44/414 (10%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +QR   R+  L A A S     P              S  + L +GSG+Y    G+GTP 
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAP------------GESAQTPLKKGSGDYAMSFGIGTPA 102

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR 208
             +    DTGSD++W +C  C +C  +  P + P  S S A V C    C +L    C+ 
Sbjct: 103 TGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSN 162

Query: 209 -------RNTCLYQVSYGDG----SITVGDFSTETLTFRGTRVA--RVALGCGHDNEGLF 255
                     C Y  +YG+       T G   TET TF     A   +A GC   +EG F
Sbjct: 163 VAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGF 222

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---DSAVSRTARF 312
              +GL+GLGRG+LS  TQ        F Y L   S  + PS + FG   D        F
Sbjct: 223 GTGSGLVGLGRGKLSLVTQLNV---EAFGYRL--SSDLSAPSPISFGSLADVTGGNGDSF 277

Query: 313 --TPLLANPKLDT--FYYVELVGISVGGAHVRGITASLFKLD-PAGNGGVIIDSGTSVTR 367
             TPLL NP +    FYYV L GISVGG  V+ I +  F  D   G GGVI DSGT++T 
Sbjct: 278 MSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ-IPSGTFSFDRSTGAGGVIFDSGTTLTM 336

Query: 368 LTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPA 426
           L  PAY  +RD   +     K  P  +  D      G +    P++VLHF  GAD+ L  
Sbjct: 337 LPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLST 396

Query: 427 TNYLIPV---DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAA-SRIGFAP 476
            NYL  +   +     C++   +   L+IIGNI Q  F VV+DL+  +R+ F P
Sbjct: 397 ENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 450


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 169/371 (45%), Gaps = 34/371 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           GEY  +LG+GTP  Y    +DT SD+VW+QC PC  CY Q DP+F+P  S S+A VPC S
Sbjct: 86  GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSS 145

Query: 196 PLCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDN-E 252
             C +LD   C+  +   C Y   Y   ++T G  + + L   G     V LGC   +  
Sbjct: 146 DTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVLGCSDSSVG 205

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-----DSAVS 307
           G    A+GL+GL RG LS  +Q      R+F YCL     S  P  +V G     D+  +
Sbjct: 206 GPPPQASGLVGLARGPLSLLSQLS---VRRFMYCLPP-PMSRTPGKLVLGAGAGADAVRN 261

Query: 308 RTARFTPLLANP-KLDTFYYVELVGISVGG---AHVRGITA-----------SLFKLDPA 352
            + R T  +++  +  ++YY+   G++VG      +R  T+                  A
Sbjct: 262 VSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGA 321

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLS---GKTEV 408
              G+I+D  ++++ L    Y  L D         +  P   L  D CF L    G   V
Sbjct: 322 NAYGMIVDVASTISFLEASLYDELADDLEEEIRLPRATPSTRLGLDLCFILPEGVGIDRV 381

Query: 409 KVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
            VPTV + F G  + L      +     G       G  SG+SI+GN QQQ   V+Y+L 
Sbjct: 382 YVPTVSMSFDGRWLELERDRLFL---EDGRMMCLMIGRTSGVSILGNYQQQNMHVLYNLR 438

Query: 469 ASRIGFAPRGC 479
             +I FA   C
Sbjct: 439 RGKITFAKASC 449


>gi|300078619|gb|ADJ67210.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
          Length = 84

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 77/83 (92%), Positives = 80/83 (96%)

Query: 397 DTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNI 456
           DTCFDLSGKTEVKVPTV LHFRGADVSLPA+NYLIPVDS G+FCFAFAGTMSGLSIIGNI
Sbjct: 1   DTCFDLSGKTEVKVPTVALHFRGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNI 60

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           QQQGFRVVYDLA SR+GFAPRGC
Sbjct: 61  QQQGFRVVYDLAGSRVGFAPRGC 83


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/308 (35%), Positives = 156/308 (50%), Gaps = 27/308 (8%)

Query: 193 CRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-------VAL 245
           C   LC  +    C R +TC Y+ +YGDG++TVG ++TE  TF  +           +  
Sbjct: 3   CAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGF 62

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD-- 303
           GCG  N G     +G++G GR  LS  +Q      R+FSYCL   + S + S+++FG   
Sbjct: 63  GCGSVNVGSLNNGSGIVGFGRNPLSLVSQLS---IRRFSYCLTSYA-SRRQSTLLFGSLS 118

Query: 304 -----SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                 A  R  + TPLL +P+  TFYYV   G++VG   +R I  S F L P G+GGVI
Sbjct: 119 DGVYGDATGRV-QTTPLLQSPQNPTFYYVHFTGLTVGARRLR-IPESAFALRPDGSGGVI 176

Query: 359 IDSGTSVTRLTRPAYIALRDAFR-------AGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
           +DSGT++T L       +  AFR       A   + +    F +       S  +++ VP
Sbjct: 177 VDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVP 236

Query: 412 TVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
            +VLHF+GAD+ LP  NY++     G  C   A +    S IGN+ QQ  RV+YDL A  
Sbjct: 237 RMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAET 296

Query: 472 IGFAPRGC 479
           +  AP  C
Sbjct: 297 LSIAPARC 304


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 123/387 (31%), Positives = 173/387 (44%), Gaps = 50/387 (12%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV----FDPAKSRS 187
           G Y T L  GTP + ++++ DTGS +VW  C     C +C + + DP     F P  S S
Sbjct: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138

Query: 188 FATVPCRSPLCR-------KLDSSGCNRR-----NTC-LYQVSYGDGSITVGDFSTETLT 234
              V C++P C        K     CN +      TC  Y V YG GS T G   +ETL 
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197

Query: 235 FRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
           F   ++    +GC   +       +G+ G GRG  S P+Q G    +KF+YCL  R    
Sbjct: 198 FPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDD 251

Query: 295 KPSS--MVFGDSAVSRTA-RFTPLLANPKLDT-----FYYVELVGISVGGAHVRGITASL 346
            P S  ++   + V  +   +TP   NP +       +YY+ +  I VG   V+ +    
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK-VPYKF 310

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLS 403
               P GNGG IIDSG++ T + +P    +   F    ++  RA D         CFD+S
Sbjct: 311 LVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDIS 370

Query: 404 GKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF-AFAGTMSGLS--------II 453
            +  VK P ++  F+ GA  +LP  NY   V SSG  C       M            I+
Sbjct: 371 KEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVIL 430

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  QQQ F V YDL   R+GF  + C+
Sbjct: 431 GAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 129/398 (32%), Positives = 183/398 (45%), Gaps = 34/398 (8%)

Query: 91  RDVLRVKSL-TAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPR 149
           +D +RVK L T  ++  V   P               + SG A   G Y  R+ +GTP +
Sbjct: 66  KDPVRVKYLSTLVSQKTVSTAP---------------IASGQAFNIGNYVVRVKLGTPGQ 110

Query: 150 YVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRR 209
            ++MVLDT +D  ++ C+ C  C   +D  F P  S S+  + C  P C ++    C   
Sbjct: 111 LLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPAT 167

Query: 210 NT--CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
            T  C +  SY   S +      + L      +   + GC +   G  V A GLLGLGRG
Sbjct: 168 GTGACSFNQSYAGSSFS-ATLVQDALRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRG 226

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYV 327
            LS  +Q+G  ++  FSYCL    +     S+  G     ++ R TPLL +P   + YYV
Sbjct: 227 PLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYV 286

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR--AGAS 385
              GISVG   V    +     +P    G IIDSGT +TR   P Y A+R+ FR   G +
Sbjct: 287 NFTGISVGRVLVP-FPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGT 345

Query: 386 SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAG 445
           +         FDTCF      E   P + LHF G D+ LP  N LI   +    C A A 
Sbjct: 346 TFT---SIGAFDTCF--VKTYETLAPPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAA 400

Query: 446 ----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                 S L++I N QQQ  R+++D+  +++G A   C
Sbjct: 401 APDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 125/380 (32%), Positives = 184/380 (48%), Gaps = 35/380 (9%)

Query: 127 VISGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCA-PCKKCYSQTDP----VF 180
           + SG   G  +YF  + +GTP P+   +V DTGSD+ W+ C   CK C  + +P    VF
Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSC-PKPNPHPGRVF 166

Query: 181 DPAKSRSFATVPCRSPLCR-----KLDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLT 234
               S SF T+PC S  C+         + C   N  CL+   Y +G   +G F+ ET+T
Sbjct: 167 RANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVT 226

Query: 235 -----FRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD 289
                 +  R+  V +GC            G++GLG  + S   +    F  KFSYCLVD
Sbjct: 227 VGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVD 286

Query: 290 R-STSAKPSSMVFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             S+S   + + FGD    +  +   T LL    ++ FY V + GISVGG+ +  I++ +
Sbjct: 287 HLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG-YINAFYPVNVSGISVGGSML-SISSDI 344

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA-----PDFSLFDTCFD 401
           + +   G GG+I+DSGTS+T L   AY  + DA +      K+      P+ + F  CF+
Sbjct: 345 WNV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNF--CFE 400

Query: 402 LSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQ 459
             G     VP +++HF  GA    P  +Y+I V + G  C         G SI+GN+ QQ
Sbjct: 401 DKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDV-AEGIKCLGIIKADFPGSSILGNVMQQ 459

Query: 460 GFRVVYDLAASRIGFAPRGC 479
                YDL   ++GF P  C
Sbjct: 460 NHLWEYDLGRGKLGFGPSSC 479


>gi|300078594|gb|ADJ67200.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
          Length = 84

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 76/83 (91%), Positives = 79/83 (95%)

Query: 397 DTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNI 456
           DTCFDLSGKTEVKVPTV LHFRG DVSLPA+NYLIPVDS G+FCFAFAGTMSGLSIIGNI
Sbjct: 1   DTCFDLSGKTEVKVPTVALHFRGVDVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGNI 60

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           QQQGFRVVYDLA SR+GFAPRGC
Sbjct: 61  QQQGFRVVYDLAGSRVGFAPRGC 83


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 199/398 (50%), Gaps = 35/398 (8%)

Query: 106 AVRVPPRNRSRGRANGGFSSS------VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGS 159
           + ++P R   R R     +SS      + SG   G+G+YF ++ VGTP +   +V DTGS
Sbjct: 53  SAQLPSRRGGRQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGS 112

Query: 160 DVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD-----SSGCNRRNTCLY 214
           ++ W++CA      S    VF P  S+S+A VPC S  C KLD     ++  +  + C Y
Sbjct: 113 ELTWVKCA---GGASPPGLVFRPEASKSWAPVPCSSDTC-KLDVPFSLANCSSSASPCSY 168

Query: 215 QVSYGDGS---ITVGDFSTETLTFRGTRVAR---VALGCGHDNEGL-FVAAAGLLGLGRG 267
              Y +GS   + V    + T+   G +VA+   V LGC   ++G  F +  G+L LG  
Sbjct: 169 DYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNA 228

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRT-ARFTPLLANPKLDTFY 325
           ++SF ++   RF   FSYCLVD       +  + FG   V RT A  T L  +P +  FY
Sbjct: 229 KISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAM-PFY 287

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
            V++  + V G     I A ++  DP  +GGVI+DSGT++T L  PAY A+  A     +
Sbjct: 288 GVKVDAVHVAG-QALDIPAEVW--DPK-SGGVILDSGTTLTVLATPAYKAVVAALTKLLA 343

Query: 386 SLKRAPDFSLFDTCFDLSGKT--EVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFA 442
            + +  DF  F+ C++ +       ++P + + F G A +  PA +Y+I V   G  C  
Sbjct: 344 GVPKV-DFPPFEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVIDV-KPGVKCIG 401

Query: 443 F-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              G   G+S+IGNI QQ     +DL    + F P  C
Sbjct: 402 LQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 139/448 (31%), Positives = 195/448 (43%), Gaps = 52/448 (11%)

Query: 64  SLSLRLHHVDSLSF-NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG 122
           SL L L  VD+ +  N T + L    +QR + R   +              RS G A   
Sbjct: 28  SLHLELARVDAAAAANLTDQELIRRAVQRSLDRPGIVA-------------RSGGGAADE 74

Query: 123 FSSSVISG--LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
              +V S   L  G GEY  +LG GTP  +    +DT SD+VW+QC PC  CY Q DPVF
Sbjct: 75  AGKAVASEAPLVPGGGEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVF 134

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGT 238
           +P  S S+A VPC S  C +LD   C+  +   C Y   Y    +T G  + + L   G 
Sbjct: 135 NPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGD 194

Query: 239 RVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
               V  GC   + G   A A+GL+GLGRG LS  +Q       +F YCL     S    
Sbjct: 195 VFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSV---HRFMYCL-PPPMSRTSG 250

Query: 298 SMVFG---DSAVSRTARFTPLLANP-KLDTFYYVELVGISVGG---AHVRGITA------ 344
            +V G   D+  + + R T  +++  +  ++YY+ L G++VG       R  T+      
Sbjct: 251 KLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGA 310

Query: 345 ---------SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
                     +     A   G+I+D  ++++ L    Y  L D         +  P   L
Sbjct: 311 GGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRL 370

Query: 396 -FDTCFDLS---GKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS 451
             D CF L    G   V VPTV L F G  + L      +   + G       G  SG+S
Sbjct: 371 GLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFV---TDGRMMCLMIGRTSGVS 427

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           I+GN Q Q  RV+++L   +I FA   C
Sbjct: 428 ILGNFQLQNMRVLFNLRRGKITFAKASC 455


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 132/412 (32%), Positives = 200/412 (48%), Gaps = 40/412 (9%)

Query: 95  RVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVIS-----GLAQG-----SGEYFTRLGV 144
           R K   ++ ES +++  ++++R +    F SS+++      +A G     +  Y  R  +
Sbjct: 52  RPKEPLSWEESVLQMQAKDKARLQ----FLSSLVARKSVVPIASGRQIVQNPTYIVRAKI 107

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK---- 200
           GTP + + M +DT SDV WI   PC  C   +  +F+   S ++ ++ C++  C++    
Sbjct: 108 GTPAQTMLMAMDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVLHL 164

Query: 201 ----LDSSGCNRRNT-----CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDN 251
               L S     + T     C + ++YG GS    + S +T+T     V   + GC    
Sbjct: 165 LSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKA 223

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR 311
            G  + A GLLGLGRG LS  +QT   +   FSYCL    +     S+  G     +  +
Sbjct: 224 TGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIK 283

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
           +TPLL NP+  + Y+V L+ + VG   V     S F  +P+   G I DSGT  TRL  P
Sbjct: 284 YTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGS-FTFNPSTGAGTIFDSGTVFTRLVTP 342

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLI 431
           AYIA+RDAFR              FDTC+ +     +  PT+   F G +V+LP  N LI
Sbjct: 343 AYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTGMNVTLPPDNLLI 398

Query: 432 PVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +  T C A A       S L++I N+QQQ  R++YD+  SR+G A   C
Sbjct: 399 HSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 450


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 141/432 (32%), Positives = 199/432 (46%), Gaps = 75/432 (17%)

Query: 115 SRGRANGG-----FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC--- 166
           SRGR         F+  + SG   G+G+YF R  VGTP +   +V DTGSD+ W++C   
Sbjct: 59  SRGRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRA 118

Query: 167 ---------------APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK---LDSSGC-N 207
                          AP      +T   F P KSR++A +PC S  CR+      + C  
Sbjct: 119 AAAASASPRNASSLPAPAPASPRRT---FRPDKSRTWAPIPCSSATCRESLPFSLAACAT 175

Query: 208 RRNTCLYQVSYGDGSI---TVG-DFSTETLTFRGTRVAR---VALGCGHDNEGL-FVAAA 259
             N C Y   Y DGS    TVG D +T  L+ R  R A+   V LGC     G  F+A+ 
Sbjct: 176 PANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASD 235

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAV------------ 306
           G+L LG   +SF ++   RF  +FSYCLVD       +S + FG +              
Sbjct: 236 GVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIAS 295

Query: 307 -------------SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                        +  AR TPL+ + +   FY V + G+SV G  ++ I  +++ ++   
Sbjct: 296 CKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLK-IPRAVWDVE--Q 352

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV--- 410
            GG I+DSGTS+T L +PAY A+  A     + L R      FD C++ +  +   V   
Sbjct: 353 GGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRV-TMDPFDYCYNWTSPSGSDVAAP 411

Query: 411 -PTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDL 467
            P + +HF G A +  PA +Y+I   + G  C     G   GLS+IGNI QQ     YDL
Sbjct: 412 LPMLAVHFAGSARLEPPAKSYVIDA-APGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDL 470

Query: 468 AASRIGFAPRGC 479
              R+ F    C
Sbjct: 471 KNRRLRFKRSRC 482


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 144/410 (35%), Positives = 191/410 (46%), Gaps = 55/410 (13%)

Query: 118 RANGGFSSSVISGLAQGS-------GEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPC 169
           R  G  S +V + LA+G+        EY   L +GTP P+ V + LDTGSD+VW QCA C
Sbjct: 73  RPAGAGSHAVTAPLARGTVGDADIDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-C 131

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCR--KLDSSGCN-RRNTCLYQVSYGDGSITVG 226
             C++Q  P FD   S++   VPC  P+C   K   SGC    NTC Y   Y D SIT G
Sbjct: 132 HVCFAQPFPTFDALASQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSG 191

Query: 227 DFSTETLTFR------------GTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPT 273
               +T TFR            G  V  V  GCG  N+G+F +  +G+ G  RG +S P+
Sbjct: 192 RIVEDTFTFRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPS 251

Query: 274 QTGRRFNRKFSYC---LVDRSTS-----AKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
           Q       +FS+C   + D  TS       P     G  A     + TP  AN    + Y
Sbjct: 252 QLKVA---RFSHCFTAIADARTSPVFLGGAPGPDNLGAHATG-PVQSTPF-ANSN-GSLY 305

Query: 326 YVELVGISVGGAHV-RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA-- 382
           Y+ L GI+VG   +     A   K   +G+GG IIDSGT +  L  P Y +LR AF A  
Sbjct: 306 YLTLKGITVGKTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARV 365

Query: 383 -------GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPV-- 433
                   A+  +    F    +           +P VVLH  GAD  LP  +Y++ +  
Sbjct: 366 KLPVANESAADAESTLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLE 425

Query: 434 --DSSGT-FCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             D SG+  C    +   S L+IIGN QQQ   V YDL  +++ F P  C
Sbjct: 426 DEDGSGSGLCLVMNSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 123/387 (31%), Positives = 172/387 (44%), Gaps = 50/387 (12%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV----FDPAKSRS 187
           G Y T L  GTP + ++++ DTGS +VW  C     C +C + + DP     F P  S S
Sbjct: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138

Query: 188 FATVPCRSPLCR-------KLDSSGCNRR-----NTC-LYQVSYGDGSITVGDFSTETLT 234
              V C++P C        K     CN +      TC  Y V YG GS T G   +ETL 
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197

Query: 235 FRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA 294
           F    +    +GC   +       +G+ G GRG  S P+Q G    +KF+YCL  R    
Sbjct: 198 FPDKXIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDD 251

Query: 295 KPSS--MVFGDSAVSRTA-RFTPLLANPKLDT-----FYYVELVGISVGGAHVRGITASL 346
            P S  ++   + V  +   +TP   NP +       +YY+ +  I VG   V+ +    
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK-VPYKF 310

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLS 403
               P GNGG IIDSG++ T + +P    +   F    ++  RA D         CFD+S
Sbjct: 311 LVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDIS 370

Query: 404 GKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF-AFAGTMSGLS--------II 453
            +  VK P ++  F+ GA  +LP  NY   V SSG  C       M            I+
Sbjct: 371 KEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVIL 430

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  QQQ F V YDL   R+GF  + C+
Sbjct: 431 GAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 122/375 (32%), Positives = 181/375 (48%), Gaps = 50/375 (13%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSF 188
           SG     GEYF  + VG+P +  ++V+DTGS+  W+ C                  S+SF
Sbjct: 104 SGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------------SKSF 145

Query: 189 ATVPCRSPLCRKLDSSG------CNR-RNTCLYQVSYGDGSITVGDFSTETLTF-----R 236
             V C S  C K+D S       C +  + CLY +SY DGS   G F T+++T      +
Sbjct: 146 EAVTCASRKC-KVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGK 204

Query: 237 GTRVARVALGCGH---DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD---- 289
             ++  + +GC     +         G+LGLG  + SF  +   ++  KFSYCLVD    
Sbjct: 205 QGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSH 264

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           RS S+  +     ++ +    R T L+  P    FY V +VGIS+GG  ++ I   ++  
Sbjct: 265 RSVSSNLTIGGHHNAKLLGEIRRTELILFPP---FYGVNVVGISIGGQMLK-IPPQVWDF 320

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTE 407
           +    GG +IDSGT++T L  PAY A+ +A     + +KR    DF   + CFD  G  +
Sbjct: 321 N--AEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGFDD 378

Query: 408 VKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVV 464
             VP +V HF  GA    P  +Y+I V +    C        + G S+IGNI QQ     
Sbjct: 379 SVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVPIDGIGGASVIGNIMQQNHLWE 437

Query: 465 YDLAASRIGFAPRGC 479
           +DL+ + +GFAP  C
Sbjct: 438 FDLSTNTVGFAPSTC 452


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 146/494 (29%), Positives = 209/494 (42%), Gaps = 70/494 (14%)

Query: 1   MEGKARNHLLLLFSFFFTA------AASLQYQTFVLNSLPTPSTLSWPESVSVSESESSL 54
           ME K    ++L+FS  +          + Q     LN +P  S  S              
Sbjct: 1   MEAKLATTIILIFSVIWLMRVNGIDPCASQADNSDLNVIPIYSKCS-------------- 46

Query: 55  PLPAPDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSL-TAFAESAVRVPPRN 113
           P   P ++SS   R+ ++ S                +D LR K L T   +  V   P  
Sbjct: 47  PFKPPKSDSSWDNRIINMAS----------------KDPLRFKYLSTLVGQKTVSTAP-- 88

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCY 173
                        + SG     G Y  R+ +GTP + ++MVLDT +D  ++ C+ C  C 
Sbjct: 89  -------------IASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC- 134

Query: 174 SQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTE 231
             +D  F P  S S+  + C  P C ++    C    T  C +  SY   S +      +
Sbjct: 135 --SDTTFSPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQD 191

Query: 232 TLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           +L      +   + GC +   G  V A GLLGLGRG LS  +Q+G  ++  FSYCL    
Sbjct: 192 SLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFK 251

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           +     S+  G     ++ R TPLL +P   + YYV   GISVG   V    +     +P
Sbjct: 252 SYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVP-FPSEYLGFNP 310

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFR--AGASSLKRAPDFSLFDTCFDLSGKTEVK 409
               G IIDSGT +TR   P Y A+R+ FR   G ++         FDTCF      E  
Sbjct: 311 NTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFT---SIGAFDTCF--VKTYETL 365

Query: 410 VPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQGFRVVY 465
            P + LHF G D+ LP  N LI   +    C A A       S L++I N QQQ  R+++
Sbjct: 366 APPITLHFEGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILF 425

Query: 466 DLAASRIGFAPRGC 479
           D   +++G A   C
Sbjct: 426 DTVNNKVGIAREVC 439


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 174/365 (47%), Gaps = 20/365 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ + SG A   G Y  R+ +GTP + ++MVLDT +D  +I  + C  C + T   F P 
Sbjct: 84  SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---FSPN 140

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
            S S+  + C  P C ++    C    +  C +  SY  GS        ++L      + 
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYA-GSTYSATLVQDSLRLATDVIP 199

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
             + G  +   G  + A GLLGLGRG LS  +QTG  ++  FSYCL    +     S+  
Sbjct: 200 SYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKL 259

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           G     ++ R TPLL NP+  + Y+V L GI+VG  +V      L   D     G IIDS
Sbjct: 260 GPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVP-FPKELLAFDVNTGSGTIIDS 318

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRG 419
           GT +TR   P Y A+RD FR   +     P  SL  FDTCF      E   P + LHF  
Sbjct: 319 GTVITRFVEPVYNAVRDEFRKQVT----GPFSSLGAFDTCF--VKNYETLAPAITLHFTD 372

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGT-----MSGLSIIGNIQQQGFRVVYDLAASRIGF 474
            D+ LP  N LI   S    C A A T      + L++I N QQQ  RV++D   +++G 
Sbjct: 373 LDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGI 432

Query: 475 APRGC 479
           A   C
Sbjct: 433 ARELC 437


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 125/360 (34%), Positives = 175/360 (48%), Gaps = 25/360 (6%)

Query: 131 LAQGSGE-YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFA 189
           LA+ S E Y   +G+GTPP+   ++ DT SD+ W QC        Q +P+FDPAKS SFA
Sbjct: 83  LARISDEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFA 142

Query: 190 TVPCRSPLCRKLDSSGCNR--RNTCLYQVSYGDGSITVGDFSTETLTFRGTR---VARVA 244
            V C S LC + D+ G  R    TC Y   Y       G  + E+ T             
Sbjct: 143 FVTCSSKLCTE-DNPGTKRCSNKTCRYVYPYVSVE-AAGVLAYESFTLSDNNQHICMSFG 200

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCG   +G  + A+G+LG+    LS  +Q       KFSYCL    T  K S + FG  
Sbjct: 201 FGCGALTDGNLLGASGILGMSPAILSMVSQLAI---PKFSYCLTPY-TDRKSSPLFFGAW 256

Query: 305 A-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           A + R     P+     L  +YYV LVG+S+G   +  + A+ F L     GG ++D G 
Sbjct: 257 ADLGRYKTTGPI--QKSLTFYYYVPLVGLSLGTRRLD-VPAATFALK---QGGTVVDLGC 310

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT---EVKVPTVVLHFR-G 419
           +V +L  PA+ AL++A     +          +  CF L        V+ P +VL+F  G
Sbjct: 311 TVGQLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGG 370

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           AD+ LP  NY     ++G  C A      G+SIIGN+QQQ F +++D+  S+  FAP  C
Sbjct: 371 ADMVLPRDNYF-QEPTAGLMCLALVPG-GGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 166/352 (47%), Gaps = 51/352 (14%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN----- 207
           +++DTGSD+ W+QC PC  CY+Q DP+FDP+ S S+A VPC +  C     +        
Sbjct: 124 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 183

Query: 208 ----------RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA 257
                     +   C Y ++YGDGS + G  +T+T+   G  V     GCG  N GL   
Sbjct: 184 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLRRP 243

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA---RFTP 314
            +          S PT +               S  A  S  + GD++  R A    +T 
Sbjct: 244 GSAA--------SSPTASPP-----------GTSGDAAGSLSLGGDTSSYRNATPVSYTR 284

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           ++A+P    FY++ + G SVGGA V                 V++DSGT +TRL    Y 
Sbjct: 285 MIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN--------VLLDSGTVITRLAPSVYR 336

Query: 375 ALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
           A+R  F  + GA     AP FSL D C++L+G  EVKVP + L    GAD+++ A   L 
Sbjct: 337 AVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLF 396

Query: 432 PVDSSGT-FCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                G+  C A A         IIGN QQ+  RVVYD   SR+GFA   C+
Sbjct: 397 MARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/438 (29%), Positives = 186/438 (42%), Gaps = 54/438 (12%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI-SGLAQGSG 136
           N T   L    IQR   R+  +               +RG A     + V  + +    G
Sbjct: 41  NLTEHELLRRAIQRSRYRLAGI-------------GMARGEAASARKAVVAETPIMPAGG 87

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +LG+GTPP      +DT SD++W QC PC  CY Q DP+F+P  S ++A +PC S 
Sbjct: 88  EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147

Query: 197 LCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +LD   C   +  +C Y  +Y   + T G  + + L         VA GC   + G 
Sbjct: 148 TCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGG 207

Query: 255 F--VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---DSAVSRT 309
                A+G++GLGRG LS  +Q      R+F+YCL     S  P  +V G   D+A + T
Sbjct: 208 APPPQASGVVGLGRGPLSLVSQLS---VRRFAYCLPP-PASRIPGKLVLGADADAARNAT 263

Query: 310 ARF-TPLLANPKLDTFYYVELVGISVGGAHV----------------------RGITASL 346
            R   P+  +P+  ++YY+ L G+ +G   +                          A+ 
Sbjct: 264 NRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATA 323

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGK 405
             +  A   G+IID  +++T L    Y  L +        L R    SL  D CF L   
Sbjct: 324 VAVGDANRYGMIIDIASTITFLEASLYDELVNDLEV-EIRLPRGTGSSLGLDLCFILPDG 382

Query: 406 T---EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGF 461
                V VP V L F G  + L           SG  C       +G +SI+GN QQQ  
Sbjct: 383 VAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNM 442

Query: 462 RVVYDLAASRIGFAPRGC 479
           +V+Y+L   R+ F    C
Sbjct: 443 QVLYNLRRGRVTFVQSPC 460


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/431 (28%), Positives = 194/431 (45%), Gaps = 46/431 (10%)

Query: 61  AESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRAN 120
           A  +L L L H   + ++R P H+++++ +  V R++ L A                +  
Sbjct: 26  ASPTLVLNLVHSYHI-YSRKPPHVYHIK-EASVERLEYLKA----------------KTT 67

Query: 121 GGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVF 180
           G   + +   +      +   + +G+PP    + +DT SD++WIQC PC  CY+Q+ P+F
Sbjct: 68  GDIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIF 127

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR---- 236
           DP++S +     CR+               +C Y + Y D + + G  + E L F     
Sbjct: 128 DPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYD 187

Query: 237 ---GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
                 +  V  GCGHDN G  +   G+LGLG G  S       RF +KFSYC       
Sbjct: 188 ESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLV----HRFGKKFSYCFGSLDDP 243

Query: 294 AKPSS-MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD-P 351
           + P + +V GD   +     TPL  +   + FYYV +  ISV G  +  I   +F  +  
Sbjct: 244 SYPHNVLVLGDDGANILGDTTPLEIH---NGFYYVTIEAISVDGI-ILPIDPRVFNRNHQ 299

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALR----DAFRAGASSLKRAPDFSLFDTCFDLSGK-- 405
            G GG IID+G S+T L   AY  L+    D F    ++   + D  +   C++ + +  
Sbjct: 300 TGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERD 359

Query: 406 -TEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRV 463
             E   P V  HF  GA++SL   +  + + S   FC A   T   L+ IG   QQ + +
Sbjct: 360 LVESGFPIVTFHFSEGAELSLDVKSLFMKL-SPNVFCLAV--TPGNLNSIGATAQQSYNI 416

Query: 464 VYDLAASRIGF 474
            YDL A  + F
Sbjct: 417 GYDLEAMEVSF 427


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/438 (29%), Positives = 186/438 (42%), Gaps = 54/438 (12%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI-SGLAQGSG 136
           N T   L    IQR   R+  +               +RG A     + V  + +    G
Sbjct: 41  NLTEHELLRRAIQRSRYRLAGI-------------GMARGEAASARKAVVAETPIMPAGG 87

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +LG+GTPP      +DT SD++W QC PC  CY Q DP+F+P  S ++A +PC S 
Sbjct: 88  EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147

Query: 197 LCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +LD   C   +  +C Y  +Y   + T G  + + L         VA GC   + G 
Sbjct: 148 TCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGG 207

Query: 255 F--VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---DSAVSRT 309
                A+G++GLGRG LS  +Q      R+F+YCL     S  P  +V G   D+A + T
Sbjct: 208 APPPQASGVVGLGRGPLSLVSQLS---VRRFAYCLPP-PASRIPGKLVLGADADAARNAT 263

Query: 310 ARF-TPLLANPKLDTFYYVELVGISVGGAHV----------------------RGITASL 346
            R   P+  +P+  ++YY+ L G+ +G   +                          A+ 
Sbjct: 264 NRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATA 323

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGK 405
             +  A   G+IID  +++T L    Y  L +        L R    SL  D CF L   
Sbjct: 324 VAVGDANRYGMIIDIASTITFLEASLYDELVNDLEV-EIRLPRGTGSSLGLDLCFILPDG 382

Query: 406 T---EVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGF 461
                V VP V L F G  + L           SG  C       +G +SI+GN QQQ  
Sbjct: 383 VAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNM 442

Query: 462 RVVYDLAASRIGFAPRGC 479
           +V+Y+L   R+ F    C
Sbjct: 443 QVLYNLRRGRVTFVQSPC 460


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 117/356 (32%), Positives = 155/356 (43%), Gaps = 47/356 (13%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           ++  +GEY  ++ +GTPP  VY + DTGSD++W QC PC  CY Q +P+FDP+KS SF  
Sbjct: 17  VSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKE 76

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           V C S  CR LD+                                  T +  +  GCGH+
Sbjct: 77  VSCESQQCRLLDTP---------------------------------TSILNIVFGCGHN 103

Query: 251 NEGLFVA-AAGLLGLGRGRLSFPTQTGRRF--NRKFSYCLVD-RSTSAKPSSMVFGDSA- 305
           N G F     GL G G   LS  +Q        RKFS CLV  R+  +  S ++FG  A 
Sbjct: 104 NSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAE 163

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           VS +   +  L      T+Y+V L GISVG       ++S      A  G V ID+GT  
Sbjct: 164 VSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPM----ATKGNVFIDAGTPP 219

Query: 366 TRLTRPAYIALRDAFR-AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
           T L R  Y  L    + A      + PD      C+     T +  P +  HF GADV L
Sbjct: 220 TLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP-QLCY--RSATLIDGPILTAHFDGADVQL 276

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              N  I     G +CFA         I GN  Q  F + +DL   ++ F    C 
Sbjct: 277 KPLNTFI-SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 331


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/381 (32%), Positives = 179/381 (46%), Gaps = 32/381 (8%)

Query: 111 PRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK 170
           P++ S  + +   + +V   +  G G Y     +GTPP+ +  + DTGSD++W +C    
Sbjct: 73  PQSSSASQLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGG 132

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNR----RNTCLYQVSYG---DGSI 223
                    + P  S +F  +PC   LC  L S    R       C Y+ +YG   D   
Sbjct: 133 GAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDF 192

Query: 224 TVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
           T G   +ET T  G  V  V  GC    EG +   AGL+GLGRG LS  +Q        F
Sbjct: 193 TQGFLGSETFTLGGDAVPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAG---TF 249

Query: 284 SYCLVDRSTSAKPSSMVFGDSAVSRTA----RFTPLLANPKLDTFYYVELVGISVGGAHV 339
            YCL   ++ A P  ++FG  A    A    + T LLA+    TFY V L  I++G A  
Sbjct: 250 MYCLTADASKASP--LLFGALATMTGAGAGVQSTGLLAST---TFYAVNLRSITIGSATT 304

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
            G+               + DSGT++T L  PAY   + AF +  +SL        F+ C
Sbjct: 305 AGVGGPGGV---------VFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEAC 355

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQ 458
           ++      + +P +VLHF  GAD++LP  NY++ VD  G  C+    + S LSIIGNI Q
Sbjct: 356 YEKPDSARL-IPAMVLHFDGGADMALPVANYVVEVD-DGVVCWVVQRSPS-LSIIGNIMQ 412

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
             + V++D+  S + F P  C
Sbjct: 413 MNYLVLHDVRKSVLSFQPANC 433


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 130/369 (35%), Positives = 173/369 (46%), Gaps = 38/369 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L VGTPP+ V MVLDTGS++ W+ CAP             F P  S +FA+VPC S  CR
Sbjct: 70  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129

Query: 200 KLD---SSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDNE 252
             D      C+     C   +SY DGS + G  +TE  T       R A GC     D  
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTS 189

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-----AVS 307
              VA AGLLG+ RG LSF +Q      R+FSYC+ DR  +     ++ G S      ++
Sbjct: 190 PDGVATAGLLGMNRGALSFVSQAS---TRRFSYCISDRDDAG---VLLLGHSDLPFLPLN 243

Query: 308 RTARFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
            T  + P +  P  D   Y V+L+GI VGG  +  I AS+   D  G G  ++DSGT  T
Sbjct: 244 YTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLP-IPASVLAPDHTGAGQTMVDSGTQFT 302

Query: 367 RLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDL-SGKT-EVKVPTVVLHFR 418
            L   AY AL+  F         A   P+F+    FDTCF +  G+    ++P V L F 
Sbjct: 303 FLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFN 362

Query: 419 GADVSLPATNYLIPVDSS-----GTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAAS 470
           GA +++     L  V        G +C  F    M  ++  +IG+  Q    V YDL   
Sbjct: 363 GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERG 422

Query: 471 RIGFAPRGC 479
           R+G AP  C
Sbjct: 423 RVGLAPIRC 431


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 174/364 (47%), Gaps = 19/364 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ + SG     G Y  R+ +GTP + ++MVLDT +D  ++  + C  C + T   F P 
Sbjct: 84  SAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGCSATT---FYPN 140

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
            S SF  + C  P C ++    C    +  C +  SY  GS        ++L      + 
Sbjct: 141 VSTSFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYA-GSTFSATLVQDSLRLATDVIP 199

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
             + G  +   G  V A GLLGLGRG LS  +Q+G  ++  FSYCL    +     S+  
Sbjct: 200 SYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPSFKSYYFSGSLKL 259

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           G     ++ R TPLL NP   + YYV L  ISVG  +V  + + L   +P+   G IIDS
Sbjct: 260 GPVGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVP-LPSELLAFNPSTGAGTIIDS 318

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRG 419
           GT +TR   P Y A+RD FR   +     P  SL  FDTCF      E   P + LHF  
Sbjct: 319 GTVITRFVEPIYNAVRDEFRKQVT----GPFSSLGAFDTCF--VKNYETLAPAITLHFTD 372

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFA 475
            D+ LP  N LI   S    C A A   S     L++I N QQQ  RV++D   +++G A
Sbjct: 373 LDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNNKVGIA 432

Query: 476 PRGC 479
              C
Sbjct: 433 RELC 436


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 131/376 (34%), Positives = 176/376 (46%), Gaps = 47/376 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKK------CYSQTDPVFDPAKSRSFATVPCRS 195
           L VGTPP+ V MVLDTGS++ W+ CA  ++        +     F P  S +FA VPC S
Sbjct: 67  LAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGS 126

Query: 196 PLCRKLD------SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC-- 247
             C   D        G +R+  C   +SY DGS + G  +T+          R A GC  
Sbjct: 127 TQCSSRDLPAPPSCDGASRQ--CHVSLSYADGSASDGALATDVFAVGEAPPLRSAFGCMS 184

Query: 248 -GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-- 304
             +D+    VA AGLLG+ RG LSF TQ      R+FSYC+ DR  +     ++ G S  
Sbjct: 185 TAYDSSPDGVATAGLLGMNRGTLSFVTQAS---TRRFSYCISDRDDAG---VLLLGHSDL 238

Query: 305 ---AVSRTARFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
               ++ T  + P L  P  D   Y V+L+GI VGG  +  I AS+   D  G G  ++D
Sbjct: 239 PFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALP-IPASVLAPDHTGAGQTMVD 297

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDLSG---KTEVKVP 411
           SGT  T L   AY AL+  F      L RA   P F+     DTCF +         ++P
Sbjct: 298 SGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLP 357

Query: 412 TVVLHFRGADVSLPATNYLIPVD-----SSGTFCFAFAGT-MSGLS--IIGNIQQQGFRV 463
            V L F GA++S+     L  V      + G +C  F    M  L+  +IG+  Q    V
Sbjct: 358 PVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWV 417

Query: 464 VYDLAASRIGFAPRGC 479
            YDL   R+G AP  C
Sbjct: 418 EYDLERGRVGLAPVKC 433


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 165/360 (45%), Gaps = 28/360 (7%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +   + +G+PP    + +DT SD++W+QC PC  CY+Q+ P+FDP++S +     CR+  
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-------GTRVARVALGCGHD 250
                     +  +C Y + Y DG+ + G  + E L F           +  V  GCGHD
Sbjct: 145 YSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHD 204

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRT 309
           N G  +   G+LGLG G  S       RF  KFSYC       + P + +V GD   +  
Sbjct: 205 NYGEPLVGTGILGLGYGEFSLV----HRFGTKFSYCFGSLDDPSYPHNVLVLGDDGANIL 260

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD-PAGNGGVIIDSGTSVTRL 368
              TPL      + FYYV +  ISV G  +  I   +F  +   G GG IID+G S+T L
Sbjct: 261 GDTTPL---EIYNGFYYVTIEAISVDGI-ILPIDPWVFNRNHQTGLGGTIIDTGNSLTSL 316

Query: 369 TRPAYIALRDAFRAGASSLKRAPDFSLFDT----CFDLSGK---TEVKVPTVVLHFR-GA 420
              AY  L++           A D +  D     C++ + +    E   P V  HF  GA
Sbjct: 317 VEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDGA 376

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           ++SL   +  + + S   FC A   T   ++ IG   QQ + + YDL A +I F    C 
Sbjct: 377 ELSLDVKSVFMKL-SPNVFCLAV--TPGNMNSIGATAQQSYNIGYDLEAKKISFERIDCG 433


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 130/369 (35%), Positives = 173/369 (46%), Gaps = 38/369 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L VGTPP+ V MVLDTGS++ W+ CAP             F P  S +FA+VPC S  CR
Sbjct: 69  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128

Query: 200 KLD---SSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDNE 252
             D      C+     C   +SY DGS + G  +TE  T       R A GC     D  
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTS 188

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-----AVS 307
              VA AGLLG+ RG LSF +Q      R+FSYC+ DR  +     ++ G S      ++
Sbjct: 189 PDGVATAGLLGMNRGALSFVSQAS---TRRFSYCISDRDDAG---VLLLGHSDLPFLPLN 242

Query: 308 RTARFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
            T  + P +  P  D   Y V+L+GI VGG  +  I AS+   D  G G  ++DSGT  T
Sbjct: 243 YTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLP-IPASVLAPDHTGAGQTMVDSGTQFT 301

Query: 367 RLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDL-SGKT-EVKVPTVVLHFR 418
            L   AY AL+  F         A   P+F+    FDTCF +  G+    ++P V L F 
Sbjct: 302 FLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFN 361

Query: 419 GADVSLPATNYLIPVDSS-----GTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAAS 470
           GA +++     L  V        G +C  F    M  ++  +IG+  Q    V YDL   
Sbjct: 362 GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERG 421

Query: 471 RIGFAPRGC 479
           R+G AP  C
Sbjct: 422 RVGLAPIRC 430


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 122/358 (34%), Positives = 173/358 (48%), Gaps = 49/358 (13%)

Query: 143 GVGTPPRYVYMVLDTGSDVVWIQCAPC--KKCYSQTDPVFDPAKSRSFATVPCRSPLCRK 200
           G  +PP  V +VLDT  DV W++C PC   +C       +DP +S +++  PC S  C++
Sbjct: 157 GSSSPP--VTVVLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQ 209

Query: 201 LD--SSGCNRRNTCLYQV-SYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFV 256
           L   ++GC+    C Y V + GD   T G +S++ LT   G RV     GC  + +G F 
Sbjct: 210 LGRYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGDRVEGFRFGCSQNEQGSFE 269

Query: 257 AAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF--- 312
             A G++ LGRG  S   QT   +   FSYCL    T+       F    V   A +   
Sbjct: 270 NQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKG-----FFQIGVPIGASYRFV 324

Query: 313 -TPLL-----ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
            TP+L     A+    T Y   L+ I+V G  +  + A +F        G ++DS T +T
Sbjct: 325 TTPMLKERGGASAAAATLYRALLLAITVDGKELN-VPAEVFA------AGTVMDSRTIIT 377

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA 426
           RL   AY ALR AFR      + AP     DTC+DL+G    ++P + L F G       
Sbjct: 378 RLPVTAYGALRAAFR-NRMRYRVAPPQEELDTCYDLTGVRYPRLPRIALVFDG------- 429

Query: 427 TNYLIPVDSSGTF---CFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            N ++ +D SG     C AFA     S  SI+GN+QQQ  +V++D+   RIGF    C
Sbjct: 430 -NAVVEMDRSGILLNGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/342 (32%), Positives = 160/342 (46%), Gaps = 78/342 (22%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC---KKCYSQTDPVFDPAKSRSFATVPC 193
           EY   +G+G+P     +V+DTGSDV W+QC PC     C++    +FDPA S ++A   C
Sbjct: 105 EYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNC 164

Query: 194 RSPLCRKL----DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH 249
            +  C +L    +++GC+ ++ C Y V YGDGS T G       T      +   LG G 
Sbjct: 165 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTG-------TGFQFGCSHAELGAGM 217

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
           D++       GL+GLG    S  +QT  R                               
Sbjct: 218 DDK-----TDGLIGLGGDAQSLVSQTAAR------------------------------- 241

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
                   + K+ T+Y+  L  I+VGG  + G++ S+F        G ++DSGT +TRL 
Sbjct: 242 --------SKKVPTYYFAALEDIAVGGKKL-GLSPSVFA------AGSLVDSGTVITRLP 286

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNY 429
             AY AL  AFRAG +   RA    + DTCF+ +G  +V +PTV L F G  V       
Sbjct: 287 PAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAV------- 339

Query: 430 LIPVDSSGTF---CFAFAGTMS--GLSIIGNIQQQGFRVVYD 466
            + +D+ G     C AFA T        IGN+QQ+ F V+YD
Sbjct: 340 -VDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 128/383 (33%), Positives = 181/383 (47%), Gaps = 41/383 (10%)

Query: 105 SAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
           S  +VP ++ +   +NG F+      +   +G+Y  +L +GTPP  VY ++DT SD+VW 
Sbjct: 6   SFYQVPKKSYA---SNGPFTR-----VTSNNGDYLMKLTLGTPPVDVYGLVDTDSDLVWA 57

Query: 165 QCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSIT 224
           QC PC+ CY Q +P+FDP K             C       C+    C Y  +Y D S T
Sbjct: 58  QCTPCQGCYKQKNPMFDPLKE------------CNSFFDHSCSPEKACDYVYAYADDSAT 105

Query: 225 VGDFSTETLTFRGTR----VARVALGCGHDNEGLFVAA-AGLLGLGRGRLSFPTQTGRRF 279
            G  + E  TF  T     V  +  GCGH+N G+F     GL+GLG G LS  +Q G  +
Sbjct: 106 KGMLAKEIATFSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLY 165

Query: 280 -NRKFSYCLVDRSTSAKPSSMV-FGD-SAVSRTARFTPLLANPKLDTFYYVELVGISVGG 336
            +++FS CLV        S  +  G+ S VS     T  L + +  T Y V L GISVG 
Sbjct: 166 GSKRFSQCLVPFHADPHTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGD 225

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFS 394
             V   ++ +        G ++IDSGT  T L +  Y  L +  +   +   +   PD  
Sbjct: 226 TFVPFNSSEMLS-----KGNIMIDSGTPETYLPQEFYDRLVEELKVQINLPPIHVDPDLG 280

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFRGADVS-LPATNYLIPVDSSGTFCFAFAGTMSGLSII 453
               C+    +T ++ P +  HF GADV  LP   ++ P D  G FCFA  GT  GL I 
Sbjct: 281 T-QLCY--KSETNLEGPILTAHFEGADVKLLPLQTFIPPKD--GVFCFAMTGTTDGLYIF 335

Query: 454 GNIQQQGFRVVYDLAASRIGFAP 476
           GN  Q    + +DL    + F P
Sbjct: 336 GNFAQSNVLIGFDLDKRIVFFKP 358


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 161/353 (45%), Gaps = 35/353 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  +L VGTPP  +  V+DTGS++ W QC PC  CY Q  P+FDP+KS +F    C    
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCHD-- 437

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
                       ++C Y+V Y D + T G  +T+T+T   T      +A   +GCG +N 
Sbjct: 438 ------------HSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNS 485

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSRT 309
               +  G +GL  G LS  TQ G  +    SYC     TS      +++V G   VS T
Sbjct: 486 WFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNAIVGGGGVVSTT 545

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              T   A P    FYY+ L  +SVG   +  +      L+    G ++IDSGT++T   
Sbjct: 546 MFVT--TARPG---FYYLNLDAVSVGDTRIETLGTPFHALE----GNIVIDSGTTLTYFP 596

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATN 428
                 +R A      ++  A D +  D     S  TE+  P + +HF  GAD+ L   N
Sbjct: 597 ESYCNLVRQAVEHVVPAVPAA-DPTGNDLLCYYSNTTEI-FPVITMHFSGGADLVLDKYN 654

Query: 429 YLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +   S G FC A      +  +I GN  Q  F V YD ++  + F P  C+
Sbjct: 655 MFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707



 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/339 (31%), Positives = 152/339 (44%), Gaps = 53/339 (15%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +L +GTPP  V  VLDTGS+++W QC PC  CY Q  P+FDP+KS +F    C +P
Sbjct: 64  EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTP 123

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDN 251
                        ++C Y++ Y D S T G  +TET+T   T      +    +GC  +N
Sbjct: 124 ------------DHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNN 171

Query: 252 --EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
              G   +++G++GL RG LS  +Q G                 A P     GD  VS T
Sbjct: 172 SGSGFRPSSSGIVGLSRGSLSLISQMG----------------GAYP-----GDGVVSTT 210

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
                + A       YY+ L  +SVG   +  +      L    NG ++IDSGT +T   
Sbjct: 211 -----MFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHAL----NGNIVIDSGTPLTYFP 261

Query: 370 RPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATN 428
                 +R A     ++  R  D S  D     S   E+  P + +HF  GAD+ L   N
Sbjct: 262 VSYCNLVRKAVERVVTA-DRVVDPSRNDMLCYYSNTIEI-FPVITVHFSGGADLVLDKYN 319

Query: 429 YLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYD 466
             + ++  G FC A      + ++I GN  Q  F V YD
Sbjct: 320 MYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 181/359 (50%), Gaps = 35/359 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC--APCKKCYSQTDPVFDPAKSRSFATVPC 193
           G Y     +GTPP+ +  + DTGSD++W +C  A    C  Q  P + P  S +FA +PC
Sbjct: 89  GAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPC 148

Query: 194 RSPLCRKLDS---SGCNRRNT-CLYQVSYG----DGSITVGDFSTETLTFRGTRVARVAL 245
              LC  L S   + C      C Y+ SYG    D   T G  + ET T     V  V  
Sbjct: 149 SDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPSVRF 208

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GC   +EG + + +GL+GLGRG LS  +Q        F YCL   ++ A P  ++FG  A
Sbjct: 209 GCTTASEGGYGSGSGLVGLGRGPLSLVSQLNA---STFMYCLTSDASKASP--LLFGSLA 263

Query: 306 VSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
               A+   T LLA+    TFY V L  IS+G A   G+       +P    GV+ DSGT
Sbjct: 264 SLTGAQVQSTGLLAS---TTFYAVNLRSISIGSATTPGVG------EPE---GVVFDSGT 311

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK---TEVKVPTVVLHFRGA 420
           ++T L  PAY   + AF +  +SL +  D   F+ CF        +   VPT+VLHF GA
Sbjct: 312 TLTYLAEPAYSEAKAAFLS-QTSLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHFDGA 370

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           D++LP  NY++ V+  G  C+    + S LSIIGNI Q  + V++D+  S + F P  C
Sbjct: 371 DMALPVANYVVEVE-DGVVCWIVQRSPS-LSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 177/369 (47%), Gaps = 46/369 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L VG PP+ + MVLDTGS++ W+ C       S    VF+P  S +++ VPC SP+CR  
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRTR 124

Query: 201 -----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DN 251
                + +S   + + C   +SY D +   G+ + ET             GC       N
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSN 184

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA- 310
                 + GL+G+ RG LSF  Q G     KFSYC+   S S     ++ GD++ S    
Sbjct: 185 SEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCI---SGSDSSGFLLLGDASYSWLGP 238

Query: 311 -RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            ++TPL+      P  D   Y V+L GI V G+ +  +  S+F  D  G G  ++DSGT 
Sbjct: 239 IQYTPLVLQSTPLPYFDRVAYTVQLEGIRV-GSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDLSGKTEVK---VPTVVL 415
            T L  P Y AL++ F     S+ R    PDF      D C+ +   T      +P V L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357

Query: 416 HFRGADVSLPATNYLIPVDSSGT------FCFAFAGT-MSGLS--IIGNIQQQGFRVVYD 466
            FRGA++S+     L  V+ +G+      +CF F  + + G+   +IG+  QQ   + +D
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFD 417

Query: 467 LAASRIGFA 475
           LA SR+GFA
Sbjct: 418 LAKSRVGFA 426


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 98/262 (37%), Positives = 137/262 (52%), Gaps = 16/262 (6%)

Query: 108 RVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA 167
           ++ PRN S+   N    +++ S ++    +Y   L +GTPP  +Y   DTGSD++W+QC 
Sbjct: 32  KLIPRNSSKDFFN---RNTIQSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCI 88

Query: 168 PCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVG 226
           PC  CY Q +P+FD   S +F+ + C S  C KL S+ C+  +  C Y  SY DGS T G
Sbjct: 89  PCTNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQG 148

Query: 227 DFSTETLTFRGTRVARVA-----LGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRF- 279
             + ETLT   T    VA      GCGH+N G F     G++GLGRG LS  +Q G    
Sbjct: 149 VLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLG 208

Query: 280 NRKFSYCLVDRSTS---AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGG 336
              FS CLV  +T+   + P S   G   +      TPL++     +FY+V L+GISV  
Sbjct: 209 GNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVED 268

Query: 337 AHVRGITASLFKLDPAGNGGVI 358
            ++     S   L+PA  G VI
Sbjct: 269 INLPFNAGS--SLEPAAKGNVI 288


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 130/393 (33%), Positives = 190/393 (48%), Gaps = 54/393 (13%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP-----------VFDP 182
           G G+YF R  VGTP +   +V DTGSD+ W++C P K   + T+             F P
Sbjct: 91  GIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRP 150

Query: 183 AKSRSFATVPCRSPLCRK---LDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF--- 235
            KS+++A +PC S  C K      S C    + C Y   Y DGS   G   TE+ T    
Sbjct: 151 EKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALS 210

Query: 236 ----------RGTRVARVALGCGHDNEG-LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
                     +  ++  + LGC     G  F A+ G+L LG   +SF +    RF  +FS
Sbjct: 211 SSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFS 270

Query: 285 YCLVDR-STSAKPSSMVFG-DSAVSRT--------ARFTPLLANPKLDTFYYVELVGISV 334
           YCLVD  S     S + FG +SA+S          AR TPL+ + ++  FY V +  ISV
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISV 330

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
            G  ++ I   ++++D  G GGVI+DSGTS+T L +PAY A+  A       L R P  +
Sbjct: 331 DGELLK-IPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAAL---GKKLARFPRVA 384

Query: 395 L--FDTCFDLSGKTEV----KVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGT 446
           +  F+ C++ +  +       +P + +HF G A +  P+ +Y+I   + G  C     G 
Sbjct: 385 MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDA-APGVKCIGVQEGP 443

Query: 447 MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             G+S+IGNI QQ     +DL   R+ F    C
Sbjct: 444 WPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 117/365 (32%), Positives = 173/365 (47%), Gaps = 54/365 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y++ + +G+PP+   +V+DTGSD+ W++C PC    S T   FD   S ++  + C  
Sbjct: 1   GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSST---FDRLASNTYKALTCAD 57

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV------ALGCGH 249
                             Y   YGDGS T GD S +TL   G     +        GCG 
Sbjct: 58  D-----------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGS 100

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS--TSAKPSSMVFGDSAVS 307
             +GL     G+L L  G LSFP+Q G ++  KFSYCL+ ++   S K S MVFG++AV 
Sbjct: 101 LLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160

Query: 308 ---------RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
                    +  ++TP+  +     +Y V L GISVG   +  ++ S F      +   I
Sbjct: 161 LKEPGSGKLQELQYTPIGES---SIYYTVRLDGISVGNQRLD-LSPSAFL--NGQDKPTI 214

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---FDTCFDLSGKTEVKVPTVVL 415
            DSGT++T L       + D+ +   +S+    +F      D CF +   +   +P +  
Sbjct: 215 FDSGTTLTMLPP----GVCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQGLPDITF 270

Query: 416 HFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
           HF  GAD     +NY+I  D     C  F  T + +SI GN+QQQ F V++D+   RIGF
Sbjct: 271 HFNGGADFVTRPSNYVI--DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRIGF 327

Query: 475 APRGC 479
               C
Sbjct: 328 KETDC 332


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 177/369 (47%), Gaps = 46/369 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L VG PP+ + MVLDTGS++ W+ C       S    VF+P  S +++ VPC SP+CR  
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRTR 124

Query: 201 -----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DN 251
                + +S   + + C   +SY D +   G+ + ET             GC       N
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSN 184

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA- 310
                 + GL+G+ RG LSF  Q G     KFSYC+    +S     ++ GD++ S    
Sbjct: 185 SEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCISGSDSSV---FLLLGDASYSWLGP 238

Query: 311 -RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            ++TPL+      P  D   Y V+L GI V G+ +  +  S+F  D  G G  ++DSGT 
Sbjct: 239 IQYTPLVLQSTPLPYFDRVAYTVQLEGIRV-GSKILSLPKSVFVPDHTGAGQTMVDSGTQ 297

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDLSGKTEVK---VPTVVL 415
            T L  P Y AL++ F     S+ R    PDF      D C+ +   T      +P V L
Sbjct: 298 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 357

Query: 416 HFRGADVSLPATNYLIPVDSSGT------FCFAFAGT-MSGLS--IIGNIQQQGFRVVYD 466
            FRGA++S+     L  V+ +G+      +CF F  + + G+   +IG+  QQ   + +D
Sbjct: 358 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFD 417

Query: 467 LAASRIGFA 475
           LA SR+GFA
Sbjct: 418 LAKSRVGFA 426


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 110/192 (57%), Gaps = 9/192 (4%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-----DSSGCN 207
           +++DTGSD+ W+QC PC  CY+Q  PVF P+ S S+ ++PC S  C+ L     ++  C 
Sbjct: 158 VIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACE 217

Query: 208 RR-NTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGR 266
              + C Y V+YGDGS T G+   E L+F G  V+    GCG +N+GLF   +GL+GLGR
Sbjct: 218 SNPSNCSYAVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGR 277

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT---ARFTPLLANPKLDT 323
             LS  +QT   F   FSYCL      A  S  +  +S+V +      +T ++ NP+L  
Sbjct: 278 SNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSN 337

Query: 324 FYYVELVGISVG 335
           FY + L GI VG
Sbjct: 338 FYMLNLTGIDVG 349


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 124/396 (31%), Positives = 172/396 (43%), Gaps = 57/396 (14%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKCYSQTDPV------FD 181
            +   G Y   L  GTPP+ +  ++DTGSD+VW  C     CK C   +         F 
Sbjct: 60  FSHSYGGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFI 119

Query: 182 PAKSRSFATVPCRSPLCRKLDSSGCNRRNTCL-----------YQVSYGDGSITVGDFST 230
           P +S S   + C++P C  +  S  N    C            Y + YG G+ T G   +
Sbjct: 120 PKESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGT-TGGVALS 178

Query: 231 ETLTFRGTRVARVALGCGHDNEGLFVAA--AGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           ETL           +GC      +F +   AG+ G GRG  S P+Q G     KFSYCL+
Sbjct: 179 ETLHLHSLSKPNFLVGC-----SVFSSHQPAGIAGFGRGLSSLPSQLGL---GKFSYCLL 230

Query: 289 DR---STSAKPSSMVFGDSAVSRTAR-----FTPLLANPKLDT------FYYVELVGISV 334
                  + K SS+V     +    +     +TP + NPK+D       +YY+ L  I+V
Sbjct: 231 SHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITV 290

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           GG HV+ +          GNGGVIIDSGT+ T + R A+  L D F       +R  +  
Sbjct: 291 GGHHVK-VPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIE 349

Query: 395 L---FDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGL 450
                  CF++S    V  P + L+F+ GADV+LP  NY   V             ++G 
Sbjct: 350 DAIGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGP 409

Query: 451 S-------IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                   I+GN Q Q F V YDL   R+GF    C
Sbjct: 410 ERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/372 (32%), Positives = 174/372 (46%), Gaps = 45/372 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L VGTPP+ V MVLDTGS++ W+ C    K     + VF+P  S S+  +PC SP+C+  
Sbjct: 74  LTVGTPPQSVTMVLDTGSELSWLHC----KKQQNINSVFNPHLSSSYTPIPCMSPICKTR 129

Query: 201 ----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
               L    C+  N C   VSY D +   G+ +++T    G+    +  G        N 
Sbjct: 130 TRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGSMDSGFSSNA 189

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
                  GL+G+ RG LSF TQ G     KFSYC+  +  S     ++FGD+        
Sbjct: 190 NEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCISGKDASG---VLLFGDATFKWLGPL 243

Query: 311 RFTPLL-ANPKLDTF----YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
           ++TPL+  N  L  F    Y V L+GI VG   ++ +   +F  D  G G  ++DSGT  
Sbjct: 244 KYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQ-VPKEIFAPDHTGAGQTMVDSGTRF 302

Query: 366 TRLTRPAYIALRDAFRA---GASSLKRAPDFSL---FDTCFDL-SGKTEVKVPTVVLHFR 418
           T L    Y ALR+ F A   G  +L   P+F      D CF +  G     VP V + F 
Sbjct: 303 TFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVFE 362

Query: 419 GADVSLPATNYLIPVDSSG--------TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDL 467
           GA++S+     L  V   G         +C  F  + + G+   +IG+  QQ   + +DL
Sbjct: 363 GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQNVWMEFDL 422

Query: 468 AASRIGFAPRGC 479
             SR+GFA   C
Sbjct: 423 VNSRVGFADTKC 434


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 131/370 (35%), Positives = 173/370 (46%), Gaps = 42/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VGTPP+ V MVLDTGS++ W+ CA  +   +  D  F P  S +FA VPC S  C   
Sbjct: 65  LAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCSSR 123

Query: 202 D------SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDNE 252
           D          +RR  C   +SY DGS + G  +T+          R A GC    +D+ 
Sbjct: 124 DLPAPPSCDAASRR--CRVSLSYADGSASDGALATDVFAVGDAPPLRSAFGCMSAAYDSS 181

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS-RTAR 311
              VA AGLLG+ RG LSF TQ      R+FSYC+ DR  +     ++ G S +      
Sbjct: 182 PDAVATAGLLGMNRGALSFVTQAS---TRRFSYCISDRDDAG---VLLLGHSDLPFLPLN 235

Query: 312 FTPLL-ANPKLDTF----YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +TPL    P L  F    Y V+L+GI VGG  +  I  S+   D  G G  ++DSGT  T
Sbjct: 236 YTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLP-IPPSVLAPDHTGAGQTMVDSGTQFT 294

Query: 367 RLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDLSG---KTEVKVPTVVLHF 417
            L   AY A++  F      L  A   P F+    FDTCF +         ++P V L F
Sbjct: 295 FLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLF 354

Query: 418 RGADVSLPATNYLIPVD-----SSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
            GA +S+     L  V      + G +C  F    M  L+  +IG+  Q    V YDL  
Sbjct: 355 NGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDLER 414

Query: 470 SRIGFAPRGC 479
            R+G AP  C
Sbjct: 415 GRVGLAPVKC 424


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 172/369 (46%), Gaps = 33/369 (8%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV-------FDPA 183
           L    GEY     +G P   V   LDT + ++W+QC+ C    SQ +P        F  +
Sbjct: 68  LVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCN---SQCEPEKRGLTTKFLSS 124

Query: 184 KSRSFATVPCRSPLCRKLDS-SGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGTR-- 239
           KS ++   PC S  C  L     CN  +  C Y++ YGD   T G  S+++  F  +   
Sbjct: 125 KSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGM 184

Query: 240 ---VARVALGCGHDN-EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK 295
              V  +  GC      G   +  G +GL +  LS  +Q G +   KFSYCLV  +    
Sbjct: 185 LVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIK---KFSYCLVPFNNLGS 241

Query: 296 PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGA--HVRGITASLFKLDPAG 353
            S M FG   V+   + TPLL  P  D  YYV+++GIS+G    H  G+       D   
Sbjct: 242 TSKMYFGSLPVTSGGQ-TPLLY-PNSDA-YYVKVLGISIGNDEPHFDGVFDVYEVRD--- 295

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD-FSLFDTCFDLSGKTEVK-VP 411
             G IID+G + + L   A+ +L   F       +R  D    F+ CF+L    +++  P
Sbjct: 296 --GWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFP 353

Query: 412 TVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
            V +HF GAD+ L   +  + ++  G FC A   + S +SI+GN Q Q + V YDL A  
Sbjct: 354 DVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQV 413

Query: 472 IGFAPRGCA 480
           I FAP  CA
Sbjct: 414 ISFAPVDCA 422


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 135/437 (30%), Positives = 208/437 (47%), Gaps = 41/437 (9%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           ++S+ L+L H D+L             + + + R++ +    +    +  R R+   +  
Sbjct: 46  DTSVRLKLAHRDTL-------------LPKPLSRIEDVIGADQKRHSLISRKRN---STV 89

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           G    + SG+  G+ +YFT + VGTP +   +V+DTGS++ W+ C    +       VF 
Sbjct: 90  GVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFR 148

Query: 182 PAKSRSFATVPCRSPLCRK-----LDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF 235
             +S+SF TV C +  C+         + C   +T C Y   Y DGS   G F+ ET+T 
Sbjct: 149 ADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 208

Query: 236 RGT--RVARV---ALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD 289
             T  R+AR+    +GC     G  F  A G+LGL     SF +     +  KFSYCLVD
Sbjct: 209 GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 268

Query: 290 RSTSAKPSS-MVFGDSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             ++   S+ ++FG S  ++TA  R TPL    ++  FY + ++GIS+ G  +  I + +
Sbjct: 269 HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISL-GYDMLDIPSQV 326

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDTCFDL-SG 404
           +  D    GG I+DSGTS+T L   AY  +          LKR  P+    + CF   SG
Sbjct: 327 W--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSG 384

Query: 405 KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFR 462
               K+P +  H +G     P     +   + G  C  F  AGT    ++IGNI QQ + 
Sbjct: 385 FNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGT-PATNVIGNIMQQNYL 443

Query: 463 VVYDLAASRIGFAPRGC 479
             +DL AS + FAP  C
Sbjct: 444 WEFDLMASTLSFAPSAC 460


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 185/370 (50%), Gaps = 48/370 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L VG+PP+ + MVLDTGS++ W+ C       S    VF+P  S +++ VPC SP+CR  
Sbjct: 65  LAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRTR 120

Query: 201 -----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTET-----LTFRGTRVARVALGCGHD 250
                + +S   + + C   +SY D +   G+ + +T     +T  GT    +  G   D
Sbjct: 121 TRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGTLFGCMDSGLSSD 180

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           +E     + GL+G+ RG LSF  Q G     KFSYC+   S S     ++ GD++ S   
Sbjct: 181 SEE-DAKSTGLMGMNRGSLSFVNQLGFS---KFSYCI---SGSDSSGILLLGDASYSWLG 233

Query: 311 --RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
             ++TPL+      P  D   Y V+L GI VG + +  +  S+F  D  G G  ++DSGT
Sbjct: 234 PIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVG-SKILSLPKSVFVPDHTGAGQTMVDSGT 292

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCFDLSGKTE---VKVPTVV 414
             T L  P Y AL++ F A   S+ R    P+F      D C+ +   T      +P + 
Sbjct: 293 QFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVIS 352

Query: 415 LHFRGADVSLPATNYLIPVDSSGT------FCFAFAGT-MSGLS--IIGNIQQQGFRVVY 465
           L FRGA++S+     L  V+ +G+      +CF F  + + G+   +IG+  QQ   + +
Sbjct: 353 LMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEF 412

Query: 466 DLAASRIGFA 475
           DLA SR+GFA
Sbjct: 413 DLAKSRVGFA 422


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 172/362 (47%), Gaps = 20/362 (5%)

Query: 124 SSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA 183
           S+ + SG A   G Y  R+ +GTP + ++MVLDT +D  +I  + C  C + T   F P 
Sbjct: 84  SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---FSPN 140

Query: 184 KSRSFATVPCRSPLCRKLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
            S S+  + C  P C ++    C    +  C +  SY  GS        ++L      + 
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYA-GSTYSATLVQDSLRLATDVIP 199

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
             + G  +   G  + A GLLGLGRG LS  +QTG  ++  FSYCL    +     S+  
Sbjct: 200 SYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKL 259

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           G     ++ R TPLL NP+  + Y+V L GI+VG  +V      L   D     G IIDS
Sbjct: 260 GPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVP-FPKELLAFDVNTGSGTIIDS 318

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRG 419
           GT +TR   P Y A+RD FR   +     P  SL  FDTCF      E   P + LHF  
Sbjct: 319 GTVITRFVEPVYNAVRDEFRKQVT----GPFSSLGAFDTCF--VKNYETLAPAITLHFTD 372

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAGT-----MSGLSIIGNIQQQGFRVVYDLAASRIGF 474
            D+ LP  N LI   S    C A A T      + L++I N QQQ  RV++D   ++  +
Sbjct: 373 LDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKGWY 432

Query: 475 AP 476
            P
Sbjct: 433 CP 434


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 125/386 (32%), Positives = 187/386 (48%), Gaps = 34/386 (8%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC--APCKKCYSQTDP-- 178
           F+  + SG   G+G+YF R  VGTP +   +V DTGSD+ W++C  A          P  
Sbjct: 86  FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPAR 145

Query: 179 VFDPAKSRSFATVPCRSPLCRK---LDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLT 234
           VF  A S+S+A + C S  C        + C+   + C Y   Y DGS   G   T++ T
Sbjct: 146 VFRTAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSAT 205

Query: 235 F----------------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGR 277
                            R  ++  V LGC    +G  F ++ G+L LG   +SF ++   
Sbjct: 206 IALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAA 265

Query: 278 RFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGG 336
           RF  +FSYCLVD       +S + FG  A +  A+ TPLL + ++  FY V +  + V G
Sbjct: 266 RFGGRFSYCLVDHLAPRNATSYLTFGPGATAPAAQ-TPLLLDRRMTPFYAVTVDAVYVAG 324

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF 396
             +  I A ++ +D   NGG I+DSGTS+T L  PAY A+  A     + L R      F
Sbjct: 325 EAL-DIPADVWDVD--RNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRV-TMDPF 380

Query: 397 DTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIG 454
           + C++ +    +++P + +HF G A +  PA +Y+I   + G  C     G+  G+S+IG
Sbjct: 381 EYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDA-APGVKCIGVQEGSWPGVSVIG 439

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGCA 480
           NI QQ     +DL    + F    CA
Sbjct: 440 NILQQEHLWEFDLRDRWLRFKHTRCA 465


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 166/362 (45%), Gaps = 38/362 (10%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           S  Y  R   GTP + + + +DT +D  W+ C  C  C S T P F P KS +F  V C 
Sbjct: 103 SPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGC-STTTP-FAPPKSTTFKKVGCG 160

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
           +  C+++ +  C+  + C +  +YG  S+       +T+T     V     GC     G 
Sbjct: 161 ASQCKQVRNPTCD-GSACAFNFTYGTSSV-AASLVQDTVTLATDPVPAYTFGCIQKATGS 218

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-----------VDRSTSAKPSSMVFGD 303
            +   GLLGLGRG LS   QT + +   FSYCL            D    A+P   V+  
Sbjct: 219 SLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGHXDLXPVAQPRDQVY-- 276

Query: 304 SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
                     P   NP+  + YYV LV I V G  +  I       +P    G + DSGT
Sbjct: 277 ----------PSFKNPRRSSLYYVNLVAIRV-GRRIVDIPPEALAFNPXTGAGTVFDSGT 325

Query: 364 SVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFDLSGKTEVKVPTVVLHFRGAD 421
             TRL  PAY A+R+ FR   S  K+    SL  FDTC+ +     +  PT+   F G +
Sbjct: 326 VFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV----PIVAPTITFMFSGMN 381

Query: 422 VSLPATNYLIPVDSSGTFCFAFA----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
           V+LP  N LI   +    C A A       S L++I N+QQQ  RV++D+  SR+G A  
Sbjct: 382 VTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARE 441

Query: 478 GC 479
            C
Sbjct: 442 LC 443


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 181/393 (46%), Gaps = 52/393 (13%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV----FDPA 183
           A+  G Y   L  GTP + +  V DTGS +VW+ C     C  C +S  DP     F P 
Sbjct: 84  AKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPK 143

Query: 184 KSRSFATVPCRSPLCR-----KLDSSGC--NRRNTCL----YQVSYGDGSITVGDFSTET 232
            S S   + C+SP C+      +   GC  N RN  +    Y + YG GS T G   TE 
Sbjct: 144 NSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEK 202

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS- 291
           L F    V    +GC   +       AG+ G GRG +S P+Q      ++FS+CLV R  
Sbjct: 203 LDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNL---KRFSHCLVSRRF 256

Query: 292 -----TSAKPSSMVFGDSAVSRTA--RFTPLLANPKLDT-----FYYVELVGISVGGAHV 339
                T+        G ++ S+T    +TP   NP +       +YY+ L  I VG  HV
Sbjct: 257 DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---F 396
           + I          G+GG I+DSG++ T + RP +  + + F +  S+  R  D       
Sbjct: 317 K-IPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGL 375

Query: 397 DTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF--------AGTM 447
             CF++SGK +V VP ++  F+ GA + LP +NY   V ++ T C           +G  
Sbjct: 376 GPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGT 435

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
               I+G+ QQQ + V YDL   R GFA + C+
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 130/406 (32%), Positives = 193/406 (47%), Gaps = 44/406 (10%)

Query: 93  VLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVY 152
           VL   S+TA A +A RV  R  +     GG   +V+      +  Y     +GTPP+   
Sbjct: 10  VLCFISVTARA-AAFRVHGRLLADAATEGG---AVVPIHWTQAMNYVANFTIGTPPQPAS 65

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL--DSSGCNRRN 210
            V+D   ++VW QC  C +C+ Q  P+FDP  S ++   PC +PLC  +  DS  C+  N
Sbjct: 66  AVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCS-GN 124

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC----GHDNEGLFVAAAGLLGLGR 266
            C YQ S   G  T G   T+T    GT  A +A GC      D  G     +G++GLGR
Sbjct: 125 VCAYQASTNAGD-TGGKVGTDTFAV-GTAKASLAFGCVVASDIDTMG---GPSGIVGLGR 179

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFTPLLANPK 320
              S  TQTG      FSYCL     + + S++  G SA       + +  F  +  N  
Sbjct: 180 TPWSLVTQTGV---AAFSYCLAPHD-AGRNSALFLGSSAKLAGGGKAASTPFVNISGNGN 235

Query: 321 -LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
            L  +Y V+L G+  G A        +  L P+G+  V++D+ + ++ L   AY A++ A
Sbjct: 236 DLSNYYKVQLEGLKAGDA--------MIPLPPSGS-TVLLDTFSPISFLVDGAYQAVKKA 286

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGT 438
             A   +   A     FD CF  SG +    P +V  FR GA +++PATNYL+    +GT
Sbjct: 287 VTAAVGAPPMATPVEPFDLCFPKSGASGA-APDLVFTFRGGAAMTVPATNYLLDY-KNGT 344

Query: 439 FCFAF-----AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C A        + + LS++G++QQ+    ++DL    + F P  C
Sbjct: 345 VCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 114/361 (31%), Positives = 175/361 (48%), Gaps = 35/361 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +     +G P      ++DTGS+++W++CAPCK+C  Q  P+ DP+KS ++A++PC + +
Sbjct: 99  FLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTM 158

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
           C    S+ CNR N C Y +SY  G  + G  +TE L F  +      V  V  GC H+N 
Sbjct: 159 CHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN- 217

Query: 253 GLFVAA--AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP----SSMVFGDSAV 306
           G +      G+ GLG+G  SF T+ G     KFSYCL      A P    + +VFG+ A 
Sbjct: 218 GDYKDRRFTGVFGLGKGITSFVTRMG----SKFSYCL---GNIADPHYGYNQLVFGEKA- 269

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +     TPL     ++  YYV L GISVG   +  I ++ F +        +IDSGT++T
Sbjct: 270 NFEGYSTPL---KVVNGHYYVTLEGISVGEKRLD-IDSTAFSMK-GNEKSALIDSGTALT 324

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTE-VKVPTVVLHFR-GADVSL 424
            L   A+ AL +  R     +   P +     C+  +   + +  P V  HF  GAD+ L
Sbjct: 325 WLAESAFRALDNEVRQLLDGV-LMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDL 383

Query: 425 PATNYLIPVDSSGTFCF------AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
             T  +    +    C       A+       S+IG + QQ + + YDL ++++ F    
Sbjct: 384 D-TESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRID 442

Query: 479 C 479
           C
Sbjct: 443 C 443


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 135/437 (30%), Positives = 208/437 (47%), Gaps = 41/437 (9%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           ++S+ L+L H D+L             + + + R++ +    +    +  R R+   +  
Sbjct: 24  DTSVRLKLAHRDTL-------------LPKPLSRIEDVIGADQKRHSLISRKRN---STV 67

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFD 181
           G    + SG+  G+ +YFT + VGTP +   +V+DTGS++ W+ C    +       VF 
Sbjct: 68  GVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFR 126

Query: 182 PAKSRSFATVPCRSPLCRK-----LDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTF 235
             +S+SF TV C +  C+         + C   +T C Y   Y DGS   G F+ ET+T 
Sbjct: 127 ADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 186

Query: 236 RGT--RVARV---ALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVD 289
             T  R+AR+    +GC     G  F  A G+LGL     SF +     +  KFSYCLVD
Sbjct: 187 GLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVD 246

Query: 290 RSTSAKPSS-MVFGDSAVSRTA--RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
             ++   S+ ++FG S  ++TA  R TPL    ++  FY + ++GIS+ G  +  I + +
Sbjct: 247 HLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISL-GYDMLDIPSQV 304

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDTCFDL-SG 404
           +  D    GG I+DSGTS+T L   AY  +          LKR  P+    + CF   SG
Sbjct: 305 W--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSG 362

Query: 405 KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQQQGFR 462
               K+P +  H +G     P     +   + G  C  F  AGT    ++IGNI QQ + 
Sbjct: 363 FNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGT-PATNVIGNIMQQNYL 421

Query: 463 VVYDLAASRIGFAPRGC 479
             +DL AS + FAP  C
Sbjct: 422 WEFDLMASTLSFAPSAC 438


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 177/366 (48%), Gaps = 40/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L +GTPP+   MVLDTGS + WIQC   KK  ++  P   FDP+ S +F+T+PC  P+C+
Sbjct: 101 LPIGTPPQVQPMVLDTGSQLSWIQCH--KKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158

Query: 200 ------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
                  L +S C++   C Y   Y DG+   G+   E  TF R      + LGC  ++ 
Sbjct: 159 PRIPDFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATES- 216

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL---VDRSTSAKPSSMVFGDSAVSRT 309
                  G+LG+ RGRLSF +Q+      KFSYC+   V R       S   G +  S T
Sbjct: 217 ---TDPRGILGMNRGRLSFASQSKI---TKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNT 270

Query: 310 ARFTPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            R+  +L        P LD   Y V L GI +GG  +  I+ ++F+ D  G+G  ++DSG
Sbjct: 271 FRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKL-NISPAVFRADAGGSGQTMLDSG 329

Query: 363 TSVTRLTRPAYIALR-DAFRAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  T L   AY  +R +  RA    +K+   +  + D CFD     E+   +  +V  F 
Sbjct: 330 SEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFD-GNAIEIGRLIGDMVFEFE 388

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTM---SGLSIIGNIQQQGFRVVYDLAASRIGF 474
           +G  + +P    L  V+  G  C   A +    +  +IIGN  QQ   V +DL   R+GF
Sbjct: 389 KGVQIVVPKERVLATVE-GGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGF 447

Query: 475 APRGCA 480
               C+
Sbjct: 448 GTADCS 453


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 119/393 (30%), Positives = 179/393 (45%), Gaps = 53/393 (13%)

Query: 113 NRSRGRANGGFSSSVISGLAQGSGEY----------FTRLGVGTPPRYVYMVLDTGSDVV 162
           NR++     G+ SS  S L QG+  Y            +L VGTPP  +   +DTGSD++
Sbjct: 388 NRAQNNFLVGYDSS--SLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDII 445

Query: 163 WIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGS 222
           W QC PC  CYSQ  P+FDP+KS +F    C                N+C Y++ Y D +
Sbjct: 446 WTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNG--------------NSCHYEIIYADKT 491

Query: 223 ITVGDFSTETLTFRGTR-----VARVALGCGHDN-----EGLFVAAAGLLGLGRGRLSFP 272
            + G  +TET+T   T      +A   +GCG DN      G   +++G++GL  G LS  
Sbjct: 492 YSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLI 551

Query: 273 TQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVEL 329
           +Q    +    SYC   + TS      +++V GD  V+            K + FYY+ L
Sbjct: 552 SQMDLPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIK------KDNPFYYLNL 605

Query: 330 VGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR 389
             +SV       + A+L     A +G + IDSGT++T         +R+A     +++K 
Sbjct: 606 DAVSVE----DNLIATLGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVK- 660

Query: 390 APDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
            PD    D        T    P + +HF  GAD+ L   N  +   + G FC A      
Sbjct: 661 VPDMGS-DNLLCYYSDTIDIFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDP 719

Query: 449 GL-SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            + ++ GN  Q  F V YD +++ I F+P  C+
Sbjct: 720 SMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752



 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 107/344 (31%), Positives = 160/344 (46%), Gaps = 41/344 (11%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  +L VGTPP  +   +DTGSD++W QC PC  CYSQ DP+FDP+KS +F    C    
Sbjct: 82  YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHG-- 139

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCG-H-- 249
                        +C Y++ Y D + + G  +TET+T   T      +A   +GCG H  
Sbjct: 140 ------------KSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNT 187

Query: 250 --DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDS 304
             DN G   +++G++GL  G  S  +Q    +    SYC   + TS      +++V GD 
Sbjct: 188 DLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIVAGDG 247

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            V+            K + FYY+ L  +SV    +  +         A +G ++IDSG++
Sbjct: 248 TVAADMFIK------KDNPFYYLNLDAVSVEDNRIETLGTPFH----AEDGNIVIDSGST 297

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VT         +R A     +++ R PD S  D     S   ++  P + +HF  GAD+ 
Sbjct: 298 VTYFPVSYCNLVRKAVEQVVTAV-RVPDPSGNDMLCYFSETIDI-FPVITMHFSGGADLV 355

Query: 424 LPATNYLIPVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYD 466
           L   N  +  +S G FC A    + +  +I GN  Q  F V YD
Sbjct: 356 LDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 129/423 (30%), Positives = 194/423 (45%), Gaps = 49/423 (11%)

Query: 101 AFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSD 160
           AF  S  R   R  + G +   F   + SG   G G+YF R  VGTP +   +V DTGSD
Sbjct: 57  AFIASHGRRRARETAAGSSAAAFEMPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSD 116

Query: 161 VVWIQC----APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRNT-C 212
           + W++C    A   +  S +   F P  SR++A + C S  C K      + C    + C
Sbjct: 117 LTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPC 176

Query: 213 LYQVSYGDGSITVGDFSTETLTF---------RGTRVARVALGCGHDNEG-LFVAAAGLL 262
            Y   Y DGS   G   TE+ T          R  ++  + LGC     G  F  + G+L
Sbjct: 177 AYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVL 236

Query: 263 GLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFGDSAVSRTARF--------- 312
            LG   +SF +    RF  +FSYCLVD  +    +S + FG +    ++           
Sbjct: 237 SLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASC 296

Query: 313 -------------TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
                        TPLL + ++  FY V +  +SV G  ++ I  +++ +D    GGVI+
Sbjct: 297 TAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLK-IPRAVWDVD--AGGGVIL 353

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT-EVKVPTVVLHFR 418
           DSGTS+T L +PAY A+  A   G + L R      F+ C++ +  + +V +P + +HF 
Sbjct: 354 DSGTSLTVLAKPAYRAVVAALSEGLAGLPRV-TMDPFEYCYNWTSPSGDVTLPKMAVHFA 412

Query: 419 G-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
           G A +  P  +Y+I   + G  C     G   G+S+IGNI QQ     +D+   R+ F  
Sbjct: 413 GAARLEPPGKSYVIDA-APGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQR 471

Query: 477 RGC 479
             C
Sbjct: 472 SRC 474


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 193/386 (50%), Gaps = 33/386 (8%)

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-----APCKKCYSQT 176
            F+  + SG   G+G+YF RL VGTP +   +V DTGSD+ W++C     +      S  
Sbjct: 88  AFAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPP 147

Query: 177 DPVFDPAKSRSFATVPCRSPLCRK---LDSSGCNR-RNTCLYQVSYGDGSITVG----DF 228
             VF PA S+S++ +PC S  C+       + C+   + C Y   Y D S   G    D 
Sbjct: 148 QRVFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDS 207

Query: 229 STETLTFR-GTRVAR---VALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKF 283
           +T +L+   GTR A+   V LGC    +G  F ++ G+L LG   +SF ++   RF  +F
Sbjct: 208 ATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRF 267

Query: 284 SYCLVDRSTSAKPSS-MVFGD----SAVSRTARFTP--LLANPKLDTFYYVELVGISVGG 336
           SYCLVD       +S + FG+         ++R TP  LL + +   FY+V +  ++V G
Sbjct: 268 SYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAG 327

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF 396
             +  I   ++  D   NGG I+DSGTS+T L  PAY A+  A     + + R  +   F
Sbjct: 328 ERLE-ILPDVW--DFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRV-NMDPF 383

Query: 397 DTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLSIIG 454
           + C++ +G    ++P + L F G A ++ P  +Y+I   + G  C     G   G+S+IG
Sbjct: 384 EYCYNWTG-VSAEIPRMELRFAGAATLAPPGKSYVIDT-APGVKCIGVVEGAWPGVSVIG 441

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGCA 480
           NI QQ     +DLA   + F    CA
Sbjct: 442 NILQQEHLWEFDLANRWLRFKQSRCA 467


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 126/383 (32%), Positives = 175/383 (45%), Gaps = 51/383 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA------PCKKC-YSQTDPVFDPA----K 184
           G Y     +GTPP+ V +VLDTGS +VW  C        C+ C +S  DP   P     K
Sbjct: 72  GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131

Query: 185 SRSFATVPCRSPLCRKLDSS--GCNRRNTC-LYQVSYGDGSITVGDFSTETLTF-RGTRV 240
           S +  ++PCRSP C  +  S   C+    C  Y + YG GS T G   ++ L   +  R+
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRI 190

Query: 241 ARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV 300
                GC   +        G+ G GRG  S P Q G     KFSYCLV       P S  
Sbjct: 191 PDFLFGCSLVSN---RQPEGIAGFGRGLASIPAQLGL---TKFSYCLVSHRFDDTPQS-- 242

Query: 301 FGDSAVSRTAR----------FTPLLANPKL---DTFYYVELVGISVGGAHVRGITASLF 347
            GD  + R  R          + P   +P L     +YY+ L  I VGG  V  I     
Sbjct: 243 -GDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVP-IPPRYL 300

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA---PDFSLFDTCFDLSG 404
                G+GG+I+DSG++ T + R  +  +        +  KRA    D S    C++++G
Sbjct: 301 VPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITG 360

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF------AGTMSGLSII-GNI 456
           ++EV VP +   F+ GA++ LP T+Y   V + G  C          G+ +G +II GN 
Sbjct: 361 QSEVDVPKLTFSFKGGANMDLPLTDYFSLV-TDGVVCMTVLTDPDEPGSTTGPAIILGNY 419

Query: 457 QQQGFRVVYDLAASRIGFAPRGC 479
           QQQ F + YDL   R GF P+ C
Sbjct: 420 QQQNFYIEYDLKKQRFGFKPQQC 442


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 137/432 (31%), Positives = 190/432 (43%), Gaps = 81/432 (18%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC------APCKKCYSQT 176
           F+  + SG   G+G+YF R  VGTP R   +V DTGSD+ W++C      AP    Y   
Sbjct: 92  FAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPG-YGYA 150

Query: 177 DP----------------------VFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRNT 211
            P                      VF P +SR++A +PC S  C        + C    +
Sbjct: 151 APASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGS 210

Query: 212 -CLYQVSYGDGSITVGDFSTETLTF-----------RGTRVARVALGCGHDNEG-LFVAA 258
            C Y   Y DGS   G   T++ T            R  ++  V LGC     G  F+A+
Sbjct: 211 PCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLAS 270

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG-DSAVSRT------- 309
            G+L LG   +SF ++   RF  +FSYCLVD       +S + FG + AVS +       
Sbjct: 271 DGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTAC 330

Query: 310 ---------------ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
                          AR TPLL + ++  FY V + GISV G  +R         D A  
Sbjct: 331 AGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLR---IPRLVWDVAKG 387

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT-----EVK 409
           GG I+DSGTS+T L  PAY A+  A     + L R      FD C++ +  +      V 
Sbjct: 388 GGAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRV-TMDPFDYCYNWTSPSTGEDLTVA 446

Query: 410 VPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDL 467
           +P + +HF G A +  PA +Y+I   + G  C     G   G+S+IGNI QQ     +DL
Sbjct: 447 MPELAVHFAGSARLQPPAKSYVIDA-APGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDL 505

Query: 468 AASRIGFAPRGC 479
              R+ F    C
Sbjct: 506 KNRRLRFKRSRC 517


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 114/329 (34%), Positives = 164/329 (49%), Gaps = 14/329 (4%)

Query: 155 LDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLY 214
           +DT SDV WI   PC  C   +  +F+   S ++ ++ C++  C+++    C     C +
Sbjct: 1   MDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCG-GGVCSF 56

Query: 215 QVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ 274
            ++YG GS    + S +T+T     V   + GC     G  + A GLLGLGRG LS  +Q
Sbjct: 57  NLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQ 115

Query: 275 TGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISV 334
           T   +   FSYCL    +     S+  G     +  ++TPLL NP+  + Y+V L+ + V
Sbjct: 116 TQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRV 175

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           G   V     S F  +P+   G I DSGT  TRL  PAYIA+RDAFR             
Sbjct: 176 GRRVVDVPPGS-FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLG 234

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAG----TMSGL 450
            FDTC+ +     +  PT+   F G +V+LP  N LI   +  T C A A       S L
Sbjct: 235 GFDTCYTV----PIAAPTITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVL 290

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ++I N+QQQ  R++YD+  SR+G A   C
Sbjct: 291 NVIANLQQQNHRLLYDVPNSRLGVARELC 319


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 126/411 (30%), Positives = 172/411 (41%), Gaps = 57/411 (13%)

Query: 116 RGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ 175
           + R N   S +      +  G Y   L +GTPP+    VLDTGS +VW  C     C   
Sbjct: 66  KHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHC 125

Query: 176 TDPVFDPAKSRSF-----------------------ATVPCRSPLCRKLDSSGCNRRNTC 212
             P  DP K  +F                         V  R P C+K  S  C+   TC
Sbjct: 126 NFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSL--TC 183

Query: 213 L-YQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
             Y + YG G+ T G    + L F G  V +  +GC   +       +G+ G GRG+ S 
Sbjct: 184 PSYIIQYGLGA-TAGFLLLDNLNFPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESL 239

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSS--MVFGDSAVSRTA----RFTPLLANPKLDT-- 323
           P+Q      ++FSYCLV       P S  +V   S+   T      +TP  +NP  ++  
Sbjct: 240 PSQMNL---KRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVF 296

Query: 324 --FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY-IALRDAF 380
             +YYV L  + VGG  V+ I     +    GNGG I+DSG++ T + RP Y +  ++  
Sbjct: 297 REYYYVTLRKLIVGGVDVK-IPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFL 355

Query: 381 RAGASSLKRAPDF---SLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSS 436
           R       R  +    S    CF++SG   +  P     F+ GA +S P  NY   V  +
Sbjct: 356 RQLGKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDA 415

Query: 437 GTFCFAFAG--------TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              CF            T     I+GN QQQ F V YDL   R GF PR C
Sbjct: 416 EVLCFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 127/411 (30%), Positives = 174/411 (42%), Gaps = 57/411 (13%)

Query: 116 RGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC 172
           + R N   S +      +  G Y   L +GTPP+    VLDTGS +VW  C     C  C
Sbjct: 70  KHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHC 129

Query: 173 -YSQTD----PVFDPAKSRSFATVPCRSPLCR--------------KLDSSGCNRRNTC- 212
            +   D    P F P  S +   + CR+P C               K +S  C+   TC 
Sbjct: 130 NFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSL--TCP 187

Query: 213 LYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFP 272
            Y + YG GS T G    + L F G  V +  +GC   +       +G+ G GRG+ S P
Sbjct: 188 AYIIQYGLGS-TAGFLLLDNLNFPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLP 243

Query: 273 TQTGRRFNRKFSYCLVDRSTSAKPSS--MVFGDSAVSRTA---------RFTPLLANPKL 321
           +Q      ++FSYCLV       P S  +V   S+   T          R  P   NP  
Sbjct: 244 SQMNL---KRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAF 300

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF- 380
             +YY+ L  + VGG  V+ I  +  +    GNGG I+DSG++ T + RP Y  +   F 
Sbjct: 301 KEYYYLTLRKVIVGGKDVK-IPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFV 359

Query: 381 RAGASSLKRAPDF---SLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSS 436
           +    +  RA D    S    CF++SG   V  P +   F+ GA ++ P  NY   V  +
Sbjct: 360 KQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDA 419

Query: 437 GTFCFAFAG--------TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              C             T     I+GN QQQ F + YDL   R GF PR C
Sbjct: 420 EVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 129/397 (32%), Positives = 175/397 (44%), Gaps = 53/397 (13%)

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV---- 179
           +S   +  G Y   L  GTPP+ +  + DTGS +VW  C     C +C +   DP     
Sbjct: 122 VSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISK 181

Query: 180 FDPAKSRSFATVPCRSPLCR-------KLDSSGCNRR-----NTCL-YQVSYGDGSITVG 226
           F P  S S   V CR+P C        K     CN +     ++C  Y + YG G+ T G
Sbjct: 182 FVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAG 240

Query: 227 DFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
              +ETL     RV    +GC   +       AG+ G GRG  S P+Q   R  R FS+C
Sbjct: 241 ILLSETLDLENKRVPDFLVGCSVMSVH---QPAGIAGFGRGPESLPSQM--RLKR-FSHC 294

Query: 287 LVDRSTSAKP--SSMVF-----GDSAVSRTARFTPLLANPKLDT-----FYYVELVGISV 334
           LV R     P  S +V       D + +++  + P   NP +       +YY+ L  I +
Sbjct: 295 LVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILI 354

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF- 393
           GG  V+     L   D  GNGG IIDSG++ T L +P + A+ D          RA D  
Sbjct: 355 GGKPVKFPYKYLVP-DSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVE 413

Query: 394 --SLFDTCFDLSGKTE-VKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
             S    CF++  + E  + P VVL F+ G  +SL A NYL  V   G  C       + 
Sbjct: 414 AQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAV 473

Query: 450 LS-------IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +        I+G  QQQ   V YDLA  RIGF  + C
Sbjct: 474 VGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 123/388 (31%), Positives = 174/388 (44%), Gaps = 52/388 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC----YSQTD-PVFDPAKSRS 187
           G Y   L  GTPP+    V+DTGS +VW  C     C +C      +T  P F P  S S
Sbjct: 81  GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSS 140

Query: 188 FATVPCRSP------------LCRKLDSSGCNRRNTC-LYQVSYGDGSITVGDFSTETLT 234
              + C++P             C++ DS+  N   TC  Y + YG GS T G   +ETL 
Sbjct: 141 SKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLD 199

Query: 235 FRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS 293
           F   + +    +GC   +        G+ G GR   S P+Q G    +KFSYCLV  +  
Sbjct: 200 FPNKKTIPDFLVGCSIFS---IKQPEGIAGFGRSPESLPSQLGL---KKFSYCLVSHAFD 253

Query: 294 AKPSS--MVF---GDSAVSRTA--RFTPLLANPK--LDTFYYVELVGISVGGAHVRGITA 344
             P+S  +V      S V++TA    TP L NP      +YYV L  I +G  HV+ +  
Sbjct: 254 DTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVK-VPY 312

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFD 401
                   GNGG I+DSGT+ T +  P Y  +   F    +    A +         C++
Sbjct: 313 KFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYN 372

Query: 402 LSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAG--------TMSGLSI 452
           +SG+  + VP ++  F+ GA ++LP +NY   VD SG  C                   I
Sbjct: 373 ISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVD-SGVICLTIVSDNVAGPGLGGGPAII 431

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +GN QQ+ F V +DL   + GF  + CA
Sbjct: 432 LGNYQQRNFYVEFDLENEKFGFKQQSCA 459


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 132/399 (33%), Positives = 178/399 (44%), Gaps = 79/399 (19%)

Query: 151 VYMVLDTGSDVVWIQCAP--CKKCYSQ-----TDPVFDPAKSRSFATVPCRSPLC----- 198
           V + LDTGSD+VW  CAP  C  C  +     + P+  P  SR    +PC SPLC     
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRR---IPCASPLCSAAHA 161

Query: 199 ---------------RKLDSSGCNRRNTC--LYQVSYGDGSITVGDFSTETLTFRGTR-- 239
                            +++  C   + C  LY  +YGDGS+             G R  
Sbjct: 162 SAPPSDLCAAARCPLEDIETGSCGASHACPPLY-YAYGDGSLVAHLRRGRVALGAGARAS 220

Query: 240 ----VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA- 294
               V      C H   G  V   G+ G GRG LS P Q   + + +FSYCLV  S  A 
Sbjct: 221 VAVAVDNFTFACAHTALGEPV---GVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRAD 277

Query: 295 ---KPSSMVFGDSAVSRTAR--------FTPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
              +PS ++ G S     A         +TPLL NPK   FY V L  +SVG A ++   
Sbjct: 278 RLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQA-R 336

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL-----KRAPDFSLFDT 398
             L ++D AGNGG+++DSGT+ T L    Y  + +AF    ++      +RA + +    
Sbjct: 337 PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTP 396

Query: 399 CFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPV-----------DSSGTFCFAFAGT 446
           C+  +  ++  VP + LHFRG A V+LP  NY +             D  G       G 
Sbjct: 397 CYRYA-ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGD 455

Query: 447 MSG------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            SG         +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 456 ASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 168/361 (46%), Gaps = 13/361 (3%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           ++Q    Y  ++ +G+P   +Y+V DTGS + W QC PC + + Q  P+F+   SR++  
Sbjct: 84  ISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRD 143

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           +PC+   C    +    R + C+Y+++Y  GS T G  + + L            GC  D
Sbjct: 144 LPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRIPFYFGCSRD 203

Query: 251 NEGL-----FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC--LVDRSTSAKPSSMV-FG 302
           N+            G++GL    +S   Q       +FSYC  L D S+ +  +S++ FG
Sbjct: 204 NQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFG 263

Query: 303 -DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            D   SR    +    +P+    Y++ L+ +SV G  ++ I    F L P G GG IIDS
Sbjct: 264 NDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQ-IPPGTFALKPDGTGGTIIDS 322

Query: 362 GTSVTRLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           GT+VT +++ AY  +  AF+        +R         C+   G T    P++  HF+G
Sbjct: 323 GTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSMAFHFQG 382

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFAG-TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           AD  +      + V   G FC A    +    +IIG + Q   + +YD A  ++ F P  
Sbjct: 383 ADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFTPEN 442

Query: 479 C 479
           C
Sbjct: 443 C 443


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 167/364 (45%), Gaps = 44/364 (12%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS-- 195
           +   + +G PP    +++DTGSD+ WI C PC KCY QT P F P++S ++    C S  
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136

Query: 196 ---PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALGC 247
              P   + + +G      C Y + Y D S T G  + E LTF  +         +  GC
Sbjct: 137 HAMPQIFRDEKTG-----NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGC 191

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
           G DN G F   +G+LGLG G  S  T   R F  KFSYC    +    P +++   +   
Sbjct: 192 GQDNSG-FTKYSGVLGLGPGTFSIVT---RNFGSKFSYCFGSLTNPTYPHNILILGNGAK 247

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA------GNGGVIIDS 361
                TPL         YY++L  IS G          L  ++P         GG +ID+
Sbjct: 248 IEGDPTPLQI---FQDRYYLDLQAISFG--------EKLLDIEPGTFQRYRSQGGTVIDT 296

Query: 362 GTSVTRLTRPAYIALRDA--FRAGASSLKRAPDFSLFDT-CFDLSGKTEVK-VPTVVLHF 417
           G S T L R AY  L +   F  G   L+R  D+  + T C++ + K ++   P V  HF
Sbjct: 297 GCSPTILAREAYETLSEEIDFLLG-EVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHF 355

Query: 418 R-GADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLSIIGNIQQQGFRVVYDLAASRIGFA 475
             GA+++L   +  +  +S  +FC A    T   +S+IG + QQ + V Y+L   ++ F 
Sbjct: 356 AGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQ 415

Query: 476 PRGC 479
              C
Sbjct: 416 RTDC 419


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 133/408 (32%), Positives = 181/408 (44%), Gaps = 71/408 (17%)

Query: 131 LAQGSGEYFTRLGVGT-PPRYVYMVLDTGSDVVWIQCAP-----CKKCYSQTDPVFDPAK 184
           ++    +Y     +G+ P + + + +DTGSD+VW  CAP     C+  ++ T P+     
Sbjct: 12  ISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRS 71

Query: 185 SRSFATVPCRSPLCR--------------------KLDSSGCNRRNTCLYQVSYGDGSIT 224
            R    V C+SP C                      +++S C+      +  +YGDGS  
Sbjct: 72  HR----VSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSF- 126

Query: 225 VGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNR 281
           +     +TL+     +     GC H          G+ G GRG LS P Q          
Sbjct: 127 IAHLHRDTLSMSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGN 183

Query: 282 KFSYCLV----DRSTSAKPSSMVFG--DSAVSRTARF--TPLLANPKLDTFYYVELVGIS 333
           +FSYCLV    D+    KPS ++ G  D   S    F  T +L NPK   FY V L GIS
Sbjct: 184 RFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGIS 243

Query: 334 VGGAHVRGITAS--LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRA 390
           VG    R I A   L ++D  G+GGV++DSGT+ T L    Y ++   F R      KRA
Sbjct: 244 VGK---RTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRA 300

Query: 391 PDFSL---FDTCFDLSGKTEVKVPTVVLHFRG--ADVSLPATNYLIP-VDSS-------G 437
            +         C+ L G   V+VPTV  HF G  ++V LP  NY    +D         G
Sbjct: 301 SEVEEKTGLGPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVG 358

Query: 438 TFCFAFAGTMSGLS-----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                  G  + LS     I+GN QQQGF VVYDL   R+GFA R CA
Sbjct: 359 CLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCA 406


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 181/370 (48%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           L VGTPP+ V MV+DTGS++ W+ C      Y  T   FDP +S S+ T+PC SP C  R
Sbjct: 35  LTVGTPPQNVSMVIDTGSELSWLHCNKTLS-YPTT---FDPTRSTSYQTIPCSSPTCTNR 90

Query: 200 KLD---SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD----NE 252
             D    + C+  N C   +SY D S + G+ +++      + ++ +  GC       N 
Sbjct: 91  TQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFSSNS 150

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
                + GL+G+ RG LSF +Q G     KFSYC+     S     ++ G+S ++ +   
Sbjct: 151 DEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCISGTDFSGL---LLLGESNLTWSVPL 204

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V    +  I  S F+ D  G G  ++DSGT  
Sbjct: 205 NYTPLIQISTPLPYFDRVAYTVQLEGIKVLDK-LLPIPKSTFEPDHTGAGQTMVDSGTQF 263

Query: 366 TRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y ALR AF    SS+ R    PDF      D C+   LS +    +PTV L F
Sbjct: 264 TFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVF 323

Query: 418 RGADVSLPATN--YLIPVDSSGT---FCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
           RGA++++      Y +P +  G     C +F  + + G+   +IG+  QQ   + +DL  
Sbjct: 324 RGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLEK 383

Query: 470 SRIGFAPRGC 479
           SRIG A   C
Sbjct: 384 SRIGLAQVRC 393


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 177/372 (47%), Gaps = 50/372 (13%)

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCRK---- 200
           PP+ + MV+DTGS++ W++C       S  +PV  FDP +S S++ +PC SP CR     
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 201 -LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGC-----GHDNEG 253
            L  + C+    C   +SY D S + G+ + E   F   T  + +  GC     G D E 
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
                 GLLG+ RG LSF +Q G     KFSYC+    T   P  ++ GDS  +      
Sbjct: 198 -DTKTTGLLGMNRGSLSFISQMGF---PKFSYCI--SGTDDFPGFLLLGDSNFTWLTPLN 251

Query: 312 FTPLL----ANPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +TPL+      P  D   Y V+L GI V G  +  I  S+   D  G G  ++DSGT  T
Sbjct: 252 YTPLIRISTPLPYFDRVAYTVQLTGIKVNGK-LLPIPKSVLVPDHTGAGQTMVDSGTQFT 310

Query: 367 RLTRPAYIALRDAF---RAGASSLKRAPDFSL---FDTCFDLS-----GKTEVKVPTVVL 415
            L  P Y ALR  F     G  ++   PDF      D C+ +S          ++PTV L
Sbjct: 311 FLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSL 370

Query: 416 HFRGADVSLPATN--YLIP---VDSSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDL 467
            F GA++++      Y +P   V +   +CF F  + + G+   +IG+  QQ   + +DL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430

Query: 468 AASRIGFAPRGC 479
             SRIG AP  C
Sbjct: 431 QRSRIGLAPVEC 442


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 136/425 (32%), Positives = 186/425 (43%), Gaps = 73/425 (17%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC---------------- 166
           F+  + SG   G+G+YF R  VGTP R   +V DTGSD+ W++C                
Sbjct: 40  FAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGY 99

Query: 167 -----APCKKCYSQTDP-------VFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRNT 211
                AP     S           VF P +SR++A +PC S  C        + C    +
Sbjct: 100 NYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGS 159

Query: 212 -CLYQVSYGDGSITVGDFSTE--TLTFRGTRVAR---------VALGCGHDNEGL-FVAA 258
            C Y+  Y DGS   G   T+  T+   G R  +         V LGC     G  F+A+
Sbjct: 160 PCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLAS 219

Query: 259 AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-----------------STSAKPSSMVF 301
            G+L LG   +SF ++   RF  +FSYCLVD                   +SA  S    
Sbjct: 220 DGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTAC 279

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
             SA +  AR TPLL + ++  FY V + G+SV G  +R         D    GG I+DS
Sbjct: 280 AGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLR---IPRLVWDVQKGGGAILDS 336

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD----LSGKT-EVKVPTVVLH 416
           GTS+T L  PAY A+  A       L R      FD C++    L+G+   V VP + +H
Sbjct: 337 GTSLTVLVSPAYRAVVAALGKKLVGLPRV-AMDPFDYCYNWTSPLTGEDLAVAVPALAVH 395

Query: 417 FRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
           F G A +  P  +Y+I   + G  C     G   G+S+IGNI QQ     +DL   R+ F
Sbjct: 396 FAGSARLQPPPKSYVIDA-APGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRF 454

Query: 475 APRGC 479
               C
Sbjct: 455 KRSRC 459


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 65/441 (14%)

Query: 101 AFAESAVRVP-PRNRSRGRANGGFSSS--VISGLAQGSGEYFTRLGVGTPPRYVYMVLDT 157
              +S+V +P P+++++ R     SS   V+  L +    Y   L +GTPP+ V + LDT
Sbjct: 43  TLTKSSVSLPTPKSQTQERIKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDT 102

Query: 158 GSDVVWIQCA----PCKKCYS------QTDPVFDPAKSRSFATVPCRSPLCRKLDSS--- 204
           GSD+ W+ C      C +CY       ++  VF P  S +     C S  C ++ SS   
Sbjct: 103 GSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNP 162

Query: 205 -------GCN----RRNTCL-----YQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
                  GC+     ++TC+     +  +YG+G +  G  + + L  R   V R + GC 
Sbjct: 163 FDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCV 222

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP---SSMVFGDSA 305
                 +    G+ G GRG LS P+Q G    + FS+C +       P   S ++ G SA
Sbjct: 223 TST---YREPIGIAGFGRGLLSLPSQLG-FLEKGFSHCFLPFKFVNNPNISSPLILGASA 278

Query: 306 VS----RTARFTPLLANPKLDTFYYVELVGISVG-GAHVRGITASLFKLDPAGNGGVIID 360
           +S     + +FTP+L  P     YY+ L  I++G       +  +L + D  GNGG+++D
Sbjct: 279 LSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVD 338

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDTCFD----------LSGKTEV 408
           SGT+ T L  P Y  L    ++  +  +     S   FD C+           L     +
Sbjct: 339 SGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMM 398

Query: 409 KVPTVVLHF-RGADVSLPATNYLI----PVDSSGTFCFAFAGTMSG----LSIIGNIQQQ 459
             P++  HF   A + LP  N       P D S   C  F     G      + G+ QQQ
Sbjct: 399 IFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQ 458

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
             +VVYDL   RIGF    C 
Sbjct: 459 NVKVVYDLEKERIGFQAMDCV 479


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 127/413 (30%), Positives = 181/413 (43%), Gaps = 84/413 (20%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA---------------------------- 167
           GEYFT + VG+P +  ++  DTGS+  W  C                             
Sbjct: 109 GEYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKR 168

Query: 168 -----------------PCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG----- 205
                            PCK        VF P +S+SF  V C S  C K+D S      
Sbjct: 169 NRTRTTRRTKKKKAKSNPCKG-------VFCPHRSKSFQAVTCASQKC-KIDLSQLFSLS 220

Query: 206 -CNR-RNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVALGCGHDNEG---LF 255
            C +  + CLY +SY DGS   G F T+T+T      +  ++  + +GC    E      
Sbjct: 221 LCPKPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFN 280

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV----FGDSAVSRTAR 311
               G+LGLG  + SF  +    +  KFSYCLVD  +    SS +      ++ +    +
Sbjct: 281 EDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIK 340

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
            T L+  P    FY V +VGIS+GG  ++ I   ++  +    GG +IDSGT++T L  P
Sbjct: 341 RTELILFPP---FYGVNVVGISIGGQMLK-IPPQVWDFNS--QGGTLIDSGTTLTALLVP 394

Query: 372 AYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN 428
           AY  + +A     + +KR    DF   D CFD  G  +  VP +V HF G A    P  +
Sbjct: 395 AYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKS 454

Query: 429 YLIPVDSSGTFCFAFA--GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           Y+I V +    C        + G S+IGNI QQ     +DL+ + IGFAP  C
Sbjct: 455 YIIDV-APLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 165/353 (46%), Gaps = 22/353 (6%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +   + +G PP    +++DTGSD+ WIQC PC KCY QT P F P++S ++    C S  
Sbjct: 88  FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESAP 146

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-----RVALGCGHDNE 252
                     +   C Y + Y D S T G  + E LTF+ +         +  GCG DN 
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
           G F   +G+LGLG G  S  T   R F  KFSYC         P + +   +        
Sbjct: 207 G-FTQYSGVLGLGPGTFSIVT---RNFGSKFSYCFGSLIDPTYPHNFLILGNGARIEGDP 262

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TPL         YY++L  IS+ G  +  I   +F+      GG +ID+G S T L R A
Sbjct: 263 TPLQI---FQDRYYLDLQAISL-GEKLLDIEPGIFQ-RYRSKGGTVIDTGCSPTILAREA 317

Query: 373 YIALRDA--FRAGASSLKRAPDFSLF-DTCFDLSGKTEVK-VPTVVLHFR-GADVSLPAT 427
           Y  L +   F  G   L+R  D+  + + C++ + K ++   P V  HF  GA+++L   
Sbjct: 318 YETLSEEIDFLLG-EVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVE 376

Query: 428 NYLIPVDSSGTFCFAFA-GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +  +  +S  +FC A    T   +S+IG + QQ + V Y+L   ++ F    C
Sbjct: 377 SLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 172/370 (46%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L +GTPP+ + MVLDTGS++ W++C    K       +F+P  S+++  +PC S  C+  
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRC----KKEPNFTSIFNPLASKTYTKIPCSSQTCKTR 126

Query: 202 DSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC----GHDNE 252
            S       C+    C + +SY D S   G  + ET  F          GC       N 
Sbjct: 127 TSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNT 186

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTA 310
                  GL+G+ RG LSF  Q G    RKFSYC+    ++     ++ G++  S  +  
Sbjct: 187 EEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISGLDSTG---FLLLGEARYSWLKPL 240

Query: 311 RFTPLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V    V  +  S+F  D  G G  ++DSGT  
Sbjct: 241 NYTPLVQISTPLPYFDRVAYSVQLEGIKVNNK-VLPLPKSVFVPDHTGAGQTMVDSGTQF 299

Query: 366 TRLTRPAYIALRDAF---RAGASSLKRAPDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y ALR  F    AG   +   P +      D C+  D +  T   +P V L F
Sbjct: 300 TFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMF 359

Query: 418 RGADVSLPATN--YLIPVDSSG---TFCFAFAGTMS-GLS--IIGNIQQQGFRVVYDLAA 469
           RGA++S+      Y +P +  G    +CF F  +   G+S  +IG+ QQQ   + YDL  
Sbjct: 360 RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWMEYDLEN 419

Query: 470 SRIGFAPRGC 479
           SRIGFA   C
Sbjct: 420 SRIGFAELRC 429


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 181/388 (46%), Gaps = 53/388 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDP---VFDPAKSRSF 188
           G Y   L  GTPP+ + +++DTGSD+VW  C     C+ C +S ++P   +F P  S S 
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 189 ATVPCRSPL------------CRKLDSSGCNRRNTC-LYQVSYGDGSITVGDFSTETLTF 235
             + C +P             CR  + +  N    C  Y V YG G IT G   +ETL  
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDL 206

Query: 236 RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR--STS 293
            G  V    +GC   +       AG+ G GRG  S P+Q G    +KFSYCL+ R    +
Sbjct: 207 PGKGVPNFIVGCSVLSTS---QPAGISGFGRGPPSLPSQLGL---KKFSYCLLSRRYDDT 260

Query: 294 AKPSSMVFGDSAVS--RTA--RFTPLLANPKL------DTFYYVELVGISVGGAHVRGIT 343
            + SS+V    + S  +TA   +TP + NPK+        +YY+ L  I+VGG HV+ I 
Sbjct: 261 TESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK-IP 319

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD---FSLFDTCF 400
                    G+GG IIDSGT+ T +    +  +   F     S KRA +    +    CF
Sbjct: 320 YKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCF 378

Query: 401 DLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF------AFAGTMSG--LS 451
           ++SG      P + L FR GA++ LP  NY+  +      C       A     SG    
Sbjct: 379 NISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAI 438

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           I+GN QQQ F V YDL   R+GF  + C
Sbjct: 439 ILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 132/395 (33%), Positives = 187/395 (47%), Gaps = 44/395 (11%)

Query: 112 RNRSRGRANGGFSSSV---ISGLAQG--SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           +   RGR     SS+V   + G+A    +G YFT++ +GTPPR   + +DTGSD++W+ C
Sbjct: 5   KAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNC 64

Query: 167 APCKKCYSQTD---PV--FDPAKSRSFATVPCRSPLC---RKLDSSGCNRRNTCLYQVSY 218
            PC  C + +D   P+  +D   S S + VPC  P C    ++  SGCN +N C Y   Y
Sbjct: 65  HPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY 124

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQ 274
           GDGS T+G    + L +     A V  GCG    G       A  G++G G   LSF +Q
Sbjct: 125 GDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQ 184

Query: 275 TGRRFN--RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGI 332
             ++      F++CL       +   ++   + +    ++TPL+  P + + Y V L  I
Sbjct: 185 LAKQGKTPNVFAHCL---DGGERGGGILVLGNVIEPDIQYTPLV--PYM-SHYNVVLQSI 238

Query: 333 SVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD 392
           SV  A++  I   LF  D     G I DSGT++  L   AY A   A      SL  AP 
Sbjct: 239 SVNNANLT-IDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQAV-----SLVVAP- 289

Query: 393 FSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT---FCFAFAGTMSG 449
           F L DT   LS       P VVL+F GA ++L    YLI   S+     +C  +    S 
Sbjct: 290 FLLCDT--RLSRFIYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSA 347

Query: 450 LS-----IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            S     I G++  +   VVYDL   RIG+ P  C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 155/453 (34%), Positives = 204/453 (45%), Gaps = 65/453 (14%)

Query: 65  LSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFS 124
            S+   H DS    R+P H  +L     VL      A   S VR    +RS  R +   +
Sbjct: 35  FSVEFIHRDS---ARSPFHDPSLTAPARVLE-----AARRSTVRAAALSRSYVRVDAPSA 86

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC---------APCKKCYSQ 175
              +S L     EY   + +GTPP  +  + DTGSD++W+ C         A  +   +Q
Sbjct: 87  DGFVSELTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQ 146

Query: 176 TDPV-FDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
              V FDP+KS +F  V C S  C +L  + C   + C Y  SYGDGS T G  STET T
Sbjct: 147 PPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFT 206

Query: 235 F------RG----TRVARVALGCGHDNEGLFVAAA---GLLGLGRGRLSFPTQTG--RRF 279
           F      RG    TRVA V  GC       FV ++   GL+GLG G LS  +Q G     
Sbjct: 207 FADAPGARGDGTTTRVANVNFGCST----TFVGSSVGDGLVGLGGGDLSLVSQLGADTSL 262

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGD-SAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
            R+FSYCLV  S  A  S++ FG  +AV+     T  L   ++  +Y VEL  + VG   
Sbjct: 263 GRRFSYCLVPYSVKAS-SALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNK- 320

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFS--- 394
                      +      +I+DSGT++T L      AL D   +     +K  P  S   
Sbjct: 321 ---------TFEAPDRSPLIVDSGTTLTFLPE----ALVDPLVKELTGRIKLPPAQSPER 367

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFR-----GADVSLPATNYLIPVDSSGTFCFAFAGTMSG 449
           L   CFD+SG  E +V  ++         GA V+L A N  + V   GT C A +     
Sbjct: 368 LLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQ-EGTLCLAVSAMSEQ 426

Query: 450 L--SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              SIIGNI QQ   V YDL    + FAP  CA
Sbjct: 427 FPASIIGNIAQQNMHVGYDLDKGTVTFAPAACA 459


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 117/370 (31%), Positives = 167/370 (45%), Gaps = 49/370 (13%)

Query: 155 LDTGSDVVWIQCA---PCKKC--YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGC--- 206
           +DTGSD+VW+ C     C  C   S ++ VF P  S S   V C    C+ L  +     
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 207 ---------NRRNTCL-YQVSYGDGSITVGDFSTETLTF-----RGTR-VARVALGCGHD 250
                    N   TC  Y + YG GS T G   TETL        G R +   A+GC   
Sbjct: 61  CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCSIV 119

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNR-KFSYCLVDR--STSAKPSSMVFGDSAVS 307
           +       +G+ G GRG LS P+Q G    + +F+YCL         K S MV GD A+ 
Sbjct: 120 SS---QQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176

Query: 308 RTA--RFTPLLANPK------LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
                 +TP L N +         +YY+ L G+S+GG  ++ + + L + D  GNGG II
Sbjct: 177 NNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTII 236

Query: 360 DSGTSVTRLTRP--AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           DSGT+ T  +     +IA   A + G        D +    C+D++G   + +P    HF
Sbjct: 237 DSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVTGLENIVLPEFAFHF 296

Query: 418 R-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLS-------IIGNIQQQGFRVVYDLAA 469
           + G+D+ LP  NY     S  + C     +   L        I+GN QQQ F ++YD   
Sbjct: 297 KGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYDREK 356

Query: 470 SRIGFAPRGC 479
           +R+GF  + C
Sbjct: 357 NRLGFTQQTC 366


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 122/369 (33%), Positives = 178/369 (48%), Gaps = 42/369 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VG+PP+ V MVLDTGS++ W+    CKK  + T  VF+P  S S++ +PC SP+CR  
Sbjct: 44  LTVGSPPQQVTMVLDTGSELSWLH---CKKSPNLTS-VFNPLSSSSYSPIPCSSPVCRTR 99

Query: 202 -----DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                +   C+ +  C   VSY D S   G+ +++      + +     GC       N 
Sbjct: 100 TRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNS 159

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR- 311
                  GL+G+ RG LSF TQ G     KFSYC+  R +S     ++FGDS +S     
Sbjct: 160 EEDAKTTGLMGMNRGSLSFVTQLGL---PKFSYCISGRDSSG---VLLFGDSHLSWLGNL 213

Query: 312 -FTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI VG   +  +  S+F  D  G G  ++DSGT  
Sbjct: 214 TYTPLVQISTPLPYFDRVAYTVQLDGIRVGNK-ILPLPKSIFAPDHTGAGQTMVDSGTQF 272

Query: 366 TRLTRPAYIALRDAF---RAGASSLKRAPDFSL---FDTCFDL-SGKTEVKVPTVVLHFR 418
           T L  P Y ALR+ F     G  +    P+F      D C+ + +G    ++P V L FR
Sbjct: 273 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR 332

Query: 419 GADVSL--PATNYLIPVDSSG---TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAAS 470
           GA++ +      Y +P    G    +C  F  + + G+   +IG+  QQ   + +DL  S
Sbjct: 333 GAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKS 392

Query: 471 RIGFAPRGC 479
           R+GF    C
Sbjct: 393 RVGFVETRC 401


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 181/372 (48%), Gaps = 50/372 (13%)

Query: 147 PPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCRK---- 200
           PP+ + MV+DTGS++ W++C       S  +PV  FDP +S S++ +PC SP CR     
Sbjct: 82  PPQNISMVIDTGSELSWLRC----NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 201 -LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGC-----GHDNEG 253
            L  + C+    C   +SY D S + G+ + E   F   T  + +  GC     G D E 
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
                 GLLG+ RG LSF +Q G     KFSYC+    T   P  ++ GDS  +      
Sbjct: 198 D-TKTTGLLGMNRGSLSFISQMGF---PKFSYCI--SGTDDFPGFLLLGDSNFTWLTPLN 251

Query: 312 FTPLL----ANPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           +TPL+      P  D   Y V+L GI V G  +  I  S+   D  G G  ++DSGT  T
Sbjct: 252 YTPLIRISTPLPYFDRVAYTVQLTGIKVNGK-LLPIPKSVLLPDHTGAGQTMVDSGTQFT 310

Query: 367 RLTRPAYIALRDAF---RAGASSLKRAPDFSL---FDTCFDLSG---KTEV--KVPTVVL 415
            L  P Y ALR  F     G  ++   P+F      D C+ +S    +T +  ++PTV L
Sbjct: 311 FLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSL 370

Query: 416 HFRGADVSLPATN--YLIPVDSSGT---FCFAFAGT-MSGLS--IIGNIQQQGFRVVYDL 467
            F GA++++      Y +P  ++G    +CF F  + + G+   +IG+  QQ   + +DL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430

Query: 468 AASRIGFAPRGC 479
             SRIG AP  C
Sbjct: 431 QRSRIGLAPVQC 442


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 135/381 (35%), Positives = 188/381 (49%), Gaps = 46/381 (12%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------F 180
           V+S +   S EY   + +G+PPR +  + DTGSD+VW++C   KK  + T         F
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKC---KKGNNDTSSAAAPTTQF 146

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF----- 235
           DP++S ++  V C++  C  L  + C+  + C Y  +YGDGS T G  STET TF     
Sbjct: 147 DPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGS 206

Query: 236 ----RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG--RRFNRKFSYCLVD 289
               R  RV  V  GC     G F  A GL+GLG G +S  TQ G      R+FSYCLV 
Sbjct: 207 GRSPRQVRVGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265

Query: 290 RSTSAKPSSMVFGDSA--VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
            S +A  S++ FG  A      A  TPL+A   +DT+Y V L  + VG   V    +S  
Sbjct: 266 HSVNAS-SALNFGALADVTEPGAASTPLVAG-DVDTYYTVVLDSVKVGNKTVASAASSR- 322

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGKT 406
                    +I+DSGT++T L       + D   R       ++PD  L   C++++G+ 
Sbjct: 323 ---------IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPD-GLLQLCYNVAGR- 371

Query: 407 EVK----VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQ 459
           EV+    +P + L F  GA V+L   N  + V   GT C A   T     +SI+GN+ QQ
Sbjct: 372 EVEAGESIPDLTLEFGGGAAVALKPENAFVAV-QEGTLCLAIVATTEQQPVSILGNLAQQ 430

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
              V YDL A  + FA   CA
Sbjct: 431 NIHVGYDLDAGTVTFAGADCA 451


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 136/483 (28%), Positives = 200/483 (41%), Gaps = 89/483 (18%)

Query: 58  APDAESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRG 117
           +P +++ L    H +    FN T  HL      R   R        ++ V +P       
Sbjct: 17  SPSSQTILLPLTHSISKTKFNST-HHLLKSTSTRSKARFHHQHHKHQTQVSLP------- 68

Query: 118 RANGGFSSSVISGLAQGSGEYFTRLGVGT-PPRYVYMVLDTGSDVVWIQCAP--CKKCYS 174
                        LA GS +Y     +G+ PP+ + + +DTGSD+VW  C+P  C  C  
Sbjct: 69  -------------LAPGS-DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEG 114

Query: 175 QTDPVFDPAKSRSFATVPCRSP-------------LC-------RKLDSSGCNRRNTCLY 214
           +         ++   +V C+SP             LC         +++S C+  +   +
Sbjct: 115 KPQTTKPANITKQTHSVSCQSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPF 174

Query: 215 QVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ 274
             +YGDGS  V +   +TL+     +     GC H          G+ G GRG LS P Q
Sbjct: 175 YYAYGDGSF-VANLYQQTLSLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQ 230

Query: 275 TGR---RFNRKFSYCLVDRSTSA----KPSSMVFGDSAVSRTAR---------FTPLLAN 318
                     +FSYCLV  S       +PS ++ G    + T           +T +L+N
Sbjct: 231 LSTLSPHLGNRFSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSN 290

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
           PK   +Y V L GISVG   V      L ++D  GNGG+++DSGT+ T L    Y A+ +
Sbjct: 291 PKHPYYYCVGLAGISVGKRTVPA-PEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVN 349

Query: 379 AFRAGASSL-KRAPDFSL---FDTCFDLSGKTEVKVPTVVLHFRG--ADVSLPATNYLIP 432
            F    +   KRA +         C+ L+G +++  P + LHF G  +DV LP  NY   
Sbjct: 350 EFDKRVNRFHKRASEIETKTGLGPCYYLNGLSQI--PVLKLHFVGNNSDVVLPRKNYFYE 407

Query: 433 VDSSG--------TFCFAFAGTMSGLSI-------IGNIQQQGFRVVYDLAASRIGFAPR 477
               G          C           +       +GN QQQGF VVYDL   R+GFA +
Sbjct: 408 FMDGGDGIRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKK 467

Query: 478 GCA 480
            CA
Sbjct: 468 ECA 470


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 171/365 (46%), Gaps = 39/365 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L +GTPP+   MVLDTGS + WIQC   KK   +    FDP+ S SF+T+PC  PLC+  
Sbjct: 76  LPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPR 134

Query: 201 ----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNEGLF 255
                  + C+    C Y   Y DG+   G+   E +TF  T +   + LGC  ++    
Sbjct: 135 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD-- 192

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRTARF 312
               G+LG+ RGRLSF +Q       KFSYC+  +S         S   GD+  S   ++
Sbjct: 193 --DRGILGMNRGRLSFVSQAKI---SKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKY 247

Query: 313 TPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             LL        P LD   Y V ++GI  G   +  I+ S+F+ D  G+G  ++DSG+  
Sbjct: 248 VSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKL-NISGSVFRPDAGGSGQTMVDSGSEF 306

Query: 366 TRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV-----LHFR 418
           T L   AY  +R     R G    K        D CFD  G   + +P ++     +  R
Sbjct: 307 THLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFD--GNVAM-IPRLIGDLVFVFTR 363

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFA-GTMSGL--SIIGNIQQQGFRVVYDLAASRIGFA 475
           G ++ +P    L+ V   G  C      +M G   +IIGN+ QQ   V +D+   R+GFA
Sbjct: 364 GVEILVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFA 422

Query: 476 PRGCA 480
              C+
Sbjct: 423 KADCS 427


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 129/405 (31%), Positives = 189/405 (46%), Gaps = 42/405 (10%)

Query: 93  VLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVY 152
           VL   S+TA A +A RV  R  +     GG   +V+      +  Y     +GTPP+   
Sbjct: 10  VLCFISVTARA-AAFRVHGRLLADAATEGG---AVVPIHWTQAMNYVANFTIGTPPQPAS 65

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNT 211
            V+D   ++VW QC  C +C+ Q  P+FDP  S ++   PC +PLC  + S   N   N 
Sbjct: 66  AVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPLCESIPSDVRNCSGNV 125

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC----GHDNEGLFVAAAGLLGLGRG 267
           C Y+ S   G  T G   T+T    GT  A +A GC      D  G     +G++GLGR 
Sbjct: 126 CAYEASTNAGD-TGGKVGTDTFAV-GTAKASLAFGCVVASDIDTMG---GPSGIVGLGRT 180

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV---SRTARFTPLL----ANPK 320
             S  TQTG      FSYCL     + K S++  G SA       A  TP +        
Sbjct: 181 PWSLVTQTGV---AAFSYCLAPHD-AGKNSALFLGSSAKLAGGGKAASTPFVNISGNGND 236

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
           L  +Y V+L G+  G A        +  L P+G+  V++D+ + ++ L   AY A++ A 
Sbjct: 237 LSNYYKVQLEGLKAGDA--------MIPLPPSGS-TVLLDTFSPISFLVDGAYQAVKKAV 287

Query: 381 RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTF 439
                +   A     FD CF  SG +    P +V  FR GA +++PATNYL+    +GT 
Sbjct: 288 TVAVGAPPMATPVEPFDLCFPKSGASGA-APDLVFTFRGGAAMTVPATNYLLDY-KNGTV 345

Query: 440 CFAF-----AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           C A        + + LS++G++QQ+    ++DL    + F P  C
Sbjct: 346 CLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 171/365 (46%), Gaps = 39/365 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK- 200
           L +GTPP+   MVLDTGS + WIQC   KK   +    FDP+ S SF+T+PC  PLC+  
Sbjct: 76  LPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPR 134

Query: 201 ----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNEGLF 255
                  + C+    C Y   Y DG+   G+   E +TF  T +   + LGC  ++    
Sbjct: 135 IPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESSD-- 192

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRTARF 312
               G+LG+ RGRLSF +Q       KFSYC+  +S         S   GD+  S   ++
Sbjct: 193 --DRGILGMNRGRLSFVSQAKI---SKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKY 247

Query: 313 TPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             LL        P LD   Y V ++GI  G   +  I+ S+F+ D  G+G  ++DSG+  
Sbjct: 248 VSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKL-NISGSVFRPDAGGSGQTMVDSGSEF 306

Query: 366 TRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV-----LHFR 418
           T L   AY  +R     R G    K        D CFD  G   + +P ++     +  R
Sbjct: 307 THLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFD--GNVAM-IPRLIGDLVFVFTR 363

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFA-GTMSGL--SIIGNIQQQGFRVVYDLAASRIGFA 475
           G ++ +P    L+ V   G  C      +M G   +IIGN+ QQ   V +D+   R+GFA
Sbjct: 364 GVEIFVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFA 422

Query: 476 PRGCA 480
              C+
Sbjct: 423 KADCS 427


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 127/389 (32%), Positives = 177/389 (45%), Gaps = 52/389 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV----FDPAKSRS 187
           G Y   L  GTP + +  V DTGS +VW  C     C  C +S  DP     F P  S S
Sbjct: 88  GGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSS 147

Query: 188 FATVPCRSPLCR-----KLDSSGC--NRRNTCL----YQVSYGDGSITVGDFSTETLTFR 236
              + C++P C+      +   GC  N RN  +    Y + YG GS T G   +E L F 
Sbjct: 148 SRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFP 206

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS----- 291
              V    +GC   +       AG+ G GRG  S P+Q      + FS+CLV R      
Sbjct: 207 DLTVPDFVVGCSVIST---RTPAGIAGFGRGPESLPSQMKL---KSFSHCLVSRRFDDTN 260

Query: 292 -TSAKPSSMVFGDSAVSRTA--RFTPLLANPKLDT-----FYYVELVGISVGGAHVRGIT 343
            T+        G  + S+T    +TP   NP +       +YY+ L  I VG  HV+ I 
Sbjct: 261 VTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVK-IP 319

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF---SLFDTCF 400
                    GNGG I+DSG++ T + RP +  + + F    S+  R  D    S    CF
Sbjct: 320 YKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIAPCF 379

Query: 401 DLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFA-------GTMSGLSI 452
           ++SGK +V VP ++  F+ GA + LP +NY   V ++ T C           G  +G +I
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGGTGPAI 439

Query: 453 I-GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           I G+ QQQ + V YDL   R GFA + C+
Sbjct: 440 ILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 129/412 (31%), Positives = 184/412 (44%), Gaps = 71/412 (17%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK----------KCYSQTDPVFDPA 183
           G  +Y    G+G PP+    V+DTGSD+VW QC+ C+           C+ Q  P ++ +
Sbjct: 74  GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133

Query: 184 KSRSFATVPCRS---PLCR-KLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLT 234
            SR+   VPC      LC    +++GC R      + C+   SYG G + +G   T+  T
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192

Query: 235 FRGTRVARVALGCGHDNE---GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR- 290
           F  +    +A GC        G    A+G++GLGRG LS  +Q       +FSYCL    
Sbjct: 193 FPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNA---TEFSYCLTPYF 249

Query: 291 STSAKPSSMVFGDSAVSRTARF-------------TPLLANPK---LDTFYYVELVGISV 334
             +  PS +  GD  ++  +                P   NPK     TFYY+ LVG++ 
Sbjct: 250 RDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAA 309

Query: 335 GGAHVRGITASLFKLDPAG----NGGVIIDSGTSVTRLTRPAYIALRDAFR---AGASSL 387
           G A V  + A  F L  A      GG +IDSG+  TRL  PA+ AL         G+ SL
Sbjct: 310 GNATV-ALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSL 368

Query: 388 KRAPDF--SLFDTCFDLSGKTE----VKVPTVVLHFR-----GADVSLPATNYLIPVDSS 436
              P       + C +     +      VP +VL F      G ++ +PA  Y   V++S
Sbjct: 369 VPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEAS 428

Query: 437 GTFCFAFAGTMSG--------LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            T+C A   + SG         +IIGN  QQ  RV+YDLA   + F P  C+
Sbjct: 429 -TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 132/395 (33%), Positives = 186/395 (47%), Gaps = 44/395 (11%)

Query: 112 RNRSRGRANGGFSSSV---ISGLAQG--SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           +   RGR     SS+V   + G+A    +G YFT++ +GTPPR   + +DTGSD++W+ C
Sbjct: 5   KAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNC 64

Query: 167 APCKKCYSQTD---PV--FDPAKSRSFATVPCRSPLC---RKLDSSGCNRRNTCLYQVSY 218
            PC  C + +D   P+  +D   S S + VPC  P C    ++  SGCN +N C Y   Y
Sbjct: 65  HPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY 124

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQ 274
           GDGS T+G    + L +     A V  GCG    G       A  G++G G   LSF +Q
Sbjct: 125 GDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQ 184

Query: 275 TGRRFN--RKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGI 332
             ++      F++CL       +   ++   + +    ++TPL+  P +   Y V L  I
Sbjct: 185 LAKQGKTPNVFAHCL---DGGERGGGILVLGNVIEPDIQYTPLV--PYM-YHYNVVLQSI 238

Query: 333 SVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD 392
           SV  A++  I   LF  D     G I DSGT++  L   AY A   A      SL  AP 
Sbjct: 239 SVNNANLT-IDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQAV-----SLVVAP- 289

Query: 393 FSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT---FCFAFAGTMSG 449
           F L DT   LS       P VVL+F GA ++L    YLI   S+     +C  +    S 
Sbjct: 290 FLLCDT--RLSRFIYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSA 347

Query: 450 LS-----IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            S     I G++  +   VVYDL   RIG+ P  C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 128/406 (31%), Positives = 191/406 (47%), Gaps = 44/406 (10%)

Query: 93  VLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVY 152
           VL   S+TA A +A RV  R  +     GG   +V+      +  Y     +GTPP+   
Sbjct: 10  VLCFISVTARA-AAFRVHGRLLADAATEGG---AVVPIHWTQAMNYVANFTIGTPPQPAS 65

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL--DSSGCNRRN 210
            V+D   ++VW QC  C +C+ Q  P+FDP  S ++   PC +PLC  +  DS  C+  N
Sbjct: 66  AVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPSDSRNCS-GN 124

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC----GHDNEGLFVAAAGLLGLGR 266
            C YQ S   G  T G   T+T    GT  A +A GC      D  G     +G++GLGR
Sbjct: 125 VCAYQASTNAGD-TGGKVGTDTFAV-GTAKASLAFGCVVASDIDTMG---GPSGIVGLGR 179

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFTPLLANPK 320
              S  TQTG      FSYCL     + K S++  G SA       + +  F  +  N  
Sbjct: 180 TPWSLVTQTGV---AAFSYCLAPHD-AGKNSALFLGSSAKLAGGGKAASTPFVNISGNGN 235

Query: 321 -LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
            L  +Y V+L G+  G A        +  L P+G+  V++D+ + ++ L   AY A++ A
Sbjct: 236 DLSNYYKVQLEGLKAGDA--------MIPLPPSGS-TVLLDTFSPISFLVDGAYQAVKKA 286

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGT 438
                 +   A     FD CF  SG +    P +V  FR GA +++ A+NYL+    +GT
Sbjct: 287 VTVAVGAPPMATPVEPFDLCFPKSGASGA-APDLVFTFRGGAAMTVAASNYLLDY-KNGT 344

Query: 439 FCFAF-----AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            C A        + + LS++G++QQ+    ++DL    + F P  C
Sbjct: 345 VCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 173/366 (47%), Gaps = 40/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VFDPAKSRSFATVPCRSPLCR 199
           L +GTPP+   M+LDTGS + WIQC   KK   +  P  VFDP+ S SF+ +PC  PLC+
Sbjct: 86  LPIGTPPQTQQMILDTGSQLSWIQCH--KKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCK 143

Query: 200 ------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
                  L +S C++   C Y   Y DG++  G+   E +TF R      + LGC  ++ 
Sbjct: 144 PRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS 202

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRT 309
                A G+LG+  GRLSF +Q       KFSYC+  R          S   G++  S  
Sbjct: 203 D----AKGILGMNLGRLSFASQAKL---TKFSYCVPTRQVRPGFTPTGSFYLGENPNSGG 255

Query: 310 ARFTPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            R+  LL        P LD   Y V + GI +G   +  I  S F+ DP+G G  +IDSG
Sbjct: 256 FRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLN-IPISAFRPDPSGAGQTMIDSG 314

Query: 363 TSVTRLTRPAYIALR-DAFRAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  T L   AY  +R +  R   + LK+   +  + D CF+     E+   +  +V  F 
Sbjct: 315 SEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFN-GNAIEIGRLIGNMVFEFD 373

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFA-GTMSGL--SIIGNIQQQGFRVVYDLAASRIGF 474
           +G ++ +     L  V   G  C       M G   +IIGN  QQ   V +DLA  R+GF
Sbjct: 374 KGVEIVVEKERVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGF 432

Query: 475 APRGCA 480
               C+
Sbjct: 433 GKADCS 438


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 168/351 (47%), Gaps = 21/351 (5%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           A  +G Y    G+GTPP+ V   LD  SD+VW  C         T P F+P +S + A V
Sbjct: 94  ATNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-------ATAP-FNPVRSTTVADV 145

Query: 192 PCRSPLCRKLDSSGCNR-RNTCLYQVSYGDGSI-TVGDFSTETLTFRGTRVARVALGCGH 249
           PC    C++     C    + C Y   YG G+  T G   TE  TF  TR+  V  GCG 
Sbjct: 146 PCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGL 205

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            N G F   +G++GLGRG LS  +Q   + +R FSY      +    S ++FGD A  +T
Sbjct: 206 KNVGDFSGVSGVIGLGRGNLSLVSQL--QVDR-FSYHFAPDDSVDTQSFILFGDDATPQT 262

Query: 310 ARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL-DPAGNGGVIIDSGTSVT 366
           +    T LLA+    + YYVEL GI V G  +  I +  F L +  G+GGV +     VT
Sbjct: 263 SHTLSTRLLASDANPSLYYVELAGIQVDGKDL-AIPSGTFDLRNKDGSGGVFLSITDLVT 321

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGADV-SL 424
            L   AY  LR A  A    L      +L  D C+      + KVP++ L F G  V  L
Sbjct: 322 VLEEAAYKPLRQAV-ASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMEL 380

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGF 474
              NY     ++G  C     + +G  S++G++ Q G  ++YD+  S++ F
Sbjct: 381 ELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 188/404 (46%), Gaps = 53/404 (13%)

Query: 123 FSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC----APCKKCYSQTDP 178
           F+  + SG   G+G+YF R  VGTP +   ++ DTGSD+ W++C    +P     + +  
Sbjct: 95  FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154

Query: 179 -----------VFDPAKSRSFATVPCRSPLCRK---LDSSGCNRRN-TCLYQVSYGDGSI 223
                      VF P  S++++ +PC S  C+       + C+     C Y   Y D S 
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSA 214

Query: 224 TVGDFSTETLTF-------------RGTRVARVALGC--GHDNEGLFVAAAGLLGLGRGR 268
             G   T++ T              R  ++  V LGC   H  +G F A+ G+L LG   
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSN 273

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSS-MVFG---DSAVSRT---ARFTPLLANPKL 321
           +SF ++   RF  +FSYCLVD       +S + FG   D+A S        TPLL + ++
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
             FY V +  +SV G  +  I A ++  D   NGG IIDSGTS+T L  PAY A+  A  
Sbjct: 334 RPFYAVAVDSVSVDGVALD-IPAEVW--DVGSNGGTIIDSGTSLTVLATPAYKAVVAALS 390

Query: 382 AGASSLKRAPDFSLFDTCFDLS----GKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSS 436
              + L R      FD C++ +    G  ++ VP + + F G A +  PA +Y+I   + 
Sbjct: 391 EQLAGLPRVA-MDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDA-AP 448

Query: 437 GTFCFAF-AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           G  C     G   G+S+IGNI QQ     +DL    + F    C
Sbjct: 449 GVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 171/374 (45%), Gaps = 40/374 (10%)

Query: 138 YFTRLGVG--------TPPRYVYMVLDTGSDVVWIQCAPCKK----CYSQTDPVFDPAKS 185
           +  ++GVG        T  +  Y  +DTG+++ WIQC  C+     C+   DP +  ++S
Sbjct: 80  FLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQS 139

Query: 186 RSFATVPC-RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTR 239
           +S+  V C +   C   + + C +   C Y V+YG GS T G+ + ET TF     + T 
Sbjct: 140 KSYKPVSCNQHSFC---EPNQC-KEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTA 195

Query: 240 VARVALGCGHDNEGLFVA-------AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
           +  ++ GC  D+  +  A        +G+LG+G G  SF  Q G   + KFSYC+   +T
Sbjct: 196 LKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNT 255

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
               + + FG   V      T  +   K    Y+V L+GISV G  +  IT +   +   
Sbjct: 256 HN--TYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLN-ITKTDLAVRKD 312

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS---LKRAPDFSLF-DTCFD-LSGKTE 407
           G+ G IID+GT  T L +P +  L  A     SS   LKR     L  D C++ LS    
Sbjct: 313 GSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGR 372

Query: 408 VKVPTVVLHFRGADVSL-PATNYLI-PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
             +P V  H   AD+ + P   +L    +    FC +     S  +IIG  QQ   + VY
Sbjct: 373 KNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSK-TIIGAYQQMKQKFVY 431

Query: 466 DLAASRIGFAPRGC 479
           D  A  + F P  C
Sbjct: 432 DTKARVLSFGPEDC 445


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 179/399 (44%), Gaps = 61/399 (15%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKC--YSQTDPVFDPAKSRSFATV 191
           Y   L +GTPP+ + + +DTGSD+ W+ C      C  C  Y     +   + S S +++
Sbjct: 29  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 88

Query: 192 P--CRSPLCRKLDSS----------GCNR----RNTC-----LYQVSYGDGSITVGDFST 230
              C SPLC  + SS          GC+     + TC      +  +YG G + +G  + 
Sbjct: 89  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 148

Query: 231 ETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +TLT  G+       V     GC       +    G+ G GRG LS P+Q G    + FS
Sbjct: 149 DTLTTHGSSPSFTREVPNFCFGCVGST---YREPIGIAGFGRGVLSLPSQLGF-LQKGFS 204

Query: 285 YCLVDRSTSAKP---SSMVFGDSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           +C +    +  P   S +V GD A+S     +FT LL NP    +YY+ L  I+VG A  
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---F 396
             + +SL + D  GNGG+IIDSGT+ T L  P Y  L    ++   +  RA +      F
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQS-IITYPRAQEQEARTGF 323

Query: 397 DTCFDLSGKTEVK------VPTVVLHF-RGADVSLPATNYLI----PVDSSGTFCFAFAG 445
           D C+ +     V       +P++  HF     + LP  N+      P +S+   C     
Sbjct: 324 DLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQN 383

Query: 446 TMSGLS----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                S    + G+ QQQ  +VVYDL   RIGF P  CA
Sbjct: 384 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 422


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/393 (31%), Positives = 180/393 (45%), Gaps = 52/393 (13%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDPV----FDPA 183
           A+  G Y   L  GTP + +  V DTGS +V + C     C  C +S  DP     F P 
Sbjct: 84  AKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPK 143

Query: 184 KSRSFATVPCRSPLCR-----KLDSSGC--NRRNTCL----YQVSYGDGSITVGDFSTET 232
            S S   + C+SP C+      +   GC  N RN  +    Y + YG GS T G   TE 
Sbjct: 144 NSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEK 202

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS- 291
           L F    V    +GC   +       AG+ G GRG +S P+Q   +   +FS+CLV R  
Sbjct: 203 LDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRF 256

Query: 292 -----TSAKPSSMVFGDSAVSRTA--RFTPLLANPKLDT-----FYYVELVGISVGGAHV 339
                T+        G ++ S+T    +TP   NP +       +YY+ L  I VG  HV
Sbjct: 257 DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---F 396
           + I          G+GG I+DSG++ T + RP +  + + F +  S+  R  D       
Sbjct: 317 K-IPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGL 375

Query: 397 DTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF--------AGTM 447
             CF++SGK +V VP ++  F+G A + LP +NY   V ++ T C           +G  
Sbjct: 376 GPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGT 435

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
               I+G+ QQQ + V YDL   R GFA + C+
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/390 (31%), Positives = 175/390 (44%), Gaps = 56/390 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTD----PVFDPAKSRS 187
           G Y   L  GTPP+    V+DTGS +VW  C     C +C +   +    P F P +S S
Sbjct: 90  GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSS 149

Query: 188 FATVPCRS------------PLCRKLDSSGCNRRNTC-LYQVSYGDGSITVGDFSTETLT 234
              + C++              C++ D +  N   +C  Y + YG GS T G   +ETL 
Sbjct: 150 SNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLD 208

Query: 235 FRGTR-VARVALGCGHDNEGLFV--AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           F   + +    +GC      LF      G+ G GR   S P+Q G    +KFSYCLV  +
Sbjct: 209 FPHKKTIPGFLVGC-----SLFSIRQPEGIAGFGRSPESLPSQLGL---KKFSYCLVSHA 260

Query: 292 TSAKPSS--MVFGDSAVSRTAR-----FTPLLANPK--LDTFYYVELVGISVGGAHVRGI 342
               P+S  +V    + S   +     +TP   NP      +YYV L  I +G  HV+ +
Sbjct: 261 FDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVK-V 319

Query: 343 TASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF---SLFDTC 399
                     GNGG I+DSGT+ T + +P Y  +   F    +    A +    +    C
Sbjct: 320 PYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPC 379

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLS------ 451
           F++SG+  V VP  + HF+ GA ++LP  NY   VD SG  C    +  MSG        
Sbjct: 380 FNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVD-SGVICLTIVSDNMSGSGIGGGPA 438

Query: 452 -IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            I+GN QQ+ F V +DL   R GF  + C 
Sbjct: 439 IILGNYQQRNFHVEFDLKNERFGFKQQNCV 468


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 179/399 (44%), Gaps = 61/399 (15%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKC--YSQTDPVFDPAKSRSFATV 191
           Y   L +GTPP+ + + +DTGSD+ W+ C      C  C  Y     +   + S S +++
Sbjct: 12  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 71

Query: 192 P--CRSPLCRKLDSS----------GCNR----RNTC-----LYQVSYGDGSITVGDFST 230
              C SPLC  + SS          GC+     + TC      +  +YG G + +G  + 
Sbjct: 72  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 131

Query: 231 ETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +TLT  G+       V     GC       +    G+ G GRG LS P+Q G    + FS
Sbjct: 132 DTLTTHGSSPSFTREVPNFCFGCVGST---YREPIGIAGFGRGVLSLPSQLGF-LQKGFS 187

Query: 285 YCLVDRSTSAKP---SSMVFGDSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           +C +    +  P   S +V GD A+S     +FT LL NP    +YY+ L  I+VG A  
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---F 396
             + +SL + D  GNGG+IIDSGT+ T L  P Y  L    ++   +  RA +      F
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQS-IITYPRAQEQEARTGF 306

Query: 397 DTCFDLSGKTEVK------VPTVVLHF-RGADVSLPATNYLI----PVDSSGTFCFAFAG 445
           D C+ +     V       +P++  HF     + LP  N+      P +S+   C     
Sbjct: 307 DLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQN 366

Query: 446 TMSGLS----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                S    + G+ QQQ  +VVYDL   RIGF P  CA
Sbjct: 367 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 405


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 172/371 (46%), Gaps = 45/371 (12%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           L  GTP + + MVLDTGS++ W+ C    K     + +F+P  S+++  +PC SP C  R
Sbjct: 71  LTAGTPLQNITMVLDTGSELSWLHC----KKEPNFNSIFNPLASKTYTKIPCSSPTCETR 126

Query: 200 KLD---SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
             D      C+    C + +SY D S   G+ + ET             GC       N 
Sbjct: 127 TRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNS 186

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTA 310
                  GL+G+ RG LSF  Q G    RKFSYC+ DR +S     ++ G+++ S  +  
Sbjct: 187 EEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISDRDSSG---VLLLGEASFSWLKPL 240

Query: 311 RFTPLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V    V  +  S+F  D  G G  ++DSGT  
Sbjct: 241 NYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDK-VLSLPKSVFVPDHTGAGQTMVDSGTQF 299

Query: 366 TRLTRPAYIALRDAFRAGASSLKRA---PDFSLFDTCFDLSGKTE------VKVPTVVLH 416
           T L  P Y AL+  F      + R    P + +F    DL    E        +P V L 
Sbjct: 300 TFLLGPVYSALKQEFLLQTKGVLRVLNEPRY-VFQGAMDLCYLIEPTRAALPNLPVVNLM 358

Query: 417 FRGADVSLPATN--YLIPVDSSG---TFCFAFAGTMS-GLS--IIGNIQQQGFRVVYDLA 468
           FRGA++S+      Y +P +  G    +CF F  + S G+   +IG+ QQQ   + YDL 
Sbjct: 359 FRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDLE 418

Query: 469 ASRIGFAPRGC 479
            SRIGFA   C
Sbjct: 419 KSRIGFAEVRC 429


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 168/355 (47%), Gaps = 25/355 (7%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATV 191
           A  +G Y    G+GTPP+ V   LD  SD+VW  C         T P F+P +S + A V
Sbjct: 94  ATNAGMYVFSYGIGTPPQQVSGALDISSDLVWTACG-------ATAP-FNPVRSTTVADV 145

Query: 192 PCRSPLCRKLDSSGCNR-----RNTCLYQVSYGDGSI-TVGDFSTETLTFRGTRVARVAL 245
           PC    C++     C        + C Y   YG G+  T G   TE  TF  TR+  V  
Sbjct: 146 PCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVF 205

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA 305
           GCG  N G F   +G++GLGRG LS  +Q   + +R FSY      +    S ++FGD A
Sbjct: 206 GCGLQNVGDFSGVSGVIGLGRGNLSLVSQL--QVDR-FSYHFAPDDSVDTQSFILFGDDA 262

Query: 306 VSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL-DPAGNGGVIIDSG 362
             +T+    T LLA+    + YYVEL GI V G  +  I +  F L +  G+GGV +   
Sbjct: 263 TPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDL-AIPSGTFDLRNKDGSGGVFLSIT 321

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGAD 421
             VT L   AY  LR A  A    L      +L  D C+      + KVP++ L F G  
Sbjct: 322 DLVTVLEEAAYKPLRQAV-ASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGA 380

Query: 422 V-SLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGF 474
           V  L   NY     ++G  C     + +G  S++G++ Q G  ++YD+  S++ F
Sbjct: 381 VMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 132/396 (33%), Positives = 176/396 (44%), Gaps = 73/396 (18%)

Query: 151 VYMVLDTGSDVVWIQCAP--CKKCYSQTDPVFDPAKSRSFAT--------VPCRSPLCRK 200
           V + LDTGSD+VW  CAP  C  C  +  P    + S             VPC SPLC  
Sbjct: 109 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSA 168

Query: 201 LDSS----------GCNRRN----TC---------LYQVSYGDGSITVGDFSTETLTFRG 237
             +S          GC   +    +C         LY  +YGDGS+              
Sbjct: 169 AHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLY-YAYGDGSLVAHLRRGRVGLGAS 227

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA--- 294
             V      C H   G  V   G+ G GRG LS P Q   + + +FSYCLV  S  A   
Sbjct: 228 VAVDNFTFACAHTALGEPV---GVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRL 284

Query: 295 -KPSSMVFGDS--AVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
            +PS ++ G S  A + T  F  TPLL NPK   FY V L  +SVG   ++     L ++
Sbjct: 285 IRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQA-RPELARV 343

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL-----KRAPDFSLFDTCFDLSG 404
           D AGNGG+++DSGT+ T L    Y  + +AF    ++      +RA + +    C+  + 
Sbjct: 344 DRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCYHYA- 402

Query: 405 KTEVKVPTVVLHFRG-ADVSLPATNYLIPV------------DSSGTFCFAFAGTMSG-- 449
            ++  VP + LHFRG A V+LP  NY +              D  G       G +SG  
Sbjct: 403 ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGED 462

Query: 450 ------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                    +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 463 GGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 123/371 (33%), Positives = 181/371 (48%), Gaps = 42/371 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VGTPP+ V MV+DTGS++ W+ C   +   S +   F+P  S S++ +PC S  C   
Sbjct: 77  LTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWSSSYSPIPCSSSTCTDQ 135

Query: 202 DSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD----NE 252
                    C+    C   +SY D S + G+ +T+T     + +  V  GC       N 
Sbjct: 136 TRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSSNS 195

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
                  GL+G+ RG LSF +Q G     KFSYC+ +   S     ++ GD+  S  A  
Sbjct: 196 EEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISEYDFSGL---LLLGDANFSWLAPL 249

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAH-VRGITASLFKLDPAGNGGVIIDSGTS 364
            +TPL+      P  D   Y V+L GI V  AH +  I  S+F+ D  G G  ++DSGT 
Sbjct: 250 NYTPLIEMSTPLPYFDRVAYTVQLEGIKV--AHKLLPIPESVFEPDHTGAGQTMVDSGTQ 307

Query: 365 VTRLTRPAYIALRDAF-RAGASSLKRAPDFSL-----FDTCFDL-SGKTEV-KVPTVVLH 416
            T L  PAY ALRD F    A SL+   D +       D C+ + + +T +  +P+V L 
Sbjct: 308 FTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLV 367

Query: 417 FRGADVSLPATN--YLIPVDSSGT---FCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLA 468
           FRGA++++      Y +P +  G     CF F  + + G+   +IG++ QQ   + +DL 
Sbjct: 368 FRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLK 427

Query: 469 ASRIGFAPRGC 479
            SRIG A   C
Sbjct: 428 KSRIGLAEIRC 438


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 91/274 (33%), Positives = 132/274 (48%), Gaps = 54/274 (19%)

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
           C Y ++YGDGS T G+   E L F    V     GCG +N+GLF   +GL+GLGR  LS 
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSL 192

Query: 272 PTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVG 331
            +QT                                          NP+L  FY++ L G
Sbjct: 193 ISQTSE----------------------------------------NPQLYNFYFINLTG 212

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP 391
           IS+GG  ++  +         G   +++DSGT +TRL    Y AL+  F    +    AP
Sbjct: 213 ISIGGVALQAPSV--------GPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAP 264

Query: 392 DFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATN--YLIPVDSSGTFCFAFAGT-- 446
            FS+ DTCF+LS   EV +PT+ +HF G A++++  T   Y +  D+S   C A A    
Sbjct: 265 AFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDAS-QVCLALASLEY 323

Query: 447 MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              ++I+GN QQ+  RV+YD   +++GFA   C+
Sbjct: 324 QDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 171/365 (46%), Gaps = 42/365 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L +GTPP+   MVLDTGS + WIQ      C+ +  P   FDP+ S +F+ +PC  PLC+
Sbjct: 79  LPIGTPPQTQPMVLDTGSQLSWIQ------CHKKQPPTASFDPSLSSTFSILPCTHPLCK 132

Query: 200 ------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGCGHDNE 252
                  L +S C++   C Y   Y DG+   G+   E  TF R      + LGC  ++ 
Sbjct: 133 PRIPDFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATES- 190

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRT 309
                  G+LG+  GRLSF  Q+      KFSYC+  R T        S   G++  S+ 
Sbjct: 191 ---TDPRGILGMNLGRLSFAKQSKI---TKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKG 244

Query: 310 ARFTPLLAN-----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
            ++  ++ +     P  D   Y + +VGI + G  +  I+ ++F+ D  G+G  +IDSG+
Sbjct: 245 FKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLN-ISPAVFRADAGGSGQTMIDSGS 303

Query: 364 SVTRLTRPAYIALR-DAFRAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF-R 418
             T L   AY  +R    RA    LK+   +  + D CFD     E+   +  +V  F R
Sbjct: 304 EFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFER 363

Query: 419 GADVSLPATNYLIPVDSSGTFCFAFAGTM---SGLSIIGNIQQQGFRVVYDLAASRIGFA 475
           G +V +P    L  V   G  C     +    +  +IIGN  QQ   V +DL   R+GF 
Sbjct: 364 GVEVVIPKERVLADV-GGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGFG 422

Query: 476 PRGCA 480
              C+
Sbjct: 423 KADCS 427


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 180/371 (48%), Gaps = 44/371 (11%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFA 189
           +G Y+TR+ +GTPP+  Y+ +DTGSDV W+ C PC  C   ++      +FDP KS S  
Sbjct: 45  TGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKT 104

Query: 190 TVPCRSPLCRKLDSSGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRG---------TR 239
           ++ C    C    +S C+  + +C Y   YGDGS T G    + L+F           + 
Sbjct: 105 SISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSG 164

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF--NRKFSYCLVDRSTSAKPS 297
            AR+  GCG +  G ++   GL+G G+  +S P+Q  ++      F++CL  +  +    
Sbjct: 165 TARLTFGCGSNQTGTWL-TDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCL--QGDNKGSG 221

Query: 298 SMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
           ++V G         +TP++  PK  + Y VEL+ I V G +V   TA     D + +GGV
Sbjct: 222 TLVIGH-IREPGLVYTPIV--PK-QSHYNVELLNIGVSGTNVTTPTA----FDLSNSGGV 273

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           I+DSGT++T L +PAY    D F+A      R+    +    F      E   P V L+F
Sbjct: 274 IMDSGTTLTYLVQPAY----DQFQAKVRDCMRS---GVLPVAFQFFCTIEGYFPNVTLYF 326

Query: 418 R-GADVSLPATNYL---IPVDSSGTFCFAFAGTMS-----GLSIIGNIQQQGFRVVYDLA 468
             GA + L  ++YL   +       +CF++  + S       +I G+   +   VVYD  
Sbjct: 327 AGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNV 386

Query: 469 ASRIGFAPRGC 479
            +RIG+    C
Sbjct: 387 NNRIGWKNFDC 397


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 169/361 (46%), Gaps = 36/361 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y   L +GTPP+    ++    + VW QC+PC++C+ Q  P+F+ + S ++   PC + L
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87

Query: 198 CRKLDSSGCNRRNTCLYQVS--YGDGSITVGDFSTETLTFRGTRVARVALGCGHD-NEGL 254
           C  + +S C+    C Y+V   +GD   T G   T+T    GT  A +A GC  D N   
Sbjct: 88  CESVPASTCSGDGVCSYEVETMFGD---TSGIGGTDTFAI-GTATASLAFGCAMDSNIKQ 143

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV---SRTAR 311
            + A+G++GLGR   S     G+     FSYCL     + K S+++ G SA     ++A 
Sbjct: 144 LLGASGVVGLGRTPWSL---VGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGGKSAA 200

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI-IDSGTSVTRLTR 370
            TPL+      + Y + L GI  G   +           P  NG V+ +D+   V+ L  
Sbjct: 201 TTPLVNTSDDSSDYMIHLEGIKFGDVIIA----------PPPNGSVVLVDTIFGVSFLVD 250

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTCF-----DLSGKTEVKVPTVVLHFRG-ADVSL 424
            A+ A++ A      +   A     FD CF          + + +P VVL F+G A +++
Sbjct: 251 AAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTV 310

Query: 425 PATNYLIPVDSSGTFCFAFAGT-----MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           P + Y+     +GT C A   +      + LSI+G + Q+    ++DL    + F P  C
Sbjct: 311 PPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 369

Query: 480 A 480
           +
Sbjct: 370 S 370


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 123/404 (30%), Positives = 190/404 (47%), Gaps = 45/404 (11%)

Query: 112 RNRSR-GRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
           R+R+R GR   G    V+    QG+      G YFT++ +G+P +  Y+ +DTGSD++WI
Sbjct: 50  RDRARHGRILQGVVGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKDFYVQIDTGSDILWI 109

Query: 165 QCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCR---KLDSSGCNRR-NTCLYQ 215
            C  C  C   +        FD A S + A V C  P+C    +  +SGC+ + N C Y 
Sbjct: 110 NCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYT 169

Query: 216 VSYGDGSITVGDFSTETLTFRGTRVAR---------VALGCGHDNEGLFV----AAAGLL 262
             YGDGS T G + ++T+ F    + +         +  GC     G       A  G+ 
Sbjct: 170 FQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIF 229

Query: 263 GLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK 320
           G G G LS  +Q   R    + FS+CL  +        +V G+  +  +  ++PL+  P 
Sbjct: 230 GFGPGALSVISQLSSRGVTPKVFSHCL--KGGENGGGVLVLGE-ILEPSIVYSPLV--PS 284

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
           L   Y + L  I+V G  +  I +++F      N G I+DSGT++  L + AY    DA 
Sbjct: 285 L-PHYNLNLQSIAVNG-QLLPIDSNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVDAI 340

Query: 381 RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIP---VDSS 436
            A  S   + P  S  + C+ +S       P V L+F  GA + L   +YL+    +DS+
Sbjct: 341 TAAVSQFSK-PIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSA 399

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +C  F     G +I+G++  +    VYDLA  RIG+A   C+
Sbjct: 400 AMWCIGFQKVERGFTILGDLVLKDKIFVYDLANQRIGWADYNCS 443


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 172/370 (46%), Gaps = 36/370 (9%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +A G+ EY    G G P +   +  DT   V  ++C PC    +  DP F+P++S SFA 
Sbjct: 81  VAPGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAA 139

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT------FRGTRVARVA 244
           +PC SP C  ++ +G     +C + + +G+ ++  G    +TLT      F G     + 
Sbjct: 140 IPCGSPEC-AVECTGA----SCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIE 194

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT----GRRFNRKFSYCLVDRSTSAKPSSMV 300
           +G   D    F  A GL+ L R   S  ++            FSYCL   S ++    + 
Sbjct: 195 VGADADT---FDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLS 251

Query: 301 FGDSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G S    +    ++ P+ +NP     Y+VELVGISVGG  +  +  ++F        G 
Sbjct: 252 IGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLP-VPPAVFAAH-----GT 305

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           ++++ T  T L   AY ALRDAFR   +    AP F + DTC++L+G   + VPTV L F
Sbjct: 306 LLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALRF 365

Query: 418 RGA-DVSLPATNYLIPVDSSGTFC-------FAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
            G  ++ L     +   D S  F         A       +S+IG + Q+   VVYDL  
Sbjct: 366 AGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRG 425

Query: 470 SRIGFAPRGC 479
            R+GF P  C
Sbjct: 426 GRVGFIPGRC 435


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 131/386 (33%), Positives = 184/386 (47%), Gaps = 31/386 (8%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPCK 170
           RNR   +          SG A         + VGTP  + V  ++D  S  VW QCAPC 
Sbjct: 65  RNRGNKQQQQQLGGEAASGAAP---PLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCA 121

Query: 171 KCYSQTDP---VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCL----------YQVS 217
                  P    F P  S +F+ +PC S +C  +    C R               Y ++
Sbjct: 122 AAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLT 181

Query: 218 YGDGSI-TVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG 276
           YG  +  T G  +T+T TF  T V  V  GC   + G F  A+G++G+GRG LS  +Q  
Sbjct: 182 YGGSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL- 240

Query: 277 RRFNRKFSYCLV--DRSTSAKPSSMV-FGDSAVSRT--ARFTPLLANPKLDTFYYVELVG 331
            +F  KFSY L+  + +      S++ FGD AV +T   R TPLL++     FYYV L G
Sbjct: 241 -QFG-KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTG 298

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKR 389
           + V G  +  I A  F L   G GGVI+ S T VT L + AY  +R A   R G  ++  
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNG 358

Query: 390 APDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
           +    L D C++ S   +VKVP + L F  GAD+ L A NY    + +G  C     +  
Sbjct: 359 SAALEL-DLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQG 417

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGF 474
           G S++G + Q G  ++YD+ A R+ F
Sbjct: 418 G-SVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 174/395 (44%), Gaps = 72/395 (18%)

Query: 151 VYMVLDTGSDVVWIQCAP--CKKCYSQTDPVFDPAKSRSFAT--VPCRSPLC-------- 198
           V + LDTGSD+VW  CAP  C  C  +  P           +  +PC SPLC        
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRIPCASPLCSAAHASAP 164

Query: 199 ------------RKLDSSGCNRRNTC--LYQVSYGDGSITVGDFSTETLTFRGTR----- 239
                         +++  C   + C  LY  +YGDGS+             G R     
Sbjct: 165 PSDLCAVARCPLEDIETGSCGASHACPPLY-YAYGDGSLVAHLRRGRVALGAGARASVAV 223

Query: 240 -VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---- 294
            V      C H   G  V   G+ G GRG LS P Q   + + +FSYCLV  S  A    
Sbjct: 224 AVDNFTFACAHTALGEPV---GVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLI 280

Query: 295 KPSSMVFGDSAVSRTAR-------FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
           +PS ++ G S     A        +TPLL NPK   FY V L  +SVG A ++     L 
Sbjct: 281 RPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQA-RPELA 339

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL-----KRAPDFSLFDTCFDL 402
           ++D AGNGG+++DSGT+ T L    Y  + +AF    ++      +RA + +    C+  
Sbjct: 340 RVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRY 399

Query: 403 SGKTEVKVPTVVLHFRG-ADVSLPATNYLIPV-----------DSSGTFCFAFAGTMSG- 449
           +  ++  VP + LHFRG A V+LP  NY +             D  G       G  SG 
Sbjct: 400 A-ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGE 458

Query: 450 -----LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                   +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 459 EGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 171/365 (46%), Gaps = 31/365 (8%)

Query: 133 QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP 192
           +GSG +   L +G+PP    +V+DTGS ++W+QC PC  C+ Q+   FDP KS SF T+ 
Sbjct: 100 RGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLG 158

Query: 193 CRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-----GTRVARVALGC 247
           C  P    ++   CNR N   Y++ Y  G  + G  + E+L F        + + +  GC
Sbjct: 159 CGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218

Query: 248 GH-----DNEGLFVAAAGLLGLG-RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
           GH     +N+    A  G+ GLG    ++  TQ G     KFSYC+ D +      + + 
Sbjct: 219 GHMNIKTNNDD---AYNGVFGLGAYPHITMATQLG----NKFSYCIGDINNPLYTHNHLV 271

Query: 302 GDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
                      TPL  +      YYV L  ISVG   ++ I  + FK+   G+GGV+IDS
Sbjct: 272 LGQGSYIEGDSTPLQIHFG---HYYVTLQSISVGSKTLK-IDPNAFKISSDGSGGVLIDS 327

Query: 362 GTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFD-TCFD-LSGKTEVKVPTVVLHFR 418
           G + T+L    +  L D         L+R P    F+  CF  +  +  V  P V  HF 
Sbjct: 328 GMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFA 387

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFAGTMS---GLSIIGNIQQQGFRVVYDLAASRIGF 474
            GAD+ L + + L        FC A   + S    LS+IG + QQ + V +DL   ++ F
Sbjct: 388 GGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFF 446

Query: 475 APRGC 479
               C
Sbjct: 447 RRIDC 451


>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
          Length = 150

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)

Query: 334 VGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF 393
           VGG  V  I+  +F+L   G+GGV++D+GT+VTRL   AY A RDAF A  ++L RA   
Sbjct: 5   VGGIRVP-ISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGV 63

Query: 394 SLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSI 452
           ++FDTC+DL G   V+VPTV  +F G  + +LPA N+LIP+D +GTFCFAFA + SGLSI
Sbjct: 64  AIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSI 123

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +GNIQQ+G ++ +D A   +GF P  C
Sbjct: 124 LGNIQQEGIQISFDGANGYVGFGPNIC 150


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 131/386 (33%), Positives = 184/386 (47%), Gaps = 31/386 (8%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP-PRYVYMVLDTGSDVVWIQCAPCK 170
           RNR   +          SG A         + VGTP  + V  ++D  S  VW QCAPC 
Sbjct: 65  RNRGNKQQQQQLGGEAASGAAP---PLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCA 121

Query: 171 KCYSQTDP---VFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCL----------YQVS 217
                  P    F P  S +F+ +PC S +C  +    C R               Y ++
Sbjct: 122 AAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLT 181

Query: 218 YGDGSI-TVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG 276
           YG  +  T G  +T+T TF  T V  V  GC   + G F  A+G++G+GRG LS  +Q  
Sbjct: 182 YGGSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL- 240

Query: 277 RRFNRKFSYCLV--DRSTSAKPSSMV-FGDSAVSRTAR--FTPLLANPKLDTFYYVELVG 331
            +F  KFSY L+  + +      S++ FGD AV +T R   TPLL++     FYYV L G
Sbjct: 241 -QFG-KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTG 298

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKR 389
           + V G  +  I A  F L   G GGVI+ S T VT L + AY  +R A   R G  ++  
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNG 358

Query: 390 APDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
           +    L D C++ S   +VKVP + L F  GAD+ L A NY    + +G  C     +  
Sbjct: 359 SAALEL-DLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQG 417

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGF 474
           G S++G + Q G  ++YD+ A R+ F
Sbjct: 418 G-SVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/442 (30%), Positives = 202/442 (45%), Gaps = 45/442 (10%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           ++++ L+L H D+L  N              + R++ +    +    +  R R   +  G
Sbjct: 28  DTAVRLKLAHRDTLWPN-------------PLSRIEDIIGADQKRHSLISRKR---KFKG 71

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC--APCKKCYSQTDPV 179
           G    + SG+  G+ +YFT + VGTP +   +V+DTGS++ W+ C      K   +   V
Sbjct: 72  GVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRV 131

Query: 180 FDPAKSRSFATVPCRSPLCRK-----LDSSGCNRRNT-CLYQVSYGDGSITVGDFSTETL 233
           F   +S+SF TV C +  C+         S C   +T C Y   Y DGS   G F+ ET+
Sbjct: 132 FRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETI 191

Query: 234 TF-----RGTRVARVALGCGHDNEGLFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
           T      R  R+  + +GC     G     A G+LGL     SF +     F  K SYCL
Sbjct: 192 TVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCL 251

Query: 288 VDRSTSAKPSS-MVFGDSAVSRTARFTPLLANPKLDT-----FYYVELVGISVGGAHVRG 341
           VD  ++   S+ ++FG S+ S + +  P    P LD      FY + ++GIS+G   +  
Sbjct: 252 VDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTP-LDLTLIPPFYAINIIGISIGDDML-D 309

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDTCF 400
           I   ++  D    GG I+DSGTS+T L   AY  +          LKR  P+    + CF
Sbjct: 310 IPTQVW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCF 367

Query: 401 -DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF--AGTMSGLSIIGNIQ 457
              SG  E K+P +  H +G     P     +   + G  C  F  AGT    +++GNI 
Sbjct: 368 SSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGT-PATNVVGNIM 426

Query: 458 QQGFRVVYDLAASRIGFAPRGC 479
           QQ +   +DL AS + FAP  C
Sbjct: 427 QQNYLWEFDLMASTLSFAPSTC 448


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/246 (37%), Positives = 131/246 (53%), Gaps = 12/246 (4%)

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
           VA    GC     G  V   GL+G G G LSFP+Q    +   FSYCL    +S   S++
Sbjct: 357 VAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSSTL 416

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             G +   +  + TPLL+NP   + YYV +VGI VGG  +  + AS    DPA   G I+
Sbjct: 417 RLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPML-VPASALAFDPASGRGTIV 475

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           D+GT  TRL+ P Y A+RD FR+   +    P    FDTC++++    + VPTV   F G
Sbjct: 476 DAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGP-LGGFDTCYNVT----ISVPTVTFSFDG 530

Query: 420 -ADVSLPATNYLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
              V+LP  N +I   S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+G
Sbjct: 531 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 590

Query: 474 FAPRGC 479
           F+   C
Sbjct: 591 FSRELC 596


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 130/404 (32%), Positives = 185/404 (45%), Gaps = 50/404 (12%)

Query: 114 RSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCY 173
           RS G  + G  + V++ +     EY   + VGTPP  V  + DTGSD+VW++C       
Sbjct: 86  RSSGAPSPGTGAGVVAEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDN 145

Query: 174 SQTDP---VFDPAKSRSFATVPCRSPLCRKLDSSG-CNRRNTCLYQVSYGDGSITVGDFS 229
           + T P    F P+ S ++  V C +  CR L S+  C+   +C Y  SYGDGS   G  S
Sbjct: 146 NSTAPPSVYFVPSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLS 205

Query: 230 TETLTFR----------------------GTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
           TET TF                          +A++  GC     G F A   L+GLG G
Sbjct: 206 TETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADG-LVGLGGG 264

Query: 268 RLSFPTQTG--RRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT--ARFTPLLANPKLDT 323
            +S  +Q G      RKFSYCL   + +   S++ FG  AV     A  TPL+   +++T
Sbjct: 265 PVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITG-EVET 323

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL-RDAFRA 382
           +Y + L  I+V G   R  TA+           +I+DSGT++T L       L +D  R 
Sbjct: 324 YYTIALDSINVAGTK-RPTTAA--------QAHIIVDSGTTLTYLDSALLTPLVKDLTRR 374

Query: 383 GASSLKRAPDFSLFDTCFDLS---GKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGT 438
                  +P+  + D C+D+S   G+  + +P V L    G +V+L   N  + V   G 
Sbjct: 375 IKLPRAESPE-KILDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVV-QEGV 432

Query: 439 FCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            C A   T     +SI+GNI QQ   V YDL    + FA   CA
Sbjct: 433 LCLALVATSERQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCA 476


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 119/368 (32%), Positives = 177/368 (48%), Gaps = 54/368 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC-- 193
           G Y++ + +G+PP+   +V+DTGSD+ W++C PC    S T   FD   S ++  + C  
Sbjct: 122 GVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSST---FDRLASNTYKALTCAD 178

Query: 194 --RSP----LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
             R P    L R+L  SG + R+T              G  S E   F G        GC
Sbjct: 179 DLRLPVLLRLWRRLFHSGRSLRDTL----------KMAGAASDELEEFPG-----FVFGC 223

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS--TSAKPSSMVFGDSA 305
           G   +GL     G+L L  G LSFP+Q G ++  KFSYCL+ ++   S K S MVFG++A
Sbjct: 224 GSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAA 283

Query: 306 VS---------RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
           V          +  ++TP+  +     +Y V L GISVG   +  ++ S F      +  
Sbjct: 284 VELKEPGSGKPQELQYTPIGES---SIYYTVRLDGISVGNQRL-DLSPSTFL--NGQDKP 337

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---FDTCFDLSGKTEVKVPTV 413
            I DSGT++T L  P+ +   D+ +   +S+    +F      D CF +   +   +P +
Sbjct: 338 TIFDSGTTLTML--PSGVC--DSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQGLPDI 393

Query: 414 VLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRI 472
             HF  GAD     +NY+I  D     C  F  T + +SI GN+QQQ F V++D+   RI
Sbjct: 394 TFHFNGGADFVTRPSNYVI--DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRI 450

Query: 473 GFAPRGCA 480
           GF    C 
Sbjct: 451 GFKETDCG 458


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 167/357 (46%), Gaps = 44/357 (12%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L   +G Y   L +GTPP    ++ DTGS ++W QCAPC +C ++  P F PA S +F+ 
Sbjct: 83  LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142

Query: 191 VPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           +PC S LC+ L S    CN    C+Y   YG G  T G  +TETL   G     V  GC 
Sbjct: 143 LPCASSLCQFLTSPYRTCNATG-CVYYYPYGMG-FTAGYLATETLHVGGASFPGVTFGCS 200

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA--V 306
            +N G+  +++G++GLGR  LS  +Q G     +FSYCL   +  A  S ++FG  A   
Sbjct: 201 TEN-GVGNSSSGIVGLGRSPLSLVSQVGV---ARFSYCLRSNA-DAGDSPILFGSLAKVT 255

Query: 307 SRTARFTPLLANPKL--DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
               + TPLL NP++   ++YYV L GI+VG   +    A+L  ++    G  +    T+
Sbjct: 256 GGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTVNGTRFGFDLCFDATA 315

Query: 365 VTRLTRPAYIALRDAFRAGAS-SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
                      L   F  GA  +++R   F + +   D  G+  V+   V          
Sbjct: 316 AGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEV--DSQGRAAVECLLV---------- 363

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LPA+  L                   +SIIGN+ Q    V+YDL      FAP  CA
Sbjct: 364 LPASEKL------------------SISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 402


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 172/380 (45%), Gaps = 48/380 (12%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSR 186
           A   G YFT++ +G+PP+  Y+ +DTGSD++W+ CAPC KC  +TD      ++D   S 
Sbjct: 71  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASS 130

Query: 187 SFATVPCRSPLCR-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR----GTRVA 241
           +   V C    C   + S  C  +  C Y V YGDGS + GDF  + +T        R A
Sbjct: 131 TSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTA 190

Query: 242 ----RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRS 291
                V  GCG +  G       A  G++G G+   S  +Q   G    R FS+CL + +
Sbjct: 191 PLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMN 250

Query: 292 TSAKPSSMVFGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
                   +F    V S   + TPL+ N      Y V L G+ V G  +  +  SL   +
Sbjct: 251 GGG-----IFAIGEVESPVVKTTPLVPN---QVHYNVILKGMDVDGEPID-LPPSLASTN 301

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLSGKTE 407
             G+GG IIDSGT++  L +  Y +L +   A     K+     +      CF  +  T+
Sbjct: 302 --GDGGTIIDSGTTLAYLPQNLYNSLIEKITA-----KQQVKLHMVQETFACFSFTSNTD 354

Query: 408 VKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAG----TMSGLSII--GNIQQQG 460
              P V LHF  +  +S+   +YL  +     +CF +      T  G  +I  G++    
Sbjct: 355 KAFPVVNLHFEDSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSN 413

Query: 461 FRVVYDLAASRIGFAPRGCA 480
             VVYDL    IG+A   C+
Sbjct: 414 KLVVYDLENEVIGWADHNCS 433


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 122/363 (33%), Positives = 170/363 (46%), Gaps = 34/363 (9%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV---FDPAKSRSFATVPCRSPLC 198
           L +GTPP+   MVLDTGS + WIQC   K    +  P    FDP+ S SF  +PC  PLC
Sbjct: 86  LPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLC 145

Query: 199 --RKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNE 252
             R  D S    C+  + C Y   Y DG+   G+   E + F  ++    + LGC   ++
Sbjct: 146 KPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQSD 205

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
                A G+LG+  GRL FP+Q       KFSYC+  +       S   G++  S + R+
Sbjct: 206 D----ARGILGMNLGRLGFPSQAKI---TKFSYCVPTKQAQPASGSFYLGNNPASSSFRY 258

Query: 313 TPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             LL        P LD   Y + L GIS+GG  +  I  S+FK +  G+G  +IDSG+  
Sbjct: 259 VNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLN-IPPSVFKPNAGGSGQTMIDSGSEF 317

Query: 366 TRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVK--VPTVVLHF-RGA 420
           T L   AY  +R+    + G    K      + D CFD     E+   V  +V  F +G 
Sbjct: 318 TYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFD-GDAIEIGRLVGDMVFEFEKGV 376

Query: 421 DVSLPATNYLIPVDSSGTFCFAFAGTM---SGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
            + +P    L  VD  G  C     +    +G +IIGN  QQ   V +DLA  R+GF   
Sbjct: 377 QIVIPKERVLATVD-GGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLANRRVGFGEA 435

Query: 478 GCA 480
            C+
Sbjct: 436 DCS 438


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 131/471 (27%), Positives = 206/471 (43%), Gaps = 73/471 (15%)

Query: 66  SLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSS 125
           SL  + + SL+  R P  L           +  LT  + +++   P+  +  R       
Sbjct: 19  SLLFYSIQSLARPRNPNSL-----------ILGLTPASRASLPTHPKASTSSRKKLTDVL 67

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKCYSQTD---- 177
            ++  L +    Y   L +GTPP+ + + +DTGSD+ W  C      C +C +  +    
Sbjct: 68  DMMEPLREVRDGYLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMM 127

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSS----------GCNR----RNTCLYQV-----SY 218
             F P+ S S     C SP C  + SS          GC+     + TC +       +Y
Sbjct: 128 ASFSPSHSSSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTY 187

Query: 219 GDGSITVGDFSTETLTFRG------TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFP 272
           G G +  G  + +TL   G        + R   GC   +   +    G+ G GRG LS P
Sbjct: 188 GAGGVVTGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASS---YREPIGIAGFGRGALSLP 244

Query: 273 TQTGRRFNRK-FSYCLVDRSTSAKP---SSMVFGDSAVSRT--ARFTPLLANPKLDTFYY 326
           +Q G  F RK FS+C +    +  P   S ++ GD A++     +FTP+L +P    +YY
Sbjct: 245 SQLG--FLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYY 302

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V L  I+VG      + +SL + D  GNGG+++DSGT+ T L  P Y  +    ++   +
Sbjct: 303 VGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQS-IIN 361

Query: 387 LKRAPDFSL---FDTCFDLSGK-----TEVKVPTVVLHF-RGADVSLPATNYLI----PV 433
             RA D  +   FD C+ +  +     T   +P++  HF   A + L   ++      P 
Sbjct: 362 YPRATDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPS 421

Query: 434 DSSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +S+   C  F     G      ++G+ QQQ   VVYD+   RIGF P  CA
Sbjct: 422 NSTVVKCLLFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDCA 472


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 164/359 (45%), Gaps = 42/359 (11%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           Y  RL +GTPP  +   +DTGSD++W QC PC  CY+Q  P+FDP+KS +F    C    
Sbjct: 61  YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCHG-- 118

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNE 252
                       N+C Y++ Y D S + G  +TET+T + T      +A  ++GCG +N 
Sbjct: 119 ------------NSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNS 166

Query: 253 GLFV-----AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA---KPSSMVFGDS 304
            L       +++G++GL  G  S  +Q         SYC   + TS      +++V GD 
Sbjct: 167 NLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVVAGDG 226

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
            V+            K   FYY+ L  +SVG   +  +         A +G + IDSGT+
Sbjct: 227 TVAADMFIK------KDQPFYYLNLDAVSVGDKRIETLGTPFH----AQDGNIFIDSGTT 276

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-TCFDLSGKTEVKVPTVVLHFR-GADV 422
            T L       +R+A  A   +  + PD S  +  C++    T    P + LHF  GAD+
Sbjct: 277 YTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNW--DTMEIFPVITLHFAGGADL 334

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            L   N  +   + GTFC A       + +I GN       V YD +   I F+P  C+
Sbjct: 335 VLDKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 93/246 (37%), Positives = 131/246 (53%), Gaps = 12/246 (4%)

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
           VA    GC     G  V   GL+G G G LSFP+Q    +   FSYCL    +S   S++
Sbjct: 296 VAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSSTL 355

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             G +   +  + TPLL+NP   + YYV +VGI VGG  +  + AS    DPA   G I+
Sbjct: 356 RLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPML-VPASALAFDPASGRGTIV 414

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           D+GT  TRL+ P Y A+RD FR+   +    P    FDTC++++    + VPTV   F G
Sbjct: 415 DAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGP-LGGFDTCYNVT----ISVPTVTFSFDG 469

Query: 420 -ADVSLPATNYLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
              V+LP  N +I   S G  C A A     G  + L+++ ++QQQ  RV++D+A  R+G
Sbjct: 470 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 529

Query: 474 FAPRGC 479
           F+   C
Sbjct: 530 FSRELC 535


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 171/366 (46%), Gaps = 40/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP--VFDPAKSRSFATVPCRSPLCR 199
           L +GTPP+   M+LDTGS + WIQC   KK   +  P  VFDP+ S SF+ +PC  PLC+
Sbjct: 81  LPIGTPPQSQQMILDTGSQLSWIQCH--KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCK 138

Query: 200 ------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNE 252
                  L +S C+    C Y   Y DG++  G+   E +TF  ++    + LGC  D  
Sbjct: 139 PRIPDFTLPTS-CDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS 197

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRT 309
                  G+LG+  GRLSF +Q       KFSYC+  R          S   G++  S  
Sbjct: 198 D----DKGILGMNLGRLSFASQAKI---TKFSYCVPTRQVRPGFTPTGSFYLGENPNSAG 250

Query: 310 ARFTPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            ++  LL        P LD   + V L GI +G   +  I  S F+ DP+G G  +IDSG
Sbjct: 251 FQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLN-IPVSAFRADPSGAGQSMIDSG 309

Query: 363 TSVTRLTRPAYIALR-DAFRAGASSLKRAPDFS-LFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  T L   AY  +R +  R     LK+   +S + D CFD     E+   +  +V  F 
Sbjct: 310 SEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDMCFD-GNAMEIGRLIGNMVFEFD 368

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLS--IIGNIQQQGFRVVYDLAASRIGF 474
           +G ++ +     L  V   G  C       M G +  IIGN  QQ   V +D+A  R+GF
Sbjct: 369 KGVEIVIEKGRVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRVGF 427

Query: 475 APRGCA 480
               C+
Sbjct: 428 GKADCS 433


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 122/411 (29%), Positives = 178/411 (43%), Gaps = 69/411 (16%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP--CKKCYSQTDPVFDPAKSRSF 188
           L+ GS +Y     +G   + + + +DTGSD+VW  C P  C  C  +     DP+   + 
Sbjct: 69  LSPGS-DYTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNI 127

Query: 189 AT---VPCRSPLCR--------------------KLDSSGCNRRNTCLYQVSYGDGSITV 225
           +    + C S  C                      +++  C   +   +  +YGDGS+ +
Sbjct: 128 SHSTPISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSL-I 186

Query: 226 GDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRK 282
                +TL+    ++     GC H     F    G+ G GRG LS P Q      +   +
Sbjct: 187 ASLYRDTLSLSTLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNR 243

Query: 283 FSYCLVDRSTSA----KPSSMVFGDSAVSRTAR--------FTPLLANPKLDTFYYVELV 330
           FSYCLV  S  +    KPS ++ G     + +         +T +L NPK   FY V L 
Sbjct: 244 FSYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLK 303

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKR 389
           GISVG   V      L +++  G+GGV++DSGT+ T L    Y ++ + F R    S +R
Sbjct: 304 GISVGKKTVPAPKI-LRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRR 362

Query: 390 APDFSL---FDTCFDLSGKTEVKVPTVVLHFRGAD--VSLPATNYLIPV----------D 434
           AP+         C+ L+  T   VP V L F G +  V LP  NY              +
Sbjct: 363 APEIEQKTGLSPCYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKE 420

Query: 435 SSGTFCFAFAGTMSGLS-----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             G   F   G  + +S     ++GN QQQGF V YDL   R+GFA R CA
Sbjct: 421 RVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCA 471


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 120/364 (32%), Positives = 166/364 (45%), Gaps = 35/364 (9%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV-FDPAKSRSFATVPCRSPLCRK 200
           L +GTPP+   MVLDTGS + WIQC              FDP+ S SF+ +PC  PLC+ 
Sbjct: 84  LPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLCKP 143

Query: 201 -----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-VARVALGCGHDNEGL 254
                   + C++   C Y   Y DG+   G    E +TF  ++    + LGC   +   
Sbjct: 144 RIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEAS--- 200

Query: 255 FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS---SMVFGDSAVS---- 307
                G+LG+  GR SF +Q       KFSYC+  R   A  S   S   G++  S    
Sbjct: 201 -TDEKGILGMNLGRRSFASQAKI---SKFSYCVPTRQARAGLSSTGSFYLGNNPNSGRFQ 256

Query: 308 --RTARFTPLLANPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
                 FTP   +P LD   Y + + GI +G A +  I+A+LF+ DP+G G  IIDSG+ 
Sbjct: 257 YINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLN-ISATLFRPDPSGAGQTIIDSGSE 315

Query: 365 VTRLTRPAYIALR-DAFRAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF-RG 419
            T L   AY  +R +  R     LK+   +  + D CFD     E+   +  +V  F +G
Sbjct: 316 FTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFD-GNPMEIGRLIGNMVFEFEKG 374

Query: 420 ADVSLPATNYLIPVDSSGTFCFAFA-GTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAP 476
            ++ +     L  V   G  C       M G   +IIGN  QQ   V YDLA  RIG   
Sbjct: 375 VEIVIDKWRVLADV-GGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGK 433

Query: 477 RGCA 480
             C+
Sbjct: 434 ADCS 437


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/237 (36%), Positives = 126/237 (53%), Gaps = 35/237 (14%)

Query: 66  SLRLHHVD----SLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAV----------RVPP 111
           SLR+ H+      LS N+      +  ++RD  RV+S+ +     +          ++P 
Sbjct: 64  SLRVVHMHGACSHLSSNKDARLDHDEILRRDEARVESIHSKLSKNIADEVSKAKSTKLPA 123

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-K 170
           +N                G+  GS  Y   +G+GTP   + ++ DTGSD+ W QC PC  
Sbjct: 124 KN----------------GIILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLG 167

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFST 230
            CYSQ +P F+P+ S S+  V C SP+C   +S  C+  N CLY + YGDGS+TVG  + 
Sbjct: 168 SCYSQKEPKFNPSSSSSYHNVSCSSPMCGNPES--CSASN-CLYGIGYGDGSVTVGFLAK 224

Query: 231 ETLTFRGTRVAR-VALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
           E  T   + V   +  GCG +N+G+F+ +AG+LGLG G+ SFP QT   +N  FSYC
Sbjct: 225 EKFTLTNSDVLDDIYFGCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 125/415 (30%), Positives = 183/415 (44%), Gaps = 75/415 (18%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-----PCKKCYS--QTDPVFDPAKSRSFAT 190
           Y   L +GTPP+   + LDTGSD+ W+ C       C  C S  +  P F P++S S   
Sbjct: 25  YLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNTR 84

Query: 191 VPCRSPLCRKLDSSGCNRRNTCL--------------------YQVSYGDGSITVGDFST 230
             C S  C  + SS  NR + C                     +  +YG G++ +G  S 
Sbjct: 85  DLCGSRFCVDVHSSD-NRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSR 143

Query: 231 ETLTFRGTR-----------VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           +++T  G+            VA    G G     +     G+ G GRG LS P+Q G   
Sbjct: 144 DSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSI-REPLGIAGFGRGALSLPSQLGF-L 201

Query: 280 NRKFSYCLVDRSTSAKP---SSMVFGDSAVSRTAR-----FTPLLANPKLDTFYYVELVG 331
            + FS+C +    +  P   S +V GD A+S  +      FTP+L +     FYYV L G
Sbjct: 202 GKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGLEG 261

Query: 332 ISVG---GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           + +G   G        SL  +D  GNGGV++D+GT+ T+L  P Y ++  +  + A   +
Sbjct: 262 VVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPPYE 321

Query: 389 RAPDFSL---FDTCFDLSGK----TEVKVPTVVLHFRG-ADVSLPATNYLIPV----DSS 436
           R+ D      FD CF +        + ++P + LH  G A ++LP  +   PV    DS 
Sbjct: 322 RSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYPVTAIRDSV 381

Query: 437 GTFCFAF-----------AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
              C  F                  +++G+ Q Q   VVYDLAA R+GF PR CA
Sbjct: 382 VVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPRDCA 436


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 179/366 (48%), Gaps = 37/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L +GTP +   +VLDTGS + WIQC P K       P   FDP+ S SF+ +PC  PLC+
Sbjct: 85  LPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCK 144

Query: 200 ------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNE 252
                  L +S C+    C Y   Y DG+   G+   E  TF  ++    + LGC  ++ 
Sbjct: 145 PRIPDFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKEST 203

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTS---AKPSSMVFGDSAVSRT 309
            +     G+LG+  GRLSF +Q       KFSYC+  RS     A   S   G++  SR 
Sbjct: 204 DV----KGILGMNLGRLSFISQAKI---SKFSYCIPTRSNRPGLASTGSFYLGENPNSRG 256

Query: 310 ARFTPLLANPK------LDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            ++  LL  P+      LD   Y V L+GI +G   +  I +S+F+ D  G+G  ++DSG
Sbjct: 257 FKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRL-NIPSSVFRPDAGGSGQTMVDSG 315

Query: 363 TSVTRLTRPAYIALRDAF-RAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  T L   AY  +++   R   S LK+   + S  D CFD + +  +   +  +V  F 
Sbjct: 316 SEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFG 375

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFA-GTMSGL--SIIGNIQQQGFRVVYDLAASRIGF 474
           RG ++ +     L+ V   G  C      +M G   +IIGN+ QQ   V +D+A  R+GF
Sbjct: 376 RGVEILVEKQRLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGF 434

Query: 475 APRGCA 480
           +   C+
Sbjct: 435 SKAECS 440


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 119/226 (52%), Gaps = 21/226 (9%)

Query: 145 GTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRK---- 200
           G+P   + +++DTGSD+ W+QC PC  CY+Q DP+FDPA S ++A V C +  C      
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162

Query: 201 -------LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
                    S+G      C Y ++YGDGS + G  +T+T+   G  +     GCG  N G
Sbjct: 163 ATGTPGSCGSTGAGSEK-CYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRG 221

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF--GDSAVSRTAR 311
           LF   AGL+GLGR  LS  +QT  R+   FSYCL   ++     S+    GD A S    
Sbjct: 222 LFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRN 281

Query: 312 FTP-----LLANPKLDTFYYVELVGISVGGAHV--RGITASLFKLD 350
            TP     ++A+P    FY++ + G +VGG  +  +G+ AS   +D
Sbjct: 282 TTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLID 327


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 127/397 (31%), Positives = 186/397 (46%), Gaps = 48/397 (12%)

Query: 110 PPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
           PP +    R+N  +S ++I  L            +GTP +   +VLDTGS + WIQC P 
Sbjct: 63  PPSSPYTFRSNIKYSMALILSLP-----------IGTPSQSQELVLDTGSQLSWIQCHPK 111

Query: 170 KKCYSQTDPV--FDPAKSRSFATVPCRSPLCR------KLDSSGCNRRNTCLYQVSYGDG 221
           K       P   FDP+ S SF+ +PC  PLC+       L +S C+    C Y   Y DG
Sbjct: 112 KIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTS-CDSNRLCHYSYFYADG 170

Query: 222 SITVGDFSTETLTFRGTRVA-RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN 280
           +   G+   E  TF  ++    + LGC  ++        G+LG+  GRLSF +Q      
Sbjct: 171 TFAEGNLVKEKFTFSNSQTTPPLILGCAKES----TDEKGILGMNLGRLSFISQAKI--- 223

Query: 281 RKFSYCLVDRSTS---AKPSSMVFGDSAVSRTARFTPLLANPK------LDTF-YYVELV 330
            KFSYC+  RS     A   S   GD+  SR  ++  LL  P+      LD   Y V L 
Sbjct: 224 SKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQ 283

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKR 389
           GI +G   +  I  S+F+ D  G+G  ++DSG+  T L   AY  +++   R   S LK+
Sbjct: 284 GIRIGQKRLN-IPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKK 342

Query: 390 APDF-SLFDTCFDLSGKTEVK--VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFA- 444
              + S  D CFD +   E+   +  +V  F RG ++ +   + L+ V   G  C     
Sbjct: 343 GYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNV-GGGIHCVGIGR 401

Query: 445 GTMSGL--SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            +M G   +IIGN+ QQ   V +D+   R+GF+   C
Sbjct: 402 SSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 116/377 (30%), Positives = 172/377 (45%), Gaps = 42/377 (11%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSR 186
           A   G YFT++ +G+PP+  Y+ +DTGSD++W+ CAPC KC  +TD      ++D   S 
Sbjct: 68  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 127

Query: 187 SFATVPCRSPLCR-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR----GTRVA 241
           +   V C    C   + S  C  +  C Y V YGDGS + GDF  + +T        R A
Sbjct: 128 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 187

Query: 242 ----RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRS 291
                V  GCG +  G       A  G++G G+   S  +Q   G    R FS+CL + +
Sbjct: 188 PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN 247

Query: 292 TSAKPSSMVFGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
                   +F    V S   + TP++ N      Y V L G+ V G  +  +  SL   +
Sbjct: 248 GGG-----IFAVGEVESPVVKTTPIVPN---QVHYNVILKGMDVDGDPID-LPPSLASTN 298

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
             G+GG IIDSGT++  L +  Y +L +   A    +K       F  CF  +  T+   
Sbjct: 299 --GDGGTIIDSGTTLAYLPQNLYNSLIEKITA-KQQVKLHMVQETF-ACFSFTSNTDKAF 354

Query: 411 PTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAG----TMSGLSII--GNIQQQGFRV 463
           P V LHF  +  +S+   +YL  +     +CF +      T  G  +I  G++      V
Sbjct: 355 PVVNLHFEDSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLV 413

Query: 464 VYDLAASRIGFAPRGCA 480
           VYDL    IG+A   C+
Sbjct: 414 VYDLENEVIGWADHNCS 430


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 176/370 (47%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           L VGTPP+ V MVLDTGS++ W++C   K    QT   FDP +S S++ VPC S  C  R
Sbjct: 89  LTVGTPPQNVSMVLDTGSELSWLRCN--KTQTFQT--TFDPNRSSSYSPVPCSSLTCTDR 144

Query: 200 KLD---SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD----NE 252
             D    + C+    C   +SY D S + G+ +++T     + +     GC       N 
Sbjct: 145 TRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTNT 204

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
                  GL+G+ RG LSF +Q       KFSYC+ D   S     ++ GD+  S     
Sbjct: 205 EEDSKNTGLMGMNRGSLSFVSQMDF---PKFSYCISDSDFSGV---LLLGDANFSWLMPL 258

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V  + +  +  S+F  D  G G  ++DSGT  
Sbjct: 259 NYTPLIQISTPLPYFDRVAYTVQLEGIKV-SSKLLPLPKSVFVPDHTGAGQTMVDSGTQF 317

Query: 366 TRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y ALR+ F    S + R    P++      D C+   LS  +   +PTV L F
Sbjct: 318 TFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMF 377

Query: 418 RGADVSLPATN--YLIPVDSSGT---FCFAFAGT---MSGLSIIGNIQQQGFRVVYDLAA 469
           RGA++ +      Y +P +  G+   +CF F  +        +IG+  QQ   + +DL  
Sbjct: 378 RGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEK 437

Query: 470 SRIGFAPRGC 479
           SRIGFA   C
Sbjct: 438 SRIGFAQVQC 447


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 116/377 (30%), Positives = 172/377 (45%), Gaps = 42/377 (11%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSR 186
           A   G YFT++ +G+PP+  Y+ +DTGSD++W+ CAPC KC  +TD      ++D   S 
Sbjct: 72  ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 131

Query: 187 SFATVPCRSPLCR-KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR----GTRVA 241
           +   V C    C   + S  C  +  C Y V YGDGS + GDF  + +T        R A
Sbjct: 132 TSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 191

Query: 242 ----RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRS 291
                V  GCG +  G       A  G++G G+   S  +Q   G    R FS+CL + +
Sbjct: 192 PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN 251

Query: 292 TSAKPSSMVFGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
                   +F    V S   + TP++ N      Y V L G+ V G  +  +  SL   +
Sbjct: 252 GGG-----IFAVGEVESPVVKTTPIVPN---QVHYNVILKGMDVDGDPID-LPPSLASTN 302

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
             G+GG IIDSGT++  L +  Y +L +   A    +K       F  CF  +  T+   
Sbjct: 303 --GDGGTIIDSGTTLAYLPQNLYNSLIEKITA-KQQVKLHMVQETF-ACFSFTSNTDKAF 358

Query: 411 PTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAG----TMSGLSII--GNIQQQGFRV 463
           P V LHF  +  +S+   +YL  +     +CF +      T  G  +I  G++      V
Sbjct: 359 PVVNLHFEDSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLV 417

Query: 464 VYDLAASRIGFAPRGCA 480
           VYDL    IG+A   C+
Sbjct: 418 VYDLENEVIGWADHNCS 434


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 130/389 (33%), Positives = 173/389 (44%), Gaps = 55/389 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKCYSQTD-PVFDPAKSRSFATV 191
           G Y   L  GTP +    VLDTGS +VW+ C+    C KC S ++ P F P  S S   V
Sbjct: 84  GGYSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFV 143

Query: 192 PCRSP-------------LCRKLDSSGCNRRNTC-LYQVSYGDGSITVGDFSTETLTFRG 237
            C +P              CR+  ++  N   TC  Y V YG GS T G   +E L F  
Sbjct: 144 GCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPT 202

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS--TSAK 295
            + +   LGC   +       AG+ G GRG  S P+Q       +FSYCL+      SA 
Sbjct: 203 KKYSDFLLGCSVVS---VYQPAGIAGFGRGEESLPSQMNLT---RFSYCLLSHQFDDSAT 256

Query: 296 PSSMVFGDSAVSRTAR-----FTPLLANPK------LDTFYYVELVGISVGGAHVRGITA 344
            +S +  ++A SR  +     +TP L NP          +YY+ L  I VG   VR +  
Sbjct: 257 ITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVR-VPR 315

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD----FSLFDTCF 400
            L + +  G+GG I+DSG++ T + RP +  +   F A   S  RA +    F L   CF
Sbjct: 316 RLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEF-AKQVSYTRAREAEKQFGL-SPCF 373

Query: 401 DLSGKTEV-KVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAFA--------GTMSGL 450
            L+G  E    P +   FRG A + LP  NY   V      C            GT+   
Sbjct: 374 VLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPA 433

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            I+GN QQQ F V YDL   R GF  + C
Sbjct: 434 VILGNYQQQNFYVEYDLENERFGFRSQSC 462


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 171/370 (46%), Gaps = 36/370 (9%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +A G+ EY    G G P +   +  DT   V  ++C PC    +  DP F+P++S SFA 
Sbjct: 81  VAPGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAA 139

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT------FRGTRVARVA 244
           +PC SP C  ++ +G     +C + + +G+ ++  G    +TLT      F G     + 
Sbjct: 140 IPCGSPEC-AVECTGA----SCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIE 194

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT----GRRFNRKFSYCLVDRSTSAKPSSMV 300
           +G   D    F  A GL+ L R   S  ++            FSYCL   S ++    + 
Sbjct: 195 VGADADT---FDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLS 251

Query: 301 FGDSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G S    +    ++ P+ +NP     Y+V+LVGISVGG  +  +  ++F        G 
Sbjct: 252 IGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLP-VPPAVFAAH-----GT 305

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           ++++ T  T L   AY ALRDAFR   +    AP F + DTC++L+G   + VP V L F
Sbjct: 306 LLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRF 365

Query: 418 RGA-DVSLPATNYLIPVDSSGTFC-------FAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
            G  ++ L     +   D S  F         A       +S+IG + Q+   VVYDL  
Sbjct: 366 AGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRG 425

Query: 470 SRIGFAPRGC 479
            R+GF P  C
Sbjct: 426 GRVGFIPGRC 435


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 145/498 (29%), Positives = 214/498 (42%), Gaps = 105/498 (21%)

Query: 35  TPSTLSWPESVSVSESESSLPLPAPDAESSLSL-RLHHVDSLSFNRTPEHLFNLRIQRDV 93
           +PST++ P S ++++  SS P    +  ++ S+ R HH+      ++P+  F+L      
Sbjct: 24  SPSTITIPLSPTITKRPSSDPWEYLNHLATTSISRAHHL------KSPKTNFSL------ 71

Query: 94  LRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYM 153
                        ++ P  +RS G    G+S S               L +GTP + V +
Sbjct: 72  -------------IKTPLFSRSYG----GYSMS---------------LSLGTPSQTVKL 99

Query: 154 VLDTGSDVVWIQCAP---CKKC-YSQTD----PVFDPAKSRSFATVPCRSPLCRKLDSS- 204
           ++DTGS +VW  C     C  C +  TD    P F P  S S   + C++P C  +  S 
Sbjct: 100 IMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSS 159

Query: 205 ------GCN-RRNTCL-----YQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
                  CN +   C      Y + YG GS T G   +ET+ F    ++    GC   + 
Sbjct: 160 VQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPNKTISDFLAGCSLLST 218

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP-SSMVFGD----SAVS 307
                  G+ G GR + S P Q G    +KFSYCLV R     P SS +  D    ++ S
Sbjct: 219 R---QPEGIAGFGRSQESLPLQLGL---KKFSYCLVSRRFDDSPVSSDLILDMGPSTSDS 272

Query: 308 RTA--RFTPL------LANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
           +T    +TP        +NP    +YYV L  I VG  HV+ +  S       GNGG I+
Sbjct: 273 KTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVK-VPYSFLVPGSDGNGGTIV 331

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLSGKTEVKVPTVVLH 416
           DSG++ T +    +  L   F    ++   A +         CFD+SG+  V +P +   
Sbjct: 332 DSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQ 391

Query: 417 FR-GADVSLPATNYLIPVDSSGTFCFAF----AGTMSG---------LSIIGNIQQQGFR 462
           F+ GA + LP +NY   VD  G  C       A  + G           I+GN QQQ F 
Sbjct: 392 FKGGAKMQLPLSNYFAFVD-MGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFY 450

Query: 463 VVYDLAASRIGFAPRGCA 480
           + YDL   R GF  + CA
Sbjct: 451 IEYDLENDRFGFKEQSCA 468


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 182/410 (44%), Gaps = 61/410 (14%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKC--YSQTDPVF 180
           ++  L +    Y   L +GTPP+ + + +DTGSD+ W+ C      C  C  Y  +  + 
Sbjct: 1   MVEQLREVRDGYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMS 60

Query: 181 DPAKSRSFATV--PCRSPLCRKLDSS----------GCNR----RNTCL-----YQVSYG 219
             + S S ++    C SP C  + SS          GC+     + TC      +  +YG
Sbjct: 61  AFSPSHSSSSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYG 120

Query: 220 DGSITVGDFSTETLTF-----RGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPT 273
            G +  G  + +TL       R T+ + +   GC       +    G+ G  RG LSFP+
Sbjct: 121 AGGVVTGTLTRDTLRVHEGPARVTKDIPKFCFGCVGST---YHEPIGIAGFVRGTLSFPS 177

Query: 274 QTGRRFNRKFSYCLVDRSTSAKP---SSMVFGDSAVSR--TARFTPLLANPKLDTFYYVE 328
           Q G    + FS+C +    +  P   S +V GD+A+S     +FTP+L +P    +YY+ 
Sbjct: 178 QLGL-LKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIG 236

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           L  I+VG      +  +L + D  GNGG++IDSGT+ T L  P Y  L   F+A   +  
Sbjct: 237 LEAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKA-IITYP 295

Query: 389 RAPDFSL---FDTCFDLS------GKTEVKVPTVVLHF-RGADVSLPATNYLI----PVD 434
           RA +  +   FD C+ +          +   P++  HF       LP  N+      P +
Sbjct: 296 RATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSN 355

Query: 435 SSGTFCFAFAGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           S+   C  F            + G+ QQQ  ++VYDL   RIGF P  CA
Sbjct: 356 STVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 405


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 171/365 (46%), Gaps = 36/365 (9%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPC 193
           G+ +Y   +G GTP +   M LDT   V  + C PC    +  DP FD ++S +F  VPC
Sbjct: 145 GALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHVPC 204

Query: 194 RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR----VALGCGH 249
            SP C    ++ C+  + C + + + +G+     FS + LT   +   +    V L  G 
Sbjct: 205 DSPDCPS--TANCSAGSVCPFNLFFVEGT-----FSQDVLTVAPSVAVQDFTFVCLDAGA 257

Query: 250 DNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            ++G+     G L L R R S P++     +  FSYC+     S  P  +  GD A  R 
Sbjct: 258 -SDGM--PEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDS--PGFLSLGDDATVRG 312

Query: 310 ARFT---PLLA--NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
              T   PLL+  +P L   Y++++VG+S+G   +  I +  F      N   I+++GT+
Sbjct: 313 DNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLP-IPSGTF----GNNASTIVEAGTT 367

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVS 423
            T L   AY  LRDAFR   +   R+ P F  FDTC++ +G  E+ VP V   F   D  
Sbjct: 368 FTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKFGNGDSL 427

Query: 424 LPATNYLIPVD--SSGTF---CFAFAGTMSGL----SIIGNIQQQGFRVVYDLAASRIGF 474
           L   + ++  D  S G F   C AF+          ++IG        VVYD+A   +GF
Sbjct: 428 LIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGF 487

Query: 475 APRGC 479
            P  C
Sbjct: 488 IPESC 492


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 171/370 (46%), Gaps = 36/370 (9%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           +A G+ EY    G G P +   +  DT   V  ++C PC    +  DP F+P++S SFA 
Sbjct: 169 VAPGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAA 227

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT------FRGTRVARVA 244
           +PC SP C  ++ +G     +C + + +G+ ++  G    +TLT      F G     + 
Sbjct: 228 IPCGSPEC-AVECTGA----SCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIE 282

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT----GRRFNRKFSYCLVDRSTSAKPSSMV 300
           +G   D    F  A GL+ L R   S  ++            FSYCL   S ++    + 
Sbjct: 283 VGADADT---FDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLS 339

Query: 301 FGDSAVSRTA---RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G S    +    ++ P+ +NP     Y+V+LVGISVGG  +  +  ++F        G 
Sbjct: 340 IGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLP-VPPAVFAAH-----GT 393

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           ++++ T  T L   AY ALRDAFR   +    AP F + DTC++L+G   + VP V L F
Sbjct: 394 LLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRF 453

Query: 418 RGA-DVSLPATNYLIPVDSSGTFC-------FAFAGTMSGLSIIGNIQQQGFRVVYDLAA 469
            G  ++ L     +   D S  F         A       +S+IG + Q+   VVYDL  
Sbjct: 454 AGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRG 513

Query: 470 SRIGFAPRGC 479
            R+GF P  C
Sbjct: 514 GRVGFIPGRC 523


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 130/417 (31%), Positives = 180/417 (43%), Gaps = 74/417 (17%)

Query: 131 LAQGSGEYFTRLGVGT-PPRYVYMVLDTGSDVVWIQCAP--CKKCYSQTDPV----FDPA 183
           L+ GS +Y     +G+ PP+ + + +DTGSD+VW  CAP  C  C  + D        P 
Sbjct: 67  LSPGS-DYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPP 125

Query: 184 KSRSFATVPCRSPLC--------------------RKLDSSGCNRRNTCLYQVSYGDGSI 223
              S A+V C+SP C                      +++S C+  +   +  +YGDGS+
Sbjct: 126 NITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSL 185

Query: 224 TVGDFSTETLTFRGTR---VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR--- 277
            V     ++L+   +    +     GC H   G      G+ G GRG LS P Q      
Sbjct: 186 -VARLYRDSLSMPASSPLVLHNFTFGCAHTALG---EPVGVAGFGRGVLSLPAQLASFSP 241

Query: 278 RFNRKFSYCLVDRSTSA----KPSSMVFG-----DSAVSRTAR------FTPLLANPKLD 322
               +FSYCLV  S  A    +PS ++ G     D    R         +T +L NPK  
Sbjct: 242 HLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHP 301

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
            FY V L GI+VG   +  +   L ++D  GNGG+++DSGT+ T L    Y +L   F  
Sbjct: 302 YFYCVGLEGITVGNRKIP-VPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNH 360

Query: 383 GASSL-KRAPDFSL---FDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPV---- 433
               + KRA           C+  S  +  KVP V LHF G + V LP  NY        
Sbjct: 361 RMGRVYKRATQIEERTGLGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGR 419

Query: 434 ----DSSGTFCFAF------AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                     C         A +    + +GN QQQGF VVYDL   R+GFA R CA
Sbjct: 420 DGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCA 476


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 167/372 (44%), Gaps = 47/372 (12%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQ--TDPVFDPAKSRSFATVPCRS 195
           +F    VG PP   + ++DTGS ++WIQC PCK C S     PVF+PA S +F    C  
Sbjct: 68  FFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDD 127

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVAR--VALGCGHD 250
             CR   +  C+  N C+Y+  Y  G+ + G  + E LTF    G  V    +A GCGH+
Sbjct: 128 RFCRYAPNGHCS-SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHE 186

Query: 251 N-EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST-SAKPSSMVFGDSAVSR 308
           N E L     G+LGLG    S   Q G     KFSYC+ D +  +   + +V G+ A   
Sbjct: 187 NGEQLESEFTGILGLGAKPTSLAVQLG----SKFSYCIGDLANKNYGYNQLVLGEDA--- 239

Query: 309 TARFTPLLANPKLDTF------YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
                 +L +P    F      YY+ L GISVG   +  I   +FK       GVI+D+G
Sbjct: 240 -----DILGDPTPIEFETENGIYYMNLEGISVGDKQLN-IEPVVFKRR-GSRTGVILDTG 292

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV---PTVVLHFR- 418
           T  T L   A IA R+ +    S L    +   F       G+   ++   P V  HF  
Sbjct: 293 TLYTWL---ADIAYRELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAG 349

Query: 419 GADVSLPATNYLIPVDSSGT----FCFA------FAGTMSGLSIIGNIQQQGFRVVYDLA 468
           GA++++ AT+   P+  S T    FC +        G     + IG + QQ + + YDL 
Sbjct: 350 GAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLK 409

Query: 469 ASRIGFAPRGCA 480
              I      C 
Sbjct: 410 ERNIYLQRIDCV 421


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 119/401 (29%), Positives = 179/401 (44%), Gaps = 65/401 (16%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKC----YSQTDPVFDPAKSRSFA 189
           Y   L +GTPP+ + +++DTGSD+ W+ C      C +C     ++    F P+ S S  
Sbjct: 82  YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141

Query: 190 TVPCRSPLCRKLDSS----------GCNR----RNTCL-----YQVSYGDGSITVGDFST 230
              C SP C  + SS          GC+     + TC      +  +YG G +  G  + 
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201

Query: 231 ETLTFRGT------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +TL   G+       + +   GC       +    G+ G GRG LS  +Q G    + FS
Sbjct: 202 DTLRVNGSSPGVAKEIPKFCFGCVG---SAYREPIGIAGFGRGTLSMVSQLGF-LQKGFS 257

Query: 285 YCLVDRSTSAKP---SSMVFGDSAVSRT--ARFTPLLANPKLDTFYYVELVGISVGGAHV 339
           +C +    +  P   S +V GD A++     +FTP+L +P    FYYV L  I+VG    
Sbjct: 258 HCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVSA 317

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---- 395
             + +SL + D  GNGG+ IDSGT+ T L  P Y  +    +   S++    D  +    
Sbjct: 318 TEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQ---STINYPRDTGMEMQT 374

Query: 396 -FDTCFDL------SGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGT----FCFAF 443
            FD C+ +      +  ++  +P++  HF     + LP  N+  PV + G      C  F
Sbjct: 375 GFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMF 434

Query: 444 AGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             T  G      + G+ QQQ   VVYDL   RIGF P  CA
Sbjct: 435 QSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 475


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 117/428 (27%), Positives = 177/428 (41%), Gaps = 77/428 (17%)

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKCYSQTDPV-- 179
           +VI  L +    Y   L +GTPP+ V + +DTGSD+ W+ C      C+ C    + +  
Sbjct: 9   NVIEPLREIRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISG 68

Query: 180 -----FDPAKSRSFATVPCRSPLCRKLDSS----------GCNR----RNTC-----LYQ 215
                F P  S +     C S  C  + SS          GC+     + TC      + 
Sbjct: 69  PRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFA 128

Query: 216 VSYGDGSITVGDFSTETLTFRGT---------RVARVALGCGHDNEGLFVAAAGLLGLGR 266
            +YG   +  G  + + L   G          ++ R   GC       +    G+ G GR
Sbjct: 129 YTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCV---GATYREPIGIAGFGR 185

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRSTSAKP---SSMVFGDSAVS---RTARFTPLLANPK 320
           G LS P Q G   ++ FS+C +    S  P   S ++ G+ A+S      +FTPLL +P 
Sbjct: 186 GLLSLPFQLGFS-HKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPM 244

Query: 321 LDTFYYVELVGISVGGAHVR---GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALR 377
              +YY+ L  I++G        G++  L ++D  GNGG++IDSGT+ T L  P Y  L 
Sbjct: 245 YPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLI 304

Query: 378 DAFR--AGASSLKRAPDFSLFDTCFDLSGKT-------EVKVPTVVLHF-RGADVSLPAT 427
                  G    K+    + FD C+ +  K        + ++P++  HF     V LP  
Sbjct: 305 SNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQG 364

Query: 428 NYLI----PVDSSGTFCFAFAGTMSGLS-----------IIGNIQQQGFRVVYDLAASRI 472
           N       P++S+   C  +                   I G+ QQQ   VVYDL   R+
Sbjct: 365 NNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERL 424

Query: 473 GFAPRGCA 480
           GF P  C 
Sbjct: 425 GFQPMDCV 432


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 169/361 (46%), Gaps = 86/361 (23%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VG+PP+ V MVLDTGS++ W+ C      +S    VFDP +S S++ +PC SP CR  
Sbjct: 379 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 434

Query: 202 DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGL 261
             S                                                       GL
Sbjct: 435 THS----------------------------------------------------KTTGL 442

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTARFTPLLAN- 318
           +G+ RG LSF TQ G +   KFSYC+  + +S     ++FG+S+ S  +  ++TPL+   
Sbjct: 443 IGMNRGSLSFVTQMGLQ---KFSYCISGQDSSG---ILLFGESSFSWLKALKYTPLVQIS 496

Query: 319 ---PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
              P  D   Y V+L GI V  + ++ +  S++  D  G G  ++DSGT  T L  P Y 
Sbjct: 497 TPLPYFDRVAYTVQLEGIKVANSMLQ-LPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYT 555

Query: 375 ALRDAF-RAGASSLK--RAPDFSL---FDTCF--DLSGKTEVKVPTVVLHFRGADVSLPA 426
           AL++ F R   +SLK    P+F      D C+   L+ +T   +PTV L FRGA++S+ A
Sbjct: 556 ALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSA 615

Query: 427 TNYLIPVD-----SSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRIGFAPRG 478
              +  V      S   +CF F  + + G+   IIG+  QQ   + +DLA SR+GFA   
Sbjct: 616 ERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVR 675

Query: 479 C 479
           C
Sbjct: 676 C 676


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 118/336 (35%), Positives = 170/336 (50%), Gaps = 34/336 (10%)

Query: 170 KKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSS--GCNRRNTCLYQVSYGDGSITVGD 227
            +C ++  P F PA S +F+ +PC S LC+ L S    CN    C+Y   YG G  T G 
Sbjct: 86  HECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG-CVYYYPYGMG-FTAGY 143

Query: 228 FSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
            +TETL   G     VA GC  +N G+  +++G++GLGR  LS  +Q G     +FSYCL
Sbjct: 144 LATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVG---RFSYCL 199

Query: 288 VDRSTSAKPSSMVFGDSAVSRTARFTP-LLANPKL--DTFYYVELVGISVGGAHVRGITA 344
                 A  S ++FG  A     + +P +L NP++   ++YYV L GI+VG   +  +T+
Sbjct: 200 -RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLP-VTS 257

Query: 345 SLFKLDPAGN----GGVIIDSGTSVTRLTRPAYIALRDAF-----RAGASSLKRAPDFSL 395
           + F           GG I+DSGT++T L +  Y  ++ AF      A  ++      F  
Sbjct: 258 TTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG- 316

Query: 396 FDTCFDLS---GKTEVKVPTVVLHFRG-ADVSLPATNYL--IPVDSSG---TFCFAF--A 444
           FD CFD +   G + V VPT+VL F G A+ ++   +Y+  + VDS G     C     A
Sbjct: 317 FDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPA 376

Query: 445 GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                +SIIGN+ Q    V+YDL      FAP  CA
Sbjct: 377 SEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 122/419 (29%), Positives = 184/419 (43%), Gaps = 45/419 (10%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           N T +    L IQ    R   + A  E ++     N  + R +   +   I         
Sbjct: 53  NETAKDRMELDIQHSAARFAYIQARIEGSLV--SNNEYKARVSPSLTGRTI--------- 101

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
               + +G PP    +V+DTGSD++W+ C PC  C +    +FDP+ S +F      SPL
Sbjct: 102 -MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTF------SPL 154

Query: 198 CRK-LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHD- 250
           C+   D  GC+R +   + V+Y D S   G F  +T+ F  T     R+  V  GCGH+ 
Sbjct: 155 CKTPCDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNI 214

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
            +       G+LGL  G  S  T+ G    +KFSYC+ D +        +          
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATKIG----QKFSYCIGDLADPYYNYHQLILGEGADLEG 270

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP   +   + FYYV + GISVG   +  I    F++     GGVIID+G+++T L  
Sbjct: 271 YSTPFEVH---NGFYYVTMEGISVGEKRLD-IAPETFEMKKNRTGGVIIDTGSTITFLVD 326

Query: 371 PAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLS-GKTEVKVPTVVLHF-RGADVSLPA 426
             +  L    R   G S  +   + S +  CF  S  +  V  P V  HF  GAD++L +
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDS 386

Query: 427 TNYLIPVDSSGTFCFAFAGTMSGL------SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++   ++ +  FC    G +S L      S+IG + QQ + V YDL    + F    C
Sbjct: 387 GSFFNQLNDN-VFCMT-VGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDC 443


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/378 (30%), Positives = 174/378 (46%), Gaps = 44/378 (11%)

Query: 133 QGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP 192
           +GSG +   L +G+PP    +V+DTGS ++W+QC PC  C+ Q+   FDP KS SF T+ 
Sbjct: 100 RGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLG 158

Query: 193 CRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR-------------GTR 239
           C  P    ++   CNR N   Y++ Y  G  + G  + E+L F               T+
Sbjct: 159 CGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218

Query: 240 VAR-----VALGCGH-----DNEGLFVAAAGLLGLG-RGRLSFPTQTGRRFNRKFSYCLV 288
           +++     +  GCGH     +N+    A  G+ GLG    ++  TQ G     KFSYC+ 
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDD---AYNGVFGLGAYPHITMATQLG----NKFSYCIG 271

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
           D +      + +            TPL  +      YYV L  ISVG   ++ I  + FK
Sbjct: 272 DINNPLYTHNHLVLGQGSYIEGDSTPLQIHFG---HYYVTLQSISVGSKTLK-IDPNAFK 327

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFD-TCFD-LSGK 405
           +   G+GGV+IDSG + T+L    +  L D         L+R P    F+  CF  +  +
Sbjct: 328 ISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSR 387

Query: 406 TEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS---GLSIIGNIQQQGF 461
             V  P V  HF  GAD+ L + + L        FC A   + S    LS+IG + QQ +
Sbjct: 388 DLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNY 446

Query: 462 RVVYDLAASRIGFAPRGC 479
            V +DL   ++ F    C
Sbjct: 447 NVGFDLEQMKVFFRRIDC 464


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 164/360 (45%), Gaps = 41/360 (11%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +GTPP+    ++D   ++VW QC+ C +C+ Q  P+F P  S +F   PC +  C+ + +
Sbjct: 73  IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132

Query: 204 SGCNRRNTCLYQ--VSYGDGSITVGDFSTETLTFRGTRVARVALGC----GHDNEGLFVA 257
           S C+  N C Y+  ++   G  T+G  +T+T    GT  A +  GC    G D  G    
Sbjct: 133 SNCS-SNMCTYEGTINSKLGGHTLGIVATDTFAI-GTATASLGFGCVVASGIDTMG---G 187

Query: 258 AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV------SRTAR 311
            +GL+GLGR   S  +Q       KFSYCL     S K S ++ G SA       S T  
Sbjct: 188 PSGLIGLGRAPSSLVSQMNI---TKFSYCLTPHD-SGKNSRLLLGSSAKLAGGGNSTTTP 243

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
           F        +  +Y ++L GI  G A +         L P+GN  V++ +   ++ L   
Sbjct: 244 FVKTSPGDDMSQYYPIQLDGIKAGDAAI--------ALPPSGN-TVLVQTLAPMSFLVDS 294

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR--GADVSLPATNY 429
           AY AL+        +   A     FD CF  +G +    P +V  F+   A +++P   Y
Sbjct: 295 AYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKY 354

Query: 430 LIPV-DSSGTFCFAFAGTM--------SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           LI V +  GT C A   T           L+I+G++QQ+    + DL    + F P  C+
Sbjct: 355 LIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCS 414


>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
          Length = 144

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 71/139 (51%), Positives = 95/139 (68%), Gaps = 1/139 (0%)

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           I+  +F+L+  G GGV++D+GT+VTRL   AY A RDAF    ++L R+ D S+FDTC+D
Sbjct: 6   ISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIFDTCYD 65

Query: 402 LSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
           L G   V+VPT+  +F G  + +LPA N+LIPV+  GTFCFAFA + SGLSIIGNIQQ+G
Sbjct: 66  LYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGNIQQEG 125

Query: 461 FRVVYDLAASRIGFAPRGC 479
             +  D     +GF P  C
Sbjct: 126 IEISVDGVNGFVGFGPNIC 144


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 179/378 (47%), Gaps = 48/378 (12%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD------PVFDPAKSRSF 188
           +G Y+T++ +GTPP   Y+ +DTGSDV W+ CAPC  C ++T         +DP++S + 
Sbjct: 34  TGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93

Query: 189 ATVPCRSPLCRKL---DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR----GTRV- 240
             + CR   C      +   C     C Y  +YGDGS T G F  + +TF+     T+V 
Sbjct: 94  GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153

Query: 241 --ARVALGCGHDNEGLFVAAA----GLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRST 292
             A V  GCG    G  + ++    GL+G G+  +S P+Q     +   +F++CL  +  
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL--QGD 211

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           +    ++V G S       +TP+++       Y V +  I+V G +V   T + F     
Sbjct: 212 NQGGGTIVIG-SVSEPNISYTPIVSR----NHYAVGMQNIAVNGRNV--TTPASFDTTST 264

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG-KTEVKVP 411
             GGVI+DSGT++  L  PAY    +A     SS+     FS    C  L+    +   P
Sbjct: 265 SAGGVIMDSGTTLAYLVDPAYTQFVNAVSTFESSM-----FSSHSQCLQLAWCSLQADFP 319

Query: 412 TVVLHFR-GADVSLPATNYLI--PV-DSSGTFCFAF------AGTMSGLSIIGNIQQQGF 461
           TV L F  GA ++L   NYL   P+ +    +C  +      AG +S  SI+G+I  +  
Sbjct: 320 TVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLS-YSILGDIVLKDH 378

Query: 462 RVVYDLAASRIGFAPRGC 479
            VVYD     +G+    C
Sbjct: 379 LVVYDNDNRVVGWKSFDC 396


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 138/415 (33%), Positives = 177/415 (42%), Gaps = 75/415 (18%)

Query: 131 LAQGSGEYFTRLGVG--TPPRYVYMVLDTGSDVVWIQCAP--CKKCYSQ---------TD 177
           LA GS +Y   L VG  +    V + LDTGSD+VW  CAP  C  C  +         ++
Sbjct: 77  LAPGS-DYTLSLSVGPLSTANPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSN 135

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSG--------------------CNRRNTC--LYQ 215
           P+  P  SR    +PC SP C    SS                     C   + C  LY 
Sbjct: 136 PLPPPTDSRR---IPCASPFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLY- 191

Query: 216 VSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQT 275
            +YGDGS+                V      C H   G  V   G+ G GRG LS P Q 
Sbjct: 192 YAYGDGSLVARLRRGRVGIAASVAVENFTFACAHTALGEPV---GVAGFGRGPLSLPAQL 248

Query: 276 G-RRFNRKFSYCLVDRSTSA----KPSSMVFG-----DSAVSRTARFTPLLANPKLDTFY 325
                + +FSYCLV  S  A    +PS ++ G     D A      +TPLL NPK   FY
Sbjct: 249 APAALSGRFSYCLVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFY 308

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF----- 380
            V L  +SVGG  +      L ++  AG+GG+++DSGT+ T L    Y  + + F     
Sbjct: 309 SVALEAVSVGGTRIPA-RPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMA 367

Query: 381 RAGASSLKRAPDFSLFDTCF----DLSGKTE---VKVPTVVLHFRG-ADVSLPATNYLIP 432
            A     + A D +    C+    D S   E     VP + +HFRG A V LP  NY + 
Sbjct: 368 AARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMG 427

Query: 433 VDSS-----GTFCFAFAGTMSG---LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             S      G       G   G      +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 428 FRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/200 (41%), Positives = 112/200 (56%), Gaps = 5/200 (2%)

Query: 282 KFSYCLVDRSTSAKPSSMVFGDSA-VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVR 340
           KFSYCL     S K S ++ G  A  ++ A  TPLL NP   +FYY+ L GI VGG  + 
Sbjct: 5   KFSYCLTSMDDS-KASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGTQLS 63

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF 400
            I  S+F +   G+GGVIIDSGT++T L +  +  L+  F + ++        +  D CF
Sbjct: 64  -IEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDVCF 122

Query: 401 DL-SGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQ 459
            L S  T+V+VP +V HF+G D+ LPA +Y+I     G  C A  G  +G+SI GN+QQQ
Sbjct: 123 SLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAM-GASNGMSIFGNVQQQ 181

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              V +DL    I F P  C
Sbjct: 182 NILVNHDLEKETISFVPTQC 201


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 166/356 (46%), Gaps = 21/356 (5%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC-----YSQTDPVFDPAKSR 186
           A  +G Y     VGTPP+ V  VLD  SD VW+QC+ C  C      + + P F    S 
Sbjct: 91  ATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSS 150

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNT-CLYQVSYGDGS--ITVGDFSTETLTFRGTRVARV 243
           +   V C +  C++L    C+  ++ C Y   YG G+   T G  + +   F   R   V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGV 210

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
             GC    EG      G++GLGRG LS  +Q   +  R FSY L         S ++F D
Sbjct: 211 IFGCAVATEGDI---GGVIGLGRGELSLVSQL--QIGR-FSYYLAPDDAVDVGSFILFLD 264

Query: 304 SAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            A  RT+R   TPL+AN    + YYVEL GI V G  +  I    F L   G+GGV++  
Sbjct: 265 DAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDL-AIPRGTFDLQADGSGGVVLSI 323

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL-FDTCFDLSGKTEVKVPTVVLHFRGA 420
              VT L   AY  +R A  A    L+ A    L  D C+        KVP++ L F G 
Sbjct: 324 TIPVTFLDAGAYKVVRQAM-ASKIGLRAADGSELGLDLCYTSESLATAKVPSMALVFAGG 382

Query: 421 DV-SLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGF 474
            V  L   NY     ++G  C     + +G  S++G++ Q G  ++YD++ SR+ F
Sbjct: 383 AVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 175/385 (45%), Gaps = 50/385 (12%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----------PVFDPAK 184
           G +   L  GTPP+ +  ++DTGS VVW   APC   Y+ T+           P+F+P  
Sbjct: 85  GGHSIPLSFGTPPQKLSFLVDTGSHVVW---APCTTHYTCTNCSFSDAEPKKVPIFNPKL 141

Query: 185 SRSFATVPCRSPLCRKLDSS----GC-----NRRNTCL----YQVSYGDGSITVGDFSTE 231
           S S   + CR+P C    S     GC     N +N       Y + YG G+ + GDF  E
Sbjct: 142 SSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGA-SSGDFLLE 200

Query: 232 TLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL--VD 289
            L F G  +    +GC     G  V +A L G GR   S P Q G    +KF+YCL   D
Sbjct: 201 NLNFPGKTIHEFLVGCTTSAVGE-VTSAALAGFGRSMFSLPMQMGV---KKFAYCLNSHD 256

Query: 290 RSTSAKPSSMVFGDS-AVSRTARFTPLLAN-PKLDTFYYVELVGISVGGAHVRGITASLF 347
              +   S ++   S   ++   + P L N P    +YY+ +  I +G   +R I +   
Sbjct: 257 YDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLR-IPSKYL 315

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---FDTCFDLSG 404
                G GG++IDSG +   +T P +  + +  +   S  +R+ +         C++ +G
Sbjct: 316 APGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTG 375

Query: 405 KTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF---AGTMS-----GLSII-G 454
           +  +K+P ++  FR GA + +P  NY + +      CF     AGT +     G SII G
Sbjct: 376 QKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSIILG 435

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGC 479
           N Q   + V +DL   R+GF  + C
Sbjct: 436 NSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 172/366 (46%), Gaps = 43/366 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSRSFATVPCRSPLCR 199
           L +GTPP+   MVLDTGS + WIQ      C+++T P   FDP+ S SF  +PC  PLC+
Sbjct: 92  LPIGTPPQPQQMVLDTGSQLSWIQ------CHNKTPPTASFDPSLSSSFYVLPCTHPLCK 145

Query: 200 K-----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNEG 253
                    + C++   C Y   Y DG+   G+   E L F  ++    + LGC  ++  
Sbjct: 146 PRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSESRD 205

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS----SMVFGDSAVSRT 309
               A G+LG+  GRLSFP Q       KFSYC+  R  +   +    S   G++  S  
Sbjct: 206 ----ARGILGMNLGRLSFPFQAKV---TKFSYCVPTRQPANNNNFPTGSFYLGNNPNSAR 258

Query: 310 ARFTPLLANPK------LDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            R+  +L  P+      LD   Y V + GI +GG  +  I  S+F+ +  G+G  ++DSG
Sbjct: 259 FRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKL-NIPPSVFRPNAGGSGQTMVDSG 317

Query: 363 TSVTRLTRPAYIALRDA-FRAGASSLKRAPDF-SLFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  T L   AY  +R+   R     +K+   +  + D CFD     E+   +  V   F 
Sbjct: 318 SEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFD-GNAMEIGRLLGDVAFEFE 376

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFAGTM---SGLSIIGNIQQQGFRVVYDLAASRIGF 474
           +G ++ +P    L  V   G  C     +    +  +IIGN  QQ   V +DLA  RIGF
Sbjct: 377 KGVEIVVPKERVLADV-GGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGF 435

Query: 475 APRGCA 480
               C+
Sbjct: 436 GVADCS 441


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 125/239 (52%), Gaps = 13/239 (5%)

Query: 65  LSLRLHHVDS--LSFNRTPEHLFNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSRG--RA 119
           + + +HHV     S    P   F+  +  D  RVK+L +       R P    ++   R 
Sbjct: 40  VQMTIHHVHGPGSSLAPQPPVSFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRF 99

Query: 120 NGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKCYSQTDP 178
               S  +  G + GSG Y+ ++G G+P RY  M++DTGS + W+QC PC   C+ Q DP
Sbjct: 100 PKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADP 159

Query: 179 VFDPAKSRSFATVPCRSPLCRKLDSSGCNR------RNTCLYQVSYGDGSITVGDFSTET 232
           +FDP+ S+++ ++ C S  C  L  +  N        N C+Y  SYGD S ++G  S + 
Sbjct: 160 LFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDL 219

Query: 233 LTFRGTR-VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
           LT   ++ +     GCG D++GLF  AAG+LGLGR +LS   Q   +F   FSYCL  R
Sbjct: 220 LTLAPSQTLPGFVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR 278


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 173/394 (43%), Gaps = 51/394 (12%)

Query: 116 RGRANGGFSSSV---ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKK 171
           +G A+ G +S+V   I G     G+Y+T + VG PPR  ++ +DTGSD+ WIQC APC  
Sbjct: 166 KGAASAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTN 225

Query: 172 CYSQTDPVFDPAKSRSFATVPCRSPLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFS 229
           C     P++ PAK +    VP R  LC++L  D + C     C Y++ Y D S ++G  +
Sbjct: 226 CAKGPHPLYKPAKEK---IVPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLA 282

Query: 230 TETLTFRGTRVARVAL----GCGHDNEGLFVAAA----GLLGLGRGRLSFPTQTGRR--F 279
            + +    T   R  L    GC +D +G  +++     G+LGL    +S P+Q   +   
Sbjct: 283 KDDMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGII 342

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAH 338
           +  F +C+   +       M  GD  V R    + P+   P  D  Y+ E   ++ G   
Sbjct: 343 SNVFGHCITRETNGG--GYMFLGDDYVPRWGMTWAPIRGGP--DNLYHTEAQKVNYGDQE 398

Query: 339 VRGITASLFKLDPAGNG-GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD 397
           +            AGN   VI DSG+S T L    Y  L DA +  + S  +    +   
Sbjct: 399 LH-----------AGNSVQVIFDSGSSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLP 447

Query: 398 TCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVD-----SSGTFCFAFAGTMSGLS- 451
            C+            + LHF      +P T  ++P D       G  C    G ++G   
Sbjct: 448 LCWKADFSVRSFFKPLNLHFGRRWFVVPKTFTIVPDDYLIISDKGNVCL---GLLNGTEI 504

Query: 452 ------IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                 I+G++  +G  VVYD    +IG+A   C
Sbjct: 505 NHGSTIIVGDVSLRGKLVVYDNERRQIGWANSEC 538


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 170/359 (47%), Gaps = 42/359 (11%)

Query: 142  LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
            L VG+PP+ V MVLDTGS++ W+ C       S    VF+P  S S++ +PC SP+CR  
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPICRTR 1059

Query: 202  -----DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                 +   C+ +  C   VSY D S   G+ +++      + +     GC       N 
Sbjct: 1060 TRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSSNS 1119

Query: 253  GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR- 311
                   GL+G+ RG LSF TQ G     KFSYC+  R +S     ++FGD  +S     
Sbjct: 1120 EEDAKTTGLMGMNRGSLSFVTQLGL---PKFSYCISGRDSSG---VLLFGDLHLSWLGNL 1173

Query: 312  -FTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
             +TPL+      P  D   Y V+L GI VG   +  +  S+F  D  G G  ++DSGT  
Sbjct: 1174 TYTPLVQISTPLPYFDRVAYTVQLDGIRVGNK-ILPLPKSIFAPDHTGAGQTMVDSGTQF 1232

Query: 366  TRLTRPAYIALRDAF---RAGASSLKRAPDFSL---FDTCFDLSGKTEV-KVPTVVLHFR 418
            T L  P Y ALR+ F     G  +    P+F      D C+ ++   ++  +P+V L FR
Sbjct: 1233 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR 1292

Query: 419  GADVSL--PATNYLIPVDSSGT---FCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
            GA++ +      Y +P    G    +C  F  + + G+   +IG+  QQ   + +DL A
Sbjct: 1293 GAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVA 1351


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 121/408 (29%), Positives = 178/408 (43%), Gaps = 35/408 (8%)

Query: 95  RVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
            +K LT  + +  +    +  +   +  F   V   +   +  +     VG PP     +
Sbjct: 55  HIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIK--TSLFLVNFSVGQPPVPQLTI 112

Query: 155 LDTGSDVVWIQCAPCKKCYS--QTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTC 212
           +DTGS ++WIQC PCK C S     PVF+PA S +F    C    CR   +  C   N C
Sbjct: 113 MDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKC 172

Query: 213 LYQVSYGDGSITVGDFSTETLTF---RGTRVAR--VALGCGHDN-EGLFVAAAGLLGLGR 266
           +Y+  Y  G+ + G  + E LTF    G  V    +A GCG++N E L     G+LGLG 
Sbjct: 173 VYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGA 232

Query: 267 GRLSFPTQTGRRFNRKFSYCLVDRST-SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
              S   Q G     KFSYC+ D +  +   + +V G+ A       TP+    + ++ Y
Sbjct: 233 KPTSLAVQLG----SKFSYCIGDLANKNYGYNQLVLGEDA-DILGDPTPIEFETE-NSIY 286

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           Y+ L GISVG   +  I   +FK       GVI+DSGT  T L   A IA R+ +    S
Sbjct: 287 YMNLEGISVGDTQLN-IEPVVFKRR-GPRTGVILDSGTLYTWL---ADIAYRELYNEIKS 341

Query: 386 SLKRAPDFSLFDTCFDLSGKTEVKV---PTVVLHFR-GADVSLPATNYLIPVDSSGT--- 438
            L    +   F       G+   ++   P V  HF  GA++++ AT+   P+    T   
Sbjct: 342 ILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNV 401

Query: 439 FCFA------FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           FC +        G     + IG + QQ + + YDL    I      C 
Sbjct: 402 FCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCV 449


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 114/401 (28%), Positives = 171/401 (42%), Gaps = 62/401 (15%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCA----PCKKCYS------QTDPVFDPAKSRS 187
           Y   L +GTPP+ V + +DTGSD+ W+ C      C  C        ++  +F P  S S
Sbjct: 11  YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70

Query: 188 FATVPCRSPLCRKLDSS----------GCN----RRNTCL-----YQVSYGDGSITVGDF 228
                C S  C ++ SS          GC+     ++TC+     +  +YG+G +  G  
Sbjct: 71  SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130

Query: 229 STETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV 288
           + + L  R   V R + GC       +    G+ G GRG LS P+Q G    + FS+C +
Sbjct: 131 TRDILKARTRDVPRFSFGCVTST---YHEPIGIAGFGRGLLSLPSQLG-FLEKGFSHCFL 186

Query: 289 DRSTSAKP---SSMVFGDSAVS----RTARFTPLLANPKLDTFYYVELVGISVG-GAHVR 340
                  P   S ++ G SA+S     + +FTP+L  P     YY+ L  I++G      
Sbjct: 187 PFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNITPT 246

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL--FDT 398
            +  +L + D  GNGG+++DSGT+ T L  P Y  L    ++  +  +     S   FD 
Sbjct: 247 QVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATETESRTGFDL 306

Query: 399 CFD----------LSGKTEVKVPTVVLHF-RGADVSLPATNYLI----PVDSSGTFCFAF 443
           C+           L     +  P++  +F   A + LP  N       P D S   C  F
Sbjct: 307 CYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLF 366

Query: 444 AGTMSG----LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                G      + G+ QQQ  +VVYDL   RIGF    C 
Sbjct: 367 QNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 407


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 99/271 (36%), Positives = 139/271 (51%), Gaps = 21/271 (7%)

Query: 212 CLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
           C + +SY DG+ TVG +S + LT   G  V     GCGH    +     G+LGLGR R S
Sbjct: 37  CGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRES 96

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV 330
                G R+   FSYCL   S S+KP  +  G         FTP+   P   TF  V L 
Sbjct: 97  L----GARYGGVFSYCL--PSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLA 150

Query: 331 GISVGGAHVRGITASLFKLDPAG-NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR 389
           GI+VGG  +         L P+  +GG+I+DSGT +T L   AY ALR AFR    + + 
Sbjct: 151 GINVGGKKL--------DLRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRL 202

Query: 390 APDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMS 448
            P+  L DTC++L+G   V VP + L F  GA ++L   N ++    +G   FA +G   
Sbjct: 203 LPNGDL-DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDG 258

Query: 449 GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              ++GN+ Q+ F V++D + S+ GF  + C
Sbjct: 259 SAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 122/395 (30%), Positives = 182/395 (46%), Gaps = 64/395 (16%)

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSR 186
           I G     G Y+  + +G P +  Y+ +DTGSD+ W+QC APC+ C      ++DP ++R
Sbjct: 21  IGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRAR 80

Query: 187 SFATVPCRSPLCRKLDSSG---CNRR-NTCLYQVSYGDGSITVGDFSTETLTF---RGTR 239
               V CR P C ++   G   C+     C Y+V Y DGS T+G    +T+T     GTR
Sbjct: 81  ---VVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTR 137

Query: 240 V-ARVALGCGHDNEGLFVAAA----GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRST 292
              R  +GCG+D +G    A     G++GL   ++S P+Q   +   N    +CL   S 
Sbjct: 138 FQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSN 197

Query: 293 SAKPSSMVFGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGGA--HVRGITASLFKL 349
                 + FGD+ V +    +TP++  P ++  Y   L  I  GG    + G T  +   
Sbjct: 198 GG--GYLFFGDTLVPALGMTWTPMIGRPLVEG-YQARLRSIKYGGEVLELEGTTDDV--- 251

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIA-----LRDAFRAGASSLK---------RAPDFSL 395
                GG + DSGTS T L   AY A     +R A R+G   +K         R P  S 
Sbjct: 252 -----GGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGP--SP 304

Query: 396 FDTCFDLSGKTEVKVPTVVLHFRGAD-------VSLPATNYLIPVDSSGTFCF----AFA 444
           F++  D+S   +    TV L F G+        + L    YLI V + G  C     A  
Sbjct: 305 FESVADVSAYFK----TVTLDFGGSTWWSSGKLLELSPEGYLI-VSTQGNVCLGVLDASV 359

Query: 445 GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++   +I+G+I  +G+ VVYD    +IG+  R C
Sbjct: 360 ASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 107/316 (33%), Positives = 146/316 (46%), Gaps = 19/316 (6%)

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTET 232
           P FD + S +     C S LC+ L  + C         TC+Y   Y D S+T G    + 
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234

Query: 233 LTF-RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
            TF  G  V  VA GCG  N G+F +   G+ G GRG LS P+Q        FS+C    
Sbjct: 235 FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVG---NFSHCFTAV 291

Query: 291 STSAKPSSMVFGDSAVSRTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           +   + + ++   + + +  R     TPL+ N    T YY+ L GI+VG   +  +  S 
Sbjct: 292 NGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLP-VPESA 350

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT 406
           F L   G GG IIDSGTS+T L    Y  +RD F A         + +   TCF    + 
Sbjct: 351 FALT-NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQA 409

Query: 407 EVKVPTVVLHFRGADVSLPATNYLIPV-DSSGT--FCFAFAGTMSGLSIIGNIQQQGFRV 463
           +  VP +VLHF GA + LP  NY+  V D +G    C A        + IGN QQQ   V
Sbjct: 410 KPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHV 469

Query: 464 VYDLAASRIGFAPRGC 479
           +YDL  + + F    C
Sbjct: 470 LYDLQNNMLSFVAAQC 485



 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 65/136 (47%), Gaps = 4/136 (2%)

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRA 390
           GI+VG   +  +  S F L   G GG IIDSGTS+T L    Y  +RD F A        
Sbjct: 41  GITVGSTRLP-VPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVP 98

Query: 391 PDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYL--IPVDSSGTFCFAFAGTMS 448
            + +   TCF    + +  VP +VLHF GA + LP  NY+  +P D+  +          
Sbjct: 99  GNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD 158

Query: 449 GLSIIGNIQQQGFRVV 464
             +IIGN QQQ    +
Sbjct: 159 ETTIIGNFQQQNMHAL 174


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/349 (33%), Positives = 168/349 (48%), Gaps = 53/349 (15%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTC 212
           +++DTGSD++W QC    K  S T      A +R  +      PL R   +       TC
Sbjct: 55  LIVDTGSDLIWTQC----KLSSST-----AAAARHGS-----PPLSRTAPARTGAFTRTC 100

Query: 213 LYQVSYGDGSITVGDFSTETLTFRGTRVA--RVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
               +       VG  ++ET TF   R    R+  GCG  + G  + A G+LGL    LS
Sbjct: 101 TASAA------AVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLS 154

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFTPLLANPKLDTF 324
             TQ      ++FSYCL   +   K S ++FG  A       +R  + T +++NP    +
Sbjct: 155 LITQLK---IQRFSYCLTPFA-DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVY 210

Query: 325 YYVELVGISVGGAHVR-GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           YYV LVGIS+G  H R  + A+   + P G GG I+DSG++V  L   A+ A+++A    
Sbjct: 211 YYVPLVGISLG--HKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVM-- 266

Query: 384 ASSLKRAP----DFSLFDTCFDLSGKTE------VKVPTVVLHFRG-ADVSLPATNYLIP 432
              + R P        ++ CF L  +T       V+VP +VLHF G A + LP  NY   
Sbjct: 267 --DVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQE 324

Query: 433 VDSSGTFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +G  C A   T   SG+SIIGN+QQQ   V++D+   +  FAP  C
Sbjct: 325 -PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/390 (30%), Positives = 169/390 (43%), Gaps = 68/390 (17%)

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPA--KSRSFATVPCRSPLC---------- 198
           + + +DTGSD+VW  CAP K    +  P   P    +RS A V C+SP C          
Sbjct: 63  ITLYMDTGSDLVWFPCAPFKCILCEGKPNASPPVNTTRSVA-VSCKSPACSAAHNLASPS 121

Query: 199 ----------RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
                       +++S C       +  +YGDGS+ +     +TL+     +     GC 
Sbjct: 122 DLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSLSSLFLRNFTFGCA 180

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRKFSYCLVDRSTSA----KPSSMVF 301
           +          G+ G GRG LS P Q      +   +FSYCLV  S  +    KPS ++ 
Sbjct: 181 YTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLIL 237

Query: 302 GDSAVSRTAR----------FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           G                   +TP+L NPK   FY V L+GISVG   +      L +++ 
Sbjct: 238 GRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVG-KRIVPAPEMLRRVNN 296

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL----KRAPDFSLFDTCFDLSGKTE 407
            G+GGV++DSGT+ T L    Y ++ D F  G   +    ++  + +    C+ L+   E
Sbjct: 297 RGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPCYYLNSVAE 356

Query: 408 VKVPTVVLHFRGAD--VSLPATNYLIPV----DSS------GTFCFAFAGTMSGLS---- 451
           V  P + L F G +  V LP  NY        D++      G       G  + LS    
Sbjct: 357 V--PVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAELSGGPG 414

Query: 452 -IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +GN QQQGF V YDL   R+GFA R CA
Sbjct: 415 ATLGNYQQQGFEVEYDLEEKRVGFARRQCA 444


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 127/405 (31%), Positives = 181/405 (44%), Gaps = 68/405 (16%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP--CKKCYSQ---TDPVFDPAKSRSF-- 188
           G+Y     +G+    + + +DTGSD+VW  C+P  C  C  +     P+   A ++S   
Sbjct: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133

Query: 189 --------------ATVPC---RSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTE 231
                         A+  C   R PL   ++ S C+  +   +  +YGDGS+ V     +
Sbjct: 134 SAAACSAAHGGSLSASHLCAISRCPL-ESIEISECSSFSCPPFYYAYGDGSL-VARLYRD 191

Query: 232 TLTFRG------TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRK 282
           +L+           V     GC H   G      G+ G GRG LS P+Q      +   +
Sbjct: 192 SLSLPTPAPSPPINVRNFTFGCAHTTLG---EPVGVAGFGRGVLSMPSQLATFSPQLGNR 248

Query: 283 FSYCLVDRSTSA----KPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGA 337
           FSYCLV  S +A    +PS ++ G      T   +T LL NPK   FY V L GISVG  
Sbjct: 249 FSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNI 308

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAG--ASSLKRAPDF 393
            +      L K+D  G+GGV++DSGT+ T L    Y ++   F  R G  A+  +R  + 
Sbjct: 309 RIPA-PEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEEN 367

Query: 394 SLFDTCFDLSGKTEVKVPTVVLHFRG--ADVSLPATNYLIPV-----------DSSGTFC 440
           +    C+    +  V VP VVLHF G  ++V LP  NY                  G   
Sbjct: 368 TGLSPCYYY--ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLM 425

Query: 441 FAFAGTMSGLS-----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
               G  + L+      +GN QQQGF VVYDL  +R+GFA R C+
Sbjct: 426 LMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 470


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 171/382 (44%), Gaps = 41/382 (10%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +GL   +G YFT++G+G+P +  Y+ +DTGSD++W+ C  C +C  ++D      ++DP 
Sbjct: 60  NGLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119

Query: 184 KSRSFATVPCRSPLCRKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFR---- 236
           +S++   V C    C         GC   N C Y +SYGDGS T G +  + LTF     
Sbjct: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179

Query: 237 ----GTRVARVALGCGHDNEGLFVAAA-----GLLGLGRGRLSFPTQTGR--RFNRKFSY 285
                T+ + +  GCG    G F +++     G++G G+   S  +Q     +  + FS+
Sbjct: 180 NPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T+        G+  V    + TPL+ N      Y V L  I V G  +  + + 
Sbjct: 240 CL---DTNVGGGIFSIGE-VVEPKVKTTPLVPNM---AHYNVILKNIEVDG-DILQLPSD 291

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
            F  D     G +IDSGT++  L R  Y  L     A    LK       + +CF  +G 
Sbjct: 292 TF--DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQY-SCFQYTGN 348

Query: 406 TEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFAGTMS------GLSIIGNIQQ 458
            +   P V LHF  +  +++   +YL        +C  +  + S       ++++G+   
Sbjct: 349 VDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVL 408

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
               VVYDL    IG+    C+
Sbjct: 409 SNKLVVYDLENMTIGWTDYNCS 430


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 161/392 (41%), Gaps = 57/392 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKCYS---QTDPVFDPAKSRSFA 189
           G Y   L  GTPP+    VLDTGS +VW+ C     C KC S      P F P  S S  
Sbjct: 214 GGYSIDLKFGTPPQTFPFVLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSK 273

Query: 190 TVPCRSPLCR------------KLDSSGCNRRNTC-----LYQVSYGDGSITVGDFSTET 232
            V CR+P C             KL  +  +  N C      Y V YG GS T G   +E 
Sbjct: 274 FVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGS-TAGFLLSEN 332

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRST 292
           L F    V+   +GC   +        G+ G GRG  S P Q       +FSYCL+    
Sbjct: 333 LNFPAKNVSDFLVGCSVVS---VYQPGGIAGFGRGEESLPAQMNL---TRFSYCLLSHQF 386

Query: 293 SAKP--SSMVF-----GDSAVSRTARFTPLLANPK-----LDTFYYVELVGISVGGAHVR 340
              P  S +V      G+   +    +T  L NP         +YY+ L  I VG   VR
Sbjct: 387 DESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVR 446

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---FD 397
            +   + + D  G+GG I+DSG+++T + RP +  + + F     +  RA +        
Sbjct: 447 -VPRRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEF-VKQVNYTRARELEKQFGLS 504

Query: 398 TCFDLSGKTEV-KVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAF--------AGTM 447
            CF L+G  E    P +   FR GA + LP  NY   V      C            G +
Sbjct: 505 PCFVLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAV 564

Query: 448 SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               I+GN QQQ F V  DL   R GF  + C
Sbjct: 565 GPAVILGNYQQQNFYVECDLENERFGFRSQSC 596


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 131/410 (31%), Positives = 171/410 (41%), Gaps = 70/410 (17%)

Query: 131 LAQGSGEYFTRLGVGTP--PRYVYMVLDTGSDVVWIQCAP--CKKCY-------SQTDPV 179
           LA GS +Y   L VG P     V + LDTGSD+VW  CAP  C  C        + + P+
Sbjct: 82  LAPGS-DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPL 140

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSG--------------------CNRRNTCLYQVSYG 219
             P  SR  +   C SPLC    SS                     C          +YG
Sbjct: 141 PPPIDSRRIS---CASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYG 197

Query: 220 DGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           DGS+                V      C H          G+ G GRG LS P Q     
Sbjct: 198 DGSLVANLRRGRVGLAASMAVENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSL 254

Query: 280 NRKFSYCLVDRSTSA----KPSSMVFG---DSAVSRTAR----FTPLLANPKLDTFYYVE 328
           + +FSYCLV  S  A    + S ++ G   D+A    +     +TPLL NPK   FY V 
Sbjct: 255 SGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVA 314

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD-----AFRAG 383
           L  +SVGG  ++     L  +D  GNGG+++DSGT+ T L    +  + D        A 
Sbjct: 315 LEAVSVGGKRIQA-QPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAAR 373

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSS-----G 437
            +  + A   +    C+  S  ++  VP V LHFRG A V+LP  NY +   S      G
Sbjct: 374 FTRAEGAEAQTGLAPCYHYS-PSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 438 TFCFAFAGTMSG--------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                  G  +            +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 90/246 (36%), Positives = 128/246 (52%), Gaps = 12/246 (4%)

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSM 299
           +A    GC     G  V + GL+G  RG LSFP+Q    +   FSYCL    +S    ++
Sbjct: 324 IAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTL 383

Query: 300 VFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
             G +   +  + TPLL+NP   + YYV +VGI VGG  V  + AS    DPA   G I+
Sbjct: 384 RLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPV-AVPASALAFDPASGHGTIV 442

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG 419
           D+GT  TRL+ P Y A+ D FR+   +    P    FDTC++++    + VPTV   F G
Sbjct: 443 DAGTMFTRLSAPVYAAVCDVFRSRVRAPVAGP-LGGFDTCYNVT----ISVPTVTFLFDG 497

Query: 420 -ADVSLPATNYLIPVDSSGTFCFAFAGTMSG-----LSIIGNIQQQGFRVVYDLAASRIG 473
              V+LP  N +I     G  C A A   S      L+++ ++QQQ  RV++D+A  R+G
Sbjct: 498 RVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVG 557

Query: 474 FAPRGC 479
           F+   C
Sbjct: 558 FSRELC 563


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 131/410 (31%), Positives = 171/410 (41%), Gaps = 70/410 (17%)

Query: 131 LAQGSGEYFTRLGVGTP--PRYVYMVLDTGSDVVWIQCAP--CKKCY-------SQTDPV 179
           LA GS +Y   L VG P     V + LDTGSD+VW  CAP  C  C        + + P+
Sbjct: 82  LAPGS-DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPL 140

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSG--------------------CNRRNTCLYQVSYG 219
             P  SR  +   C SPLC    SS                     C          +YG
Sbjct: 141 PPPIDSRRIS---CASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYG 197

Query: 220 DGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           DGS+                V      C H          G+ G GRG LS P Q     
Sbjct: 198 DGSLVANLRRGRVGLAASMAVENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSL 254

Query: 280 NRKFSYCLVDRSTSA----KPSSMVFG---DSAVSRTAR----FTPLLANPKLDTFYYVE 328
           + +FSYCLV  S  A    + S ++ G   D+A    +     +TPLL NPK   FY V 
Sbjct: 255 SGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVA 314

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD-----AFRAG 383
           L  +SVGG  ++     L  +D  GNGG+++DSGT+ T L    +  + D        A 
Sbjct: 315 LEAVSVGGKRIQA-QPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAAR 373

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSS-----G 437
            +  + A   +    C+  S  ++  VP V LHFRG A V+LP  NY +   S      G
Sbjct: 374 FTRAEGAEAQTGLAPCYHYS-PSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 438 TFCFAFAGTMSG--------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                  G  +            +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 126/423 (29%), Positives = 189/423 (44%), Gaps = 54/423 (12%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           N T +    L IQ    R+ ++ A  E ++     N  + R +   +   I         
Sbjct: 53  NETAKDRMELDIQHSAARLANIQARIEGSLV--SNNDYKARVSPSLTGRTI--------- 101

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
               + +G PP    +V+DTGSD++W+ C PC  C +    +FDP+KS +F      SPL
Sbjct: 102 -MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTF------SPL 154

Query: 198 CRK-LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHD- 250
           C+   D  GC R +   + V+Y D S   G F  +T+ F  T     R++ V  GCGH+ 
Sbjct: 155 CKTPCDFEGC-RCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNI 213

Query: 251 NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKP----SSMVFGDSAV 306
                    G+LGL  G  S  T+ G    +KFSYC+      A P      ++ G+ A 
Sbjct: 214 GHDTDPGHNGILGLNNGPDSLVTKLG----QKFSYCI---GNLADPYYNYHQLILGEGA- 265

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                 TP       + FYYV + GISVG   +  I    F++     GGVIID+G+++T
Sbjct: 266 DLEGYSTPFEV---YNGFYYVTMEGISVGEKRLD-IAPETFEMKENRAGGVIIDTGSTIT 321

Query: 367 RLTRPAYIALRDAFRA--GASSLKRAPDFSLFDTCFDLS-GKTEVKVPTVVLHFR-GADV 422
            L    +  L    R   G S  +   + S +  CF  S  +  V  P V  HF  GAD+
Sbjct: 322 FLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADL 381

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSI------IGNIQQQGFRVVYDLAASRIGFAP 476
           +L + ++   ++ +  FC    G +S L+I      IG + QQ + V YDL    + F  
Sbjct: 382 ALDSGSFFNQLNDN-VFCMT-VGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQR 439

Query: 477 RGC 479
             C
Sbjct: 440 IDC 442


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 120/375 (32%), Positives = 177/375 (47%), Gaps = 42/375 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------FDPAKSRSFA 189
           G Y+T+L +GTPPR  Y+ +DTGSDV+W+ CA C  C  QT  +      FDP  S + +
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTAS 137

Query: 190 TVPCRSPLCR---KLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRG-------- 237
            + C    C    +   SGC+ + N C Y   YGDGS T G + ++ L F          
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 238 TRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A V  GC     G  V    A  G+ G G+  +S  +Q   +    R FS+CL  + 
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL--KG 255

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            +     +V G+  V     FTPL+ +      Y V L+ ISV G  +  I  S+F    
Sbjct: 256 ENGGGGILVLGE-IVEPNMVFTPLVPS---QPHYNVNLLSISVNGQALP-INPSVFS--- 307

Query: 352 AGNG-GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
             NG G IID+GT++  L+  AY+   +A     S   R P  S  + C+ ++       
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR-PVVSKGNQCYVITTSVGDIF 366

Query: 411 PTVVLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTMS-GLSIIGNIQQQGFRVVY 465
           P V L+F  GA + L   +YLI  ++ G    +C  F    + G++I+G++  +    VY
Sbjct: 367 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 426

Query: 466 DLAASRIGFAPRGCA 480
           DL   RIG+A   C+
Sbjct: 427 DLVGQRIGWANYDCS 441


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 143/303 (47%), Gaps = 18/303 (5%)

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-----RNTCLYQVSYGDGSITVGDFSTET 232
           P FD + S +     C S LC+ L  + C         TC+Y   Y D S+T G    + 
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82

Query: 233 LTF-RGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
            TF  G  V  VA GCG  N G+F +   G+ G GRG LS P+Q        FS+C    
Sbjct: 83  FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVG---NFSHCFTAV 139

Query: 291 STSAKPSSMVFGDSAVSRTAR----FTPLLANPKLDTFYYVELVGISVGGAHVRGITASL 346
           +   + + ++   + + +  R     TPL+ N    TFYY+ L GI+VG   +  +  S 
Sbjct: 140 NGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLP-VPESA 198

Query: 347 FKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT 406
           F L   G GG IIDSGTS+T L    Y  +RD F A         + +   TCF    + 
Sbjct: 199 FALT-NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQA 257

Query: 407 EVKVPTVVLHFRGADVSLPATNYL--IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVV 464
           +  VP +VLHF GA + LP  NY+  +P D+  +            +IIGN QQQ   V+
Sbjct: 258 KPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVL 317

Query: 465 YDL 467
           YDL
Sbjct: 318 YDL 320


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 119/405 (29%), Positives = 187/405 (46%), Gaps = 47/405 (11%)

Query: 112 RNRSR-GRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
           R+R+R GR   G    V+    QG+      G YFT++ +G+P +  Y+ +DTGSD++WI
Sbjct: 50  RDRARHGRILQGVVGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKEFYVQIDTGSDILWI 109

Query: 165 QCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCR---KLDSSGCNRR-NTCLYQ 215
            C  C  C   +        FD A S + A V C  P+C    +  +S C+ + N C Y 
Sbjct: 110 NCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYT 169

Query: 216 VSYGDGSITVGDFSTETLTFRGTRVAR---------VALGCGHDNEGLFV----AAAGLL 262
             YGDGS T G + ++T+ F    + +         +  GC     G       A  G+ 
Sbjct: 170 FQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIF 229

Query: 263 GLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL-ANP 319
           G G G LS  +Q   R    + FS+CL  +        +V G+  +  +  ++PL+ + P
Sbjct: 230 GFGPGALSVISQLSSRGVTPKVFSHCL--KGGENGGGVLVLGE-ILEPSIVYSPLVPSQP 286

Query: 320 KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
                Y + L  I+V G  +  I +++F      N G I+DSGT++  L + AY     A
Sbjct: 287 H----YNLNLQSIAVNG-QLLPIDSNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVKA 339

Query: 380 FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIP---VDS 435
             A  S   + P  S  + C+ +S       P V L+F  GA + L   +YL+    +D 
Sbjct: 340 ITAAVSQFSK-PIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDG 398

Query: 436 SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +  +C  F     G +I+G++  +    VYDLA  RIG+A   C+
Sbjct: 399 AAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYDCS 443


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 122/417 (29%), Positives = 178/417 (42%), Gaps = 53/417 (12%)

Query: 88  RIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTP 147
           R+  D   V + T+F       PP   + G A       V  G  +G   YF        
Sbjct: 46  RVDADGFMVVNATSFHHRPPLTPPLEYTYGVA-------VTIGTGRGKSTYF-------- 90

Query: 148 PRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN 207
                +VLDT S + W++CA C     Q  PVFDP+ S S+  +   SPLCR  +     
Sbjct: 91  -----LVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPV-LP 144

Query: 208 RRNTCLYQVSYGDGSITVGDFSTETLTFRGTR--VARVALGCGHDNEGLFVAA--AGLLG 263
             + C + +  G+    VG   T+T+        +  VA GC    EG       AG LG
Sbjct: 145 AGDKCSFHLP-GEAHGYVG---TDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTFAGTLG 200

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV-FG----DSAVSRTARFTPLLAN 318
           +G+   S   Q   R   +FSYCL+    S   +  + FG    D  +    R   L   
Sbjct: 201 MGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPTP 260

Query: 319 PKL-----DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAY 373
           P L     D+ YYV+L+GIS+ G  + GI  ++F+    G+GG  +D+GT VT L   AY
Sbjct: 261 PHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPAAY 320

Query: 374 IALRDA----FRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG------ADVS 423
             + +A     +       R P+FSL   CF         +P + L F G      A + 
Sbjct: 321 AVVEEAVAHMVQQWGYKRVRDPNFSL---CFREHPGIWSHIPKLTLDFEGPASRTVAHLE 377

Query: 424 LPATNYLIPVDSSGTFCFAFAGTMSG-LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + + N  + VD+    CF    T  G  +++G +QQ   R ++DL A+ I F    C
Sbjct: 378 IVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRESC 434


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 177/403 (43%), Gaps = 55/403 (13%)

Query: 116 RGRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC 169
             R   GF   V+    QGS      G YFTR+ +GTPPR   + +DTGSDV+W+ C+ C
Sbjct: 53  HARLLQGFVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSC 112

Query: 170 KKCYSQTDPV------FDPAKSRSFATVPCRSPLCR---KLDSSGC-NRRNTCLYQVSYG 219
             C  QT  +      FD   S +   VPC  P+C    +  ++ C  + N C Y   YG
Sbjct: 113 SNC-PQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYG 171

Query: 220 DGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV----AAAGLLGLGRG 267
           DGS T G + ++T  F             A +  GC     G       A  G+ G G+G
Sbjct: 172 DGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQG 231

Query: 268 RLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFY 325
            LS  +Q        R FS+CL  +   +    +V G+  +     ++PL+ +      Y
Sbjct: 232 ELSVISQLSSHGITPRVFSHCL--KGEDSGGGILVLGE-ILEPGIVYSPLVPS---QPHY 285

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAG-----NGGVIIDSGTSVTRLTRPAYIALRDAF 380
            ++L  I+V G         L  +DPA      N G IID+GT++  L   AY     A 
Sbjct: 286 NLDLQSIAVSG--------QLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAI 337

Query: 381 RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDS---S 436
            A  S L   P  +  + C+ +S       P V  +F  GA + L    YL+ + +   +
Sbjct: 338 TAAVSQLA-TPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGA 396

Query: 437 GTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +C  F     G++I+G++  +    VYDLA  RIG+A   C
Sbjct: 397 ALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 173/379 (45%), Gaps = 45/379 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFT++G+G P ++  + +DTGSDV+W+ C PC  C  ++       ++DP +S + + 
Sbjct: 27  GLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSL 86

Query: 191 VPCRSPLC---RKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFR-------GTR 239
           V C  PLC   R+   + C++  N C Y  SYGDGS + G +  + + +           
Sbjct: 87  VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146

Query: 240 VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGRRFN--RKFSYCLVDRSTS 293
            ++V  GC     G       A  G++G G+  LS P Q   + N  R FS+CL      
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL---EGE 203

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
            +   ++           +TPL+ +      Y V L GISV    +  I A  F      
Sbjct: 204 KRGGGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLP-IDAEDFS--STN 257

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
           + GVI+DSGT++      AY     A R  A+S        +   CF +SG+     P V
Sbjct: 258 DTGVIMDSGTTLAYFPSGAYNVFVQAIRE-ATSATPVRVQGMDTQCFLVSGRLSDLFPNV 316

Query: 414 VLHFRGADVSLPATNYLI-----PVDSSGTFCFAF------AGTMSG--LSIIGNIQQQG 460
            L+F G  + L   NYL+     P  ++  +C  +      AG   G  L+I+G+I  + 
Sbjct: 317 TLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 376

Query: 461 FRVVYDLAASRIGFAPRGC 479
             VVYDL  SRIG+    C
Sbjct: 377 KLVVYDLDNSRIGWMSYNC 395


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 173/374 (46%), Gaps = 43/374 (11%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP--CKKCYSQTDPVFDPAKSRSFATVPCRS 195
           Y  +  +G+PP   Y + DTGS++VWIQC    C  CY Q  P+F+P KS ++A   C  
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167

Query: 196 PLCRKL-----DSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFR------GTRVARV 243
             C++      +  GC      C Y +SY D S + G  ST+ +TF       G    R+
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227

Query: 244 ALGCGHDNEGL------FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS 297
             GCG++N            A G++GLG    S     G+    +FSYC +      KP+
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASL---VGQLTLGQFSYC-ISTPDVQKPN 283

Query: 298 SMV---FGDSAVSRTARFTPLLANPKLDTFYYVELV-GISVGGAHVRGITASLFKLDPAG 353
             +   FG +A S +   T L  N  L+ +Y  + V GI V    V+G    +F+    G
Sbjct: 284 GTIEIRFGLAA-SISGHSTALANN--LEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGG 340

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF-----SLFDTCFDLSGKTEV 408
            GG+I+DSGT+ T L   A  AL    +     ++ APD      S +  C++ +     
Sbjct: 341 IGGLIMDSGTTYTELYFSALDALIGELK---EQIELAPDTQDHSNSNYSLCYNAANFLLT 397

Query: 409 KVPTVVLHF-RGADVSLPATNYLIPVDS-SGTFCFAFAGTMSGLSIIGNIQQQGFRVVYD 466
            VP + L F    +   P T     +D+ +  +C A  GT SG+SIIG  Q +  ++ YD
Sbjct: 398 YVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGT-SGISIIGIYQHRDIKIGYD 456

Query: 467 LAASRIGFAPR-GC 479
           L  + + F    GC
Sbjct: 457 LKYNLVSFTEMFGC 470


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 149/339 (43%), Gaps = 74/339 (21%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L    +GC+ 
Sbjct: 166 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCS- 224

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
            N C Y V YGDG  T G +  + LT    T V     GC H   G F A          
Sbjct: 225 NNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSA---------- 274

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK-LDTFYY 326
                + +G  F R                               TPL+ NP  + T Y 
Sbjct: 275 -----STSGTMFAR-------------------------------TPLVRNPSIIPTLYL 298

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  ++
Sbjct: 299 VRLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAA 351

Query: 387 LKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFA 442
             R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C A
Sbjct: 352 YPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGCLA 403

Query: 443 FAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           F  T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 404 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 120/375 (32%), Positives = 177/375 (47%), Gaps = 42/375 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------FDPAKSRSFA 189
           G Y+T+L +GTPPR  Y+ +DTGSDV+W+ CA C  C  QT  +      FDP  S + +
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTAS 137

Query: 190 TVPCRSPLCR---KLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRG-------- 237
            + C    C    +   SGC+ + N C Y   YGDGS T G + ++ L F          
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 238 TRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A V  GC     G  V    A  G+ G G+  +S  +Q   +    R FS+CL  + 
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL--KG 255

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            +     +V G+  V     FTPL+ +      Y V L+ ISV G  +  I  S+F    
Sbjct: 256 ENGGGGILVLGE-IVEPNMVFTPLVPS---QPHYNVNLLSISVNGQALP-INPSVFS--- 307

Query: 352 AGNG-GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
             NG G IID+GT++  L+  AY+   +A     S   R P  S  + C+ ++       
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR-PVVSKGNQCYVITTSVGDIF 366

Query: 411 PTVVLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTMS-GLSIIGNIQQQGFRVVY 465
           P V L+F  GA + L   +YLI  ++ G    +C  F    + G++I+G++  +    VY
Sbjct: 367 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 426

Query: 466 DLAASRIGFAPRGCA 480
           DL   RIG+A   C+
Sbjct: 427 DLVGQRIGWANYDCS 441


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 121/360 (33%), Positives = 168/360 (46%), Gaps = 29/360 (8%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC-----YSQTDPVFDPAKSR 186
           A  +G Y     VGTPP+ V  VLD  SD VW+QC+ C  C      + + P F    S 
Sbjct: 91  ATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSS 150

Query: 187 SFATVPCRSPLCRKLDSSGCNRRNT-CLYQVSYGDGS--ITVGDFSTETLTFRGTRVARV 243
           +   V C +  C++L    C+  ++ C Y   YG G+   T G  + +   F   R   V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGV 210

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLS--FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVF 301
             GC    EG      G++GLGRG LS     Q GR     FSY L         S ++F
Sbjct: 211 IFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGR-----FSYYLAPDDAVDVGSFILF 262

Query: 302 GDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
            D A  RT+R   TPL+A+    + YYVEL GI V G  +  I    F L   G+GGV++
Sbjct: 263 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDL-AIPRGTFDLQADGSGGVVL 321

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLK-RAPDFSL--FDTCFDLSGKTEVKVPTVVLH 416
                VT L   AY  +R A    AS ++ RA D S    D C+        KVP++ L 
Sbjct: 322 SITIPVTFLDAGAYKVVRQAM---ASKIELRAADGSELGLDLCYTSESLATAKVPSMALV 378

Query: 417 FRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGF 474
           F G  V  L   NY     ++G  C     + +G  S++G++ Q G  ++YD++ SR+ F
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/377 (29%), Positives = 172/377 (45%), Gaps = 45/377 (11%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVP 192
           YFT++G+G P ++  + +DTGSDV+W+ C PC  C  ++       ++DP +S + + V 
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 193 CRSPLC---RKLDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTFR-------GTRVA 241
           C  PLC   R+   + C++  N C Y  SYGDGS + G +  + + +            +
Sbjct: 62  CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121

Query: 242 RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGRRFN--RKFSYCLVDRSTSAK 295
           +V  GC     G       A  G++G G+  LS P Q   + N  R FS+CL       +
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL---EGEKR 178

Query: 296 PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
              ++           +TPL+ +      Y V L GISV    +  I A  F      + 
Sbjct: 179 GGGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLP-IDAEDFS--STNDT 232

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           GVI+DSGT++      AY     A R  A+S        +   CF +SG+     P V L
Sbjct: 233 GVIMDSGTTLAYFPSGAYNVFVQAIRE-ATSATPVRVQGMDTQCFLVSGRLSDLFPNVTL 291

Query: 416 HFRGADVSLPATNYLI-----PVDSSGTFCFAF------AGTMSG--LSIIGNIQQQGFR 462
           +F G  + L   NYL+     P  ++  +C  +      AG   G  L+I+G+I  +   
Sbjct: 292 NFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 351

Query: 463 VVYDLAASRIGFAPRGC 479
           VVYDL  SRIG+    C
Sbjct: 352 VVYDLDNSRIGWMSYNC 368


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 149/339 (43%), Gaps = 74/339 (21%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L    +GC+ 
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCS- 206

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
            N C Y V YGDG  T G +  + LT    T V     GC H   G F A          
Sbjct: 207 NNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSA---------- 256

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK-LDTFYY 326
                + +G  F R                               TPL+ NP  + T Y 
Sbjct: 257 -----STSGTMFAR-------------------------------TPLVRNPSIIPTLYL 280

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  ++
Sbjct: 281 VRLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAA 333

Query: 387 LKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFA 442
             R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C A
Sbjct: 334 YPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGCLA 385

Query: 443 FAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           F  T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 149/339 (43%), Gaps = 74/339 (21%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLD--SSGCNR 208
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L    +GC+ 
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCS- 206

Query: 209 RNTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
            N C Y V YGDG  T G +  + LT    T V     GC H   G F A          
Sbjct: 207 NNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSA---------- 256

Query: 268 RLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPK-LDTFYY 326
                + +G  F R                               TPL+ NP  + T Y 
Sbjct: 257 -----STSGTMFAR-------------------------------TPLVRNPSIIPTLYL 280

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  ++
Sbjct: 281 VRLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAA 333

Query: 387 LKR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFA 442
             R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C A
Sbjct: 334 YPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGCLA 385

Query: 443 FAGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           F  T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 180/394 (45%), Gaps = 72/394 (18%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT-DPVFDPAKSRSFATVPCRSPLC-- 198
           + VGTPP+ V MVLDTGS++ W+    C   Y+    P F+ + S S+  VPC S  C  
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLL---CNGSYAPPLTPAFNASGSSSYGAVPCPSTACEW 115

Query: 199 --RKLDSSG-CNR--RNTCLYQVSYGDGSITVGDFSTETLTFRG---------------- 237
             R L     C+    N C   +SY D S   G  +T+T    G                
Sbjct: 116 RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITS 175

Query: 238 --TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK 295
             +  A  + G G D   +  AA GLLG+ RG LSF TQTG    R+F+YC+   +    
Sbjct: 176 YSSTTATNSNGTGTD---VSEAATGLLGMNRGTLSFVTQTG---TRRFAYCI---APGEG 226

Query: 296 PSSMVFGDS-AVSRTARFTPLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKL 349
           P  ++ GD   V+    +TPL+      P  D   Y V+L GI VG A +  I  S+   
Sbjct: 227 PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLP-IPKSVLTP 285

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR---APDFSL---FDTCFDLS 403
           D  G G  ++DSGT  T L   AY AL+  F + A  L      P F     FD CF   
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFR-- 343

Query: 404 GKTEVKV-------PTVVLHFRGADVSLPATN--YLIPVDSSG------TFCFAFAGT-M 447
              E +V       P V L  RGA+V++      Y++P +  G       +C  F  + M
Sbjct: 344 -GPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDM 402

Query: 448 SGLS--IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +G+S  +IG+  QQ   V YDL   R+GFAP  C
Sbjct: 403 AGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 180/394 (45%), Gaps = 72/394 (18%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT-DPVFDPAKSRSFATVPCRSPLC-- 198
           + VGTPP+ V MVLDTGS++ W+    C   Y+    P F+ + S S+  VPC S  C  
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLL---CNGSYAPPLTPAFNASGSSSYGAVPCPSTACEW 115

Query: 199 --RKLDSSG-CNR--RNTCLYQVSYGDGSITVGDFSTETLTFRG---------------- 237
             R L     C+    N C   +SY D S   G  +T+T    G                
Sbjct: 116 RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITS 175

Query: 238 --TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAK 295
             +  A  + G G D   +  AA GLLG+ RG LSF TQTG    R+F+YC+   +    
Sbjct: 176 YSSTTATNSNGTGTD---VSEAATGLLGMNRGTLSFVTQTG---TRRFAYCI---APGEG 226

Query: 296 PSSMVFGDS-AVSRTARFTPLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKL 349
           P  ++ GD   V+    +TPL+      P  D   Y V+L GI VG A +  I  S+   
Sbjct: 227 PGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLP-IPKSVLTP 285

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR---APDFSL---FDTCFDLS 403
           D  G G  ++DSGT  T L   AY AL+  F + A  L      P F     FD CF   
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFR-- 343

Query: 404 GKTEVKV-------PTVVLHFRGADVSLPATN--YLIPVDSSG------TFCFAFAGT-M 447
              E +V       P V L  RGA+V++      Y++P +  G       +C  F  + M
Sbjct: 344 -GPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDM 402

Query: 448 SGLS--IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +G+S  +IG+  QQ   V YDL   R+GFAP  C
Sbjct: 403 AGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 114/216 (52%), Gaps = 6/216 (2%)

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVE 328
           +S  +QTG R+N  FSYCL    +     S+  G +   R  R+TPLL NP   + YYV 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVN 60

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           + G+SVG   V+ + A  F  DPA   G +IDSGT +TR T P Y ALR+ FR   ++  
Sbjct: 61  VTGLSVGRTWVK-VPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS 119

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA--- 444
                  FDTCF+         P V LH  G  D++LP  N LI   ++   C A A   
Sbjct: 120 GYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAP 179

Query: 445 -GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + ++++ N+QQQ  RVV D+A SR+GFA   C
Sbjct: 180 QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 128/391 (32%), Positives = 172/391 (43%), Gaps = 68/391 (17%)

Query: 151 VYMVLDTGSDVVWIQCAP--CKKCYSQTDP-VFDP----------AKSRSFATV---PCR 194
           VYM  DTGSD+VW  C+P  C  C  + +P    P           KSR+ +T    P  
Sbjct: 107 VYM--DTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNSPST 164

Query: 195 SPLC-------RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL-- 245
           S LC        ++++S C+  +   +  +YGDGS+ +       L    T     +L  
Sbjct: 165 SDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSL-IAKLHKHNLIMPSTSNKPFSLKD 223

Query: 246 ---GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRKFSYCLV----DRSTSAK 295
              GC H   G      G+ G G G LS P Q          +FSYCLV    D +    
Sbjct: 224 FTFGCAHSALG---EPIGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKLHH 280

Query: 296 PSSMVFG---DSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
           PS ++ G   +       +F  TP+L NPK   FY V +  ISVG + VR   A L ++D
Sbjct: 281 PSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNA-LIRID 339

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSL---FDTCFDLSG-- 404
             GNGGV++DSGT+ T L    Y ++     R      KRA +         C+ L G  
Sbjct: 340 RDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGNG 399

Query: 405 --KTEVKVPTVVLHFRGA-DVSLPATNYLIP-VDSS--------GTFCFAFAGTMSGL-- 450
             +  + VP +  HF G   V LP  NY    +D          G       G  S    
Sbjct: 400 VERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGGP 459

Query: 451 -SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            + +GN QQQGF+VVYDL   R+GFAPR CA
Sbjct: 460 GATLGNYQQQGFQVVYDLEERRVGFAPRKCA 490


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 119/375 (31%), Positives = 176/375 (46%), Gaps = 42/375 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------FDPAKSRSFA 189
           G Y+T++ +G+PPR  Y+ +DTGSDV+W+ CA C  C  QT  +      FDP  S +  
Sbjct: 79  GLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGC-PQTSGLQIQLNFFDPGSSVTAT 137

Query: 190 TVPCRSPLCR---KLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRG-------- 237
            V C    C    +   SGC+ + N C Y   YGDGS T G + ++ L F          
Sbjct: 138 PVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 238 TRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A V  GC     G  V    A  G+ G G+  +S  +Q   +    R FS+CL  + 
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL--KG 255

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            +     +V G+  V     FTPL+ +      Y V L+ ISV G  +  I  S+F    
Sbjct: 256 ENGGGGILVLGE-IVEPNMVFTPLVPS---QPHYNVNLLSISVNGQALP-INPSVFS--- 307

Query: 352 AGNG-GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
             NG G IID+GT++  L+  AY+   +A     S   R P  S  + C+ ++       
Sbjct: 308 TSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR-PVVSKGNQCYVIATSVADIF 366

Query: 411 PTVVLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTMS-GLSIIGNIQQQGFRVVY 465
           P V L+F  GA + L   +YLI  ++ G    +C  F    + G++I+G++  +    VY
Sbjct: 367 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 426

Query: 466 DLAASRIGFAPRGCA 480
           DL   RIG+A   C+
Sbjct: 427 DLVGQRIGWANYDCS 441


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 164/371 (44%), Gaps = 49/371 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y     +GTPP+ V  V+D   ++VW QC PC+ C+ Q  P+FDP KS +F  +PC S
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 196 PLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDN 251
            LC  +  S  N   + C+Y+     G  T G   T+T    G     +  GC       
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAI-GAAKETLGFGCVVMTDKR 172

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA-------------KPSS 298
                  +G++GLGR   S  TQ        FSYCL  +S+ A             K SS
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNV---TAFSYCLAGKSSGALFLGATAKQLAGGKNSS 229

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
             F    +  +A  +   +NP    +Y V+L GI  GGA ++  ++S           V+
Sbjct: 230 TPF---VIKTSAGSSDNGSNP----YYMVKLAGIKAGGAPLQAASSS--------GSTVL 274

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +D+ +  + L   AY AL+ A  A       A     +D CF  S       P +V  F 
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCF--SKAVAGDAPELVFTFD 332

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFA--------GTMSGLSIIGNIQQQGFRVVYDLAA 469
            GA +++P  NYL+    +GT C            G + G SI+G++QQ+   V++DL  
Sbjct: 333 GGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391

Query: 470 SRIGFAPRGCA 480
             + F P  C+
Sbjct: 392 ETLSFKPADCS 402


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 161/371 (43%), Gaps = 51/371 (13%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           +     +G PP     V+DTGS + W+ C PC  C  Q+ P+FDP+KS +++ + C    
Sbjct: 93  FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCSE-- 150

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVALGCGHD-- 250
           C K D         C Y V Y     + G ++ E LT         +V  +  GCG    
Sbjct: 151 CNKCDVV----NGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFS 206

Query: 251 ---NEGLFVAAAGLLGLGRGRLS-FPTQTGRRFNRKFSYCLVD-RSTSAKPSSMVFGDSA 305
              N   +    G+ GLG GR S  P+     F +KFSYC+ + R+T+ K + +V GD A
Sbjct: 207 ISSNGYPYQGINGVFGLGSGRFSLLPS-----FGKKFSYCIGNLRNTNYKFNRLVLGDKA 261

Query: 306 VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA-GNGGVIIDSGTS 364
             +    T  + N      YYV L  IS+GG  +  I  +LF+      N GVIIDSG  
Sbjct: 262 NMQGDSTTLNVIN----GLYYVNLEAISIGGRKLD-IDPTLFERSITDNNSGVIIDSGAD 316

Query: 365 VTRLTRPAYIALR---DAFRAGASSLKRAPDFSLFDTCF------DLSGKTEVKVPTVVL 415
            T LT+  +  L    +    G   L +    + +  C+      DLSG      P V  
Sbjct: 317 HTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSG-----FPLVTF 371

Query: 416 HF-RGADVSLPATNYLIPVDSSGTFCFA------FAGTMSGLSIIGNIQQQGFRVVYDLA 468
           HF  GA + L  T+  I   +   FC A      F       S IG + QQ + V YDL 
Sbjct: 372 HFAEGAVLDLDVTSMFIQT-TENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLN 430

Query: 469 ASRIGFAPRGC 479
             R+ F    C
Sbjct: 431 RMRVYFQRIDC 441


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/272 (34%), Positives = 133/272 (48%), Gaps = 17/272 (6%)

Query: 10  LLLFSFFFTAAASLQYQTFVLNSLPTPSTLSWPESVSVSESESSLPLP-APDAESSLSLR 68
            LL+S   ++   L +Q     +L TPSTL      S+  S    P P   D  +SL + 
Sbjct: 13  FLLYSALLSSKRGLAFQG-RKTALSTPSTLHNVHITSLMPSSVCSPSPKGDDKRASLEVI 71

Query: 69  LHH--VDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGG-FSS 125
             H     LS ++         + +D  RV S+ +      R+       G+  G   + 
Sbjct: 72  HKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRS------RLAKNPADGGKLKGSKVTL 125

Query: 126 SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK-CYSQTDPVFDPAK 184
              SG   G+G Y   +G+GTP R +  + DTGSD+ W QC PC + CY Q +P+F+P+K
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSK 185

Query: 185 SRSFATVPCRSPLCRKLDSSGCN----RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV 240
           S S+  + C SP C +L S   N      +TC+Y + YGD S +VG F+ + L    T V
Sbjct: 186 STSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDV 245

Query: 241 -ARVALGCGHDNEGLFVAAAGLLGLGRGRLSF 271
                 GCG +N GLFV  AGL+GLGR  LS 
Sbjct: 246 FNNFLFGCGQNNRGLFVGVAGLIGLGRNALSL 277



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/106 (41%), Positives = 62/106 (58%), Gaps = 5/106 (4%)

Query: 377 RDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDS 435
           R+A    +   K AP  S+ DTC+D S    V VP + L+F  GA++ L  +     ++ 
Sbjct: 272 RNALSLMSKYPKAAP-ASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNI 330

Query: 436 SGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           S   C AFAG    + ++I+GN+QQ+ F VVYD+A  RIGFAP GC
Sbjct: 331 S-QVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/385 (31%), Positives = 178/385 (46%), Gaps = 50/385 (12%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           SGLA  +G YFTR+G+GTP +  Y+ +DTGSD++W+ C  C  C  +++      ++DP 
Sbjct: 81  SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140

Query: 184 KSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            S+S   V C    C   +  G    C   + C Y +SYGDGS T G F T+ L +    
Sbjct: 141 GSQSGELVTCDQQFCVA-NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199

Query: 240 --------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                    A V+ GCG    G      +A  G+LG G+   S  +Q     +  + F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         G+  V    + TPL+++      Y V L GI VGG  + G+  +
Sbjct: 260 CL---DTVNGGGIFAIGN-VVQPKVKTTPLVSDMP---HYNVILKGIDVGGTAL-GLPTN 311

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLS 403
           +F  D   + G IIDSGT++  +    Y AL      +    S++   DFS    CF  S
Sbjct: 312 IF--DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS----CFQYS 365

Query: 404 GKTEVKVPTVVLHFRGADVSLPAT--NYLIPVDSSGTFCFAFAG----TMSG--LSIIGN 455
           G  +   P V  HF G DVSL  +  +YL   +    +C  F      T  G  + ++G+
Sbjct: 366 GSVDDGFPEVTFHFEG-DVSLIVSPHDYLFQ-NGKNLYCMGFQNGGVQTKDGKDMVLLGD 423

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           +      V+YDL    IG+A   C+
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCS 448


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 165/371 (44%), Gaps = 49/371 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y     +GTPP+ V  V+D   ++VW QC PC+ C+ Q  P+FDP KS +F  +PC S
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 196 PLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---GHDN 251
            LC  +  S  N   + C+Y+     G  T G   T+T    G     +  GC       
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAI-GAAKETLGFGCVVMTDKR 172

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSA-------------KPSS 298
                  +G++GLGR   S  TQ        FSYCL  +S+ A             K SS
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNV---TAFSYCLAGKSSGALFLGATAKQLAGGKNSS 229

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
             F    +  +A  +   +NP    +Y V+L GI  GGA ++  ++S           V+
Sbjct: 230 TPF---VIKTSAGSSDNGSNP----YYMVKLAGIKTGGAPLQAASSS--------GSTVL 274

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR 418
           +D+ +  + L   AY AL+ A  A       A     +D CF  +   +   P +V  F 
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGD--APELVFTFD 332

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFA--------GTMSGLSIIGNIQQQGFRVVYDLAA 469
            GA +++P  NYL+    +GT C            G + G SI+G++QQ+   V++DL  
Sbjct: 333 GGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391

Query: 470 SRIGFAPRGCA 480
             + F P  C+
Sbjct: 392 ETLSFKPADCS 402


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 185/424 (43%), Gaps = 63/424 (14%)

Query: 111 PRNRSRGRANGGFSSSVISGLAQGS-GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-- 167
           P + S+  + G  S    + L   S G Y     +GTPP+ + ++LDTGS + W+ C   
Sbjct: 71  PNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSS 130

Query: 168 -PCKKCYSQTD---PVFDPAKSRSFATVPCRSPLCRKLDSSG-----CNR---------- 208
             C+ C S +    PVF P  S S   V CR+P C+ + S+      C R          
Sbjct: 131 YECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANC 190

Query: 209 ----RNTC-LYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLG 263
                N C  Y V YG GS T G    +TL   G  V    LGC      +    +GL G
Sbjct: 191 PAAASNVCPPYAVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAG 247

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDR---STSAKPSSMVFGDSAVSRTARFTPLLANPK 320
            GRG  S P Q G     KFSYCL+ R     +A   S+V G +      ++ PL+ +  
Sbjct: 248 FGRGAPSVPAQLGL---PKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAA 304

Query: 321 LD-----TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT----RP 371
            D      +YY+ L G++VGG  VR + A  F  + AG+GG I+DSGT+ T L     +P
Sbjct: 305 GDKLPYGVYYYLALRGVTVGGKAVR-LPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQP 363

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADV-SLPATNY 429
              A+  A        K A D      CF L  G   + +P +  HF G  V  LP  NY
Sbjct: 364 VADAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENY 423

Query: 430 LIPVDSSG---TFCFAFAGTMSGLS-----------IIGNIQQQGFRVVYDLAASRIGFA 475
            + V   G     C A      G S           I+G+ QQQ + V YDL   R+GF 
Sbjct: 424 FV-VAGRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFR 482

Query: 476 PRGC 479
            + C
Sbjct: 483 RQSC 486


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 181/404 (44%), Gaps = 35/404 (8%)

Query: 96  VKSLTAFAESAVRVPPRNRSRGRANGGFSSSV--ISGLAQGSGEYFTRLGVGTPPRYVYM 153
           V  LT    +A R+P  +  RG  +G   ++   +      +G Y TRL +GTP +   +
Sbjct: 47  VLPLTLAYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFAL 106

Query: 154 VLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCL 213
           ++D+GS V ++ CA C++C +  DP F P  S +++ V C       +D +  N R+ C 
Sbjct: 107 IVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKC------NVDCTCDNERSQCT 160

Query: 214 YQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNEG-LFVAAA-GLLGLGRGR 268
           Y+  Y + S + G    + ++F      +  R   GC +   G LF   A G++GLGRG+
Sbjct: 161 YERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQ 220

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVE 328
           LS   Q   +     S+ L          +MV G         F+   +NP    +Y +E
Sbjct: 221 LSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFS--HSNPVRSPYYNIE 278

Query: 329 LVGISVGGAHVRGITASLFKLDPA---GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           L  I V G  +R        LDP       G ++DSGT+   L   A++A +DA     +
Sbjct: 279 LKEIHVAGKALR--------LDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVN 330

Query: 386 SLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADVSLPATNYLIPVDS-SG 437
           SLK  R PD +  D CF  +G+   ++    P V + F  G  +SL   NYL       G
Sbjct: 331 SLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEG 390

Query: 438 TFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            +C   F       +++G I  +   V YD    +IGF    C+
Sbjct: 391 AYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 434


>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
          Length = 216

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 113/216 (52%), Gaps = 6/216 (2%)

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVE 328
           +S  +QTG R+N  FSYCL    +     S+  G +   R  R TPLL NP   + YYV 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVN 60

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLK 388
           + G+SVG   V+ + A  F  DPA   G +IDSGT +TR T P Y ALR+ FR   ++  
Sbjct: 61  VTGLSVGRTWVK-VPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS 119

Query: 389 RAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAFA--- 444
                  FDTCF+         P V LH  G  D++LP  N LI   ++   C A A   
Sbjct: 120 GYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAP 179

Query: 445 -GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
               + ++++ N+QQQ  RVV D+A SR+GFA   C
Sbjct: 180 QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/392 (31%), Positives = 174/392 (44%), Gaps = 71/392 (18%)

Query: 154 VLDTGSDVVWIQCAPCK----------KCYSQTDPVFDPAKSRSFATVPCRS---PLCRK 200
           V+DTGSD+VW QC+ C+           C+ Q  P ++ + SR+   VPC      LC  
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 201 L-DSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE-- 252
             +++GC R      + C+   SYG G + +G   T+  TF  +    +A GC       
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVTLAFGCVSQTRIS 195

Query: 253 -GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR-STSAKPSSMVFGDSAVSRTA 310
            G    A+G++GLGRG LS  +Q       +FSYCL      +  PS +  GD  ++   
Sbjct: 196 PGALNGASGIIGLGRGALSLVSQLNAT---EFSYCLTPYFRDTVSPSHLFVGDGELAGLR 252

Query: 311 RF-------------TPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFKLDPAG- 353
                           P   NPK     TFYY+ LVG++ G A V  + A  F L  A  
Sbjct: 253 AAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATV-ALPAGAFDLREAAP 311

Query: 354 ---NGGVIIDSGTSVTRLTRPAYIALRDAFRA---GASSLKRAPDF--SLFDTCFDLSGK 405
               GG +IDSG+  TRL  PA+ AL         G+ SL   P       + C +    
Sbjct: 312 KVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDD 371

Query: 406 TE----VKVPTVVLHFR-----GADVSLPATNYLIPVDSSGTFCFAFAGTMSG------- 449
            +      VP +VL F      G ++ +PA  Y   V++S T+C A   + SG       
Sbjct: 372 GDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEAS-TWCMAVVSSASGNATLPTN 430

Query: 450 -LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +IIGN  QQ  RV+YDLA   + F P  C+
Sbjct: 431 ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 169/370 (45%), Gaps = 43/370 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VG+PP+ V MVLDTGS++ W+ C   +   S    VF+P  S++++ VPC SP C+  
Sbjct: 73  LTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNS----VFNPLSSKTYSKVPCLSPTCKTR 128

Query: 202 DSS-----GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                    C+    C   VSY D +   G+ + ET             GC       N 
Sbjct: 129 TRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNS 188

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS--RTA 310
                  GL+G+ RG LSF  Q G     KFSYC+    ++     ++ G+++    +  
Sbjct: 189 EEDSKTTGLIGMNRGSLSFVNQMGY---PKFSYCISGFDSAG---VLLLGNASFPWLKPL 242

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V    V  +  S+F  D  G G  ++DSGT  
Sbjct: 243 SYTPLVQISTPLPYFDRVAYTVQLEGIKVKNK-VLSLPKSVFVPDHTGAGQTMVDSGTQF 301

Query: 366 TRLTRPAYIALRDAFRAGASSLKRAPDFSLF------DTCF--DLSGKTEVKVPTVVLHF 417
           T L  P Y AL++ F +    + +  +   F      D C+  D S      +P V L F
Sbjct: 302 TFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMF 361

Query: 418 RGADVSLPATN--YLIPVDSSG---TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
           +GA++S+      Y +P +  G    +CF F  + + G+   +IG+  QQ   + +DL  
Sbjct: 362 QGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNVWMEFDLEK 421

Query: 470 SRIGFAPRGC 479
           SRIG A   C
Sbjct: 422 SRIGLADVRC 431


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 178/385 (46%), Gaps = 50/385 (12%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           SGLA  +G YFTR+G+GTP +  Y+ +DTGSD++W+ C  C  C  +++      ++DP 
Sbjct: 81  SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140

Query: 184 KSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            S+S   V C    C   +  G    C   + C Y +SYGDGS T G F T+ L +    
Sbjct: 141 GSQSGELVTCDQQFCVA-NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199

Query: 240 --------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                    A V+ GCG    G      +A  G+LG G+   S  +Q     +  + F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         G+  V    + TPL+  P +   Y V L GI VGG  + G+  +
Sbjct: 260 CL---DTVNGGGIFAIGN-VVQPKVKTTPLV--PDM-PHYNVILKGIDVGGTAL-GLPTN 311

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLS 403
           +F  D   + G IIDSGT++  +    Y AL      +    S++   DFS    CF  S
Sbjct: 312 IF--DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS----CFQYS 365

Query: 404 GKTEVKVPTVVLHFRGADVSLPAT--NYLIPVDSSGTFCFAFAG----TMSG--LSIIGN 455
           G  +   P V  HF G DVSL  +  +YL   +    +C  F      T  G  + ++G+
Sbjct: 366 GSVDDGFPEVTFHFEG-DVSLIVSPHDYLFQ-NGKNLYCMGFQNGGVQTKDGKDMVLLGD 423

Query: 456 IQQQGFRVVYDLAASRIGFAPRGCA 480
           +      V+YDL    IG+A   C+
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCS 448


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 170/371 (45%), Gaps = 36/371 (9%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           SS   G+ +   E    LG+GTP   V +V DT SD++W QC PC  C +Q   ++DP K
Sbjct: 75  SSTPGGVQEKHVEPHVFLGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNK 134

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           + ++A +   S                  Y  +Y   S T G F+TET       VA + 
Sbjct: 135 TETYANLTSSS------------------YNYTYSKQSFTSGYFATETFALGNVTVANIT 176

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GCG  N+G +   AG+ G+GRG     +   +    +FSYC          +  + G  
Sbjct: 177 FGCGTRNQGYYDNVAGVFGVGRGGRGGVSLLNQLGIDRFSYCFSSSGAPGSSAVFLGGSP 236

Query: 305 AV-----SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVII 359
            +     +  A  TP++A+P L + Y+V+LVG++VG   V    AS  +    G   ++I
Sbjct: 237 ELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVDVAGASSAE---GGGRALVI 293

Query: 360 DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL----FDTCFDLSGKTEVKVP---T 412
           DS + VT L    Y  +R A  A  + LK A   +      D CF+L+       P   T
Sbjct: 294 DSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVT 353

Query: 413 VVLHFRG--ADVSLPATNYLIPVDSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAA 469
           + LHF G  AD+ LP  +YL    + G  C     + S G+ ++G+       V+YDLA 
Sbjct: 354 MTLHFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAK 413

Query: 470 SRIGFAPRGCA 480
           + + F P  CA
Sbjct: 414 NVVSFQPLDCA 424


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 128/394 (32%), Positives = 172/394 (43%), Gaps = 53/394 (13%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTD----PVFDP 182
            A   G Y   L  GTP + +  V+DTGS +VW  C     C +C +   D    P F P
Sbjct: 83  FAHSYGGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIP 142

Query: 183 AKSRSFATVPCRSPLCR-KLDSS------GCNRRN-TC-----LYQVSYGDGSITVGDFS 229
             S S   V C +P C   +DS       GC++ +  C      Y + YG G+       
Sbjct: 143 KLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLL-L 201

Query: 230 TETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV- 288
            E+L F         +GC   +       +G+ G GRG  S P Q G    +KFSYCL+ 
Sbjct: 202 LESLVFAERTEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGL---KKFSYCLLS 255

Query: 289 ----DRSTSAKPSSMVFGDSAVSRTA--RFTPLLANP-----KLDTFYYVELVGISVGGA 337
               D   S+K +  V  DS   +T    +TP   NP         +YYV L  I VG  
Sbjct: 256 HRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDK 315

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD---FS 394
            V+ +  S       GNGG I+DSG++ T + +P + A+   F    ++  RA D    S
Sbjct: 316 RVK-VPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALS 374

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF------AFAGTM 447
               CF+LSG   V +P++V  F+ GA + LP  NY   V      C       A   T+
Sbjct: 375 GLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTL 434

Query: 448 -SGLSII-GNIQQQGFRVVYDLAASRIGFAPRGC 479
            SG SII GN Q Q F   YDL   R GF  + C
Sbjct: 435 SSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 123/402 (30%), Positives = 172/402 (42%), Gaps = 75/402 (18%)

Query: 146 TPPRYVYMVLDTGSDVVWIQCAP--CKKCYSQTDPVF----DPAKSRSFATVPCRSPLC- 198
            PP++V + LDTGSD+VW  C P  C  C  + +        P  S +  +V C+S  C 
Sbjct: 91  NPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACS 150

Query: 199 -------------------RKLDSSGCNRRNTCLYQVSYGDGSITV---GDFSTETLTFR 236
                                +++S C+  +   +  +YGDGS+      D     L   
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210

Query: 237 GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRKFSYCLVDRSTS 293
              +     GC H          G+ G GRG LS P Q      +   +FSYCLV  S +
Sbjct: 211 SLSLHNFTFGCAHT---ALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFN 267

Query: 294 AK----PSSMVFG--DSAVSRTAR------FTPLLANPKLDTFYYVELVGISVGGAHVRG 341
           +     PS ++ G  D    R  +      +T +L NPK   FY V L GIS+G    + 
Sbjct: 268 SDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGK---KK 324

Query: 342 ITASLF--KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAG--ASSLKRAPDFSL 395
           I A  F  ++D  G+GGV++DSGT+ T L    Y ++   F  R G      K   D + 
Sbjct: 325 IPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTG 384

Query: 396 FDTCFDLSGKTEVKVPTVVLHFRGAD--VSLPATNYLIP-VDSS---------GTFCFAF 443
              C+     T V +P++VLHF G +  V LP  NY    +D           G      
Sbjct: 385 LGPCYYY--DTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMN 442

Query: 444 AGTMSGLS-----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            G  + L+      +GN QQ GF VVYDL   R+GFA R CA
Sbjct: 443 GGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 117/338 (34%), Positives = 153/338 (45%), Gaps = 44/338 (13%)

Query: 153 MVLDTGSDVVWIQCAPCK--KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           M +DT  D+ WIQCAPC   +CY Q + +FDP +SR+ A VPC S  C +L   G     
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYG----R 219

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAA-AGLLGLGRGRL 269
             L Q       +          T    R             G F A+ +G + LG GR 
Sbjct: 220 WLLQQPVPVLRRLRRRQGQPRGRTCHAVR-------------GNFSASTSGTMSLGGGRQ 266

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPS-SMVFGDSAVSRTARFTPLLANPK-LDTFYYV 327
           S  +QT   F   FSYC+ D S+S   S           R AR TPL+ NP  + T Y V
Sbjct: 267 SLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFAR-TPLVRNPSIIPTLYLV 325

Query: 328 ELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL 387
            L GI VGG  +  +   +F       GG ++DS   +T+L   AY ALR AFR+  ++ 
Sbjct: 326 RLRGIEVGGRRLN-VPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 378

Query: 388 KR-APDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTF---CFAF 443
            R A   +  DTC+D    T V VP V L F G  V        + +D+ G     C AF
Sbjct: 379 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV--------VRLDAMGVMVEGCLAF 430

Query: 444 AGTMS--GLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             T     L  IGN+QQQ   V+YD+    +GF    C
Sbjct: 431 VPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 165/375 (44%), Gaps = 41/375 (10%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFA 189
           +G YF ++G+G PP+  Y+ +DTGSD++W+ CA C KC +++D      ++DP  S S  
Sbjct: 79  AGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138

Query: 190 TVPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GT 238
            + C    C    +    GC +   C Y V YGDGS T G F  + L F          +
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198

Query: 239 RVARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRST 292
               V  GCG    G       A  G+LG G+   S  +Q     +  R F++CL     
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL----D 254

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
           + K   +      VS     TP++ N      Y V +  I VGG +V  +   +F  D  
Sbjct: 255 NVKGGGIFAIGEVVSPKVNTTPMVPNQP---HYNVVMKEIEVGG-NVLELPTDIF--DTG 308

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
              G IIDSGT++  L    Y ++     +    LK       F TCF  +G      P 
Sbjct: 309 DRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-TCFQYTGNVNEGFPV 367

Query: 413 VVLHFRGA-DVSLPATNYLIPVDSSGTFCFAF--AGTMS----GLSIIGNIQQQGFRVVY 465
           V  HF G+  +++   +YL  +     +CF +  +G  S     ++++G++      V+Y
Sbjct: 368 VKFHFNGSLSLTVNPHDYLFQIHEE-VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLY 426

Query: 466 DLAASRIGFAPRGCA 480
           DL    IG+    C+
Sbjct: 427 DLENQAIGWTDYNCS 441


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 118/325 (36%), Positives = 152/325 (46%), Gaps = 32/325 (9%)

Query: 178 PVFDPAKSRSFATVPCRSPLCRKLDSSGCNR-------RNTCLYQVSYGDG----SITVG 226
           P+  P  S S A V C    C +L    C+           C Y  +YG+       T G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 227 DFSTETLTFRGTRVA--RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
              TET TF     A   +A GC   +EG F   +GL+GLGRG+LS  TQ        F 
Sbjct: 73  ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFG 129

Query: 285 YCLVDRSTSAKPSSMVFG---DSAVSRTARF--TPLLANPKLDT--FYYVELVGISVGGA 337
           Y L   S  + PS + FG   D        F  TPLL NP +    FYYV L GISVGG 
Sbjct: 130 YRL--SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK 187

Query: 338 HVRGITASLFKLD-PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF 396
            V+ I +  F  D   G GGVI DSGT++T L  PAY  +RD   +     K  P  +  
Sbjct: 188 LVQ-IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDD 246

Query: 397 DTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFAGTMSGLSI 452
           D      G +    P++VLHF  GAD+ L   NYL  +   +     C++   +   L+I
Sbjct: 247 DLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTI 306

Query: 453 IGNIQQQGFRVVYDLAA-SRIGFAP 476
           IGNI Q  F VV+DL+  +R+ F P
Sbjct: 307 IGNIMQMDFHVVFDLSGNARMLFQP 331


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 169/356 (47%), Gaps = 72/356 (20%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G +   +  GTPP+   ++LDTGS + W QC  C  C   +   F+ + S ++++  C  
Sbjct: 126 GNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSCI- 184

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEGL 254
                    G    N   Y ++YGD S +VG++  +T+T   + V  +   GCG +N+G 
Sbjct: 185 --------PGTVENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGD 233

Query: 255 FVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA--R 311
           F +   G+LGLG+G+LS  +QT  +FN+ FSYCL +  +     S++FG+ A S+++  +
Sbjct: 234 FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIG---SLLFGEKATSQSSSLK 290

Query: 312 FTPLLANP---KLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           FT L+  P   +   +Y+V L  ISVG   +  I +S+F      + G IIDS T +TRL
Sbjct: 291 FTSLVNGPGTLQESGYYFVNLSDISVGNERLN-IPSSVF-----ASPGTIIDSRTVITRL 344

Query: 369 TRPAYIALRDAFRAGAS----SLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSL 424
            + AY AL+ AF+   +    S  R     + DTC++       +               
Sbjct: 345 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNXXXXXXPE--------------- 389

Query: 425 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                                    L+IIGN QQ    V+YD+   RIGF   GC+
Sbjct: 390 -------------------------LTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 420


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 118/403 (29%), Positives = 168/403 (41%), Gaps = 73/403 (18%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP----VFDPAKSRSFATVPCRSPL 197
           LG     + + + +DTGSD+VW  CAP K    +  P       P        V C+SP 
Sbjct: 76  LGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVAVSCKSPA 135

Query: 198 C--------------------RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG 237
           C                      +++S C       +  +YGDGS+ +     +TL+   
Sbjct: 136 CSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSLSS 194

Query: 238 TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGR---RFNRKFSYCLVDRSTSA 294
             +     GC H          G+ G GRG LS P Q      +   +FSYCLV  S  +
Sbjct: 195 LFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDS 251

Query: 295 ----KPSSMVFGDSAVSRTAR---------FTPLLANPKLDTFYYVELVGISVGGAHVRG 341
               KPS ++ G        +         +T +L NPK   FY V L+GI+VG    R 
Sbjct: 252 ERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAVGK---RT 308

Query: 342 ITAS--LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASS--LKRAPDFSL 395
           I A   L +++  G+GGV++DSGT+ T L    Y ++ D F  R G  +   ++  + + 
Sbjct: 309 IPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEEKTG 368

Query: 396 FDTCFDLSGKTEVKVPTVVLHFRG---ADVSLPATNYLIPVDSS----------GTFCFA 442
              C+ L+   +V  P + L F G   + V LP  NY                 G     
Sbjct: 369 LAPCYYLNSVADV--PALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCLMLM 426

Query: 443 FAGTMSGLS-----IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             G  + LS      +GN QQQGF V YDL   R+GFA R CA
Sbjct: 427 NGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 173/381 (45%), Gaps = 52/381 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCR 194
           G Y+  L +G+PP+  ++ +DTGSD+ W QC APC+ C      +++P K++    V C 
Sbjct: 38  GLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAK---VVDCH 94

Query: 195 SPLCRKLDSSG---CNRR-NTCLYQVSYGDGSITVGDFSTETLTFR---GTRV-ARVALG 246
            P+C ++   G   CN     C Y+V Y DGS T+G    +TLT R   GT +  +  +G
Sbjct: 95  LPVCAQIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIG 154

Query: 247 CGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMV 300
           CG+D +G       +  G++GL   +++ P Q   +        +CL D S       + 
Sbjct: 155 CGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGG--GYLF 212

Query: 301 FGDSAV-SRTARFTPLLANPKLDTFYYVELVGISVGG-AHVRGITASLFKLDPAGNGGVI 358
           FGD  V S    +TP++  P++   Y   L  I  GG + V      L +        V+
Sbjct: 213 FGDELVPSWGMTWTPMMGKPEM-LGYQARLQSIRYGGDSLVLNNDEDLTR----STSSVM 267

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---------FDTCFDLSGKTEVK 409
            DSGTS T L   AY ++  A    +  L+   D +L         F +  D+       
Sbjct: 268 FDSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTLPYCWRGPSPFQSITDV----HQY 323

Query: 410 VPTVVLHFRGAD-------VSLPATNYLIPVDSSGTFCF----AFAGTMSGLSIIGNIQQ 458
             T+ L F G +       + L    YLI V + G  C     A   ++   +IIG++  
Sbjct: 324 FKTLTLDFGGRNWFATDSTLDLSPQGYLI-VSTQGNVCLGILDASGASLEVTNIIGDVSM 382

Query: 459 QGFRVVYDLAASRIGFAPRGC 479
           +G+ VVYD    RIG+  R C
Sbjct: 383 RGYLVVYDNVRDRIGWIRRNC 403


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 148/357 (41%), Gaps = 69/357 (19%)

Query: 125 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAK 184
           + + S +  G G Y   + +GTPP  +  + DTGSD++W QC PC  CY Q +P+FDP K
Sbjct: 16  NDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKK 75

Query: 185 SRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
           S+++ T+   S                          + T+G    +  +F G     +A
Sbjct: 76  SKTYKTLGYLS------------------------SETFTIGSTEGDPASFPG-----LA 106

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPT-QTGRRFNRKFSYCLVDRSTSAKPSSMV-FG 302
            GCGH N G F      L    G       Q   +   +FSYCLV  S+ +  SS + FG
Sbjct: 107 FGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFG 166

Query: 303 DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
            SAV                           V G+      A       A    +IIDSG
Sbjct: 167 KSAV---------------------------VSGSGTSSPAA-------AEESNIIIDSG 192

Query: 363 TSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV 422
           T++T L R  Y  +  A                F  C+  SG  ++++PT+  HF GADV
Sbjct: 193 TTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHFIGADV 250

Query: 423 SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            LP  N  +        CF+   + S L+I GN+ Q  F V YDL  +++ F P  C
Sbjct: 251 QLPPLNTFVQAQED-LVCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 305


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 167/366 (45%), Gaps = 43/366 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV-FDPAKSRSFATVPCRSPLCR- 199
           L +GTPP+   MVLDTGS + WIQC    K   +T P  FDP  S SF+ +PC   LC+ 
Sbjct: 82  LPIGTPPQTQQMVLDTGSQLSWIQC----KVPPKTPPTAFDPLLSSSFSVLPCNHSLCKP 137

Query: 200 -----KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA-RVALGCGHDNEG 253
                 L +S C++   C Y   Y DG+   G+   E  TF  ++    + LGC  D+  
Sbjct: 138 RVPDYTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSD 196

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR---STSAKPSSMVFGDSAVSRTA 310
                 G+LG+  GRLSF +        KFSYC+  R   S S+   S   G +  S   
Sbjct: 197 ----TQGILGMNLGRLSFSSLAKI---SKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGF 249

Query: 311 RFTPLLA------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
           ++  L+        P LD   Y + ++GI + G  +  I+ S F+ DP+G G  +IDSGT
Sbjct: 250 KYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLN-ISTSAFRADPSGAGQTLIDSGT 308

Query: 364 SVTRLTRPAYIALRDAF-RAGASSLKRAPDF-SLFDTCFDLSGKTEV---KVPTVVLHFR 418
             T L   AY  +++   +     LK+   +    D CFD  G   V    +  +   F 
Sbjct: 309 WFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSLDMCFD--GDAMVIGRMIGNMAFEFE 366

Query: 419 -GADVSLPATNYLIPVDSSGTFCFAFA-GTMSGLS--IIGNIQQQGFRVVYDLAASRIGF 474
            G ++ +     L  V   G  C       + G++  IIGN  QQ   V +DL   R+GF
Sbjct: 367 NGVEIVVEREKMLADV-GGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEFDLVGRRVGF 425

Query: 475 APRGCA 480
               C+
Sbjct: 426 GRTDCS 431


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 164/367 (44%), Gaps = 42/367 (11%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ CK+C    DP F P  S S+  + C 
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
           +P C   D  G      C+Y+  Y + S + G  S + ++F         R   GC ++ 
Sbjct: 132 NPDC-NCDDEG----KLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186

Query: 252 EG-LFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G LF   A G++GLGRG+LS   Q            LVD+       S+ +G   V   
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQ------------LVDKGVIEDVFSLCYGGMEVGGG 234

Query: 310 ARFTPLLANPKLDTFYYVE-----LVGISVGGAHVRGITASLFKLDPA---GNGGVIIDS 361
           A     ++ P    F + +        I +   HV G +    KL+P    G  G ++DS
Sbjct: 235 AMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS---LKLNPKVFNGKHGTVLDS 291

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVL 415
           GT+     + A+IA++DA      SLKR   PD +  D CF  +G+   ++    P + +
Sbjct: 292 GTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAM 351

Query: 416 HF-RGADVSLPATNYLI-PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            F  G  + L   NYL       G +C          +++G I  +   V YD    ++G
Sbjct: 352 EFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLG 411

Query: 474 FAPRGCA 480
           F    C+
Sbjct: 412 FLKTNCS 418


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 182/428 (42%), Gaps = 87/428 (20%)

Query: 106 AVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQ 165
           A   PP NR R R N   +  V                VGTPP+ V MVLDTGS++ W+ 
Sbjct: 46  AASPPPANRLRFRHNVSLTVPV---------------AVGTPPQNVTMVLDTGSELSWLL 90

Query: 166 CAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDG 221
           C       S+ D  FD + S S+A VPC SP C    R L        + C   +SY D 
Sbjct: 91  CN-----GSRHDAPFDASASSSYAPVPCSSPACTWLGRDLPVRPFCDSSACRVSLSYADA 145

Query: 222 SITVGDFSTETLTFRGTRVARVALGC----GHDNEGLFVAAAGLLGLGRGRLSFPTQTGR 277
           S   G  + +T    G+       GC        +       GLLG+ RG LSF TQT  
Sbjct: 146 SSADGLLAADTFLL-GSSPMPALFGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTA- 203

Query: 278 RFNRKFSYCLVDRSTSAKPSSMVFG--------DSAVSRTARFTPLLAN----PKLD-TF 324
              R+F+YC+   +    P  ++ G         S   +   +TPL+      P  D   
Sbjct: 204 --TRRFAYCI---AAGQGPGILLLGGNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAA 258

Query: 325 YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGA 384
           Y V+L GI VG A +  I   L   D  G G  ++DSGT  T L   AY AL+  F   A
Sbjct: 259 YTVQLEGIRVGSA-LLAIPKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEF---A 314

Query: 385 SSLKRAPDFSL-------------FDTCFDLSGKTEVKV---------PTVVLHFRGADV 422
           + L R+ D  L             FD CF     TE +V         P V L  RGA+V
Sbjct: 315 NQLTRSLDGGLAPLGEPGFVFQGAFDACFR---GTEARVSAAAAGGLLPEVGLVLRGAEV 371

Query: 423 SLPATN---YLIP----VDSSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRI 472
            +       Y +P     +  G +C  F  + M+G+S  +IG+  QQ   V YDL  +R+
Sbjct: 372 VVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHHHQQDVWVEYDLRNARL 431

Query: 473 GFAPRGCA 480
           GFA   CA
Sbjct: 432 GFAAARCA 439


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 174/407 (42%), Gaps = 82/407 (20%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQC----APCKKCYSQTDPVFDPAKSRSFATVPCRS-- 195
           + VG PP+ V MVLDTGS++ W+ C     P      Q    F+ + S ++A   C S  
Sbjct: 63  VAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSSSP 122

Query: 196 ------------PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARV 243
                       P C    S      N+C   +SY D S   G  + +T    G    R 
Sbjct: 123 ECQWRGRDLPVPPFCAGPPS------NSCRVSLSYADASSADGVLAADTFLLGGAPPVRA 176

Query: 244 ALGC-------------GHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
             GC             G+ N+        AA GLLG+ RG LSF TQTG     +F+YC
Sbjct: 177 LFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTG---TLRFAYC 233

Query: 287 LVDRSTSAKPSSMVF---GDSAVSRTA---RFTPLLAN----PKLDTFYY-VELVGISVG 335
           +   +    P  +V    GD A    A    +TPL+      P  D   Y V+L GI VG
Sbjct: 234 I---APGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLEGIRVG 290

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR---APD 392
            A +  I  S+   D  G G  ++DSGT  T L   AY  L+  F    S+L      PD
Sbjct: 291 AALLP-IPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGEPD 349

Query: 393 F---SLFDTCFDLS------GKTEVKVPTVVLHFRGADVSLPATN--YLIPVD------S 435
           F     FD CF  S            +P V L  RGA+V++      Y++P +      S
Sbjct: 350 FVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRGEGGS 409

Query: 436 SGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +C  F  + M+G+S  +IG+  QQ   V YDL  SR+GFAP  C
Sbjct: 410 EAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 164/367 (44%), Gaps = 42/367 (11%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ CK+C    DP F P  S S+  + C 
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
           +P C   D  G      C+Y+  Y + S + G  S + ++F         R   GC ++ 
Sbjct: 132 NPDC-NCDDEG----KLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEE 186

Query: 252 EG-LFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G LF   A G++GLGRG+LS   Q            LVD+       S+ +G   V   
Sbjct: 187 TGDLFSQRADGIMGLGRGKLSVVDQ------------LVDKGVIEDVFSLCYGGMEVGGG 234

Query: 310 ARFTPLLANPKLDTFYYVE-----LVGISVGGAHVRGITASLFKLDPA---GNGGVIIDS 361
           A     ++ P    F + +        I +   HV G +    KL+P    G  G ++DS
Sbjct: 235 AMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS---LKLNPKVFNGKHGTVLDS 291

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVL 415
           GT+     + A+IA++DA      SLKR   PD +  D CF  +G+   ++    P + +
Sbjct: 292 GTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAM 351

Query: 416 HF-RGADVSLPATNYLI-PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            F  G  + L   NYL       G +C          +++G I  +   V YD    ++G
Sbjct: 352 EFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLG 411

Query: 474 FAPRGCA 480
           F    C+
Sbjct: 412 FLKTNCS 418


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 107/362 (29%), Positives = 168/362 (46%), Gaps = 31/362 (8%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ CA C++C +  DP F P  S S++ V C 
Sbjct: 86  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC- 144

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDN 251
                 +D +  + +  C Y+  Y + S + G    + ++F      +  R   GC +  
Sbjct: 145 -----NVDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSE 199

Query: 252 EG-LFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G LF   A G++GLGRG+LS   Q   +     S+ L          +MV G       
Sbjct: 200 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPSD 259

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
             F+   ++P    +Y +EL  I V G  +R + + +F        G ++DSGT+   L 
Sbjct: 260 MVFS--HSDPLRSPYYNIELKEIHVAGKALR-VDSRVFN----SKHGTVLDSGTTYAYLP 312

Query: 370 RPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADV 422
             A++A +DA  +   SLK  R PD +  D CF  +G+   K+    P V + F  G  +
Sbjct: 313 EQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKL 372

Query: 423 SLPATNYLI---PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           SL   NYL     VD  G +C   F       +++G I  +   V YD    +IGF    
Sbjct: 373 SLTPENYLFRHSKVD--GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTN 430

Query: 479 CA 480
           C+
Sbjct: 431 CS 432


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 129/370 (34%), Positives = 172/370 (46%), Gaps = 55/370 (14%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQT--DPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR 208
           M +DT  D+ WIQC PC         + +FDP KS S A VPC S  CR L +  +GC+ 
Sbjct: 167 MAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYGNGCSN 226

Query: 209 R----------------NTCLYQVSYGDGSITVGDFSTETLTFR-GTRVARVALGCGHDN 251
                              C Y+V+Y DG ++ G + T+ LT   GT       GC H  
Sbjct: 227 NSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFGCSHGV 286

Query: 252 EGLFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPS---SMVFGDSAVS 307
            G F    +G + LG GR S  +QT R +   FSYC+   S S   S   ++  GDS   
Sbjct: 287 RGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPKPSASGFLSLGGAINDGDSDSD 346

Query: 308 RTARF--TPLLANPKL--DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
             + F  TPL+ N ++   T+Y V L GI V G  +  +   +F      +GG ++DS  
Sbjct: 347 SPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLN-VPPVVF------SGGTLMDSSA 399

Query: 364 SVTRLTRPAYIALRDAF-----------RAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
            VT+L   AY ALR AF           R G++S   A    + DTC+D  G   V VPT
Sbjct: 400 VVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNVTVPT 459

Query: 413 VVL-HFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLAA 469
           V L  F GA V L  T  ++        C AF  T +   L  IGN+QQQ   V+YD+ A
Sbjct: 460 VSLVFFGGAVVDLDPTTAVMMEG-----CLAFVPTPADFDLGFIGNVQQQTHEVLYDVGA 514

Query: 470 SRIGFAPRGC 479
             +GF    C
Sbjct: 515 RNVGFRRGAC 524


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 168/366 (45%), Gaps = 38/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQC----APCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
           L +GTPP+   MVLDTGS V WI C     P KK    T        S  FA +PC  PL
Sbjct: 73  LPIGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFA-LPCNHPL 131

Query: 198 CRKLD-----SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDN 251
           C+         + C+    C Y  SY DG++  G+   E +    +     + LGC + +
Sbjct: 132 CKPQVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCANQS 191

Query: 252 EGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR 311
           +     A G+LG+  GRLSFP Q       KFSY +  + T     S+  G++  S   R
Sbjct: 192 DD----ARGILGMNLGRLSFPNQAKI---TKFSYFVPVKQTQPGSGSLYLGNNPNSSCFR 244

Query: 312 FTPLLA--------NPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSG 362
           +  LL          P LD   + + + GIS+GG  +  I  S+FK D  G G  IIDSG
Sbjct: 245 YVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLN-IPPSVFKPDTTGFGQTIIDSG 303

Query: 363 TSVTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLSGKTEVK--VPTVVLHF- 417
           +  + +   AY  +R+    + G+   K      + D CFD    TE+   V  +V  F 
Sbjct: 304 SEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADICFD-GDATEIGRLVGDMVFEFE 362

Query: 418 RGADVSLPATNYLIPVDSSGTFCFAFA---GTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
           +G ++ +P    LI VD  G  CF      G   G +IIGN  QQ   V +DLA  R+GF
Sbjct: 363 KGVEIVIPKERVLIEVD-GGVHCFGIGRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRVGF 421

Query: 475 APRGCA 480
               C+
Sbjct: 422 RGANCS 427


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 138/424 (32%), Positives = 187/424 (44%), Gaps = 63/424 (14%)

Query: 111 PRNRSRGRANGGFSSSVISGLAQGS-GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-- 167
           P + S+  + G  S    + L   S G Y     +GTPP+ + ++LDTGS + W+ C   
Sbjct: 39  PNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSS 98

Query: 168 -PCKKCYSQTD---PVFDPAKSRSFATVPCRSPLCRKLDSSG-----CNR---------- 208
             C+ C S +    PVF P  S S   V CR+P C+ + S+      C R          
Sbjct: 99  YECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANC 158

Query: 209 ----RNTC-LYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLG 263
                N C  Y V YG GS T G    +TL   G  V    LGC   +  +    +GL G
Sbjct: 159 PAAASNVCPPYAVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGCSLVS--VHQPPSGLAG 215

Query: 264 LGRGRLSFPTQTGRRFNRKFSYCLVDR---STSAKPSSMVFGDSAVSRTARFTPLLANPK 320
            GRG  S P Q G     KFSYCL+ R     +A   S+V G +      ++ PL+ +  
Sbjct: 216 FGRGAPSVPAQLGL---PKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAA 272

Query: 321 LD-----TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT----RP 371
            D      +YY+ L G++VGG  VR + A  F  + AG+GG I+DSGT+ T L     +P
Sbjct: 273 GDKLPYGVYYYLALRGVTVGGKAVR-LPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQP 331

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADV-SLPATNY 429
              A+  A        K A D      CF L  G   + +P +  HF G  V  LP  NY
Sbjct: 332 VADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENY 391

Query: 430 LIPVDSSG---TFCFAFAGTMSGLS-----------IIGNIQQQGFRVVYDLAASRIGFA 475
            + V   G     C A     SG S           I+G+ QQQ + V YDL   R+GF 
Sbjct: 392 FV-VAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFR 450

Query: 476 PRGC 479
            + C
Sbjct: 451 RQSC 454


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 139/264 (52%), Gaps = 13/264 (4%)

Query: 219 GDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR 278
           G  + T G  +T+T TF  T V  V  GC   + G F  A+G++G+GRG LS  +Q   +
Sbjct: 124 GSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL--Q 181

Query: 279 FNRKFSYCLV--DRSTSAKPSSMV-FGDSAVSRT--ARFTPLLANPKLDTFYYVELVGIS 333
           F  KFSY L+  + +      S++ FGD AV +T   R TPLL++     FYYV L G+ 
Sbjct: 182 FG-KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVR 240

Query: 334 VGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAP 391
           V G  +  I A  F L   G GGVI+ S T VT L + AY  +R A   R G  ++  + 
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSA 300

Query: 392 DFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGL 450
              L D C++ S   +VKVP + L F  GAD+ L A NY    + +G  C     +  G 
Sbjct: 301 ALEL-DLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG- 358

Query: 451 SIIGNIQQQGFRVVYDLAASRIGF 474
           S++G + Q G  ++YD+ A R+ F
Sbjct: 359 SVLGTLLQTGTNMIYDVDAGRLTF 382


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 114/390 (29%), Positives = 185/390 (47%), Gaps = 32/390 (8%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
            N  RGR   G S   + G     G Y+T +G+G P + + +++DTGSD++W++C+PC+ 
Sbjct: 58  HNDRRGRFLQGISFP-LKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRS 116

Query: 172 CYSQTD-----PVFDPAKSRSFATVPCRSPLCRKLDSSGCNR---RNTCLYQVSYGDGSI 223
           C S+ D      +++ + S + +   C  PLC   + + C+R    + C Y +SY D S 
Sbjct: 117 CLSKQDIIPPLSIYNLSASSTSSVSSCSDPLCTG-EQAVCSRSGSNSACAYGISYQDKST 175

Query: 224 TVGDFSTETLTFR----GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ--TGR 277
           ++G +  + + +         + +  GC  +  G +  A G++G G+   + P Q  T R
Sbjct: 176 SIGAYVKDDMHYVLQGGNATTSHIFFGCAINITGSW-PADGIMGFGQISKTVPNQIATQR 234

Query: 278 RFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGA 337
             +R FS+CL           + FG+   +    FTPLL    + T Y V+L+ ISV  +
Sbjct: 235 NMSRVFSHCLGGEKHGG--GILEFGEEPNTTEMVFTPLL---NVTTHYNVDLLSISV-NS 288

Query: 338 HVRGITASLFKL--DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
            V  I +  F    +     GVIIDSGTS   L   A   L    +   ++ K  P    
Sbjct: 289 KVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSEIK-NLTTAKLGPKLEG 347

Query: 396 FDTCFDLSGKT-EVKVPTVVLHFR-GADVSLPATNYLIPVD---SSGTFCFAFAGTMSGL 450
               +  SG T E   P V L F  G+ + L   NYL+ V+       +C+A++ +  GL
Sbjct: 348 LQCFYLKSGLTVETSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWS-SADGL 406

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +I G I  +   V YD+   RIG+  + C+
Sbjct: 407 TIFGEIVLKDKLVFYDVENRRIGWKGQNCS 436


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 126/384 (32%), Positives = 173/384 (45%), Gaps = 54/384 (14%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYS---QTDPVFDPAKSRSFATVPCRSPLC 198
           + VG PP+ V MVLDTGS++ W++C   +   +   Q    F+ + S ++A   C SP C
Sbjct: 66  VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC 125

Query: 199 ----RKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC---- 247
               R L           N+C   +SY D S   G  + +T    G    R   GC    
Sbjct: 126 QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRALFGCVTSY 185

Query: 248 ---GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG-- 302
                 N     AA GLLG+ RG LSF TQT      +F+YC+   +    P  +V G  
Sbjct: 186 SSATATNSSDSEAATGLLGMNRGSLSFVTQTA---TLRFAYCI---APGDGPGLLVLGGD 239

Query: 303 DSAVSRTARFTPLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            +A++    +TPL+      P  D   Y V+L GI VG A +  I  S+   D  G G  
Sbjct: 240 GAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLP-IPKSVLAPDHTGAGQT 298

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAP----DFSL---FDTCFDLS----GKT 406
           ++DSGT  T L   AY  L+  F    S+L  AP    DF     FD CF  S       
Sbjct: 299 MVDSGTQFTFLLADAYAPLKGEFLNQTSALL-APLGESDFVFQGAFDACFRASEARVAAA 357

Query: 407 EVKVPTVVLHFRGADVSLPATN--YLIPVDSSG------TFCFAFAGT-MSGLS--IIGN 455
              +P V L  RGA+V++      Y +P +  G       +C  F  + M+G+S  +IG+
Sbjct: 358 SQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGH 417

Query: 456 IQQQGFRVVYDLAASRIGFAPRGC 479
             QQ   V YDL   R+GFAP  C
Sbjct: 418 HHQQNVWVEYDLQNGRVGFAPARC 441


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 128/394 (32%), Positives = 171/394 (43%), Gaps = 53/394 (13%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTD----PVFDP 182
            A   G Y   L  GTP + +  V+DTGS +VW  C     C +C +   D    P F P
Sbjct: 83  FAHSYGGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIP 142

Query: 183 AKSRSFATVPCRSPLCR-KLDSS------GCNRRN-TC-----LYQVSYGDGSITVGDFS 229
             S S   V C +P C   +DS       GC++ +  C      Y + YG G+       
Sbjct: 143 KLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLL-L 201

Query: 230 TETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV- 288
            E+L F         +GC   +       +G+ G GRG  S P Q G    +KFSYCL+ 
Sbjct: 202 LESLVFAERTEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGL---KKFSYCLLS 255

Query: 289 ----DRSTSAKPSSMVFGDSAVSRTA--RFTPLLANP-----KLDTFYYVELVGISVGGA 337
               D   S+K +  V  DS   +T    +TP   NP         +YYV L  I VG  
Sbjct: 256 HRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDK 315

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD---FS 394
            V+    S       GNGG I+DSG++ T + +P + A+   F    ++  RA D    S
Sbjct: 316 RVK-XPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALS 374

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCF------AFAGTM 447
               CF+LSG   V +P++V  F+ GA + LP  NY   V      C       A   T+
Sbjct: 375 GLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTL 434

Query: 448 -SGLSII-GNIQQQGFRVVYDLAASRIGFAPRGC 479
            SG SII GN Q Q F   YDL   R GF  + C
Sbjct: 435 SSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 158/354 (44%), Gaps = 34/354 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQT-DPVFDPAKSRSFATVPCRSP 196
           +     +G PP     ++DTGS ++WIQCAPCK C  Q   P+FDP+ S ++ ++ C++ 
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNI 161

Query: 197 LCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVAR-----VALGCGHDN 251
           +CR   S  C+  + C+Y  +Y +G  +VG  +TE L F  +   R     V  GC H N
Sbjct: 162 ICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRN 221

Query: 252 EGLFVAA--AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G +      G+ GLG G  S   Q G     KFSYC+ + +      + +     V+  
Sbjct: 222 -GNYKDRRFTGVFGLGSGITSVVNQMG----SKFSYCIGNIADPDYSYNQLVLSEGVNME 276

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
              TPL     +D  Y V L GISVG   +  I  S FK        VIIDSGT+ T L 
Sbjct: 277 GYSTPL---DVVDGHYQVILEGISVGETRLV-IDPSAFK-RTEKQRRVIIDSGTAPTWLA 331

Query: 370 RPAYIALRDAFRAGASSLKR--APDFSLFDTCFD-LSGKTEVKVPTVVLHF-RGADVSLP 425
              Y AL    R   + L R   P       C+    G+  V  P V  HF  GAD    
Sbjct: 332 ENEYRALEREVR---NLLDRFLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGAD---- 384

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                + VD+       +       S+IG + QQ + V YDL   ++ F    C
Sbjct: 385 -----LVVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 176/382 (46%), Gaps = 39/382 (10%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------- 179
           +++G +     Y+ ++GVG P +++  ++DTGSD++W +C  C+ C S+ + +       
Sbjct: 77  MLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIM 136

Query: 180 ------FDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET- 232
                 +DP  S + +   C  PLC +  S   N  N+C Y +SY D S + G +  +  
Sbjct: 137 QGPITLYDPELSITASPATCSDPLCSEGGSCRGN-NNSCAYDISYEDTSSSTGIYFRDVV 195

Query: 233 -LTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFN--RKFSYCLVD 289
            L  + +    + LGC     GL+    G++G GR ++S P Q   +      F +CL  
Sbjct: 196 HLGHKASLNTTMFLGCATSISGLW-PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSG 254

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
                    +V G +       +TP+LAN   D  Y V+LV +SV    +  I AS F+ 
Sbjct: 255 EKEGG--GILVLGKNDEFPEMVYTPMLAN---DIVYNVKLVSLSVNSKALP-IEASEFEY 308

Query: 350 DP-AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCF-DLSGKTE 407
           +   GNGG IIDSGTS       A      A     +++  AP  S    CF  +S +  
Sbjct: 309 NATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESSGSPCFISISDRNS 368

Query: 408 VKV--PTVVLHFR-GADVSLPATNYLIPVDS---------SGTFCFAFAGTMSGLSIIGN 455
           V+V  P V L F  GA + L A NYL  V S          G      + ++   +I+G+
Sbjct: 369 VEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSVGNSTILGD 428

Query: 456 IQQQGFRVVYDLAASRIGFAPR 477
              +   VVYD+  SRIG+  +
Sbjct: 429 AILKDKVVVYDMEKSRIGWVKQ 450


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 164/363 (45%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP FDP  S ++  + C 
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 195 -SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHD 250
              +C   DS G      C+Y+  Y + S + G    + ++F         R   GC + 
Sbjct: 140 IDCIC---DSDGVQ----CVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192

Query: 251 NEGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
             G   +  A G++GLG G LS   Q   +   N  FS C           +MV G  + 
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP 250

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                FT   ++P    +Y V+L  I V G  +  +++ +F     G  G ++DSGT+  
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLP-LSSGIFD----GRYGAVLDSGTTYA 303

Query: 367 RLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEV----KVPTVVLHFR-G 419
            L   A+ A +DA      SLK+   PD +  D CF  +G        K PTV + F  G
Sbjct: 304 YLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENG 363

Query: 420 ADVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NY        G +C   F       +++G I  +   V+YD A S+IGF   
Sbjct: 364 QKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKT 423

Query: 478 GCA 480
            C+
Sbjct: 424 NCS 426


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 164/363 (45%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP FDP  S ++  + C 
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 195 -SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHD 250
              +C   DS G      C+Y+  Y + S + G    + ++F         R   GC + 
Sbjct: 140 IDCIC---DSDGVQ----CVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192

Query: 251 NEGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
             G   +  A G++GLG G LS   Q   +   N  FS C           +MV G  + 
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP 250

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                FT   ++P    +Y V+L  I V G  +  +++ +F     G  G ++DSGT+  
Sbjct: 251 PSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLP-LSSGIFD----GRYGAVLDSGTTYA 303

Query: 367 RLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEV----KVPTVVLHFR-G 419
            L   A+ A +DA      SLK+   PD +  D CF  +G        K PTV + F  G
Sbjct: 304 YLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENG 363

Query: 420 ADVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NY        G +C   F       +++G I  +   V+YD A S+IGF   
Sbjct: 364 QKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKT 423

Query: 478 GCA 480
            C+
Sbjct: 424 NCS 426


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 180/383 (46%), Gaps = 37/383 (9%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-PVFDPAKSRS 187
           SG   G+G+YF R  VGTP +   +V DTGSD+ W++C+            VF  A SRS
Sbjct: 103 SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRS 162

Query: 188 FATVPCRSPLCRK---LDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTF-------- 235
           +A + C S  C        + C+   + C Y   Y DGS   G   T++ T         
Sbjct: 163 WAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESR 222

Query: 236 ----RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
               R  ++  V LGC    +G  F ++ G+L LG   +SF ++   RF  +FSYCLVD 
Sbjct: 223 DGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 282

Query: 291 STSAKPSS-MVFG----------DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
                 +S + FG           S+ S  A  TPLL + ++  FY V +  + V G  +
Sbjct: 283 LAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEAL 342

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
             I A ++  D A  GG I+DSGTS+T L  PAY A+  A     + L R      F+ C
Sbjct: 343 -DIPADVW--DVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRV-SMDPFEYC 398

Query: 400 FDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQ 457
           ++ +    +++P + + F G A +  PA +Y++   + G  C     G   G+S+IGNI 
Sbjct: 399 YNWTAAA-LEIPGLEVRFAGSARLQPPAKSYVVDA-APGVKCIGVQEGAWPGVSVIGNIL 456

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
           QQ     +DL    + F    CA
Sbjct: 457 QQDHLWEFDLRDRWLRFKHTRCA 479


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 184/389 (47%), Gaps = 30/389 (7%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
            N  RGR   G S   + G     G Y+T +G+G P + + +++DTGSD++W++C+PC+ 
Sbjct: 58  HNDRRGRFLQGISFP-LKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRS 116

Query: 172 CYSQTD-----PVFDPAKSRSFATVPCRSPLC--RKLDSSGCNRRNTCLYQVSYGDGSIT 224
           C S+ D      +++ + S + +   C  PLC   ++  S     + C Y  SY D S +
Sbjct: 117 CLSKQDIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSAS 176

Query: 225 VGDFSTETLTFR----GTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ--TGRR 278
           VG +  + + +         +R+  GC  +  G +    G++G G    + P Q  T R 
Sbjct: 177 VGAYVRDDMHYVLHGGNATTSRIFFGCATNITGSW-PVDGIMGFGLISKTVPNQIATQRN 235

Query: 279 FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAH 338
            +R FS+CL           + FG++  +    FTPLL    + T Y V+L+ ISV  + 
Sbjct: 236 MSRVFSHCLGGEKHGG--GILEFGEAPNTTEMVFTPLL---NVTTHYNVDLLSISV-NSK 289

Query: 339 VRGITASLFKL--DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF 396
           V  I    F    +   N GVIIDSGT+   LT  A   L    ++  ++ K  P     
Sbjct: 290 VLPIDPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEIKS-LTTAKLGPKLEGL 348

Query: 397 DTCFDLSGKT-EVKVPTVVLHFR-GADVSLPATNYLIPVD---SSGTFCFAFAGTMSGLS 451
           +  +  SG T E   P V L F  G+ + L   NYL+  +       +C+A++ +  GL+
Sbjct: 349 ECFYLKSGLTMETSFPNVTLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWS-SADGLT 407

Query: 452 IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           I G I  +   V YD+   RIG+  + C+
Sbjct: 408 IFGEIVLKDKLVFYDVENRRIGWKGQNCS 436


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 101/352 (28%), Positives = 158/352 (44%), Gaps = 31/352 (8%)

Query: 153 MVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL-CRKLDSSGCNRRNT 211
           + LD G  + W+QC PC+ C  Q  PVFDP KS +F+ +P  + + CR       N    
Sbjct: 113 LALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLAN--GA 170

Query: 212 CLYQVSYGDGSITVGDFSTETLTFRGTR-----VARVALGCGHDNEGLFV--AAAGLLGL 264
           C + ++Y D +   G  + +T +F         ++ +  GC H  E      A AG+LGL
Sbjct: 171 CGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGILGL 230

Query: 265 GRGR-----LSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-----RFTP 314
           G G       +F  Q       +FSYC      S   S + FG    S        + TP
Sbjct: 231 GMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMY-SYLRFGSDIPSHPPPNVHRQSTP 289

Query: 315 LLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYI 374
           +LA       Y+V+L G+SVG   + G+T ++F+ +  G GG ++D GT +T     AY+
Sbjct: 290 VLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAYV 349

Query: 375 ALRDAFRAGASSLKRAPDFSLF--DTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
            +  A R      +R     +   +TC          +P++ LHF  GA + +   +  +
Sbjct: 350 HIDHAVRQHLQ--RRGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGAWLRVMPEHVFM 407

Query: 432 PVDSSGTF--CFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS--RIGFAPRGC 479
           P    G    CF F  + + L++IG  QQ   R ++DL  +   + F P  C
Sbjct: 408 PFVVGGHHYQCFGFVSS-TDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 166/366 (45%), Gaps = 38/366 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L +GTPP+ +   L   S   W+ C+        T  +F P  S S   +PC SP C   
Sbjct: 3   LSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAF 62

Query: 202 D--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA----LGCGHDNEGL- 254
              S+ C   ++C Y  SYG    + GD  ++  T    R  +VA    LGCG D+ GL 
Sbjct: 63  SAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLL 122

Query: 255 -FVAAAGLLGLGRGRLSFPTQ-TGRRFNRKFSYCLVDRSTSAKPSSMVFG-----DSAVS 307
             +  +G +G  +G +SF  Q +   +  KF YCL   +   K   +V G     ++++S
Sbjct: 123 ELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGK---LVIGNYKLRNASIS 179

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAH----VRGITASLFKLDPAGNGGVIIDSGT 363
            +  +TP++ NP+    Y++ L  IS+        ++G  ++       G GG +ID+ T
Sbjct: 180 SSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSN-------GTGGTVIDTTT 232

Query: 364 SVTRLTRPAYIALRDAFRAGASSL----KRAPDFSLFDTCFDLSGKTEVKVP-TVVLHFR 418
            ++ LT   Y  L  A +   ++L        D    + C+++S  ++   P T+  HF 
Sbjct: 233 FLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFL 292

Query: 419 -GADVSLPATNYLIPVDS-SGTFCFAFAGTMS---GLSIIGNIQQQGFRVVYDLAASRIG 473
            GA V +     L   DS + T C A   + S    L++IG  QQ    V YDL   R G
Sbjct: 293 GGAGVEVSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYG 352

Query: 474 FAPRGC 479
           F  +GC
Sbjct: 353 FGAQGC 358


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 171/370 (46%), Gaps = 40/370 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL 201
           L VGTPP+ V MV+DTGS++ W+ C       S     F+  +S S+  +PC S  C   
Sbjct: 35  LTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCTNQ 93

Query: 202 D-----SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGH----DNE 252
                  + C+  + C   +SY D S + G+ +++T     + +  +  GC       N 
Sbjct: 94  TRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSSNS 153

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-- 310
                  GL+G+ RG LSF +Q G     KFSYC+     S     ++ G+S  +     
Sbjct: 154 DEDSKNTGLMGMNRGSLSFVSQMGF---PKFSYCISGTDFSGM---LLLGESNFTWAVPL 207

Query: 311 RFTPLLAN----PKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D   Y V+L GI V    +  I  S+F+ D  G G  ++DSGT  
Sbjct: 208 NYTPLVQISTPLPYFDRIAYTVQLEGIKVSD-RLLPIPKSVFEPDHTGAGQTMVDSGTQF 266

Query: 366 TRLTRPAYIALRDAFRAGASSLKRA---PDFSL---FDTCF--DLSGKTEVKVPTVVLHF 417
           T L  PAY ALR  F    +   R    PDF      D C+   +S +   ++PTV L F
Sbjct: 267 TFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVF 326

Query: 418 RGADVSLPATN--YLIPVDSSGT---FCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAA 469
            GA++++      Y +P +  G     C +F  + + G+   +IG+  QQ   + +DL  
Sbjct: 327 NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLER 386

Query: 470 SRIGFAPRGC 479
           SRIG A   C
Sbjct: 387 SRIGLAQVRC 396


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 170/387 (43%), Gaps = 56/387 (14%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G Y+T + +GTPP+  Y+ +DTGSD++W+ C  C +C  ++       ++DP  
Sbjct: 80  GLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKA 139

Query: 185 SRSFATVPCRSPLCRKLDSSG-----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
           S + +TV C    C   D+ G     C+    C Y V+YGDGS TVG F  + L F    
Sbjct: 140 SSTGSTVMCDQGFCA--DTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVT 197

Query: 240 --------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                    A V  GCG    G       A  G+LG G    S  +Q  T  +  + F++
Sbjct: 198 GDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAH 257

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         GD  V    + TPL+A+      Y V L  I VGG  +  + A 
Sbjct: 258 CL---DTIKGGGIFAIGD-VVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLE-LPAD 309

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT----CFD 401
           +FK  P    G IIDSGT++T L    +  +  A         +  D +  D     CF+
Sbjct: 310 IFK--PGEKRGTIIDSGTTLTYLPELVFKKVMLAV------FNKHQDITFHDVQDFLCFE 361

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPA--TNYLIPVDSSGTFCFAFA-GTMSG-----LSII 453
            SG  +   PT+  HF   D++L      Y  P + +  +C  F  G +       + ++
Sbjct: 362 YSGSVDDGFPTLTFHFE-DDLALHVYPHEYFFP-NGNDVYCVGFQNGALQSKDGKDIVLM 419

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G++      VVYDL    IG+    C+
Sbjct: 420 GDLVLSNKLVVYDLENRVIGWTDYNCS 446


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 166/360 (46%), Gaps = 27/360 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++  V C 
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 167

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDN 251
                 +D +    R  C+Y+  Y + S + G    + ++F   + +A  R   GC +  
Sbjct: 168 -----TIDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVE 222

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G   +  A G++GLGRG LS   Q   +     S+ L          +MV G   +S  
Sbjct: 223 TGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLG--GISPP 280

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
           +  T   ++P    +Y ++L  + V G  +  + A++F     G  G ++DSGT+   L 
Sbjct: 281 SDMTFAYSDPDRSPYYNIDLKEMHVAGKRLP-LNANVFD----GKHGTVLDSGTTYAYLP 335

Query: 370 RPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADV 422
             A++A +DA      SLK+   PD +  D CF  +G    ++    P V + F  G   
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKY 395

Query: 423 SLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           SL   NY+       G +C   F       +++G I  +   V+YD   ++IGF    CA
Sbjct: 396 SLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCA 455


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 164/364 (45%), Gaps = 36/364 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ CK+C    DP F P  S S+  + C 
Sbjct: 77  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKC- 135

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
           +P C   D  G      C+Y+  Y + S + G  S + ++F         R   GC +  
Sbjct: 136 NPDC-NCDDEG----KLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVE 190

Query: 252 EG-LFVA-AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            G LF   A G++GLGRG+LS   Q   +      FS C           +MV G   +S
Sbjct: 191 TGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLG--KIS 246

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA---GNGGVIIDSGTS 364
             A      ++P    +Y ++L  + V G  +        KL+P    G  G ++DSGT+
Sbjct: 247 PPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSL--------KLNPKVFNGKHGTVLDSGTT 298

Query: 365 VTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVLHF- 417
                + A+IA++DA      SLKR   PD +  D CF  +G+   ++    P + + F 
Sbjct: 299 YAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFG 358

Query: 418 RGADVSLPATNYLI-PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAP 476
            G  + L   NYL       G +C          +++G I  +   V YD    ++GF  
Sbjct: 359 NGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLK 418

Query: 477 RGCA 480
             C+
Sbjct: 419 TNCS 422


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ CA C++C +  DP F P  S +++ V C 
Sbjct: 85  NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC- 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT----RVARVALGCGHD 250
                 +D +  + +N C Y+  Y + S + G    + ++F GT    +  R   GC + 
Sbjct: 144 -----NVDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSF-GTESELKPQRAVFGCENS 197

Query: 251 NEG-LFVAAA-GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
             G LF   A G++GLGRG+LS   Q   +      FS C           +MV G    
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLGAMPA 255

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                +T   +N     +Y +EL  + V G  +R +   +F     G  G ++DSGT+  
Sbjct: 256 PPGMIYT--HSNAVRSPYYNIELKEMHVAGKALR-VDPRIFD----GKHGTVLDSGTTYA 308

Query: 367 RLTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RG 419
            L   A++A +DA  +    LK  R PD +  D CF  +G+   ++    P V + F  G
Sbjct: 309 YLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368

Query: 420 ADVSLPATNYLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NYL       G +C   F       +++G I  +   V YD    +IGF   
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428

Query: 478 GCA 480
            C+
Sbjct: 429 NCS 431


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 123/374 (32%), Positives = 176/374 (47%), Gaps = 63/374 (16%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------F 180
           V+S +   S EY   + +G+PPR +  + DTGSD+VW++C   KK  + T         F
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKC---KKGNNDTSSAAAPTTQF 146

Query: 181 DPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF----- 235
           DP++S ++  V C++  C  L  + C+  + C Y  +YGDGS T G  STET TF     
Sbjct: 147 DPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGA 206

Query: 236 ----RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTG--RRFNRKFSYCLVD 289
               R  R+  V  GC     G F  A GL+GLG G +S  TQ G      R+FSYCLV 
Sbjct: 207 GRSPRQVRIGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265

Query: 290 RSTSAKPSSMVFGDSA--VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
            S +A  S++ FG  A      A  TPL+ N  + +                        
Sbjct: 266 HSVNAS-SALNFGALADVTEPGAASTPLVGNKTVAS------------------------ 300

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGKT 406
               A +  +I+DSGT++T L       + D   R       ++PD  L   C++++G+ 
Sbjct: 301 ----AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPD-GLLQLCYNVAGR- 354

Query: 407 EVK----VPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQ 459
           EV+    +P + L F  GA V+L   N  + V   GT C A   T     +SI+GN+ QQ
Sbjct: 355 EVEAGESIPDLTLEFGGGAAVALKPENAFVAV-QEGTLCLAIVATTEQQPVSILGNLAQQ 413

Query: 460 GFRVVYDLAASRIG 473
              V YDL A  +G
Sbjct: 414 NIHVGYDLDAGTVG 427



 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 66/132 (50%), Gaps = 11/132 (8%)

Query: 357 VIIDSGTSVTRLTRPAYIALRDAF-RAGASSLKRAPDFSLFDTCFDLSGKTEVK----VP 411
           +I+DSGT++T L       + D   R       ++PD  L   C++++G+ EV+    +P
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPD-GLLQLCYNVAGR-EVEAGESIP 496

Query: 412 TVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVVYDLA 468
            + L F G A V+L   N  + V   GT C A   T     +SI+GN+ QQ   V YDL 
Sbjct: 497 DLTLEFGGGAAVALKPENAFVAV-QEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 555

Query: 469 ASRIGFAPRGCA 480
           A  + FA   CA
Sbjct: 556 AGTVTFAVADCA 567


>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
          Length = 424

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/398 (29%), Positives = 167/398 (41%), Gaps = 100/398 (25%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCK----------KCYSQTDPVFDPA 183
           G  +Y    G+G PP+    V+DTGSD+VW QC+ C+           C+ Q  P ++ +
Sbjct: 74  GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133

Query: 184 KSRSFATVPCRS---PLCRKL-DSSGCNR-----RNTCLYQVSYGDGSITVGDFSTETLT 234
            SR+   VPC      LC    +++GC R      + C+   SYG G + +G   T+  T
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192

Query: 235 FRGTRVARVALGCGHDNE---GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS 291
           F  +    +A GC        G    A+G++GLGRG LS                     
Sbjct: 193 FPSSSSVTLAFGCVSQTRISPGALTGASGIIGLGRGALSL-------------------- 232

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPK---LDTFYYVELVGISVGGAHVRGITASLFK 348
                                     NPK     TFYY+ LVG++ G A V  + A  F 
Sbjct: 233 --------------------------NPKDSPFSTFYYLPLVGLAAGNATV-ALPAGAFD 265

Query: 349 LDPAG----NGGVIIDSGTSVTRLTRPAYIALRDAFRA---GASSLKRAPDF--SLFDTC 399
           L  A      GG +IDSG+  TRL  PA+ AL         G+ SL   P       + C
Sbjct: 266 LREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELC 325

Query: 400 FDLSGKTE----VKVPTVVLHFR-----GADVSLPATNYLIPVDSSGTFCFAFAGTMSG- 449
            +     +      VP++VL F      G ++ +PA  Y   V++S T+C A   + SG 
Sbjct: 326 VEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEAS-TWCMAVVSSASGN 384

Query: 450 -------LSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                   +IIGN  QQ  RV+YDLA   + F P  C+
Sbjct: 385 ATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 166/382 (43%), Gaps = 43/382 (11%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +GL   +G Y+T++G+G+P +  Y+ +DTGSD++W+ CA C  C  ++       ++DP 
Sbjct: 63  NGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPN 122

Query: 184 KSRSFATVPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR- 239
            S++   VPC    C    S   SGC +  +C Y ++YGDGS T G F  ++LTF     
Sbjct: 123 GSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSG 182

Query: 240 -------VARVALGCGHDNEGLF-----VAAAGLLGLGRGRLSFPTQTGR--RFNRKFSY 285
                   + V  GCG    G        A  G++G G+   S  +Q     +  R FS+
Sbjct: 183 NLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSH 242

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVELVGISVGGAHVRGITA 344
           CL      +     +F    V       TPL+  P++   Y V L  + V G     I  
Sbjct: 243 CL-----DSHHGGGIFSIGQVMEPKFNTTPLV--PRM-AHYNVILKDMDVDG---EPILL 291

Query: 345 SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG 404
            L+  D     G IIDSGT++  L    Y  L          LK       F TCF  S 
Sbjct: 292 PLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQF-TCFHYSD 350

Query: 405 KTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS------GLSIIGNIQQ 458
           K +   P V  HF G  +++   +YL  +     +C  +  + +       L +IG++  
Sbjct: 351 KLDEGFPVVKFHFEGLSLTVHPHDYLF-LYKEDIYCIGWQKSSTQTKEGRDLILIGDLVL 409

Query: 459 QGFRVVYDLAASRIGFAPRGCA 480
               VVYDL    IG+    C+
Sbjct: 410 SNKLVVYDLENMVIGWTNFNCS 431


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 109/381 (28%), Positives = 168/381 (44%), Gaps = 54/381 (14%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFA 189
           +G YF ++G+GTP +  Y+ +DTGSD++W+ CA C +C +++D      ++D   S +  
Sbjct: 71  AGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 130

Query: 190 TVPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA------ 241
            V C    C   D    GC     CLY V YGDGS T G F  + + +   R++      
Sbjct: 131 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYN--RISGNFQTT 188

Query: 242 ----RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRS 291
                V  GCG+   G       A  G+LG G+   S  +Q  +  +  + FS+CL    
Sbjct: 189 PTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL---- 244

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            +     +      V      TPL+ N      Y V +  I VGG  +  + +  F  + 
Sbjct: 245 DNVDGGGIFAIGEVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLD-VPSDAF--ES 298

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-----TCFDLSGKT 406
               G IIDSGT++    +  Y+ L +        L + PD  L       TCFD +G  
Sbjct: 299 GDRKGTIIDSGTTLAYFPQEVYVPLIEKI------LSQQPDLRLHTVEQAFTCFDYTGNV 352

Query: 407 EVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAF----AGTMSG--LSIIGNIQQQ 459
           +   PTV LHF +   +++    YL  V     +C  +    A T  G  L+++G++   
Sbjct: 353 DDGFPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLS 411

Query: 460 GFRVVYDLAASRIGFAPRGCA 480
              VVYDL    IG+    C+
Sbjct: 412 NKLVVYDLEKQGIGWVEYNCS 432


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/387 (28%), Positives = 170/387 (43%), Gaps = 54/387 (13%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +G    +G YF ++G+GTP +  Y+ +DTGSD++W+ CA C +C +++D      ++D  
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 184 KSRSFATVPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
            S +   V C    C   D    GC     CLY V YGDGS T G F  + + +   R++
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQY--NRIS 263

Query: 242 ----------RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                      V  GCG+   G       A  G+LG G+   S  +Q  +  +  + FS+
Sbjct: 264 GNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSH 323

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL     +     +      V      TPL+ N      Y V +  I VGG  +  + + 
Sbjct: 324 CL----DNVDGGGIFAIGEVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLD-VPSD 375

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-----TCF 400
            F  +     G IIDSGT++    +  Y+ L +        L + PD  L       TCF
Sbjct: 376 AF--ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI------LSQQPDLRLHTVEQAFTCF 427

Query: 401 DLSGKTEVKVPTVVLHF-RGADVSLPATNYLIPVDSSGTFCFAF----AGTMSG--LSII 453
           D +G  +   PTV LHF +   +++    YL  V     +C  +    A T  G  L+++
Sbjct: 428 DYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLL 486

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G++      VVYDL    IG+    C+
Sbjct: 487 GDLVLSNKLVVYDLEKQGIGWVEYNCS 513


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 170/372 (45%), Gaps = 41/372 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV----FDPAKSRSFATV 191
           G YF ++G+GTP R  ++ +DTGSD++W+ CA C +C  ++D V    +D   S +  +V
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSV 142

Query: 192 PCRSPLCRKLDS-SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVAR 242
            C    C  ++  S C+  +TC Y + YGDGS T G    + +           G+    
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202

Query: 243 VALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRSTSAKP 296
           +  GCG    G       A  G++G G+   SF +Q     +  R F++CL + +     
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGG-- 260

Query: 297 SSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
              +F     VS   + TP+L+       Y V L  I VG + V  ++++ F  D   + 
Sbjct: 261 ---IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNS-VLELSSNAF--DSGDDK 311

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           GVIIDSGT++  L    Y  L +   A    L        F TCF  + K + + PTV  
Sbjct: 312 GVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYTDKLD-RFPTVTF 369

Query: 416 HF-RGADVSLPATNYLIPVDSSGTFCFAFAG----TMSG--LSIIGNIQQQGFRVVYDLA 468
            F +   +++    YL  V    T+CF +      T  G  L+I+G++      VVYD+ 
Sbjct: 370 QFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIE 428

Query: 469 ASRIGFAPRGCA 480
              IG+    C+
Sbjct: 429 NQVIGWTNHNCS 440


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 113/403 (28%), Positives = 181/403 (44%), Gaps = 43/403 (10%)

Query: 112 RNRSR-GRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
           R+R+R  R   G +  V+    QG+      G Y+T++ +GTPP+   + +DTGSD++W+
Sbjct: 45  RDRARHARMLRGVAGGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWV 104

Query: 165 QCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCR---KLDSSGCN-RRNTCLYQ 215
            C  C  C   +        FD   S + A +PC  P+C    +  ++ C+ R N C Y 
Sbjct: 105 NCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYT 164

Query: 216 VSYGDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV----AAAGLLG 263
             YGDGS T G + ++ + F             A +  GC     G       A  G+ G
Sbjct: 165 FQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFG 224

Query: 264 LGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL 321
            G G LS  +Q   R    + FS+CL           ++     +  +  ++PL+ +   
Sbjct: 225 FGPGPLSVVSQLSSRGITPKVFSHCL---KGDGDGGGVLVLGEILEPSIVYSPLVPS--- 278

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
              Y + L  I+V G  +  I  ++F +     GG I+D GT++  L + AY  L  A  
Sbjct: 279 QPHYNLNLQSIAVNG-QLLPINPAVFSIS-NNRGGTIVDCGTTLAYLIQEAYDPLVTAIN 336

Query: 382 AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIP---VDSSG 437
              S   R  + S  + C+ +S       P+V L+F  GA + L    YL+    +D + 
Sbjct: 337 TAVSQSARQTN-SKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAE 395

Query: 438 TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            +C  F     G SI+G++  +   VVYD+A  RIG+A   C+
Sbjct: 396 MWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ CA C++C +  DP F P  S +++ V C 
Sbjct: 85  NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC- 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT----RVARVALGCGHD 250
                 +D +  + +N C Y+  Y + S + G    + ++F GT    +  R   GC + 
Sbjct: 144 -----NVDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSF-GTESELKPQRAVFGCENS 197

Query: 251 NEG-LFVAAA-GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
             G LF   A G++GLGRG+LS   Q   +      FS C           +MV G    
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLGAMPA 255

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
                +T   +N     +Y +EL  + V G  +R +   +F     G  G ++DSGT+  
Sbjct: 256 PPGMIYT--HSNAVRSPYYNIELKEMHVAGKALR-VDPRIFD----GKHGTVLDSGTTYA 308

Query: 367 RLTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RG 419
            L   A++A +DA  +    LK  R PD +  D CF  +G+   ++    P V + F  G
Sbjct: 309 YLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368

Query: 420 ADVSLPATNYLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NYL       G +C   F       +++G I  +   V YD    +IGF   
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428

Query: 478 GCA 480
            C+
Sbjct: 429 NCS 431


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 185/408 (45%), Gaps = 71/408 (17%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWI--------QCAPCKKCYSQTDPVFDPAKSRSFA 189
           Y   L +G PP+   + LDTGSD+ W+        QC  C   +S + P+   + S+S +
Sbjct: 25  YLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSSS 84

Query: 190 TVP--CRSPLCRKLDSSGCNRRNTCL--------------------YQVSYGDGSITVGD 227
            +   C S  C  + SS  N  + C                     +  +YG G++ +G 
Sbjct: 85  NMKELCGSRFCVDIHSSD-NSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTYGGGALVLGS 143

Query: 228 FSTETLTFRGT--------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
            + + +T  G+         V     GC   +        G+ G G+G LS P+Q G   
Sbjct: 144 LAKDIVTLHGSIFGIAILLDVPGFCFGCVGSS---IREPIGIAGFGKGILSLPSQLGF-L 199

Query: 280 NRKFSYCLVDRSTSAKP---SSMVFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISV 334
           ++ FS+C +    +  P   SS++ GD A+S    F  TP+L +     FYY+ L G+S+
Sbjct: 200 DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIGLEGVSI 259

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           G         SL  +D  GNGG+I+D+GT+ T L  P Y A+  +  A     +R+ D  
Sbjct: 260 GDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSL-ASVILYERSYDLE 318

Query: 395 L---FDTCFDL----SGKTEVKVPTVVLHFRG-ADVSLPATN--YLI--PVDSSGTFCFA 442
           +   FD CF +    +  T+ ++P +  HF G   ++LP  +  Y +  P +S    C  
Sbjct: 319 MRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNSVVVKCLL 378

Query: 443 F---------AGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           F          G  +G  +++G+ Q Q   VVYD+ A RIGF P+ CA
Sbjct: 379 FQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 426


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 165/360 (45%), Gaps = 27/360 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++  V C 
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 139

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDN 251
                 +D +  + R  C+Y+  Y + S + G    + ++F   + +A  R   GC +  
Sbjct: 140 -----TIDCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENVE 194

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G   +  A G++GLGRG LS   Q   +     S+ L          +MV G   +S  
Sbjct: 195 TGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLG--GISPP 252

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
           +      ++P    +Y ++L  I V G  +  + A++F     G  G ++DSGT+   L 
Sbjct: 253 SDMAFAYSDPVRSPYYNIDLKEIHVAGKRLP-LNANVFD----GKHGTVLDSGTTYAYLP 307

Query: 370 RPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSG----KTEVKVPTVVLHFR-GADV 422
             A++A +DA      SLK+   PD +  D CF  +G    +     P V + F  G   
Sbjct: 308 EAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKY 367

Query: 423 SLPATNYLI-PVDSSGTFCF-AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +L   NY+       G +C   F       +++G I  +   VVYD   ++IGF    CA
Sbjct: 368 TLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCA 427


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 137/248 (55%), Gaps = 15/248 (6%)

Query: 240 VARVALGCGHDNEGLFV-AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSS 298
           + R+  GCG +N    +   AGLLGLGRG LS  +Q G    +KFSYCL     + K SS
Sbjct: 139 IPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLG---TQKFSYCLTSIHEN-KTSS 194

Query: 299 MVFGDSAVS-----RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
           ++FG  A S     +  R TPL+ NP L ++YY+ L GI+VG   +  I    F+L   G
Sbjct: 195 LLFGSLAYSNFNPGKIPR-TPLIQNPFLPSYYYLALKGITVGYT-LLPIPEFAFQLGKDG 252

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKT--EVKVP 411
           +GG+I+DSGT++T L   A+  L++AF +           +  D CF L  K   EVKVP
Sbjct: 253 SGGMILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFHLPVKNAAEVKVP 312

Query: 412 TVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
            ++ HF+G D++LP  NY++     G  C A   T S LSI GNIQQQ   V++DL  S 
Sbjct: 313 KLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGS-LSIFGNIQQQNMLVLHDLKKST 371

Query: 472 IGFAPRGC 479
           +   P  C
Sbjct: 372 LSLVPTQC 379



 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 46/102 (45%), Gaps = 17/102 (16%)

Query: 62  ESSLSLRLHHVDSLSFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANG 121
           E+   + L H+D+   N T   L    I R   R++ ++  A +A R             
Sbjct: 40  ETGFQVGLRHIDA-GRNFTRLQLIQRGINRGRQRLQRMSGMATTAER------------N 86

Query: 122 GFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           GF + V      G GE+   L +GTPP     ++DTGSD++W
Sbjct: 87  GFQAPV----HVGDGEFVVNLMIGTPPVPFPAIMDTGSDLIW 124


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 168/386 (43%), Gaps = 53/386 (13%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +G    +G YF ++G+GTP +  Y+ +DTGSD++W+ CA C +C +++D      ++D  
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 184 KSRSFATVPCRSPLCRKLDS--SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
            S +   V C    C   D    GC     CLY V YGDGS T G F  + + +   R++
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQY--NRIS 263

Query: 242 ----------RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                      V  GCG+   G       A  G+LG G+   S  +Q  +  +  + FS+
Sbjct: 264 GNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSH 323

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL     +     +      V      TPL+ N      Y V +  I VGG  +  + + 
Sbjct: 324 CL----DNVDGGGIFAIGEVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLD-VPSD 375

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD-----TCF 400
            F  +     G IIDSGT++    +  Y+ L +        L + PD  L       TCF
Sbjct: 376 AF--ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI------LSQQPDLRLHTVEQAFTCF 427

Query: 401 DLSGKTEVKVPTVVLHFRGADVSLPATNYLIPVDSSGTFCFAF----AGTMSG--LSIIG 454
           D +G  +   PTV LHF  + +SL    +         +C  +    A T  G  L+++G
Sbjct: 428 DYTGNVDDGFPTVTLHFDKS-ISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLG 486

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGCA 480
           ++      VVYDL    IG+    C+
Sbjct: 487 DLVLSNKLVVYDLEKQGIGWVEYNCS 512


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 122/409 (29%), Positives = 183/409 (44%), Gaps = 57/409 (13%)

Query: 112 RNRSR-GRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWI 164
           R+R R  R   GF   V+    QGS      G YFT++ +G+PPR   + +DTGSDV+W+
Sbjct: 33  RDRLRHARLLQGFVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWV 92

Query: 165 QCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCR---KLDSSGCNRR-NTCLYQ 215
            C  C  C   +        FD + S +   V C  P+C    +  ++ C+ + + C Y 
Sbjct: 93  CCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYT 152

Query: 216 VSYGDGSITVGDFSTETLTFRG--------TRVARVALGCGHDNEGLFV----AAAGLLG 263
             YGDGS T G + ++TL F             A +  GC     G       A  G+ G
Sbjct: 153 FQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFG 212

Query: 264 LGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL-ANPK 320
            G+G LS  +Q   R    R FS+CL  +   +    +V G+  +     ++PL+ + P 
Sbjct: 213 FGQGELSVISQLSTRGITPRVFSHCL--KGDGSGGGILVLGE-ILEPGIVYSPLVPSQPH 269

Query: 321 LDTFYYVELVGISVGGAHVRGITASLFKLDPAG-----NGGVIIDSGTSVTRLTRPAYIA 375
               Y + L+ I+V G         L  +DPA      + G I+DSGT++  L   AY  
Sbjct: 270 ----YNLNLLSIAVNG--------QLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYDP 317

Query: 376 LRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVD 434
              A  A  S     P  S  + C+ +S       P    +F  GA + L   +YLIP  
Sbjct: 318 FVSAVNAIVSP-SVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFG 376

Query: 435 SSG---TFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           SSG    +C  F   + G++I+G++  +    VYDL   RIG+A   C+
Sbjct: 377 SSGGSAMWCIGFQ-KVQGVTILGDLVLKDKIFVYDLVRQRIGWANYDCS 424


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 113/388 (29%), Positives = 173/388 (44%), Gaps = 58/388 (14%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G Y+T + +GTPP++ Y+ +DTGSD++W+ C  C++C  ++       ++DP  
Sbjct: 78  GLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKA 137

Query: 185 SRSFATVPCRSPLCR-----KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGT 238
           S + + V C    C      KL   G N    C Y V+YGDGS T+G F T+ L F + T
Sbjct: 138 SSTGSMVMCDQAFCAATFGGKLPKCGANV--PCEYSVTYGDGSSTIGSFVTDALQFDQVT 195

Query: 239 R-------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
           R        A V  GCG    G       A  G+LG G    S  +Q  T  +  + F++
Sbjct: 196 RDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAH 255

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         GD  V    + TPL+A+      Y V L  I VGG  ++ + A 
Sbjct: 256 CL---DTIKGGGIFSIGD-VVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLQ-LPAH 307

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT----CFD 401
           +F  +P    G IIDSGT++T L     +  ++   A      +  D +  D     CF 
Sbjct: 308 IF--EPGEKKGTIIDSGTTLTYLPE---LVFKEVMLA---VFNKHQDITFHDVQGFLCFQ 359

Query: 402 LSGKTEVKVPTVVLHFRGADVSL---PATNYLIPVDSSGTFCFAFAGTMS------GLSI 452
             G  +   PT+  HF   D++L   P   +    + +  +C  F    S       + +
Sbjct: 360 YPGSVDDGFPTITFHFE-DDLALHVYPHEYFF--ANGNDVYCVGFQNGASQSKDGKDIVL 416

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +G++      V+YDL    IG+    C+
Sbjct: 417 MGDLVLSNKLVIYDLENRVIGWTDYNCS 444


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 171/378 (45%), Gaps = 46/378 (12%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP---CKKC-YSQTDP---VFDPAKSRSF 188
           G Y   L  GTPP+ + +++DTGSD+VW  C     C+ C +S ++P   +F P  S S 
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 189 ATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVAL 245
             + C +P C  +  S    R  C            +       L F   R ++  R  L
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSR--CRDCEPTSPNCTQICPPYLNFLRFWDHRRSQFHRRML 205

Query: 246 GCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR--STSAKPSSMVFGD 303
              H +    ++     G GRG  S P+Q G +   KFSYCL+ R    + + SS+V   
Sbjct: 206 CPLHQSTRREIS-----GFGRGPPSLPSQLGLK---KFSYCLLSRRYDDTTESSSLVLDG 257

Query: 304 SAVS--RTA--RFTPLLANPKL------DTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
            + S  +TA   +TP + NPK+        +YY+ L  I+VGG HV+ I          G
Sbjct: 258 ESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVK-IPYKYLIPGADG 316

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD---FSLFDTCFDLSGKTEVKV 410
           +GG IIDSGT+ T +    +  +   F     S KRA +    +    CF++SG      
Sbjct: 317 DGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSF 375

Query: 411 PTVVLHFRG-ADVSLPATNYLIPVDSSGTFCF------AFAGTMSG--LSIIGNIQQQGF 461
           P + L FRG A++ LP  NY+  +      C       A     SG    I+GN QQQ F
Sbjct: 376 PELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNF 435

Query: 462 RVVYDLAASRIGFAPRGC 479
            V YDL   R+GF  + C
Sbjct: 436 YVEYDLRNERLGFRQQSC 453


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 168/367 (45%), Gaps = 42/367 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           L +G+PP+ V MVLDTGS++ W+ C    K     +  F+P  S S+   PC S +C  R
Sbjct: 63  LTIGSPPQNVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPCNSSVCMTR 118

Query: 200 KLD---SSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLF 255
             D    + C+  N  C   VSY D S   G  + ET +  G        GC  D+ G  
Sbjct: 119 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC-MDSAGYT 177

Query: 256 ------VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-AVSR 308
                     GL+G+ RG LS  TQ       KFSYC+   S       ++ GD  +   
Sbjct: 178 SDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCI---SGEDAFGVLLLGDGPSAPS 231

Query: 309 TARFTPLL----ANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
             ++TPL+    ++P  D   Y V+L GI V    ++ +  S+F  D  G G  ++DSGT
Sbjct: 232 PLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQ-LPKSVFVPDHTGAGQTMVDSGT 290

Query: 364 SVTRLTRPAYIALRDAF---RAGASSLKRAPDFSLFDTCFDL---SGKTEVKVPTVVLHF 417
             T L  P Y +L+D F     G  +    P+F +F+   DL   +  +   VP V L F
Sbjct: 291 QFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNF-VFEGAMDLCYHAPASLAAVPAVTLVF 349

Query: 418 RGADVSLPATNYLIPVDS--SGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRI 472
            GA++ +     L  V       +CF F  + + G+   +IG+  QQ   + +DL  SR+
Sbjct: 350 SGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRV 409

Query: 473 GFAPRGC 479
           GF    C
Sbjct: 410 GFTETTC 416


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 96/168 (57%), Gaps = 3/168 (1%)

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           TPL+ NP   +FYY+ L  ISVG   +  I  S F++   G+GGVIIDSGT++T +   A
Sbjct: 25  TPLITNPLQPSFYYISLEVISVGDTKLS-IEQSTFEVSDDGSGGVIIDSGTTITYIEENA 83

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDL-SGKTEVKVPTVVLHFRGADVSLPATNYLI 431
           + +L+  F +           +  D CF L SGKTEV++P +V HF+G D+ LP  NY+I
Sbjct: 84  FDSLKKEFTSQTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGDLELPGENYMI 143

Query: 432 PVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              S G  C A  G  +G+SI GNIQQQ   V +DL    I F P  C
Sbjct: 144 ADSSLGVACLAM-GASNGMSIFGNIQQQNILVNHDLQKETITFIPTQC 190


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 98/280 (35%), Positives = 144/280 (51%), Gaps = 37/280 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-PV----FDPAKSRSFAT 190
           G Y+TR+ +GTPP+  Y+ +DTGS+V W++CAPC  C    D PV    FDP KS +  +
Sbjct: 39  GLYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKIS 98

Query: 191 VPCRSPLCRKLDSS-GCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR---------GTR 239
           + C    C  L+    C+  R +C Y + YGDGS T G +  +  TF           + 
Sbjct: 99  ISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSG 158

Query: 240 VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF--NRKFSYCLVDRSTSAKPS 297
            AR+  GCG    G + +  GLLG G   +S P Q  ++      F++CL  +   +   
Sbjct: 159 TARLVFGCGGTQTGSW-SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCL--QGDVSGRG 215

Query: 298 SMVFGDSAVSRTARFTPLLANPKL--DTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
           S+V G      T R   L+  P +  +  Y V+L+ I + G +V   T + F L+    G
Sbjct: 216 SLVIG------TIREPDLVYTPMVFGEDHYNVQLLNIGISGRNV--TTPASFDLEYT--G 265

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
           GVIIDSGT++T L +PAY    D FR G S  K++ D ++
Sbjct: 266 GVIIDSGTTLTYLVQPAY----DEFRRGVSVFKQSSDLAV 301


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 168/372 (45%), Gaps = 41/372 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV----FDPAKSRSFATV 191
           G YF ++G+GTP R  ++ +DTGSD++W+ CA C +C  ++D V    +D   S +  +V
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSV 142

Query: 192 PCRSPLCRKLDS-SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVAR 242
            C    C  ++  S C+  +TC Y + YGDGS T G    + +           G+    
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGT 202

Query: 243 VALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRSTSAKP 296
           +  GCG    G       A  G++G G+   SF +Q     +  R F++CL + +     
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGG-- 260

Query: 297 SSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
              +F     VS   + TP+L+       Y V L  I VG + V  +++  F  D   + 
Sbjct: 261 ---IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNS-VLQLSSDAF--DSGDDK 311

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           GVIIDSGT++  L    Y  L +   A    L        F TCF    + + + PTV  
Sbjct: 312 GVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYIDRLD-RFPTVTF 369

Query: 416 HF-RGADVSLPATNYLIPVDSSGTFCFAFAG----TMSG--LSIIGNIQQQGFRVVYDLA 468
            F +   +++    YL  V    T+CF +      T  G  L+I+G++      VVYD+ 
Sbjct: 370 QFDKSVSLAVYPQEYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIE 428

Query: 469 ASRIGFAPRGCA 480
              IG+    C+
Sbjct: 429 NQVIGWTNHNCS 440


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 167/362 (46%), Gaps = 31/362 (8%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ C+ C++C +  DP F P  S S++ V C 
Sbjct: 85  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKC- 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDN 251
                 +D +  + +  C Y+  Y + S + G    + ++F      +      GC +  
Sbjct: 144 -----NVDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENSE 198

Query: 252 EG-LFVAAA-GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G LF   A G++GLGRG+LS   Q   +     S+ L          +MV G       
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPD 258

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
             F+   ++P    +Y +EL  I V G  +R + + +F        G ++DSGT+   L 
Sbjct: 259 MIFSN--SDPLRSPYYNIELKEIHVAGKALR-VESRIFN----SKHGTVLDSGTTYAYLP 311

Query: 370 RPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADV 422
             A++A ++A  +   SLK  R PD S  D CF  +G+   K+    P V + F  G  +
Sbjct: 312 EQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKL 371

Query: 423 SLPATNYLI---PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           SL   NYL     VD  G +C   F       +++G I  +   V YD    +IGF    
Sbjct: 372 SLTPENYLFRHSKVD--GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTN 429

Query: 479 CA 480
           C+
Sbjct: 430 CS 431


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 106/199 (53%), Gaps = 13/199 (6%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +G P   VY + DTGS+++W+QC PC  CY+QT P+FDPA+S ++ TV   SP+C  +  
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122

Query: 204 SGCNRRN-TCLYQVSYGDGSITVGDFSTETLTFRG-----TRVARVALGCGHDNEG-LFV 256
             C   + +C YQ +YGDG+ T G  ST+   F         V  +  GC HD +  L  
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKG 182

Query: 257 AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL 316
             AG++GL R   S  +Q      +KFSYC+V        S M FG  AV    + TPLL
Sbjct: 183 HQAGVVGLNRHPNSLVSQLKV---KKFSYCMVIPDDHGSGSRMYFGSRAVILGGK-TPLL 238

Query: 317 ANPKLDTFYYVELVGISVG 335
                 + Y+V L GISVG
Sbjct: 239 KGDY--SHYFVTLKGISVG 255



 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 55/109 (50%), Gaps = 8/109 (7%)

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYGDGSI-TVGDF 228
           +C++QT P+FDP+KS +++TVP  +P C +     C+     C Y++SYG GS  T G  
Sbjct: 333 QCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGTI 392

Query: 229 STETLTFRGTR-----VARVALGCGHDNEGLFVA-AAGLLGLGRGRLSF 271
           S +   F   R     V  +  GC     G F     G++GL +  LS 
Sbjct: 393 SIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSL 441



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 411 PTVVLHFRGADVSLPATNYLIPVDSSGTFCFAFAGTMS--GLSIIGNIQQQGFRVVYDLA 468
           P +  HF GAD  L      + V+  G +C A   + S   LSI+GNIQQQ + V YDL 
Sbjct: 269 PDITFHFYGADFILTKXTTYVEVEK-GLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLE 327

Query: 469 ASRI 472
           A  +
Sbjct: 328 AQEV 331


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 121/421 (28%), Positives = 184/421 (43%), Gaps = 51/421 (12%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGE 137
           N T +    L I+    R+  + A  E ++            N  +++SV   L   +  
Sbjct: 53  NETAKDRMELDIEHSAARLAYIQARIEGSLVY----------NNDYTASVSPSLTGRT-- 100

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPL 197
               L +G P     +V+DTGSD++WI C PC  C +    +FDP+ S +F+ + C++P 
Sbjct: 101 ILVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTPC 159

Query: 198 CRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-----RVARVALGCGHDNE 252
             K    GC + +   + +SY D S   G F  + L F  T     +++ V +GCGH N 
Sbjct: 160 GFK----GC-KCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NI 213

Query: 253 GLFV--AAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           G        G+LGL  G  S  TQ G    RKFSYC+ + +      + +          
Sbjct: 214 GFNSDPGYNGILGLNNGPNSLATQIG----RKFSYCIGNLADPYYNYNQLRLGEGADLEG 269

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
             TP         FYYV + GISVG   +  I    F++   G GGVI+DSGT++T L  
Sbjct: 270 YSTPFEV---YHGFYYVTMEGISVGEKRLD-IALETFEMKRNGTGGVILDSGTTITYLVD 325

Query: 371 PAYIALRDAFRAGASSLKRAPDFSLFDTC------FDLSGKTEVKVPTVVLHF-RGADVS 423
            A+  L +  R   + LK +    +F+        + +  +  V  P V  HF  GAD++
Sbjct: 326 SAHKLLYNEVR---NLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLA 382

Query: 424 LPATNYLIPVDSSGTFCFAFA-----GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
           L   ++    D    FC   +      T    S+IG + QQ + V YDL    + F    
Sbjct: 383 LDTGSFFSQRDD--IFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRID 440

Query: 479 C 479
           C
Sbjct: 441 C 441


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/411 (28%), Positives = 185/411 (45%), Gaps = 74/411 (18%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWI--------QCAPCKKCYSQTDPVFDPAKSRSFA 189
           Y   L +G PP+   + LDTGSD+ W+        QC  C   +S + P+   + S+S +
Sbjct: 25  YLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSSS 84

Query: 190 TVP--CRSPLCRKLDSSGCNRRNTCL--------------------YQVSYGDGSITVGD 227
            +   C S  C  + SS  N  + C                     +  +YG G++ +G 
Sbjct: 85  NMKELCGSRFCVDIHSSD-NSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTYGGGALVLGS 143

Query: 228 FSTETLTFRGT--------RVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
            + + +T  G+         V     GC   +        G+ G G+G LS P+Q G   
Sbjct: 144 LAKDIVTLHGSIFGIAILLDVPGFCFGCVGSS---IREPIGIAGFGKGILSLPSQLGF-L 199

Query: 280 NRKFSYCLVDRSTSAKP---SSMVFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISV 334
           ++ FS+C +    +  P   SS++ GD A+S    F  TP+L +     FYY+ L G+S+
Sbjct: 200 DKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIGLEGVSI 259

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFS 394
           G         SL  +D  GNGG+I+D+GT+ T L  P Y A+  +  A     +R+ D  
Sbjct: 260 GDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSL-ASVILYERSYDLE 318

Query: 395 L---FDTCFDL----SGKTEVKVPTVVLHFRG-ADVSLPATN--YLI--PVDSSGTFCFA 442
           +   FD CF +    +  T+ ++P +  HF G   ++LP  +  Y +  P +S    C  
Sbjct: 319 MRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNSVVVKCLL 378

Query: 443 F------------AGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           F             G  +G  +++G+ Q Q   VVYD+ A RIGF P+ CA
Sbjct: 379 FQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 429


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 116/379 (30%), Positives = 172/379 (45%), Gaps = 57/379 (15%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCR 194
           G Y+  + +G P +  Y+ +DTGSD+ W+QC APC+ C S    ++DP K+R    V CR
Sbjct: 21  GLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKAR---LVDCR 77

Query: 195 SPLCRKLDSSG---CNR-RNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVA-LG 246
            PLC  +   G   C      C Y V Y DGS T+G    +T+T     GTR    A +G
Sbjct: 78  VPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTNGTRSKTTAIIG 137

Query: 247 CGHDNEGLF----VAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMV 300
           CG+D +G       +  G++GL   ++S P+Q  ++        +CL   S       + 
Sbjct: 138 CGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNGG--GYLF 195

Query: 301 FGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN-GGVII 359
           FGDS V       P L         +  ++G S+ G ++ G +      D  G+ GGV+ 
Sbjct: 196 FGDSLV-------PALG------MTWTPIMGKSITG-NIGGKSGD--ADDKTGDIGGVMF 239

Query: 360 DSGTSVTRLTRPAYIALRDA--FRAGASSLKRAPDFSLFDTC------FDLSGKTEVKVP 411
           DSGTS T L   AY A+  A   +   S L R    +    C      F+     +    
Sbjct: 240 DSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFK 299

Query: 412 TVVLHFRGAD-------VSLPATNYLIPVDSSGTFCF----AFAGTMSGLSIIGNIQQQG 460
           TV L F   +       + L    YLI V + G  C     A   ++   +IIG++  +G
Sbjct: 300 TVTLDFGKRNWYSASRVLELSPEGYLI-VSTQGNVCLGILDASGASLEVTNIIGDVSMRG 358

Query: 461 FRVVYDLAASRIGFAPRGC 479
           + VVYD A ++IG+  R C
Sbjct: 359 YLVVYDNARNQIGWVRRNC 377


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 92/270 (34%), Positives = 130/270 (48%), Gaps = 20/270 (7%)

Query: 224 TVGDFSTETLTFRGTR--VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNR 281
           + G  +TET TF   +   A +  GCG    G    A+G++G+  G LS   Q       
Sbjct: 3   STGVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSIT--- 59

Query: 282 KFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFTPLLANPKLDTFYYVELVGISVG 335
           KFSYCL    T  K S ++FG  A       +   +  PLL NP  D +YYV +VGIS+G
Sbjct: 60  KFSYCLTPF-TDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIG 118

Query: 336 GAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSL 395
              +  +  ++  L P G GG ++DS T++  L  PA+  L+ A   G            
Sbjct: 119 SKRLD-VPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDD 177

Query: 396 FDTCFDLS---GKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF--AGTMSG 449
           +  CF+L        V+VP +VLHF G A++SLP  +Y     S G  C A   A     
Sbjct: 178 YPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGA 236

Query: 450 LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            ++IGN+QQQ   V+YDL   +  +AP  C
Sbjct: 237 PNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 155/351 (44%), Gaps = 68/351 (19%)

Query: 191 VPCRSPLC--------------------RKLDSSGCNRRNTC--LYQVSYGDGSITVGDF 228
           +PC SPLC                      +++  C   + C  LY  +YGDGS+     
Sbjct: 24  IPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLY-YAYGDGSLVAHLR 82

Query: 229 STETLTFRGTR------VARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRK 282
                   G R      V      C H   G  V   G+ G GRG LS P Q   + + +
Sbjct: 83  RGRVALGAGARASVAVAVDNFTFACAHTALGEPV---GVAGFGRGPLSLPGQLSPQLSGR 139

Query: 283 FSYCLVDRSTSA----KPSSMVFGDSAVSRTAR-------FTPLLANPKLDTFYYVELVG 331
           FSYCLV  S  A    +PS ++ G S     A        +TPLL NPK   FY V L  
Sbjct: 140 FSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEA 199

Query: 332 ISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL---- 387
           +SVG A ++     L ++D AGNGG+++DSGT+ T L    Y  + +AF    ++     
Sbjct: 200 VSVGAARIQA-RPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFAR 258

Query: 388 -KRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPV-----------D 434
            +RA + +    C+  +  ++  VP + LHFRG A V+LP  NY +             D
Sbjct: 259 AERAEEQTGLTPCYRYA-ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKD 317

Query: 435 SSGTFCFAFAGTMSG------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             G       G  SG         +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 318 DVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 172/386 (44%), Gaps = 50/386 (12%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +G+   +G YFT++G+GTP +  Y+ +DTGSD++W+ C  C  C  ++       ++DP 
Sbjct: 80  NGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPT 139

Query: 184 KSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            S S  TV C    C    + G    C   + C Y ++YGDGS T G F  + L +    
Sbjct: 140 ASASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVS 199

Query: 240 --------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                    A V  GCG    G      VA  G+LG G+   S  +Q  +  +  + FS+
Sbjct: 200 GDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSH 259

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         G+  V    + TPL+  P +   Y V L  I VGG+ ++ +  +
Sbjct: 260 CL---DTVNGGGIFAIGN-VVQPKVKTTPLV--PGM-PHYNVVLKTIDVGGSTLQ-LPTN 311

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS--SLKRAPDFSLFDTCFDLS 403
           +F +   G+ G IIDSGT++  L    Y A+  A  +     +LK   DF     CF  S
Sbjct: 312 IFDIG-GGSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDF----LCFQYS 366

Query: 404 GKTEVKVPTVVLHFRGADVSLPATNY---LIPVDSSGTFCFAF--AGTMS----GLSIIG 454
           G  +   P V  HF G    LP   Y    +  ++   +C  F   G  S     + ++G
Sbjct: 367 GSVDNGFPEVTFHFDG---DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLG 423

Query: 455 NIQQQGFRVVYDLAASRIGFAPRGCA 480
           ++      VVYDL    IG+    C+
Sbjct: 424 DLALSNKLVVYDLENQVIGWTNYNCS 449


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 110/377 (29%), Positives = 160/377 (42%), Gaps = 61/377 (16%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ CK C S  DP F P  S ++  V C 
Sbjct: 90  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC- 148

Query: 195 SPLCRKLDSSGCN---RRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCG 248
                   +  CN    R  C Y+  Y + S + G    + ++F         R   GC 
Sbjct: 149 --------TWQCNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCE 200

Query: 249 HDNEGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS---------AK 295
           +D  G      A G++GLGRG LS   Q   +   +  FS C                + 
Sbjct: 201 NDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISP 260

Query: 296 PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
           P+ MVF  S             +P    +Y ++L  I V G  +  +   +F     G  
Sbjct: 261 PADMVFTHS-------------DPVRSPYYNIDLKEIHVAGKRLH-LNPKVFD----GKH 302

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV--- 410
           G ++DSGT+   L   A++A + A      SLKR   PD    D CF  SG  E+ V   
Sbjct: 303 GTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICF--SG-AEINVSQL 359

Query: 411 ----PTVVLHF-RGADVSLPATNYLIPVDS-SGTFCF-AFAGTMSGLSIIGNIQQQGFRV 463
               P V + F  G  +SL   NYL       G +C   F+      +++G I  +   V
Sbjct: 360 SKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLV 419

Query: 464 VYDLAASRIGFAPRGCA 480
           +YD   S+IGF    C+
Sbjct: 420 MYDREHSKIGFWKTNCS 436


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 163/367 (44%), Gaps = 42/367 (11%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C+ C    DP F P +S ++  V C 
Sbjct: 85  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA---RVALGCGHDN 251
             +    D  G N    C+Y+  Y + S + G    + ++F         R   GC +  
Sbjct: 145 --MDCNCDHDGVN----CVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVE 198

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFG----- 302
            G   +  A G++GLGRG+LS   Q   +   N  FS C           +MV G     
Sbjct: 199 TGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGG--GAMVLGGIPPP 256

Query: 303 -DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
            D   SR+        +P    +Y +EL  I V G  ++ ++ S F        G ++DS
Sbjct: 257 PDMVFSRS--------DPYRSPYYNIELKEIHVAGKPLK-LSPSTFDR----KHGTVLDS 303

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVL 415
           GT+   L   A++A RDA    + +LK+   PD +  D CF  +G+   ++    P V +
Sbjct: 304 GTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDM 363

Query: 416 HF-RGADVSLPATNYLIP-VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIG 473
            F  G  +SL   NYL       G +C          +++G I  +   V YD    +IG
Sbjct: 364 VFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIG 423

Query: 474 FAPRGCA 480
           F    C+
Sbjct: 424 FWKTNCS 430


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 166/386 (43%), Gaps = 52/386 (13%)

Query: 128 ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSR 186
           I G     G+Y+T + VG PPR  ++ +DTGSD+ WIQC APC  C     P++ PAK +
Sbjct: 184 IKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEK 243

Query: 187 SFATVPCRSPLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVA 244
               VP R  LC++L  D + C     C Y++ Y D S ++G  + + +    T   R  
Sbjct: 244 ---IVPPRDLLCQELQGDQNYCATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNGGREK 300

Query: 245 L----GCGHDNEGLFVAAA----GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSA 294
           L    GC +D +G  + +     G+LGL    +S P+Q   +   +  F +C+       
Sbjct: 301 LDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGG 360

Query: 295 KPSSMVFGDSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAHVR--GITASLFKLDP 351
               M  GD  V R    + P+   P  D  Y+ E   ++ G   +R  G   S  +   
Sbjct: 361 --GYMFLGDDYVPRWGMTWAPIRGGP--DNLYHTEAQKVNYGDQQLRMHGQAGSSIQ--- 413

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC----FDLSGKTE 407
                VI DSG+S T L    Y  L  A +    S  +    +    C    FD+    +
Sbjct: 414 -----VIFDSGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLED 468

Query: 408 VK--VPTVVLHFRGADVSLPATNYLIPVD-----SSGTFCFAFAGTMSGLS-------II 453
           VK     + LHF      +P T  ++P D       G  C    G ++G         I+
Sbjct: 469 VKQFFKPLNLHFGNRWFVIPRTFTILPDDYLIISDKGNVCL---GLLNGAEIDHASTLIV 525

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGC 479
           G++  +G  VVYD    +IG+A   C
Sbjct: 526 GDVSLRGKLVVYDNERRQIGWADSEC 551


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 163/362 (45%), Gaps = 31/362 (8%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++ +V C 
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKC- 68

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
                 +D +  + +  C+Y+  Y + S + G    + ++F         R   GC +  
Sbjct: 69  -----NIDCNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENME 123

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            G   +  A G++G+GRG LS       +   N  FS C           +MV G   +S
Sbjct: 124 TGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGG--GAMVLG--GIS 179

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
             +      ++P    +Y ++L  I V G  +  +  ++F     G  G I+DSGT+   
Sbjct: 180 PPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLP-LNPTVFD----GKHGTILDSGTTYAY 234

Query: 368 LTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGA 420
           L   A+++ +DA      SLK  R PD +  D CF  +G    ++    P V + F  G 
Sbjct: 235 LPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQ 294

Query: 421 DVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            + L   NYL       G +C   F       +++G I  +   V+YD   S+IGF    
Sbjct: 295 KLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTN 354

Query: 479 CA 480
           C+
Sbjct: 355 CS 356


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/382 (30%), Positives = 170/382 (44%), Gaps = 55/382 (14%)

Query: 132 AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK---CYSQTDPVFDPAKSRSF 188
           A G  +Y   +G GTP + + M  DTG  +  ++CA C+    C       FDP++S +F
Sbjct: 140 APGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTF 197

Query: 189 ATVPCRSPLCRKLDSSGCNRRNT--C-LYQVSYGDGSITVGDFSTETLTFR-GTRVARVA 244
           A VPC SP CR    SGC+  +T  C L    +  G++     + + LT      V    
Sbjct: 198 APVPCGSPDCR----SGCSSGSTPSCPLTSFPFLSGAV-----AQDVLTLTPSASVDDFT 248

Query: 245 LGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS 304
            GC   + G  + AAGLL L R   S  ++        FSYCL   +TS+    +  G++
Sbjct: 249 FGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSH-GFLAIGEA 307

Query: 305 AV--SRTARFT---PLLANPKLDTFYYVELVGISVGGAHV----RGITASLFKLDPAGNG 355
            V  +RTAR T   PL+ +P     Y ++L G+S+GG  +       TAS          
Sbjct: 308 DVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPHAATAS---------A 358

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG-KTEVKVPTVV 414
            +++D+    T +    Y  LRDAFR   +   RAP     DTC++ +G + EV +P V 
Sbjct: 359 AMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNFTGVRHEVLIPLVH 418

Query: 415 LHFRGADVSLPATNYLIPVD------SSGTF----CFAFAGTMSG-------LSIIGNIQ 457
           L FRG           +  D        G F    C AFA   S          ++G + 
Sbjct: 419 LTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLA 478

Query: 458 QQGFRVVYDLAASRIGFAPRGC 479
           Q    VV+D+   +IGF P  C
Sbjct: 479 QSSMEVVHDVPGGKIGFIPGSC 500


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/364 (31%), Positives = 166/364 (45%), Gaps = 39/364 (10%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+S +   + EY   L V TPP  +  + DTGS +VW++C        +      PA S 
Sbjct: 65  VVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKC--------KLPAAHTPASS- 115

Query: 187 SFATVPCRSPLCRKL-DSSGCNR----RNTCLYQVSYGDGSITVGDFSTETLTFRGTRVA 241
           S+A +PC +  C+ L D++ C       N C+Y+ ++ DGS T G  + +  TF      
Sbjct: 116 SYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFS----T 171

Query: 242 RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSM 299
           R+  GC    EGL V   GL+GL  G +S  +Q   +  F  KFSYCLV  S+S   SS 
Sbjct: 172 RLDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSS 231

Query: 300 V-FGDSAV---SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
           + FG  A+   S  A  TPL+A  +  +FY + L  I V G  V   T +          
Sbjct: 232 LNFGSHAIVSSSPGAATTPLVAG-RNKSFYTIALDSIKVAGKPVPLQTTTT--------- 281

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV----P 411
            +I+DSGT +T L +     L  A  A     +     +L+  C+D+  +    V    P
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIP 341

Query: 412 TVVLHF-RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAAS 470
            V L    G +V LP  N  +  +   T C A   +     I+GN+ QQ   V +DL   
Sbjct: 342 DVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLERR 401

Query: 471 RIGF 474
            + F
Sbjct: 402 TVSF 405


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 162/359 (45%), Gaps = 32/359 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP-CKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   L +GTPP+ V  ++D G ++VW QCA  C++C+ Q  P+FD   S +F   PC + 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 197 LCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE-G 253
           +C  +   S   +    C Y+ S   G  TVG   T+ +       AR+A GC   +E  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGR-TVGRIGTDAVAIGTAATARLAFGCAVASEMD 169

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA----VSRT 309
               ++G +GLGR  LS   Q        FSYCL    T  K S++  G SA      + 
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNA---TAFSYCLAPPDT-GKSSALFLGASAKLAGAGKG 225

Query: 310 ARFTPLLA-----NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A  TP +      N  L   Y + L  I  G A +           P     + + + T 
Sbjct: 226 AGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIA---------MPQSGNTITVSTATP 276

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VT L    Y  LR A      +    P    +D CF  +  +    P +VL F+ GA+++
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG-GAPDLVLAFQGGAEMT 335

Query: 424 LPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +P ++YL    +  T C A  G+  + G+SI+G++QQ    +++DL    + F P  C+
Sbjct: 336 VPVSSYLFDAGND-TACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/412 (28%), Positives = 183/412 (44%), Gaps = 46/412 (11%)

Query: 105 SAVRVPPRNRSRGRANGGFSSSVISGLA----QGS------GEYFTRLGVGTPPRYVYMV 154
           S +R   R R      GG   S + G+     QGS      G YFT++ +G+PP    + 
Sbjct: 57  SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQ 116

Query: 155 LDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCRKL---DSSGC 206
           +DTGSD++W+ C+ C  C   +        FD   S +  +V C  P+C  +    ++ C
Sbjct: 117 IDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC 176

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV-- 256
           +  N C Y   YGDGS T G + T+T  F             A +  GC     G     
Sbjct: 177 SENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKS 236

Query: 257 --AAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
             A  G+ G G+G+LS  +Q   R      FS+CL  +   +     V G+  V     +
Sbjct: 237 DKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVPGMV-Y 293

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           +PL+ +      Y + L+ I V G  +  + A++F  + +   G I+D+GT++T L + A
Sbjct: 294 SPLVPS---QPHYNLNLLSIGVNG-QMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEA 347

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
           Y    +A     S L   P  S  + C+ +S       P+V L+F  GA + L   +YL 
Sbjct: 348 YDLFLNAISNSVSQLV-TPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLF 406

Query: 432 P---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                D +  +C  F       +I+G++  +    VYDLA  RIG+A   C+
Sbjct: 407 HYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 115/222 (51%), Gaps = 26/222 (11%)

Query: 89  IQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPP 148
           +Q  + RV S      S  ++P                + SG+   +  Y   +G+G+  
Sbjct: 32  MQNRIRRVASTHNVEASQTQIP----------------LSSGINLQTLNYIVTMGLGS-- 73

Query: 149 RYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKL-----DS 203
           + + +++DT SD+ W+QC PC  CY+Q  P+F P+ S S+ +V C S  C+ L     ++
Sbjct: 74  KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNT 133

Query: 204 SGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGL 261
             C   N  TC Y V+YGDGS T GD   E L+F G  V+    GCG +N+GLF   +GL
Sbjct: 134 GACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVSVSDFVFGCGRNNKGLFGGVSGL 193

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGD 303
           +GLGR  LS  +QT   F   FSYCL   + +    S+V G+
Sbjct: 194 MGLGRSYLSLVSQTNATFGGVFSYCL-PTTEAGSSGSLVMGN 234


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/411 (28%), Positives = 182/411 (44%), Gaps = 46/411 (11%)

Query: 105 SAVRVPPRNRSRGRANGGFSSSVISGLA----QGS------GEYFTRLGVGTPPRYVYMV 154
           S +R   R R      GG   S + G+     QGS      G YFT++ +G+PP    + 
Sbjct: 57  SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQ 116

Query: 155 LDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCRKL---DSSGC 206
           +DTGSD++W+ C+ C  C   +        FD   S +  +V C  P+C  +    ++ C
Sbjct: 117 IDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC 176

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV-- 256
           +  N C Y   YGDGS T G + T+T  F             A +  GC     G     
Sbjct: 177 SENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKS 236

Query: 257 --AAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
             A  G+ G G+G+LS  +Q   R      FS+CL  +   +     V G+  V     +
Sbjct: 237 DKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVPGMV-Y 293

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           +PL+ +      Y + L+ I V G  +  + A++F  + +   G I+D+GT++T L + A
Sbjct: 294 SPLVPS---QPHYNLNLLSIGVNG-QMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEA 347

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
           Y    +A     S L   P  S  + C+ +S       P+V L+F  GA + L   +YL 
Sbjct: 348 YDLFLNAISNSVSQLV-TPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLF 406

Query: 432 P---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                D +  +C  F       +I+G++  +    VYDLA  RIG+A   C
Sbjct: 407 HYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 111/406 (27%), Positives = 177/406 (43%), Gaps = 50/406 (12%)

Query: 114 RSRGRANGGFS-SSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC 166
           ++  RA  G S ++++    QG+      G Y+TR+ +GTPPR  Y+ +DTGSD++W+ C
Sbjct: 10  KAHDRARHGRSLNTIVDFTLQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNC 69

Query: 167 APCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLC---RKLDSSGCNRRNTCLYQVSY 218
            PC  C   +        FDP  S + + + C    C    ++  S C     C Y   Y
Sbjct: 70  KPCNACPLTSGLGVALNFFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGYSFEY 129

Query: 219 GDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV----AAAGLLGLGR 266
           GDGS T+G + ++   +             A++  GC ++  G       A  G+ G G+
Sbjct: 130 GDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQ 189

Query: 267 GRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR-FTPLLANPKLDT 323
             LS  +Q   +    + FS+CL      A P   +     ++     +TP++ +     
Sbjct: 190 NDLSVVSQLNSQGLAPKIFSHCL----EGADPGGGILVLGEITEPGMVYTPIVPS---QP 242

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
            Y + L GI+V G  +  I   +F        G IID GT++  L   AY    +   A 
Sbjct: 243 HYNLNLQGIAVNGQQLS-IDPQVFAT--TNTRGTIIDCGTTLAYLAEEAYEPFVNTIIA- 298

Query: 384 ASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPATNYLIPV---DSSGTFC 440
           A S    P     + CF      +   P+V L+F GA + L   +YLI     DSS  +C
Sbjct: 299 AVSQSTQPFMLKGNPCFLTVHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWC 358

Query: 441 FAF------AGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +      A   S ++I+G++  +    VYDL   RIG+    C+
Sbjct: 359 IGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDCS 404


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 127/267 (47%), Gaps = 26/267 (9%)

Query: 78  NRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVI-SGLAQGSG 136
           N T   L    IQR   R+  +               +RG A     + V  + +    G
Sbjct: 41  NLTEHELLRRAIQRSRYRLAGI-------------GMARGEAASARKAVVAETPIMPAGG 87

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           EY  +LG+GTPP      +DT SD++W QC PC  CY Q DP+F+P  S ++A +PC S 
Sbjct: 88  EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147

Query: 197 LCRKLDSSGCNRRN--TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGL 254
            C +LD   C   +  +C Y  +Y   + T G  + + L         VA GC   + G 
Sbjct: 148 TCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGG 207

Query: 255 F--VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG---DSAVSRT 309
                A+G++GLGRG LS  +Q      R+F+YCL     S  P  +V G   D+A + T
Sbjct: 208 APPPQASGVVGLGRGPLSLVSQLS---VRRFAYCLPP-PASRIPGKLVLGADADAARNAT 263

Query: 310 ARF-TPLLANPKLDTFYYVELVGISVG 335
            R   P+  +P+  ++YY+ L G+ +G
Sbjct: 264 NRIAVPMRRDPRYPSYYYLNLDGLLIG 290


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 116/414 (28%), Positives = 181/414 (43%), Gaps = 45/414 (10%)

Query: 96  VKSLTAFAESAVRVPPRNRSRGRANGGFSSSV--ISGLAQGSGEYFTRLGVGTPPRYVYM 153
           V  LT    +A R+P  +  RG  +G   ++   +      +G Y TRL +GTP +   +
Sbjct: 48  VLPLTLAYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFAL 107

Query: 154 VLDTGSDVVWIQCAPCKKCYSQT----------DPVFDPAKSRSFATVPCRSPLCRKLDS 203
           ++D+GS V ++ CA C++C +            DP F P  S +++ V C       +D 
Sbjct: 108 IVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC------NVDC 161

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNEG-LFVAAA 259
           +  N R+ C Y+  Y + S + G    + ++F      +  R   GC +   G LF   A
Sbjct: 162 TCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHA 221

Query: 260 -GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN 318
            G++GLGRG+LS   Q   +     S+ L          +MV G         F+   +N
Sbjct: 222 DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFS--HSN 279

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPA---GNGGVIIDSGTSVTRLTRPAYIA 375
           P    +Y +EL  I V G  +R        LDP       G ++DSGT+   L   A++A
Sbjct: 280 PVRSPYYNIELKEIHVAGKALR--------LDPKIFNSKHGTVLDSGTTYAYLPEQAFVA 331

Query: 376 LRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADVSLPATN 428
            +DA     +SLK  R PD +  D CF  +G+   ++    P V + F  G  +SL   N
Sbjct: 332 FKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPEN 391

Query: 429 YLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           YL       G +C   F       +++G I  +   V YD    +IGF    C+
Sbjct: 392 YLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 445


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 116/414 (28%), Positives = 181/414 (43%), Gaps = 45/414 (10%)

Query: 96  VKSLTAFAESAVRVPPRNRSRGRANGGFSSSV--ISGLAQGSGEYFTRLGVGTPPRYVYM 153
           V  LT    +A R+P  +  RG  +G   ++   +      +G Y TRL +GTP +   +
Sbjct: 47  VLPLTLAYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFAL 106

Query: 154 VLDTGSDVVWIQCAPCKKCYSQT----------DPVFDPAKSRSFATVPCRSPLCRKLDS 203
           ++D+GS V ++ CA C++C +            DP F P  S +++ V C       +D 
Sbjct: 107 IVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC------NVDC 160

Query: 204 SGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNEG-LFVAAA 259
           +  N R+ C Y+  Y + S + G    + ++F      +  R   GC +   G LF   A
Sbjct: 161 TCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHA 220

Query: 260 -GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN 318
            G++GLGRG+LS   Q   +     S+ L          +MV G         F+   +N
Sbjct: 221 DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPAPPDMVFS--HSN 278

Query: 319 PKLDTFYYVELVGISVGGAHVRGITASLFKLDPA---GNGGVIIDSGTSVTRLTRPAYIA 375
           P    +Y +EL  I V G  +R        LDP       G ++DSGT+   L   A++A
Sbjct: 279 PVRSPYYNIELKEIHVAGKALR--------LDPKIFNSKHGTVLDSGTTYAYLPEQAFVA 330

Query: 376 LRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADVSLPATN 428
            +DA     +SLK  R PD +  D CF  +G+   ++    P V + F  G  +SL   N
Sbjct: 331 FKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSPEN 390

Query: 429 YLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           YL       G +C   F       +++G I  +   V YD    +IGF    C+
Sbjct: 391 YLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 444


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 137/482 (28%), Positives = 201/482 (41%), Gaps = 69/482 (14%)

Query: 21  ASLQYQTFVLNSLPTPSTLSWPESVSVS-ESESSLPLPAPDAESS---LSLRLHH-VDSL 75
           ASL  Q   + SL     LS    V VS +S + L LP+P  E S   + L LHH V   
Sbjct: 2   ASLWTQLISMASL----LLSLARWVPVSGDSSNVLLLPSPHHEGSRPAMILPLHHSVPDS 57

Query: 76  SFNRTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGS 135
           SF+      FN R Q             ES     P  R R          +   L + +
Sbjct: 58  SFSH-----FNPRRQ-----------LKESDSEHHPNARMR----------LYDDLLR-N 90

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y  RL +GTPP+   +++DTGS V ++ C+ C+ C S  DP F P  S ++  V C  
Sbjct: 91  GYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC-- 148

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDNE 252
                   +  N R  C Y+  Y + S + G    + ++F   T ++  R   GC +D  
Sbjct: 149 ----TWQCNCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCENDET 204

Query: 253 GLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
           G      A G++GLGRG LS   Q   +   +  FS C            +      +S 
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVL----GGISP 260

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
            A      ++P    +Y ++L  I V G  +  +   +F     G  G ++DSGT+   L
Sbjct: 261 PADMVFTRSDPVRSPYYNIDLKEIHVAGKRLH-LNPKVFD----GKHGTVLDSGTTYAYL 315

Query: 369 TRPAYIALRDAFRAGASSLKR--APDFSLFDTCF-----DLSGKTEVKVPTVVLHF-RGA 420
              A++A + A      SLKR   PD    D CF     D+S +     P V + F  G 
Sbjct: 316 PESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVS-QISKSFPVVEMVFGNGH 374

Query: 421 DVSLPATNYLIPVDS-SGTFCF-AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            +SL   NYL       G +C   F+      +++G I  +   V+YD   ++IGF    
Sbjct: 375 KLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWKTN 434

Query: 479 CA 480
           C+
Sbjct: 435 CS 436


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 163/359 (45%), Gaps = 32/359 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP-CKKCYSQTDPVFDPAKSRSFATVPCRSP 196
           Y   L +GTPP+ V  ++D G ++VW QCA  C++C+ Q  P+FD   S +F   PC + 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 197 LCRKLD--SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE-G 253
           +C  +   S   +    C Y+ S   G  TVG   T+ +       AR+A GC   +E  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGR-TVGRIGTDAVAIGTAATARLAFGCAVASEMD 169

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA----VSRT 309
               ++G +GLGR  LS   Q        FSYCL    T  K S++  G SA      + 
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNA---TAFSYCLAPPDT-GKSSALFLGASAKLAGAGKG 225

Query: 310 ARFTPLLA-----NPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A  TP +      +  L   Y + L  I  G A +           P     +++ + T 
Sbjct: 226 AGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIA---------MPQSGNTIMVSTATP 276

Query: 365 VTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVS 423
           VT L    Y  LR A      +    P    +D CF  +  +    P +VL F+ GA+++
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG-GAPDLVLAFQGGAEMT 335

Query: 424 LPATNYLIPVDSSGTFCFAFAGT--MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +P ++YL    +  T C A  G+  + G+SI+G++QQ    +++DL    + F P  C+
Sbjct: 336 VPVSSYLFDAGND-TACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/372 (29%), Positives = 170/372 (45%), Gaps = 37/372 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD---PV--FDPAKSRSFAT 190
           G YFTR+ +G+PP+  Y+ +DTGSDV+W+ C  C  C   +    P+  FDP  S + + 
Sbjct: 81  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 140

Query: 191 VPCRSPLCR---KLDSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFR---GTRV--- 240
           + C    C    +   +GC+ + N C+Y   YGDGS T G + ++ L F    G+ V   
Sbjct: 141 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 200

Query: 241 -ARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS 293
            A +  GC     G       A  G+ G G+  +S  +Q   +    + FS+CL      
Sbjct: 201 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 260

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                +      V     ++PL+ +      Y + L  ISV G  +  I   +F    + 
Sbjct: 261 GGILVLG---EIVEEDIVYSPLVPS---QPHYNLNLQSISVNGKSL-AIDPEVFA--TST 311

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
           N G I+DSGT++  L   AY     A     S   R P  S    C+ ++   +   PTV
Sbjct: 312 NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVR-PLLSKGTQCYLITSSVKGIFPTV 370

Query: 414 VLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLA 468
            L+F  G  ++L   +YL+  +S G    +C  F      G++I+G++  +    VYDLA
Sbjct: 371 SLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLA 430

Query: 469 ASRIGFAPRGCA 480
             RIG+A   C+
Sbjct: 431 GQRIGWANYDCS 442


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 134/433 (30%), Positives = 185/433 (42%), Gaps = 71/433 (16%)

Query: 93  VLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVY 152
           ++RV+ L          PP NR R R +   +  V                VG PP+ V 
Sbjct: 32  LMRVQQLVL--PPTTHSPPPNRLRFRHDVSLTVPV---------------AVGAPPQNVT 74

Query: 153 MVLDTGSDVVWIQCAPCKKCYS---QTDPVFDPAKSRSFATVPCRSPLC----RKLDSS- 204
           MVLDTGS++ W++C   +   +   Q    F+ + S ++A   C SP C    R L    
Sbjct: 75  MVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDLPVPP 134

Query: 205 --GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC-------GHDNEGLF 255
                   +C   +SY D S   G  + +T    G        GC          N    
Sbjct: 135 FCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXALFGCVTSYSSATATNSSDS 194

Query: 256 VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFG--DSAVSRTARFT 313
            AA GLLG+ RG LSF TQT      +F+YC+   +    P  +V G   +A++    +T
Sbjct: 195 EAATGLLGMNRGSLSFVTQTA---TLRFAYCI---APGDGPGLLVLGGDGAALAPQLNYT 248

Query: 314 PLLAN----PKLDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
           PL+      P  D   Y V+L GI VG A +  I  S+   D  G G  ++DSGT  T L
Sbjct: 249 PLIQISRPLPYFDRVAYSVQLEGIRVGAALLP-IPKSVLAPDHTGAGQTMVDSGTQFTFL 307

Query: 369 TRPAYIALRDAFRAGASSLKRAP----DFSL---FDTCFDLS----GKTEVKVPTVVLHF 417
              AY  L+  F    S+L  AP    DF     FD CF  S          +P V L  
Sbjct: 308 LADAYAPLKGEFLNQTSALL-APLGESDFVFQGAFDACFRASEARVAAASXMLPEVGLVL 366

Query: 418 RGADVSLPATN--YLIPVDSSG------TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYD 466
           RGA+V++      Y +P +  G       +C  F  + M+G+S  +IG+  QQ   V YD
Sbjct: 367 RGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYD 426

Query: 467 LAASRIGFAPRGC 479
           L   R+GFAP  C
Sbjct: 427 LQNGRVGFAPARC 439


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/372 (29%), Positives = 170/372 (45%), Gaps = 37/372 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD---PV--FDPAKSRSFAT 190
           G YFTR+ +G+PP+  Y+ +DTGSDV+W+ C  C  C   +    P+  FDP  S + + 
Sbjct: 66  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 125

Query: 191 VPCRSPLCR---KLDSSGCNRR-NTCLYQVSYGDGSITVGDFSTETLTFR---GTRV--- 240
           + C    C    +   +GC+ + N C+Y   YGDGS T G + ++ L F    G+ V   
Sbjct: 126 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 185

Query: 241 -ARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS 293
            A +  GC     G       A  G+ G G+  +S  +Q   +    + FS+CL      
Sbjct: 186 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 245

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                +      V     ++PL+ +      Y + L  ISV G  +  I   +F    + 
Sbjct: 246 GGILVLG---EIVEEDIVYSPLVPS---QPHYNLNLQSISVNGKSL-AIDPEVFAT--ST 296

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
           N G I+DSGT++  L   AY     A     S   R P  S    C+ ++   +   PTV
Sbjct: 297 NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVR-PLLSKGTQCYLITSSVKGIFPTV 355

Query: 414 VLHFRGA-DVSLPATNYLIPVDSSG---TFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLA 468
            L+F G   ++L   +YL+  +S G    +C  F      G++I+G++  +    VYDLA
Sbjct: 356 SLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLA 415

Query: 469 ASRIGFAPRGCA 480
             RIG+A   C+
Sbjct: 416 GQRIGWANYDCS 427


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 167/366 (45%), Gaps = 39/366 (10%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ CA C++C +  DP F P  S +++ V C 
Sbjct: 82  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCS 141

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT----RVARVALGCGHD 250
           +      D +  + ++ C Y+  Y + S + G    + ++F GT    +  R   GC + 
Sbjct: 142 A------DCTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSF-GTESELKPQRAVFGCENS 194

Query: 251 NEG-LFVAAA-GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAV 306
             G LF   A G++GLGRG+LS   Q   +      FS C           +MV G    
Sbjct: 195 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLGAMPA 252

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP---AGNGGVIIDSGT 363
                F+   ++P    +Y +EL  I V G  +R        LDP       G ++DSGT
Sbjct: 253 PPDMVFS--RSDPVRSPYYNIELKEIHVAGKALR--------LDPRIFDSKHGTVLDSGT 302

Query: 364 SVTRLTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF 417
           +   L   A++A +DA  +    LK  R PD +  D CF  +G+   ++    P V + F
Sbjct: 303 TYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVF 362

Query: 418 -RGADVSLPATNYLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGF 474
             G  +SL   NYL       G +C   F       +++G I  +   V YD    +IGF
Sbjct: 363 GDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGF 422

Query: 475 APRGCA 480
               C+
Sbjct: 423 WKTNCS 428


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 173/386 (44%), Gaps = 52/386 (13%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           SGLA  +G YFTR+G+GTP +  Y+ +DTGSD++W+ C  C  C  +++      ++DP 
Sbjct: 81  SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140

Query: 184 KSRSFATVPCRSPLCRKLDSSG----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR 239
            S+S   V C    C   +  G    C   + C Y +SYGDGS T G F T+ L +    
Sbjct: 141 GSQSGELVTCDQQFCVA-NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199

Query: 240 --------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                    A V+ GCG    G      +A  G+LG G+   S  +Q     +  + F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL    T         G+  V    + TPL+  P +   Y V L GI VGG  + G+  +
Sbjct: 260 CL---DTVNGGGIFAIGN-VVQPKVKTTPLV--PDM-PHYNVILKGIDVGGTAL-GLPTN 311

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF--RAGASSLKRAPDFSLFDTCFDLS 403
           +F  D   + G IIDSGT++  +    Y AL      +    S++   DFS    CF  S
Sbjct: 312 IF--DSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS----CFQYS 365

Query: 404 GKTEVKVPTVVLHFRGADVSLPAT--NYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGF 461
           G  +   P V  HF G DVSL  +  +YL   +    +C  F     G +  G       
Sbjct: 366 GSVDDGFPEVTFHFEG-DVSLIVSPHDYLFQ-NGKNLYCMGFQ-NGGGKTKDGKDLGLLG 422

Query: 462 R-------VVYDLAASRIGFAPRGCA 480
                   V+YDL    IG+A   C+
Sbjct: 423 DLVLSNKLVLYDLENQAIGWADYNCS 448


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 168/373 (45%), Gaps = 38/373 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD---PV--FDPAKSRSFAT 190
           G Y+TRL +GTPPR  Y+ +DTGSDV+W+ C  C  C   +    P+  FDP  S + + 
Sbjct: 50  GLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASL 109

Query: 191 VPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF--------RGT 238
           + C    C    +  DS    + N C Y   YGDGS T G + ++ L F           
Sbjct: 110 ISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNN 169

Query: 239 RVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRST 292
             A +  GC     G       A  G+ G G+  +S  +Q   +    R FS+CL  +  
Sbjct: 170 SSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCL--KGD 227

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
            +    +V G+  V     +TPL+ +      Y + +  ISV G     I  S+F    +
Sbjct: 228 DSGGGILVLGE-IVEPNIVYTPLVPS---QPHYNLNMQSISVNG-QTLAIDPSVFG--TS 280

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
            + G IIDSGT++  L   AY     A  +  S   R P  S  + C+ +S       P 
Sbjct: 281 SSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVR-PYLSKGNHCYLISSSINDIFPQ 339

Query: 413 VVLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTM-SGLSIIGNIQQQGFRVVYDL 467
           V L+F  GA + L   +YLI   S G    +C  F      G++I+G++  +    VYD+
Sbjct: 340 VSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDI 399

Query: 468 AASRIGFAPRGCA 480
           A  RIG+A   C+
Sbjct: 400 ANQRIGWANYDCS 412


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 168/369 (45%), Gaps = 36/369 (9%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVP 192
           YFT++ +G+PP    + +DTGSD++W+ C+ C  C   +        FD   S +  +V 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 193 CRSPLCRKL---DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVA 241
           C  P+C  +    ++ C+  N C Y   YGDGS T G + T+T  F             A
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224

Query: 242 RVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAK 295
            +  GC     G       A  G+ G G+G+LS  +Q   R      FS+CL  +   + 
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSG 282

Query: 296 PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNG 355
               V G+  V     ++PL+ +      Y + L+ I V G  +  + A++F  + +   
Sbjct: 283 GGVFVLGEILVPGMV-YSPLVPS---QPHYNLNLLSIGVNG-QMLPLDAAVF--EASNTR 335

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           G I+D+GT++T L + AY    +A     S L   P  S  + C+ +S       P+V L
Sbjct: 336 GTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIISNGEQCYLVSTSISDMFPSVSL 394

Query: 416 HFR-GADVSLPATNYLIP---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASR 471
           +F  GA + L   +YL      D +  +C  F       +I+G++  +    VYDLA  R
Sbjct: 395 NFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQR 454

Query: 472 IGFAPRGCA 480
           IG+A   C+
Sbjct: 455 IGWASYDCS 463


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 170/368 (46%), Gaps = 39/368 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKS-------RSFATVPCR 194
           L +GTPP+   +VLDTGS + WIQC   KK   +  P+  P  +        SF+ +PC 
Sbjct: 70  LPIGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFSLLPCN 128

Query: 195 SPLCR------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGC 247
            P+C+       L +S C++   C Y   Y DG++  G+   E  TF +      V LGC
Sbjct: 129 HPICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGC 187

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
              +        G+LG+ RGRLSF +Q       KFSYC+  R+ S        GD+  S
Sbjct: 188 AQAS----TENRGILGMNRGRLSFISQAKI---SKFSYCVPSRTGSNPTGLFYLGDNPNS 240

Query: 308 RTARFTPLL------ANPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
              ++  +L      ++P LD   Y + +  I + G  +  +  + FK D  G+G  +ID
Sbjct: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLN-VPPAAFKPDAGGSGQTMID 299

Query: 361 SGTSVTRLTRPAYIALR-DAFRAGASSLKRAPDFS-LFDTCFDLSGKTEV--KVPTVVLH 416
           SG+ +T L   AY  ++ +  R   + +K+   ++ + D CFD     EV  ++  +   
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359

Query: 417 F-RGADVSLPATNYLIPVDSSGTFCFAFAGTMS---GLSIIGNIQQQGFRVVYDLAASRI 472
           F  G ++ +     ++     G  C     +     G +IIG + QQ   V YDLA  R+
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 419

Query: 473 GFAPRGCA 480
           GF    C+
Sbjct: 420 GFGGAECS 427


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 166/382 (43%), Gaps = 45/382 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA---PCKKC-YSQTD----PVFDPAKSRS 187
           G +   L  GTPP+ +  ++DTGSDVVW  C     C  C +S  D    P+FDP  S S
Sbjct: 76  GGHSISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSS 135

Query: 188 FATVPCRSPLCRK----LDSSGCNRRN--------TCLYQVSYGDGSITVGDFSTETLTF 235
              + CR+P C          GC R N         C Y   YG G+ + G F  E L F
Sbjct: 136 SKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGA-SSGYFLLENLKF 194

Query: 236 RGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRS-TSA 294
               +    LGC   +    +++  L G GR   S P Q G    +KF+YCL        
Sbjct: 195 PRKTIRNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGV---KKFAYCLNSHDYDDT 250

Query: 295 KPSSMVFGDSAVSRTA--RFTPLLANPKLDTFYY-VELVGISVGGAHVRGITASLFKLDP 351
           + S  +  D    +T    +TP L +P    FYY + +  I +G   +R I +       
Sbjct: 251 RNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLR-IPSKYLAPGS 309

Query: 352 AGNGGVIIDSGT-SVTRLTRPAYIALRDAFRAGASSLKRAPDFSL---FDTCFDLSGKTE 407
            G  GVIIDSG      +T P +  + +  +   S  +R+ +         C++ +G   
Sbjct: 310 DGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKS 369

Query: 408 VKVPTVVLHFR-GADVSLPATNY--LIPVDSSGTFCFAFAGTMSGLS-------IIGNIQ 457
           +K+P ++  FR GA++ +P  NY  + P +S   F     GT + L        I+GN Q
Sbjct: 370 IKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGT-NALEITPDPSIILGNSQ 428

Query: 458 QQGFRVVYDLAASRIGFAPRGC 479
              + V YDL   R GF  + C
Sbjct: 429 HVDYYVEYDLKNDRFGFRRQTC 450


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 169/373 (45%), Gaps = 48/373 (12%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCRS 195
           Y     +GTPP+ V  ++D   ++VW QCA C+   C+ Q  PVFDP+ S ++    C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 196 PLCRKLDSSGCNRRNTCLYQVS--YGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEG 253
           PLC+ + +  C+    C Y+    +GD   T G  ST+ +   G    R+A GC   ++G
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAI-GNAEGRLAFGCVVASDG 177

Query: 254 LFVAA----AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA---- 305
               A    +G +GLGR   S     G+     FSYCL       K S++  G SA    
Sbjct: 178 SIDGAMDGPSGFVGLGRTPWSL---VGQSNVTAFSYCLAPHGPGKK-SALFLGASAKLAG 233

Query: 306 VSRTARFTPLL-------ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
             ++   TPLL       ++   D +Y V+L GI  G   V   +        +G G + 
Sbjct: 234 AGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAAS--------SGGGAIT 285

Query: 359 I---DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVL 415
           I   ++   ++ L   AY AL     A   S   A     FD CF  +  +   VP +V 
Sbjct: 286 ILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLVF 343

Query: 416 HFR-GADVSLPATNYLI-PVDSSGTFCFAFAGTM------SGLSIIGNIQQQGFRVVYDL 467
            F+ GA ++ P + YL+   + +GT C +   +        G+SI+G++ Q+    ++DL
Sbjct: 344 TFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDL 403

Query: 468 AASRIGFAPRGCA 480
               + F P  C+
Sbjct: 404 EKETLSFEPADCS 416


>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 508

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 111/375 (29%), Positives = 168/375 (44%), Gaps = 46/375 (12%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           +Y+T + +G P R  ++ +DTGS + WIQC APC  C     P++ PAK      VP R 
Sbjct: 128 QYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKEN---IVPPRD 184

Query: 196 PLCRKL--DSSGCNRRNTCLYQVSYGDGSITVGDFS---TETLTFRGTRV-ARVALGCGH 249
             C++L  + + C+    C Y+++Y D S + G  +    E +T  G R    +  GC H
Sbjct: 185 SHCQELQGNQNYCDTCKQCDYEIAYADRSSSAGVLARDNMELITADGERENMDLVFGCAH 244

Query: 250 DNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCL-VDRSTSAKPSSMVFG 302
           D +G  +    ++ G+LGL  G +S PTQ  ++   +  F +C+  D S SA    M  G
Sbjct: 245 DQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAY---MFLG 301

Query: 303 DSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDS 361
           D  V R    + P+   P+      V+ V       +VR     L +        VI DS
Sbjct: 302 DDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQ--------VIFDS 353

Query: 362 GTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC----FDLSGKTEVKV--PTVVL 415
           G+S T      Y +L  +  A +    R         C    F +    +VK     ++L
Sbjct: 354 GSSYTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLL 413

Query: 416 HFRGADVSLPAT------NYLIPVDSSGTFCFA-FAGTMSGLS---IIGNIQQQGFRVVY 465
           HF    + +P T      NYLI +   G  C     GT  G S   +IG++  +G  V Y
Sbjct: 414 HFSKTWLVIPRTFEISPENYLI-ISGKGNVCLGVLDGTEIGHSSTIVIGDVSLRGKLVAY 472

Query: 466 DLAASRIGFAPRGCA 480
           D  A++IG+A   CA
Sbjct: 473 DNDANQIGWAQSDCA 487


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 176/409 (43%), Gaps = 55/409 (13%)

Query: 114 RSRGRANGGFSSSVISGL-AQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKC 172
           RS    +G  S  + + L     G +   L  GTPP+ +  ++DTGS VVW   APC   
Sbjct: 62  RSHHLKHGKASPLIQTSLFPHSHGGHTIPLSFGTPPQKLSFLVDTGSHVVW---APCTTH 118

Query: 173 YSQTD---------PVFDPAKSRSFATVPCRSPLCRKLDSS----GCNRRN--------T 211
           Y+ T+         P+F+P  S S   + CR P C    S     GC R N         
Sbjct: 119 YTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNGNSKKCSHA 178

Query: 212 C-LYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC--GHDNEGLFVAAAGLLGLGRGR 268
           C  Y + YG G+ + G F  E L F G  + +  +GC    D E    ++  L G GR  
Sbjct: 179 CPQYTLQYGTGAAS-GFFLLENLDFPGKTIHKFLVGCTTSADRE---PSSDALAGFGRTM 234

Query: 269 LSFPTQTGRRFNRKFSYCL----VDRSTSAKPSSMVFGDSAVSRTARFTPLLAN-PKLDT 323
            S P Q G    +KF+YCL     D + ++    + + D   ++   + P L N P    
Sbjct: 235 FSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYSDGE-TQGLSYAPFLKNPPDYPF 290

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           +YY+ +  + +G   +R I            GGV+IDSG +   +T P +  + +  +  
Sbjct: 291 YYYLGVKDMKIGNKLLR-IPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQ 349

Query: 384 ASSLKR---APDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTF 439
            S  +R   A   S    C++ +G   +K+P ++  F  GA++ +P  NY +    +   
Sbjct: 350 MSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLG 409

Query: 440 CFAFAGT--------MSGLSII-GNIQQQGFRVVYDLAASRIGFAPRGC 479
           CF               G SII GN QQ    V +DL   R+GF  + C
Sbjct: 410 CFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 121/409 (29%), Positives = 188/409 (45%), Gaps = 48/409 (11%)

Query: 108 RVPPRNRSRGRANGGFSSSVISGLAQGS------GEYFTRLGVGTPPRYVYMVLDTGSDV 161
           ++  R+  R R     SS V+    QG+      G Y+T++ +GTPP    + +DTGSDV
Sbjct: 42  QLRARDELRHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDV 101

Query: 162 VWIQCAPCKKCYSQTDPV------FDPAKSRSFATVPCRSPLC----RKLDSSGCNRRNT 211
           +W+ C  C  C  QT  +      FDP  S + + + C    C    +  D++  ++ N 
Sbjct: 102 LWVSCNSCNGC-PQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQ 160

Query: 212 CLYQVSYGDGSITVGDFSTETL----TFRGTR----VARVALGCGHDNEGLFV----AAA 259
           C Y   YGDGS T G + ++ +     F G+      A V  GC +   G       A  
Sbjct: 161 CSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVD 220

Query: 260 GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLL- 316
           G+ G G+  +S  +Q   +    R FS+CL  +  S+    +V G+  V     +T L+ 
Sbjct: 221 GIFGFGQQEMSVISQLSSQGIAPRIFSHCL--KGDSSGGGILVLGE-IVEPNIVYTSLVP 277

Query: 317 ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIAL 376
           A P     Y + L  ISV G  ++ I +S+F    + + G I+DSGT++  L   AY   
Sbjct: 278 AQPH----YNLNLQSISVNGQTLQ-IDSSVFA--TSNSRGTIVDSGTTLAYLAEEAYDPF 330

Query: 377 RDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDS 435
             A  A      R    S  + C+ ++       P V L+F  GA + L   +YLI  +S
Sbjct: 331 VSAITAAIPQSVRTV-VSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNS 389

Query: 436 SG---TFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
            G    +C  F      G++I+G++  +   VVYDLA  RIG+A   C+
Sbjct: 390 IGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 165/361 (45%), Gaps = 29/361 (8%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C+ C    DP F P  S ++  V C 
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC- 144

Query: 195 SPLCRKLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFRG-TRVA--RVALGCGHD 250
           +P C       C+   N C+Y   Y + S + G    + ++F   + +A  R   GC +D
Sbjct: 145 TPDC------NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCEND 198

Query: 251 NEGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSR 308
             G   +  A G++GLGRG LS   Q   +     S+ L          +M+ G  +   
Sbjct: 199 ETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPE 258

Query: 309 TARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRL 368
              FT   ++P    +Y + L  + V G  ++ +   +F     G  G ++DSGT+   L
Sbjct: 259 DMVFTH--SDPDRSPYYNINLKEMHVAGKKLQ-LNPKVFD----GKHGTVLDSGTTYAYL 311

Query: 369 TRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSG----KTEVKVPTVVLHFR-GAD 421
              A++A + A     +SLK+   PD +  D CF  +G    +     P V + F  G  
Sbjct: 312 PETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHK 371

Query: 422 VSLPATNYLIPVDS-SGTFCF-AFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           +SL   NYL       G +C   F+      +++G I  +   V+YD   S+IGF    C
Sbjct: 372 LSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431

Query: 480 A 480
           +
Sbjct: 432 S 432


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 169/367 (46%), Gaps = 42/367 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           L VG+PP+ V MVLDTGS++ W+ C    K     +  F+P  S S+   PC S +C  R
Sbjct: 64  LTVGSPPQNVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPCNSSICTTR 119

Query: 200 KLD---SSGCNRRNT-CLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLF 255
             D    + C+  N  C   VSY D S   G  + ET +  G        GC  D+ G  
Sbjct: 120 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC-MDSAGYT 178

Query: 256 ------VAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
                     GL+G+ RG LS  TQ       KFSYC+   S       ++ GD   + +
Sbjct: 179 SDINEDSKTTGLMGMNRGSLSLVTQMSL---PKFSYCI---SGEDALGVLLLGDGTDAPS 232

Query: 310 A-RFTPLL----ANPKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGT 363
             ++TPL+    ++P  +   Y V+L GI V    ++ +  S+F  D  G G  ++DSGT
Sbjct: 233 PLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQ-LPKSVFVPDHTGAGQTMVDSGT 291

Query: 364 SVTRLTRPAYIALRDAF---RAGASSLKRAPDFSLFDTCFDL---SGKTEVKVPTVVLHF 417
             T L    Y +L+D F     G  +    P+F +F+   DL   +  +   VP V L F
Sbjct: 292 QFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNF-VFEGAMDLCYHAPASFAAVPAVTLVF 350

Query: 418 RGADVSLPATNYLIPVD--SSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRI 472
            GA++ +     L  V   S   +CF F  + + G+   +IG+  QQ   + +DL  SR+
Sbjct: 351 SGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRV 410

Query: 473 GFAPRGC 479
           GF    C
Sbjct: 411 GFTQTTC 417


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 181/412 (43%), Gaps = 46/412 (11%)

Query: 105 SAVRVPPRNRSRGRANGGFSSSVISGLA----QGS------GEYFTRLGVGTPPRYVYMV 154
           S +R   R R      GG   S + G+     QGS      G YFT++ +G+PP    + 
Sbjct: 57  SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQ 116

Query: 155 LDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCRKL---DSSGC 206
           +DTGSD++W+ C+ C  C   +        FD   S +  +V C  P+C  +    ++ C
Sbjct: 117 IDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQC 176

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLFV-- 256
           +  N C Y   YGDGS T G + T+T  F             A +  GC     G     
Sbjct: 177 SENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKS 236

Query: 257 --AAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSSMVFGDSAVSRTARF 312
             A  G+ G G+G+LS  +Q   R      FS+CL  +   +     V G+  V     +
Sbjct: 237 DKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVPGMV-Y 293

Query: 313 TPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPA 372
           +PLL +      Y + L+ I V G  +  I A++F  + +   G I+D+GT++T L + A
Sbjct: 294 SPLLPS---QPHYNLNLLSIGVNG-QILPIDAAVF--EASNTRGTIVDTGTTLTYLVKEA 347

Query: 373 YIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLI 431
           Y    +A     S L      S  + C+ +S       P V L+F  GA + L   +YL 
Sbjct: 348 YDPFLNAISNSVSQLVTLI-ISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLF 406

Query: 432 P---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
                D +  +C  F       +I+G++  +    VYDLA  RIG+A   C+
Sbjct: 407 HYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDCS 458


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 167/358 (46%), Gaps = 30/358 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP-VFDPAKSRSFATVPCRSP 196
           Y   +G+GTP +   + +DTGS   W+ C  C  C+  T+P  F  ++S + A V C + 
Sbjct: 82  YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCH--TNPRTFLQSRSTTCAKVSCGTS 138

Query: 197 LCRKLDSS-GCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDN 251
           +C    S   C        C ++VSY DGS + G    +TLTF    ++     GC  D+
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCNLDS 198

Query: 252 EGL--FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRST----SAKPSSMVFGDS 304
            G   F    GLLG+G G +S   Q+  RF+  FSYCL + +S     S        G  
Sbjct: 199 FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD-GFSYCLPLQKSERGFFSKTTGYFSLGKV 257

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A     R+T ++A  K    ++V+L  ISV G  + G++ S+F        GV+ DSG+ 
Sbjct: 258 ATRTDVRYTKMVARRKNTELFFVDLAAISVDGERL-GLSPSIFS-----RKGVVFDSGSE 311

Query: 365 VTRLTRPAYIALRDAFRAGASSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADV 422
           ++ +   A   L    R     L+R A +      C+D+    E  +P + LHF  GA  
Sbjct: 312 LSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 369

Query: 423 SLPATNYLIP--VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            L +    +   V     +C AFA T S +SIIG++ Q    VVYDL    IG  P G
Sbjct: 370 DLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPSG 426


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 181/396 (45%), Gaps = 33/396 (8%)

Query: 112 RNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
           R R+       F+  + SG   G+G+YF R  VGTP +   +V DTGSD+ W++C     
Sbjct: 79  RRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAG 138

Query: 172 CYSQTDPV--FDPAKSRSFATVPCRSPLCRK---LDSSGCNR-RNTCLYQVSYGDGSITV 225
             +   P   F  ++SRS+A + C S  C        + C+   + C Y   Y DGS   
Sbjct: 139 PPASDPPAREFRASESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAAR 198

Query: 226 GDFSTETLTF---------------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRL 269
           G   T+  T                R  ++  V LGC    +G  F ++ G+L LG   +
Sbjct: 199 GVVGTDAATIALSGSGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNI 258

Query: 270 SFPTQTGRRFNRKFSYCLVDRSTSAKPSSMV---FGDSAVSRTARFTPLLANPKLDTFYY 326
           SF ++   RF  +FSYCLVD       SS +    G       A  TPL+ + ++  FY 
Sbjct: 259 SFASRAAARFGGRFSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYA 318

Query: 327 VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
           V +  + V G  +  I A ++  D    GG I+DSGTS+T L  PAY A+  A     ++
Sbjct: 319 VAVDAVYVAGEAL-DIPADVW--DVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAA 375

Query: 387 LKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-A 444
           L R      F+ C++ +     ++P + + F G A +  PA +Y+I   + G  C     
Sbjct: 376 LPRVA-MDPFEYCYNWTAGAP-EIPKLEVSFAGSARLEPPAKSYVIDA-APGVKCIGVQE 432

Query: 445 GTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G   G+S+IGNI QQ     +DL    + F    CA
Sbjct: 433 GAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 468


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 170/371 (45%), Gaps = 42/371 (11%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC--R 199
           + VGTPP+ + MV+DTGS++ W+ C       +   P F+P  S S+  + C SP C  R
Sbjct: 70  ITVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPISCSSPTCTTR 128

Query: 200 KLD---SSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD----NE 252
             D    + C+  N C   +SY D S + G+ +++T  F  +    +  GC +     N 
Sbjct: 129 TRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYSTNS 188

Query: 253 GLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSR--TA 310
                  GL+G+  G LS  +Q       KFSYC+   S S     ++ G+S  S   + 
Sbjct: 189 ESDSNTTGLMGMNLGSLSLVSQLKI---PKFSYCI---SGSDFSGILLLGESNFSWGGSL 242

Query: 311 RFTPLLAN----PKLD-TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSV 365
            +TPL+      P  D + Y V L GI +    +  I+ +LF  D  G G  + D GT  
Sbjct: 243 NYTPLVQISTPLPYFDRSAYTVRLEGIKISDK-LLNISGNLFVPDHTGAGQTMFDLGTQF 301

Query: 366 TRLTRPAYIALRDAFRAGASSLKRA---PDFSLFDTCFDLSGKTEV------KVPTVVLH 416
           + L  P Y ALRD F    +   RA   P+F +F    DL  +  V      ++P+V L 
Sbjct: 302 SYLLGPVYNALRDEFLNQTNGTLRALDDPNF-VFQIAMDLCYRVPVNQSELPELPSVSLV 360

Query: 417 FRGADVSLPATNYLIPV-----DSSGTFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLA 468
           F GA++ +     L  V      +   +CF F  + + G+   IIG+  QQ   + +DL 
Sbjct: 361 FEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIGHHHQQSMWMEFDLV 420

Query: 469 ASRIGFAPRGC 479
             R+G A   C
Sbjct: 421 EHRVGLAHARC 431


>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
          Length = 441

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 142/448 (31%), Positives = 192/448 (42%), Gaps = 81/448 (18%)

Query: 65  LSLRLHHVDS-LSFNRTPEHL-FNLRIQRDVLRVKSLTA-FAESAVRVPPRNRSR---GR 118
           L L LHH  S  S    P  L F   +  D  R+ SL A  A++     P  R+      
Sbjct: 43  LHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKT-----PSARATSLDAD 97

Query: 119 ANGGFSSSVIS-----GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPC-KKC 172
           A+ G + S+ S     G + G G Y TR+G+GTP     MV+DTGS + W+QC+PC   C
Sbjct: 98  ADAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSC 157

Query: 173 YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTET 232
           + Q+ PVF+P  S ++A+V C +  C  L S+  N     L Q  +  G +         
Sbjct: 158 HRQSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSGLLLLQRLHLPGQLRR-QLLLRR 216

Query: 233 LTFRGTRVARVALGCGHDNEGLFVAAAGLLGL------------------GRGRLSFPTQ 274
           L  +G R+ R           L VAA  LL L                   R +LS   Q
Sbjct: 217 LPQQGHRLVR-----------LDVAAKLLLRLWPGQPRVVSPIRPGSSASSRNKLSLLYQ 265

Query: 275 TGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISV 334
                   F+YCL   S+S   S   +     S  AR+ P                  + 
Sbjct: 266 LAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQYSLHARWCP------------------AR 307

Query: 335 GGAHVRGITASLFKLDPAGNGGVIIDSGTSV-TRLTRPAYIALRDAFRAGASSLKRAPDF 393
                   ++      PA    VI    TSV + L++    A++   RA A        +
Sbjct: 308 STTRSTSSSSCRRSSTPA---RVITRLPTSVYSALSKAVAAAMKGTSRASA--------Y 356

Query: 394 SLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAGTMSGLSI 452
           S+ DTCF     + V  P V + F  GA + L A N L+ VD S T C AFA   S  +I
Sbjct: 357 SILDTCFK-GQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDDS-TTCLAFAPARSA-AI 413

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           IGN QQQ F VVYD+ +SRIGFA  GC+
Sbjct: 414 IGNTQQQTFSVVYDVKSSRIGFAAGGCS 441


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 116/433 (26%), Positives = 188/433 (43%), Gaps = 64/433 (14%)

Query: 91  RDVLRVKSLTAFAESAVRVPPRNRSRGRANGGF--SSSV--ISGLAQGSGEYFTRLGVGT 146
           R V +   + +  +  V VP RN     +N     SSSV  + G     G YFT + VG 
Sbjct: 157 RSVYKESLVASVNDDDVIVPNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGN 216

Query: 147 PPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG 205
           PPR  Y+ +DT SD+ WIQC APC  C    + ++ P +      V  +  LC +L  + 
Sbjct: 217 PPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDN---IVTPKDSLCVELHRNQ 273

Query: 206 ----CNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL----GCGHDNEGL--- 254
               C     C Y++ Y D S ++G  + + L       +   L    GC +D +GL   
Sbjct: 274 KAGYCETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLN 333

Query: 255 -FVAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR 311
             V   G+LGL + ++S P+Q   R   N    +CL +         M  GD  V R   
Sbjct: 334 TLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGG--GYMFLGDDFVPRWGM 391

Query: 312 -FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG-------VIIDSGT 363
            + P+L +P +D+ Y  +++ ++ G               P   GG       ++ DSG+
Sbjct: 392 SWVPMLDSPSIDS-YQTQIMKLNYGSG-------------PLSLGGQERRVRRIVFDSGS 437

Query: 364 SVTRLTRPAYIALRDAFR--AGASSLKRAPDFSL---FDTCFDLSGKTEVK--VPTVVLH 416
           S T  T+ AY  L  + +  +G + ++   D +L   +   F +    +VK    T+ L 
Sbjct: 438 SYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFKTLTLQ 497

Query: 417 FRGADVSLPATNYLIP------VDSSGTFCFAF---AGTMSGLSII-GNIQQQGFRVVYD 466
           F G+   + +T + IP      + + G  C      +    G SII G+I  +G  ++YD
Sbjct: 498 F-GSKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSDVHDGSSIILGDISLRGQLIIYD 556

Query: 467 LAASRIGFAPRGC 479
              ++IG+    C
Sbjct: 557 NVNNKIGWTQSDC 569


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 177/387 (45%), Gaps = 56/387 (14%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G Y+TR+ +G+PP+  Y+ +DTGSD++W+ C  C  C +++        +DPA 
Sbjct: 76  GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135

Query: 185 SRSFATVPCRSPLCRKLDSSGC-----NRRNTCLYQVSYGDGSITVGDFSTETLTFRG-- 237
           S +  TV C    C    + G      +  + C ++++YGDGS T G + T+ + +    
Sbjct: 136 SGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVS 193

Query: 238 ------TRVARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                 T  A +  GCG    G       A  G+LG G+   S  +Q    RR  + F++
Sbjct: 194 GNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL     + +   +    + V    + TPL+ N    T Y V L GISVGGA ++  T++
Sbjct: 254 CL----DTVRGGGIFAIGNVVQPKVKTTPLVPNV---THYNVNLQGISVGGATLQLPTST 306

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD----TCFD 401
               D   + G IIDSGT++  L R  Y  L       A+   +  D  L +     CF 
Sbjct: 307 ---FDSGDSKGTIIDSGTTLAYLPREVYRTLL------AAVFDKYQDLPLHNYQDFVCFQ 357

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPA--TNYLIPVDSSGTFCFAF----AGTMSG--LSII 453
            SG  +   P +   F+G D++L     +YL   + +  +C  F      T  G  + ++
Sbjct: 358 FSGSIDDGFPVITFSFKG-DLTLNVYPDDYLFQ-NRNDLYCMGFLDGGVQTKDGKDMLLL 415

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G++      VVYDL    IG+    C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 169/368 (45%), Gaps = 39/368 (10%)

Query: 142 LGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKS-------RSFATVPCR 194
           L +GTPP+   +VLDTGS + WIQC   KK   +  P+  P  +        SF+ +PC 
Sbjct: 70  LPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFSLLPCN 128

Query: 195 SPLCR------KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVARVALGC 247
            P+C+       L +S C++   C Y   Y DG++  G+   E  TF +      V LGC
Sbjct: 129 HPICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGC 187

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
              +        G+LG+  GRLSF +Q       KFSYC+  R+ S        GD+  S
Sbjct: 188 AQAS----TENRGILGMNHGRLSFISQAKI---SKFSYCVPSRTGSNPTGLFYLGDNPNS 240

Query: 308 RTARFTPLL------ANPKLDTF-YYVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
              ++  +L      ++P LD   Y + +  I + G  +  I  + FK D  G+G  +ID
Sbjct: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLN-IPPAAFKPDAGGSGQTMID 299

Query: 361 SGTSVTRLTRPAYIALR-DAFRAGASSLKRAPDFS-LFDTCFDLSGKTEV--KVPTVVLH 416
           SG+ +T L   AY  ++ +  R   + +K+   ++ + D CFD     EV  ++  +   
Sbjct: 300 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFE 359

Query: 417 F-RGADVSLPATNYLIPVDSSGTFCFAFAGTMS---GLSIIGNIQQQGFRVVYDLAASRI 472
           F  G ++ +     ++     G  C     +     G +IIG + QQ   V YDLA  R+
Sbjct: 360 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRV 419

Query: 473 GFAPRGCA 480
           GF    C+
Sbjct: 420 GFGGAECS 427


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/376 (31%), Positives = 178/376 (47%), Gaps = 43/376 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------FDPAKSRSFA 189
           G Y+T++ +GTPPR +Y+ +DTGSDV+W+ C  C  C  QT  +      FDP  S + +
Sbjct: 75  GLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPGSSSTSS 133

Query: 190 TVPCRSPLCRK----LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETL----TFRGTRV- 240
            + C    CR      D+S   R N C Y   YGDGS T G + ++ +     F GT   
Sbjct: 134 LISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTT 193

Query: 241 ---ARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A V  GC     G       A  G+ G G+  +S  +Q   +    R FS+CL  + 
Sbjct: 194 NSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL--KG 251

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
            ++    +V G+  V     ++PL+ +      Y + L  ISV G  VR I  S+F    
Sbjct: 252 DNSGGGVLVLGE-IVEPNIVYSPLVPS---QPHYNLNLQSISVNGQIVR-IAPSVFA--T 304

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV- 410
           + N G I+DSGT++  L   AY     A  A      R+   S  + C+ ++  + V + 
Sbjct: 305 SNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSV-LSRGNQCYLITTSSNVDIF 363

Query: 411 PTVVLHFR-GADVSLPATNYLIP---VDSSGTFCFAFAGTMSG--LSIIGNIQQQGFRVV 464
           P V L+F  GA + L   +YL+    +     +C  F   +SG  ++I+G++  +    V
Sbjct: 364 PQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQ-KISGQSITILGDLVLKDKIFV 422

Query: 465 YDLAASRIGFAPRGCA 480
           YDLA  RIG+A   C+
Sbjct: 423 YDLAGQRIGWANYDCS 438


>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
          Length = 328

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/139 (46%), Positives = 91/139 (65%), Gaps = 1/139 (0%)

Query: 342 ITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFD 401
           I+  L+++   G+ G ++D+G +VTRL   AY A RDAF A  ++L RAP  S+F+TC+D
Sbjct: 190 ISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNTCYD 249

Query: 402 LSGKTEVKVPTVVLHFRGADV-SLPATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQG 460
           L+G   V+VPTV+ +F G  + ++   N+LIP D  GTF FAFA + S LSIIGNIQQ+G
Sbjct: 250 LNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQQEG 309

Query: 461 FRVVYDLAASRIGFAPRGC 479
            ++  D A   +GF    C
Sbjct: 310 IQISVDGANGFLGFGRNVC 328


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 163/383 (42%), Gaps = 48/383 (12%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G YFT + +GTPP+  Y+ +DTGSD++W+ C  C+KC  ++        +DP  
Sbjct: 76  GLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKA 135

Query: 185 SRSFATVPCRSPLCRKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTR-- 239
           S S +TV C    C         GC     C Y V YGDGS T G F T+ L F      
Sbjct: 136 SSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGD 195

Query: 240 ------VARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCL 287
                  A V  GCG    G       A  G+LG G+   S  +Q     +  + F++CL
Sbjct: 196 GQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL 255

Query: 288 VDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
                + K   +    + V    + TPL+A+      Y V L  I VGG  ++ + A +F
Sbjct: 256 ----DTIKGGGIFAIGNVVQPKVKTTPLVADMP---HYNVNLKSIDVGGTTLQ-LPAHVF 307

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL--KRAPDFSLFDTCFDLSGK 405
             +     G IIDSGT++T L    +  +  A       +      DF     CF   G 
Sbjct: 308 --ETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQDF----MCFQYPGS 361

Query: 406 TEVKVPTVVLHFRGADVSLPA--TNYLIPVDSSGTFCFAFAG----TMSGLSII--GNIQ 457
            +   PT+  HF   D++L      Y  P + +  +C  F      +  G  I+  G++ 
Sbjct: 362 VDDGFPTITFHFE-DDLALHVYPHEYFFP-NGNDMYCVGFQNGALQSKDGKDIVLMGDLV 419

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
                V+YDL    IG+    C+
Sbjct: 420 LSNKLVIYDLENQVIGWTDYNCS 442


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 170/377 (45%), Gaps = 59/377 (15%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCR 194
           G Y+  L +G PP+  ++  DTGSD+ W+QC APC +C     P++ P  +     V C+
Sbjct: 65  GYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNN----LVICK 120

Query: 195 SPLCRKLDSSG--CNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVA-RVALGCG 248
            P+C  L   G  C     C Y+V Y DG  ++G    +        G R+A R+ALGCG
Sbjct: 121 DPMCASLHPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCG 180

Query: 249 HDN--EGLFVAAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDS 304
           +D      +    G+LGLG+G+ S  +Q   +        +C+  R        + FGD 
Sbjct: 181 YDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGF----LFFGDD 236

Query: 305 AV-SRTARFTPLLANPKLDTFY---YVELVGISVGGAHVRGITASLFKLDPAGNGGVIID 360
              S    +TP+L +    T Y   Y EL+   +GG        ++FK     N  V  D
Sbjct: 237 LYDSSRVVWTPMLRDQH--THYSSGYAELI---LGG------KTTVFK-----NLLVTFD 280

Query: 361 SGTSVTRLTRPAYIALRDAFRAGASS--LKRAPDFSLFDTCFDLSGKTEVK-VPTVVLHF 417
           SG+S T L   AY AL    R   S   ++ A D      C+   GK   K V  V   F
Sbjct: 281 SGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCW--RGKRPFKSVRDVKKFF 338

Query: 418 RGADVSLPA-----TNYLIPVDS------SGTFCFA-FAGTMSGL---SIIGNIQQQGFR 462
           +   +S P      T Y IP++S       G  C     GT +GL   ++IG+I  Q   
Sbjct: 339 KPLALSFPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKM 398

Query: 463 VVYDLAASRIGFAPRGC 479
           VVYD   ++IG+AP  C
Sbjct: 399 VVYDNEKNQIGWAPTNC 415


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 176/387 (45%), Gaps = 56/387 (14%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G Y+TR+ +G+PP+  Y+ +DTGSD++W+ C  C  C +++        +DPA 
Sbjct: 76  GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135

Query: 185 SRSFATVPCRSPLCRKLDSSGC-----NRRNTCLYQVSYGDGSITVGDFSTETLTFRG-- 237
           S +  TV C    C    + G      +  + C ++++YGDGS T G + T+ + +    
Sbjct: 136 SGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVS 193

Query: 238 ------TRVARVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                 T  A +  GCG    G       A  G+LG G+   S  +Q    RR  + F++
Sbjct: 194 GNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAH 253

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL     + +   +    + V    + TPL+ N    T Y V L GISVGGA ++  T++
Sbjct: 254 CL----DTVRGGGIFAIGNVVQPKVKTTPLVPNV---THYNVNLQGISVGGATLQLPTST 306

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD----TCFD 401
               D   + G IIDSGT++  L R  Y  L       A+   +  D  L +     CF 
Sbjct: 307 ---FDSGDSKGTIIDSGTTLAYLPREVYRTLL------AAVFDKYQDLPLHNYQDFVCFQ 357

Query: 402 LSGKTEVKVPTVVLHFRGADVSLPA--TNYLIPVDSSGTFCFAF----AGTMSG--LSII 453
            SG  +   P +   F G D++L     +YL   + +  +C  F      T  G  + ++
Sbjct: 358 FSGSIDDGFPVITFSFEG-DLTLNVYPDDYLFQ-NRNDLYCMGFLDGGVQTKDGKDMLLL 415

Query: 454 GNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G++      VVYDL    IG+    C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/435 (28%), Positives = 191/435 (43%), Gaps = 70/435 (16%)

Query: 86  NLRIQRDVLRVKSLTAFAESAVRVPPRNRSR-GRANGGFSSSVISGLAQGS------GEY 138
           N R++ +VLR                R+++R GR   G    V+     G+      G Y
Sbjct: 42  NQRVELEVLRA---------------RDQARHGRLLRGVVGGVVDFTVYGTSDPYLVGLY 86

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPC 193
           FT++ +G+PPR   + +DTGSD++W+ C  C  C   +        FDP+ S + + V C
Sbjct: 87  FTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSC 146

Query: 194 RSPLCRKL---DSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVA 241
             P+C  L    ++ C+ + N C Y   YGDGS T G + ++ L F             A
Sbjct: 147 SHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSA 206

Query: 242 RVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAK 295
            +  GC     G       A  G+ G G+  LS  +Q        + FS+CL  +     
Sbjct: 207 SIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL--KGEGDG 264

Query: 296 PSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA--- 352
              +V G+  +     ++PL+ +    + Y + L  ISV G         L  +DPA   
Sbjct: 265 GGKLVLGE-ILEPNIIYSPLVPS---QSHYNLNLQSISVNG--------QLLPIDPAVFA 312

Query: 353 --GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
              N G I+DSGT++T L   AY     A  A  SS    P  S  + C+ +S   +   
Sbjct: 313 TSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSS-STTPVLSKGNQCYLVSTSVDEIF 371

Query: 411 PTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFAGTMS-GLSIIGNIQQQGFRVVY 465
           P V L+F  GA + L    YL+ +   D +  +C  F      G++I+G++  +    VY
Sbjct: 372 PPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVY 431

Query: 466 DLAASRIGFAPRGCA 480
           DLA  RIG+A   C+
Sbjct: 432 DLAHQRIGWANYDCS 446


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 161/345 (46%), Gaps = 39/345 (11%)

Query: 151 VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN 210
           V +V DT SD++W QC PC  C +Q   ++DP K+ ++A           L SS      
Sbjct: 3   VTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYA----------NLTSSN----- 47

Query: 211 TCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLS 270
              Y  +Y   S T G F+TET       VA +  GCG  N+G +   AG+ G+GRG +S
Sbjct: 48  ---YNYTYSKQSFTSGYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGVS 104

Query: 271 FPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAV-----SRTARFTPLLANPKLDTFY 325
              Q G     +FSYC          +  + G   +     +  A  TP++A+P L + Y
Sbjct: 105 LLNQLGI---DRFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGY 161

Query: 326 YVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS 385
           +V+LVG++VG   V    AS  +    G   ++IDS + VT L    Y  +R A  A  +
Sbjct: 162 FVKLVGVTVGATRVDVAGASSAE---GGGRALVIDSTSPVTVLDEATYGPVRRALVAQLA 218

Query: 386 SLKRAPDFSL----FDTCFDLSGKTEVKVP---TVVLHFRG--ADVSLPATNYLIPVDSS 436
            LK A   +      D CF+L+       P   T+ LHF G  AD+ LP  NYL    + 
Sbjct: 219 PLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKDSAG 278

Query: 437 GTFCFAFAGTMS-GLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  C     + S G+ ++G+       V+YDLA + + F P  CA
Sbjct: 279 GLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDCA 323


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/410 (27%), Positives = 176/410 (42%), Gaps = 55/410 (13%)

Query: 113 NRSRGRANGGFSSSVISGLAQGS-GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK 171
           +RS    +G  S  + + L   S G +   L  GTPP+ +  ++DTGS VVW   APC  
Sbjct: 61  SRSHHLKHGKASPLIQTSLFPHSYGAHTIPLSFGTPPQKLSFLMDTGSHVVW---APCTT 117

Query: 172 CYSQTD---------PVFDPAKSRSFATVPCRSPLCRKLDSSG-------CN-RRNTC-- 212
            Y+ T+         P+F+P  S S   + CR P C    S         CN     C  
Sbjct: 118 HYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNGNSKKCSH 177

Query: 213 ---LYQVSYGDGSITVGDFSTETLTFRGTRVARVALGC--GHDNEGLFVAAAGLLGLGRG 267
               Y + YG G+ + G F  E L F G  + +  +GC    D E    ++  L G GR 
Sbjct: 178 ACPQYTLQYGTGAAS-GFFLLENLDFPGKTIHKFLVGCTTSADREP---SSDALAGFGRT 233

Query: 268 RLSFPTQTGRRFNRKFSYCL----VDRSTSAKPSSMVFGDSAVSRTARFTPLLAN-PKLD 322
             S P Q G    +KF+YCL     D + ++    + + D   ++   + P   N P   
Sbjct: 234 MFSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYSDGE-TQGLSYAPFXKNPPDYP 289

Query: 323 TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRA 382
            +YY+ +  + +G   +R I            GGV+IDSG + + +T P +  + +  + 
Sbjct: 290 IYYYLGVKDMKIGNKVLR-IPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKK 348

Query: 383 GASSLKRAPDFSL---FDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGT 438
             S  +R+ +         C++ +G   +K+P ++  F  GA++ +P  NY +    +  
Sbjct: 349 QMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASL 408

Query: 439 FCFAFA--GTMSGLS-------IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            CF        S L        I+GN QQ    V +DL   R+GF  + C
Sbjct: 409 GCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 162/360 (45%), Gaps = 27/360 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++  V C 
Sbjct: 74  NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC- 132

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT---RVARVALGCGHDN 251
           +P C   D  G      C Y+  Y + S + G  + + ++F      +  R   GC +  
Sbjct: 133 NPSC-NCDDEG----KQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVE 187

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G   +  A G++GLGRGRLS   Q   +     S+ L          +MV G  +    
Sbjct: 188 TGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPN 247

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
             F+   +NP    +Y +EL  + V G  ++ +   +F        G ++DSGT+     
Sbjct: 248 MVFS--HSNPYRSPYYNIELKELHVAGKPLK-LKPKVFD----EKHGTVLDSGTTYAYFP 300

Query: 370 RPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADV 422
             A+ AL+DA       LK+   PD +  D CF  +G+    +    P V + F  G  +
Sbjct: 301 EAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKL 360

Query: 423 SLPATNYLI-PVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           SL   NYL      SG +C       + L +++G I  +   V YD    +IGF    C+
Sbjct: 361 SLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCS 420


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 121/408 (29%), Positives = 184/408 (45%), Gaps = 56/408 (13%)

Query: 105 SAVRVPPRNRSRGRANGGFSSSV---ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDV 161
           +A + P + +S   AN    SSV   ++G    +G Y   L +G PP+   + +DTGSD+
Sbjct: 32  AASQTPIKGKSTTPANDRVGSSVFFRVTGNVYPTGHYSVILNIGNPPKAFDLDIDTGSDL 91

Query: 162 VWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCN-RRNTCLYQVSYG 219
            W+QC APCK C    D ++ P  +R    VPC S LC+ + ++ C+     C Y+V Y 
Sbjct: 92  TWVQCDAPCKGCTKPLDKLYKPKNNR----VPCASSLCQAIQNNNCDIPTEQCDYEVEYA 147

Query: 220 DGSITVGDFSTETLTFR---GTRVA-RVALGCGHDNEGLFVAA----AGLLGLGRGRLSF 271
           D   ++G   ++    R   G+ +  R+A GCG+D + L   +    AG+LGLGRG+ S 
Sbjct: 148 DLGSSLGVLLSDYFPLRLNNGSLLQPRIAFGCGYDQKYLGPHSPPDTAGILGLGRGKASI 207

Query: 272 PTQ--TGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA-RFTPLLANPKLDTFYYVE 328
            +Q  T         +C   R T      + FGD  +  +   +TP+L +   DT Y   
Sbjct: 208 LSQLRTLGITQNVVGHCF-SRVTGG---FLFFGDHLLPPSGITWTPMLRSSS-DTLY--- 259

Query: 329 LVGISVGGAHVRGITASLFKLDPAGNGG--VIIDSGTSVTRLTRPAYIALRDAFRAGASS 386
               S G A +      LF   P G  G  +I DSG+S T      Y ++ +  R   S 
Sbjct: 260 ----SSGPAEL------LFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLSG 309

Query: 387 --LKRAPDFSLFDTCFD--------LSGKTEVKVPTV-VLHFRGADVSLPATNYLIPVDS 435
             LK AP+      C+         L  K+  K  T+  +  +   + L   +YLI +  
Sbjct: 310 MPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKNVQLQLAPEDYLI-ITK 368

Query: 436 SGTFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            G  C          +  L++IG+I  Q   VVYD    +IG+ P  C
Sbjct: 369 DGNVCLGILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTNC 416


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 167/358 (46%), Gaps = 30/358 (8%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDP-VFDPAKSRSFATVPCRSP 196
           Y   +G+GTP +   + +DTGS   W+ C  C  C+  T+P  F  ++S + A V C + 
Sbjct: 82  YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCH--TNPRTFLQSRSTTCAKVSCGTS 138

Query: 197 LCRKLDSS-GCNRRNT---CLYQVSYGDGSITVGDFSTETLTFRGT-RVARVALGCGHDN 251
           +C    S   C        C ++VSY DGS + G    +TLTF    ++   + GC  D+
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNMDS 198

Query: 252 EGL--FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL-VDRST----SAKPSSMVFGDS 304
            G   F    GLLG+G G +S   Q+   F+  FSYCL + +S     S        G  
Sbjct: 199 FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD-CFSYCLPLQKSERGFFSKTTGYFSLGKV 257

Query: 305 AVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS 364
           A     R+T ++A  K    ++V+L  ISV G  + G++ S+F        GV+ DSG+ 
Sbjct: 258 ATRTDVRYTKMVARKKNTELFFVDLTAISVDGERL-GLSPSVFS-----RKGVVFDSGSE 311

Query: 365 VTRLTRPAYIALRDAFRAGASSLKR-APDFSLFDTCFDLSGKTEVKVPTVVLHF-RGADV 422
           ++ +   A   L    R     LKR A +      C+D+    E  +P + LHF  GA  
Sbjct: 312 LSYIPDRALSVLSQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 369

Query: 423 SLPATNYLIP--VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRG 478
            L +    +   V     +C AFA T S +SIIG++ Q    VVYDL    IG  P G
Sbjct: 370 DLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPSG 426


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/401 (28%), Positives = 176/401 (43%), Gaps = 64/401 (15%)

Query: 123 FSSSVI---SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDP 178
           F SS I    G    +G YFT + VG+PPR  ++ +DTGSD+ WIQC APC  C    +P
Sbjct: 296 FDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP 355

Query: 179 VFDPAKSRSFATVPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           ++ P K      VP +  LC    R L +  C     C Y++ Y D S ++G  +++ L 
Sbjct: 356 LYKPKKGN---LVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLH 412

Query: 235 FRGTRVARVAL----GCGHDNEGLFVAAA----GLLGLGRGRLSFPTQ--TGRRFNRKFS 284
                 +   L    GC +D +GL + +     G+LGL + ++S P+Q  + R  N    
Sbjct: 413 LMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 472

Query: 285 YCLVDRSTSAKPSSMVFGDSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
           +CL   +T      M  GD  V      + P+L +   +  Y+ +++ IS G   +    
Sbjct: 473 HCLTSDATGG--GYMFLGDDFVPYWGMAWVPMLNSHSPN--YHSQIMKISHGSRQL---- 524

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF----DTC 399
            SL + D      V+ D+G+S T   + AY AL        +SLK   D  L     D  
Sbjct: 525 -SLGRQD-GRTERVVFDTGSSYTYFPKEAYYAL-------VASLKDVSDEGLIQDGSDPT 575

Query: 400 FDLSGKTEVKVPTVV----------LHFR------GADVSLPATNYLIPVDSSGTFCFAF 443
             +  + +  + +V+          L FR           +P   YLI + + G  C   
Sbjct: 576 LPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLI-ISNKGNVCLGI 634

Query: 444 ---AGTMSGLSII-GNIQQQGFRVVYDLAASRIGFAPRGCA 480
              +    G +II G+I  +G  VVYD    +IG+A   C 
Sbjct: 635 LDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCV 675


>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
          Length = 426

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 122/424 (28%), Positives = 180/424 (42%), Gaps = 57/424 (13%)

Query: 79  RTPEHLFNLRIQRDVLRVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEY 138
           RTP H+  L  +   L  K   +  ++ +  P R   + R          S     +G  
Sbjct: 28  RTPAHIPQLGQE---LWRKPAKSAPKAVINRPFRAPDKDRLG--------SAATDNAGLV 76

Query: 139 FTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLC 198
             ++ VG        V+D  +D +W QC           PV     S  F  V C S  C
Sbjct: 77  VYKISVGVAEEVFSGVVDVATDFIWAQC-----------PV-----SSDFTEVFCFSQTC 120

Query: 199 R----KLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV-ARVALGCGHDNEG 253
           +    + D+ G +   TC Y   YG G  T G  S E +T  GT +  R   GC   +  
Sbjct: 121 QLALDEEDACGNSTSFTCPYAYQYGPGISTTGYISAEEVTAVGTHITGRALFGCSLASTV 180

Query: 254 LFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLV--DRSTSAKPSSMVFGDSAVSRT-- 309
                +G+LG  RG  S  +Q   + +R FSY ++  D       S ++ GD AV +T  
Sbjct: 181 PLDGESGVLGFSRGPYSLLSQL--KISR-FSYFMLPDDADKPDSESVLLLGDDAVPQTNS 237

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG-NGGVIIDSGTSVTRL 368
           +R TPLL N      YYV+L GI V    + GI A  F L   G +GGV++ + + +T L
Sbjct: 238 SRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMSTLSPITYL 297

Query: 369 TRPAYIALRDAFRAGASSLKRAP------DFSLFDTCFDLSGKTEVKVPTVVLHFRGAD- 421
              AY AL    RA AS +K  P      D +    C+++     +  P + L F G D 
Sbjct: 298 QPAAYNALT---RALASKIKSQPVRPKADDVADLRLCYNIQSVANLTFPKITLVFHGVDG 354

Query: 422 ----VSLPATNYLIPVDSSGTFCFAFAGTMSG---LSIIGNIQQQGFRVVYDLAASRIGF 474
               + L   +Y I  +S+G  C     T +G    S++G++ Q G  ++YDL    + F
Sbjct: 355 RPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLGSLLQTGTHMIYDLRGGSLTF 414

Query: 475 APRG 478
              G
Sbjct: 415 EKGG 418


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 173/383 (45%), Gaps = 44/383 (11%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           +GL   +G YFT+LG+G+PP+  Y+ +DTGSD++W+ C  C +C  ++D      ++DP 
Sbjct: 61  NGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPK 120

Query: 184 KSRSFATVPCRSPLCRKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGT-- 238
            S +   + C    C         GC     C Y ++YGDGS T G +  + LT+     
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND 180

Query: 239 ------RVARVALGCGHDNEGLFVAAA-----GLLGLGRGRLSFPTQTGR--RFNRKFSY 285
                 + + +  GCG    G   +++     G++G G+   S  +Q     +  + FS+
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL     + +   +      V      TPL+  P++   Y V L  I V    +  + + 
Sbjct: 241 CL----DNIRGGGIFAIGEVVEPKVSTTPLV--PRM-AHYNVVLKSIEV-DTDILQLPSD 292

Query: 346 LFKLDPAGNG-GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSG 404
           +F    +GNG G IIDSGT++  L    Y  L     A    LK       F +CF  +G
Sbjct: 293 IFD---SGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQF-SCFQYTG 348

Query: 405 KTEVKVPTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAF----AGTMSG--LSIIGNIQ 457
             +   P V LHF  +  +++   +YL      G +C  +    A T +G  ++++G++ 
Sbjct: 349 NVDRGFPVVKLHFEDSLSLTVYPHDYLFQF-KDGIWCIGWQKSVAQTKNGKDMTLLGDLV 407

Query: 458 QQGFRVVYDLAASRIGFAPRGCA 480
                V+YDL    IG+    C+
Sbjct: 408 LSNKLVIYDLENMAIGWTDYNCS 430


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 119/379 (31%), Positives = 176/379 (46%), Gaps = 33/379 (8%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV--FDPAKSR 186
           SG   G+G+YF R  VGTP +   +V DTGSD+ W++C       +   P   F  ++SR
Sbjct: 5   SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESR 64

Query: 187 SFATVPCRSPLCRK---LDSSGCNR-RNTCLYQVSYGDGSITVGDFSTETLTF------- 235
           S+A + C S  C        + C+   + C Y   Y DGS   G   T+  T        
Sbjct: 65  SWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGS 124

Query: 236 --------RGTRVARVALGCGHDNEGL-FVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYC 286
                   R  ++  V LGC    +G  F ++ G+L LG   +SF ++   RF  +FSYC
Sbjct: 125 EDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYC 184

Query: 287 LVDRSTSAKPSS-MVFGDSAVSRTARF--TPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
           LVD       SS + FG       A    TPL+ + ++  FY V +  + V G  +  I 
Sbjct: 185 LVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALD-IP 243

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLS 403
           A ++  D    GG I+DSGTS+T L  PAY A+  A     ++L R      F+ C++ +
Sbjct: 244 ADVW--DVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVA-MDPFEYCYNWT 300

Query: 404 GKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSSGTFCFAF-AGTMSGLSIIGNIQQQGF 461
                ++P + + F G A +  PA +Y+I   + G  C     G   G+S+IGNI QQ  
Sbjct: 301 AGAP-EIPKLEVSFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEH 358

Query: 462 RVVYDLAASRIGFAPRGCA 480
              +DL    + F    CA
Sbjct: 359 LWEFDLRDRWLRFKHTRCA 377


>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
          Length = 484

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 121/429 (28%), Positives = 173/429 (40%), Gaps = 71/429 (16%)

Query: 89  IQRDVLRVKSLTAFAES-------------AVRVPPRNRSRGRANGGFSSSVISGLAQGS 135
           + RD LR++SL    E               V +P R        G F            
Sbjct: 89  LHRDALRLRSLLHREEDNHRTPAPAAPPGGGVSIPSRGEPIEELPGAF------------ 136

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSD-VVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
            EY    G GTP + + +  DT +     +QC PC    S  D  FDP+ S S + VPC 
Sbjct: 137 -EYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPCG---SGADHAFDPSASSSVSQVPCG 192

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSI-----------TVGDFSTETLTFRGTRVARV 243
           SP C      GC+ R +C   VS+ +  +                S     FR   +  +
Sbjct: 193 SPDC---PFHGCSGRPSCTLSVSFNNTLLGNATFFTDTLTLTPSSSATVDKFRFACLEGI 249

Query: 244 ALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQ---TGRRFNRKFSYCLVDRSTSAKPSSMV 300
           A G   D       +AG+L L R   S P++   +       FSYCL   +++A    + 
Sbjct: 250 APGPAED------GSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCL--PASTADVGFLS 301

Query: 301 FGDSA---VSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
            G +    + R   +TPL  +P     Y V+LVG+ +GG  +    A++   D       
Sbjct: 302 LGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAIAGDD------T 355

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHF 417
           I++  T+ T L    Y  LRD+FR   S    AP     DTC++ +G     VP V L F
Sbjct: 356 ILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDAFSVPAVTLKF 415

Query: 418 R-GADVSLPATNYLIPVDSSGTF---CFAFAGT---MSGLSIIGNIQQQGFRVVYDLAAS 470
             GADV L     +   D    F   C AF        G ++IG++ Q    VVYD+   
Sbjct: 416 AGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMSTEVVYDVRGG 475

Query: 471 RIGFAPRGC 479
           ++GF P  C
Sbjct: 476 KVGFVPYRC 484


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 174/374 (46%), Gaps = 39/374 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFTR+ +G+PP+  ++ +DTGSD++W+ C+PC  C S +        F+P  S + + 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 191 VPCRSPLCR---KLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFR--------G 237
           +PC    C    +   + C   +   C Y  +YGDGS T G + ++T+ F          
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208

Query: 238 TRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  + 
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           +      +V G+  V     +TPL+ +      Y + L  I V G  +  I +SLF    
Sbjct: 267 SDNGGGILVLGE-IVEPGLVYTPLVPS---QPHYNLNLESIVVNGQKLP-IDSSLFTT-- 319

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
           +   G I+DSGT++  L   AY    +A  A  S   R+   S  + CF  S   +   P
Sbjct: 320 SNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSL-VSKGNQCFVTSSSVDSSFP 378

Query: 412 TVVLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVVYD 466
           TV L+F G   +++   NYL+    +D++  +C  +       ++I+G++  +    VYD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 467 LAASRIGFAPRGCA 480
           LA  R+G+    C+
Sbjct: 439 LANMRMGWTDYDCS 452


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 163/363 (44%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +G+PP+   +++DTGS V ++ C+ C +C +  DP F P  S ++  V C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDN 251
           +      D +G      C Y+  Y + S + G  + + ++F + + +   R   GC    
Sbjct: 146 ADC--NCDENGVQ----CTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETME 199

Query: 252 EG-LFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G L+   A G++GLGRG LS   Q   +     S+ L          +MV G  +    
Sbjct: 200 SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPG 259

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP---AGNGGVIIDSGTSVT 366
             F+   ++P    +Y +EL  I V G  +        KL+P    G  G I+DSGT+  
Sbjct: 260 MVFSH--SDPSRSPYYNIELKEIHVAGKPL--------KLNPRTFDGKYGAILDSGTTYA 309

Query: 367 RLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKVPTV-----VLHFRG 419
                AY A +DA     S LK+   PD +  D CF  +G+   ++P V     ++   G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369

Query: 420 ADVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NYL      SG +C   F       +++G I  +   V Y+   S IGF   
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429

Query: 478 GCA 480
            C+
Sbjct: 430 NCS 432


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 162/363 (44%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +G+PP+   +++DTGS V ++ C+ C +C +  DP F P  S ++  V C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDN 251
           +      D +G      C Y+  Y + S + G  + + ++F + + +   R   GC    
Sbjct: 146 ADC--NCDENGVQ----CTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETME 199

Query: 252 EG-LFVA-AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G L+   A G++GLGRG LS   Q   +     S+ L          +MV G   +S  
Sbjct: 200 SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLG--GISSP 257

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP---AGNGGVIIDSGTSVT 366
                  ++P    +Y +EL  I V G  +        KL+P    G  G I+DSGT+  
Sbjct: 258 PGMVFSHSDPSRSPYYNIELKEIHVAGKPL--------KLNPRTFDGKYGAILDSGTTYA 309

Query: 367 RLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKVPTV-----VLHFRG 419
                AY A +DA     S LK+   PD +  D CF  +G+   ++P V     ++   G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369

Query: 420 ADVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NYL      SG +C   F       +++G I  +   V Y+   S IGF   
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429

Query: 478 GCA 480
            C+
Sbjct: 430 NCS 432


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 160/363 (44%), Gaps = 33/363 (9%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++  + C 
Sbjct: 85  NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQC- 143

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
           +P C   D  G      C Y+  Y + S + G  + + L+F         R   GC    
Sbjct: 144 NPSC-NCDDEG----KQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVE 198

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G   +  A G++GLGRG LS   Q   +     S+ L          +MV G+      
Sbjct: 199 TGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPPD 258

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP---AGNGGVIIDSGTSVT 366
             F    ++P    +Y +EL  + V G  +        KL+P    G  G ++DSGT+  
Sbjct: 259 MVFA--HSDPYRSAYYNIELKELHVAGKRL--------KLNPRVFDGKHGTVLDSGTTYA 308

Query: 367 RLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSGKTEVKV----PTVVLHF-RG 419
            L   A++A +DA       LK+   PD S  D CF  +G+   ++    P V + F  G
Sbjct: 309 YLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNG 368

Query: 420 ADVSLPATNYLI-PVDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPR 477
             +SL   NYL      SG +C   F       +++G I  +   V YD    +IGF   
Sbjct: 369 QKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKT 428

Query: 478 GCA 480
            C+
Sbjct: 429 NCS 431


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 174/374 (46%), Gaps = 39/374 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFTR+ +G+PP+  ++ +DTGSD++W+ C+PC  C S +        F+P  S + + 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 191 VPCRSPLCR---KLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFR--------G 237
           +PC    C    +   + C   +   C Y  +YGDGS T G + ++T+ F          
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208

Query: 238 TRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  + 
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
           +      +V G+  V     +TPL+ +      Y + L  I V G  +  I +SLF    
Sbjct: 267 SDNGGGILVLGE-IVEPGLVYTPLVPS---QPHYNLNLESIVVNGQKLP-IDSSLFTT-- 319

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVP 411
           +   G I+DSGT++  L   AY    +A  A  S   R+   S  + CF  S   +   P
Sbjct: 320 SNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSL-VSKGNQCFVTSSSVDSSFP 378

Query: 412 TVVLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVVYD 466
           TV L+F G   +++   NYL+    +D++  +C  +       ++I+G++  +    VYD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 467 LAASRIGFAPRGCA 480
           LA  R+G+    C+
Sbjct: 439 LANMRMGWTDYDCS 452


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 57/125 (45%), Positives = 76/125 (60%), Gaps = 1/125 (0%)

Query: 131 LAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFAT 190
           L  GSGEY   + +GTPP     + DTGSD++W QC PC KCY Q+ P+FDP KS SF+ 
Sbjct: 85  LTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSH 144

Query: 191 VPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHD 250
           VPC S  C+ +D S C  +  C Y  +YGD + T GD   E +T  G+   +  +GCGH+
Sbjct: 145 VPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITI-GSSSVKSVIGCGHE 203

Query: 251 NEGLF 255
           + G F
Sbjct: 204 SGGGF 208


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 162/360 (45%), Gaps = 27/360 (7%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++DTGS V ++ C+ C++C    DP F P  S ++  V C 
Sbjct: 78  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC- 136

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-RGTRVA--RVALGCGHDN 251
                 LD +  N R  C+Y+  Y + S + G    + ++F   + +A  R   GC +  
Sbjct: 137 -----TLDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENVE 191

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRT 309
            G   +  A G++GLGRG LS   Q   +     S+ L          +MV G   +S  
Sbjct: 192 TGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG--GISPP 249

Query: 310 ARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLT 369
           +      ++P    +Y ++L  I V G  +  +  S+F     G  G ++DSGT+   L 
Sbjct: 250 SDMVFAQSDPVRSPYYNIDLKEIHVAGKRLP-LNPSVFD----GKHGSVLDSGTTYAYLP 304

Query: 370 RPAYIALRDAFRAGASSLKR--APDFSLFDTCFDLSG----KTEVKVPTVVLHF-RGADV 422
             A++A ++A      S  +   PD +  D CF  +G    +     P V + F  G   
Sbjct: 305 EEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKY 364

Query: 423 SLPATNYLIPVDS-SGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           SL   NY+       G +C   F       +++G I  +   V+YD   ++IGF    CA
Sbjct: 365 SLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNCA 424


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 125/406 (30%), Positives = 172/406 (42%), Gaps = 70/406 (17%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA---PCKKC-----YSQTDPVFDPAKSRS 187
           G Y   + +GTPP+ + ++LDTGS + W+ C     C+ C           VF P  S S
Sbjct: 89  GGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSS 148

Query: 188 FATVPCRSPLCRKLDS---SGC----NRRNTCL---YQVSYGDGSITVGDFSTETLTFRG 237
              V CR+P CR + S   S C    N  N  +   Y V YG GS T G   ++TL    
Sbjct: 149 SRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGS-TSGLLISDTLRLSP 207

Query: 238 TRVA-------RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDR 290
           +  +         A+GC      +    +GL G GRG  S P+Q       KFSYCL+ R
Sbjct: 208 SSSSSAPAPFRNFAIGC--SIVSVHQPPSGLAGFGRGAPSVPSQLKV---PKFSYCLLSR 262

Query: 291 ---STSAKPSSMVFGDSAV-----SRTARFTPLLAN----PKLDTFYYVELVGISVGGAH 338
                SA    +V GD+ V       T ++ PLL N    P    +YY+ L GISVGG  
Sbjct: 263 RFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKP 322

Query: 339 VRGITASLFKLDPAGNGGVIIDSGTSVTRLT----RPAYIALRDAFRAGASSLKRAPDFS 394
           V   + +     P+  GG IIDSGT+ T L     +P   A+  A     +  +   D  
Sbjct: 323 VNLPSRAFV---PSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDAL 379

Query: 395 LFDTCFDLSGKT--EVKVPTVVLHFRGADV-SLPATNYL-------IPVDSSGTFCFAFA 444
               CF L       +++P + L F+G  V  LP  NY         P       C A  
Sbjct: 380 GLRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVV 439

Query: 445 GTMSGLS----------IIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
             +              I+G+ QQQ + + YDL   R+GF  + CA
Sbjct: 440 SDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCA 485


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 124/406 (30%), Positives = 180/406 (44%), Gaps = 64/406 (15%)

Query: 114 RSRGRANGGFSSSV-----ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-A 167
           R    A GG SSS+     + G     G Y+  + +G PP+  ++ +D+GSD+ W+QC A
Sbjct: 28  RGDKPARGGASSSIAAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA 87

Query: 168 PCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS--SGCNR----RNTCLYQVSYGDG 221
           PC+ C     P++ P KS+    VPC   LC  L +  +G +R       C Y + Y D 
Sbjct: 88  PCRSCNEVPHPLYRPTKSK---LVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQ 144

Query: 222 SITVGDFSTETLTFRGTR--VAR--VALGCGHDNE----GLFVAAAGLLGLGRGRLSFPT 273
             + G    ++   R T   VAR  VA GCG+D +     L     G+LGLG G +S  +
Sbjct: 145 GSSTGVLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLS 204

Query: 274 QTGRRFNRK--FSYCLVDRSTSAKPSSMVFGDSAVS-RTARFTPLLANPKLDTFYYVELV 330
           Q  +R   K    +CL  R        + FGD  V  + A +TP +A      +Y     
Sbjct: 205 QLKQRGVTKNVVGHCLSLRGGGF----LFFGDDLVPYQRATWTP-MARSAFRNYYSPGSA 259

Query: 331 GISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKR 389
            +  G    R +   L K        V+ DSG+S T      Y AL  A + G S +L+ 
Sbjct: 260 SLYFGD---RSLGVRLAK--------VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEE 308

Query: 390 APDFSL---------FDTCFDLSGKTEVKVPTVVLHFRGAD---VSLPATNYLIPVDSSG 437
            PD SL         F +  D+  + E K  ++VL+F       + +P  NYLI V  +G
Sbjct: 309 EPDTSLPLCWKGQEPFKSVLDV--RKEFK--SLVLNFASGKKTLMEIPPENYLI-VTENG 363

Query: 438 TFCFAFAG----TMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             C          +  LSIIG+I  Q   V+YD    +IG+    C
Sbjct: 364 NACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRAPC 409


>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
          Length = 110

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 63/109 (57%), Positives = 77/109 (70%), Gaps = 1/109 (0%)

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPATNYL 430
           AY ++RDAF+    +L+ A   ++FDTC+DLS    V+VPTV  HF    V  LPA NYL
Sbjct: 2   AYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNYL 61

Query: 431 IPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           IPVDS GTFCFAFA T S LSIIGN+QQQG RV +D+A S +GF+P  C
Sbjct: 62  IPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 170/376 (45%), Gaps = 39/376 (10%)

Query: 134 GSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSF 188
           G G Y T++ +GTPPR   + +DTGSD++WI C  C  C   +        FD   S + 
Sbjct: 80  GYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTA 139

Query: 189 ATVPCRSPLCR---KLDSSGCN-RRNTCLYQVSYGDGSITVGDFSTETLTF--------- 235
           A VPC  P+C    +  ++ C+ + N C Y   Y DGS T G + ++ + F         
Sbjct: 140 ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199

Query: 236 -RGTRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLV 288
                 A +  GC     G       A  G+LG G G LS  +Q   R    + FS+CL 
Sbjct: 200 ANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL- 258

Query: 289 DRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFK 348
            +        +V G+  +  +  ++PL+ +      Y + L  I+V G  V  I  ++F 
Sbjct: 259 -KGDGNGGGILVLGE-ILEPSIVYSPLVPS---QPHYNLNLQSIAVNG-QVLSINPAVFA 312

Query: 349 LDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEV 408
              +   G IIDSGT+++ L + AY  L +A     S    +   S    C+ +    + 
Sbjct: 313 --TSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATS-FISKGSQCYLVLTSIDD 369

Query: 409 KVPTVVLHFR-GADVSLPATNYLIP---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVV 464
             PTV  +F  GA + L  + YL+     D +  +C  F     G++I+G++  +   VV
Sbjct: 370 SFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVV 429

Query: 465 YDLAASRIGFAPRGCA 480
           YDLA  +IG+    C+
Sbjct: 430 YDLARQQIGWTNYDCS 445


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 114/401 (28%), Positives = 176/401 (43%), Gaps = 64/401 (15%)

Query: 123 FSSSVI---SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDP 178
           F SS I    G    +G YFT + VG+PPR  ++ +DTGSD+ WIQC APC  C    +P
Sbjct: 83  FDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP 142

Query: 179 VFDPAKSRSFATVPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLT 234
           ++ P K      VP +  LC    R L +  C     C Y++ Y D S ++G  +++ L 
Sbjct: 143 LYKPKKGN---LVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLH 199

Query: 235 FRGTRVARVAL----GCGHDNEGLFVAAA----GLLGLGRGRLSFPTQ--TGRRFNRKFS 284
                 +   L    GC +D +GL + +     G+LGL + ++S P+Q  + R  N    
Sbjct: 200 LMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 259

Query: 285 YCLVDRSTSAKPSSMVFGDSAVSRTAR-FTPLLANPKLDTFYYVELVGISVGGAHVRGIT 343
           +CL   +T      M  GD  V      + P+L +   +  Y+ +++ IS G   +    
Sbjct: 260 HCLTSDATGG--GYMFLGDDFVPYWGMAWVPMLNSHSPN--YHSQIMKISHGSRQL---- 311

Query: 344 ASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLF----DTC 399
            SL + D      V+ D+G+S T   + AY AL        +SLK   D  L     D  
Sbjct: 312 -SLGRQD-GRTERVVFDTGSSYTYFPKEAYYAL-------VASLKDVSDEGLIQDGSDPT 362

Query: 400 FDLSGKTEVKVPTVV----------LHFR------GADVSLPATNYLIPVDSSGTFCFAF 443
             +  + +  + +V+          L FR           +P   YLI + + G  C   
Sbjct: 363 LPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLI-ISNKGNVCLGI 421

Query: 444 ---AGTMSGLSII-GNIQQQGFRVVYDLAASRIGFAPRGCA 480
              +    G +II G+I  +G  VVYD    +IG+A   C 
Sbjct: 422 LDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCV 462


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 173/372 (46%), Gaps = 39/372 (10%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVP 192
           YFTR+ +G+PP+  ++ +DTGSD++W+ C+PC  C S +        F+P  S + + +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 193 CRSPLCR---KLDSSGCNRRNT--CLYQVSYGDGSITVGDFSTETLTFR--------GTR 239
           C    C    +   + C   +   C Y  +YGDGS T G + ++T+ F            
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 240 VARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS 293
            A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  + + 
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KGSD 294

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                +V G+  V     +TPL+ +      Y + L  I V G  +  I +SLF    + 
Sbjct: 295 NGGGILVLGE-IVEPGLVYTPLVPS---QPHYNLNLESIVVNGQKLP-IDSSLFTT--SN 347

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
             G I+DSGT++  L   AY    +A  A  S   R+   S  + CF  S   +   PTV
Sbjct: 348 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSL-VSKGNQCFVTSSSVDSSFPTV 406

Query: 414 VLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLA 468
            L+F G   +++   NYL+    +D++  +C  +       ++I+G++  +    VYDLA
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 466

Query: 469 ASRIGFAPRGCA 480
             R+G+    C+
Sbjct: 467 NMRMGWTDYDCS 478


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 150/345 (43%), Gaps = 36/345 (10%)

Query: 164 IQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRN--TCLYQVSYGDG 221
           +QC PC  CY Q DPVF+P  S S+A VPC S  C +LD   C+  +   C Y   Y   
Sbjct: 1   MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60

Query: 222 SITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVA-AAGLLGLGRGRLSFPTQTGRRFN 280
            +T G  + + L   G     V  GC   + G   A A+GL+GLGRG LS  +Q      
Sbjct: 61  GVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSV--- 117

Query: 281 RKFSYCLVDRSTSAKPSSMVFG---DSAVSRTARFTPLLANP-KLDTFYYVELVGISVGG 336
            +F YCL     S     +V G   D+  + + R T  +++  +  ++YY+ L G++VG 
Sbjct: 118 HRFMYCL-PPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGD 176

Query: 337 ---AHVRGITA---------------SLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD 378
                 R  T+                +     A   G+I+D  ++++ L    Y  L D
Sbjct: 177 QTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELAD 236

Query: 379 AFRAGASSLKRAPDFSL-FDTCFDLS---GKTEVKVPTVVLHFRGADVSLPATNYLIPVD 434
                    +  P   L  D CF L    G   V VPTV L F G  + L      +   
Sbjct: 237 DLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFV--- 293

Query: 435 SSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           + G       G  SG+SI+GN Q Q  RV+++L   +I FA   C
Sbjct: 294 TDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 441

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 116/430 (26%), Positives = 183/430 (42%), Gaps = 83/430 (19%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCA-----PCKKC-----YSQT 176
           +I  +A  +  Y   L +GTPP+   + LDTGSD+ W+ C       C +C      S+ 
Sbjct: 14  IIEPIATYTDGYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHSISKP 73

Query: 177 DPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCL--------------------YQV 216
            P F  ++S S     C S  C  + SS  N  + C                     +  
Sbjct: 74  TPAFSLSQSYSSTRDLCGSRFCVDVHSSD-NSHDACAAAGCSIPVFMSGLCTRLCPPFAY 132

Query: 217 SYGDGSITVGDFSTETLTFRGT--------RVARVALGCGHDNEGLFVAAAGLLGLGRGR 268
           +YG  ++ +G  + +T+   G+               GC   +        G+ G G+G+
Sbjct: 133 TYGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGCVGSS---IREPIGIAGFGKGK 189

Query: 269 LSFPTQTGRRFNRKFSYCLVDRSTSAKP---SSMVFGDSAVSRTA--RFTPLLANPKLDT 323
           LS P+Q G   ++ FS+C +    +  P   S MV GD A+S      FTP+L +     
Sbjct: 190 LSLPSQLGF-LDKGFSHCFLGFWFARNPNITSPMVIGDLALSVKDGFLFTPMLKSLTYPN 248

Query: 324 FYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAG 383
           FYY+ L G+++G         SL  +D  GNGGVI+D+GT+ T L+ P Y A   +  + 
Sbjct: 249 FYYIGLEGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFY-ASVLSSLSS 307

Query: 384 ASSLKRAPDFSL---FDTCFDL----SGKTEVKVPTVVLHFRGADVSL----PATNYLI- 431
                R+ +  +   FD C  +    +   + ++P + +H  G DV+L     +  Y + 
Sbjct: 308 TVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHL-GGDVTLALPKESCYYAVT 366

Query: 432 -PVDSSGTFCFAFA-----GTMSG---------------LSIIGNIQQQGFRVVYDLAAS 470
            P +S    C  F      G  S                 +++G+ Q Q   VVYDL + 
Sbjct: 367 APRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLESG 426

Query: 471 RIGFAPRGCA 480
           R+GF PR CA
Sbjct: 427 RVGFQPRDCA 436


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 163/379 (43%), Gaps = 51/379 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G Y+ ++G+GTP +  Y+ +DTGSD++W+ C  CK+C  ++       +++  +S S   
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137

Query: 191 VPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG--------TR 239
           V C    C ++     SGC    +C Y   YGDGS T G F  + + +          T 
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 240 VARVALGCGHDNEGLF-----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRST 292
              V  GCG    G        A  G+LG G+   S  +Q  +  R  + F++CL  R+ 
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNG 257

Query: 293 SAKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
                  +F     V      TPL+ N      Y V +  + VG   +  I A LF+  P
Sbjct: 258 GG-----IFAIGRVVQPKVNMTPLVPN---QPHYNVNMTAVQVGQEFLN-IPADLFQ--P 306

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLSGKTEV 408
               G IIDSGT++  L    Y  L     +   +LK      + D    CF  SG+ + 
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDE 362

Query: 409 KVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMS------GLSIIGNIQQQGF 461
             P V  HF  +  + +   +YL P +  G +C  +  +         ++++G++     
Sbjct: 363 GFPNVTFHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNK 420

Query: 462 RVVYDLAASRIGFAPRGCA 480
            V+YDL    IG+    C+
Sbjct: 421 LVLYDLENQLIGWTEYNCS 439


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 110/381 (28%), Positives = 164/381 (43%), Gaps = 54/381 (14%)

Query: 138 YFTRLGVGTPPRY--VYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVP--- 192
           Y   +GVGT   Y    + +D  +   W+QCAPC  C  Q +PVFDPAKS +F  V    
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160

Query: 193 ---CRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF-----RGTRVARVA 244
              CR P     D         C + ++Y +G+   G  + +T +F         +  + 
Sbjct: 161 AVLCRPPYHPLQDGR-------CGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIV 213

Query: 245 LGCGH-----DNEGLFVAAAGLLGLGRGRLSFPT-----QTGRRFNRKFSYCLVDRSTSA 294
            GC +     D  G   A AG+LG+G G    P      Q       +FSYC +   T+A
Sbjct: 214 FGCANRIARFDTHG---ALAGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTA 270

Query: 295 KPSSMVFGDSAVSRTA-----RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
             S + FG+   S+       +   +LA       YYV+L GISVG   V G+T  +F+ 
Sbjct: 271 Y-SFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFER 329

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAY----IALRDAFRAGASSLKRAPDFSLFDTCFDLSGK 405
           D  G GG  ID GT +T + + AY     A+R   +   +   ++P   L   C   +  
Sbjct: 330 DQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSPGHHL---CVHRTPA 386

Query: 406 TEVKVPTVVLHFRGADVSLPATNYLIPVDSSGT-----FCFAFAGTMSGLSIIGNIQQQG 460
            E ++P++ LHF G         +L  V  S T      C         +++IG +QQ  
Sbjct: 387 IEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAE-MTVIGAMQQID 445

Query: 461 FRVVYDLAAS--RIGFAPRGC 479
            R ++DL  +   + F P  C
Sbjct: 446 TRFIFDLHNNIPIVSFNPEDC 466


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 134/424 (31%), Positives = 179/424 (42%), Gaps = 87/424 (20%)

Query: 104 ESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           E  +  P  NR R R N   +  V                VGTPP+ V MVLDTGS++ W
Sbjct: 36  EVELEAPAANRLRFRHNVSLTVPV---------------AVGTPPQNVTMVLDTGSELSW 80

Query: 164 IQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSI 223
           + C       S   P+   +  R         P C    S      N C   +SY D S 
Sbjct: 81  LLCN-----GSYAPPLTRRSTRRWRGRDLPVPPFCDTPPS------NACRVSLSYADASS 129

Query: 224 TVGDFSTETLTFRG------------------TRVARVALGCGHDNEGLFVAAAGLLGLG 265
             G  +T+T    G                  +  A  + G G D   +  AA GLLG+ 
Sbjct: 130 ADGVLATDTFLLTGGAPPVAVGAYFGCITSYSSTTATNSNGTGTD---VSEAATGLLGMN 186

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDS-AVSRTARFTPLLAN----PK 320
           RG LSF TQTG    R+F+YC+   +    P  ++ GD   V+    +TPL+      P 
Sbjct: 187 RGTLSFVTQTG---TRRFAYCI---APGEGPGVLLLGDDGGVAPPLNYTPLIEISQPLPY 240

Query: 321 LDTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
            D   Y V+L GI VG A +  I  S+   D  G G  ++DSGT  T L   AY AL+  
Sbjct: 241 FDRVAYSVQLEGIRVGCALLP-IPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAE 299

Query: 380 FRAGASSLKR---APDFSL---FDTCFDLSGKTEVKV-------PTVVLHFRGADVSLPA 426
           F + A  L      P F     FD CF      E +V       P V L  RGA+V++  
Sbjct: 300 FTSQARLLLAPLGEPGFVFQGAFDACFR---GPEARVAAASGLLPEVGLVLRGAEVAVSG 356

Query: 427 TN--YLIPVDSSG------TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRIGFA 475
               Y++P +  G       +C  F  + M+G+S  +IG+  QQ   V YDL   R+GFA
Sbjct: 357 EKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFA 416

Query: 476 PRGC 479
           P  C
Sbjct: 417 PARC 420


>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
          Length = 343

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 53/90 (58%), Positives = 68/90 (75%), Gaps = 1/90 (1%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSR 186
           V+SG+  GSGEYF+R+GVG+P R +YMVLDTGSDV W+QC PC  CY Q+DPVFDP+ S 
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLST 215

Query: 187 SFATVPCRSPLCRKLDSSGC-NRRNTCLYQ 215
           S+A+V C +P C  LD++ C N    CLY+
Sbjct: 216 SYASVACDNPRCHDLDAAACRNSTGACLYE 245


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 131/401 (32%), Positives = 171/401 (42%), Gaps = 83/401 (20%)

Query: 155 LDTGSDVVWIQCAP--CKKCYSQTDPVFDPAKSRSFAT---------------VPCRSPL 197
           LDTGSD+VW  C P  C  C S+  P   P    S AT               +P  S L
Sbjct: 98  LDTGSDLVWFPCRPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLP-SSDL 156

Query: 198 C-------RKLDSSGCNRRNTCL--YQVSYGDGSITVGDFSTETLTFRGTRVARVALGCG 248
           C         +++  CN  +     +  +YGDGS+    FS ++L+     VA    GC 
Sbjct: 157 CAISNCPLDYIETGDCNTSSYPCPPFYYAYGDGSLVAKLFS-DSLSLPSVSVANFTFGCA 215

Query: 249 HDNEGLFVAAAGLLGLGRGRLSFPTQ---TGRRFNRKFSYCLVDRSTSA----KPSSMVF 301
           H          G+ G GRGRLS P Q           FSYCLV  S  +    +PS ++ 
Sbjct: 216 HTT---LAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLIL 272

Query: 302 G---DSAVSRTAR------------------FTPLLANPKLDTFYYVELVGISVGGAHVR 340
           G   D    R A                   FT +L NPK   FY V L GIS+G  ++ 
Sbjct: 273 GRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRNIP 332

Query: 341 GITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD----FSLF 396
              A L ++D  G GGV++DSGT+ T L    Y ++ + F +    +    D     S  
Sbjct: 333 A-PAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPSSGM 391

Query: 397 DTCFDLSGKTEVKVPTVVLHF--RGADVSLPATNYLIPVDSS----------GTFCFAFA 444
             C+ L+    VKVP +VLHF   G+ V+LP  NY                 G       
Sbjct: 392 SPCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEKRKVGCLMLMNG 449

Query: 445 GTMSGL-----SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           G  S L     +I+GN QQQGF VVYDL   R+GFA R CA
Sbjct: 450 GDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 490


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 160/355 (45%), Gaps = 20/355 (5%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-PVFDPAKSRSFATVPC 193
           +G++  ++ +G PP  + + + TGSD+VWI C   K C    D   FDP +S ++  VPC
Sbjct: 95  NGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPC 154

Query: 194 RSPLCRKLDSSGCNRRNTCLY------QVSYGDGSITVGDFSTETLTFRGTRVARVALGC 247
            S  C+  +++ C   + C Y      Q S  DG + +   +  + T +   +      C
Sbjct: 155 DSYRCQITNAATCQFSD-CFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFMLPNTGFIC 213

Query: 248 GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA-V 306
           G+   G +    G+LGLG G LS   +     + KFS+C+V  S S + S + FGD A V
Sbjct: 214 GNRIGGDY-PGVGILGLGHGSLSLLNRISHLIDGKFSHCIVPYS-SNQTSKLSFGDKAVV 271

Query: 307 SRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVT 366
           S +A F+  L        Y +   GISVG    + I+A     D   N G+ +DSGT  T
Sbjct: 272 SGSAMFSTRLDMTGGPYSYTLSFYGISVGN---KSISAGGIGSDYYMN-GLGMDSGTMFT 327

Query: 367 RLTRPAYIALRDAFRAGASSLKRAPDFS-LFDTCFDLSGKTEVKVPTVVLHFRGADVSLP 425
                 Y  L    R         PD +     C+  S   +   PT+ +HF G  V L 
Sbjct: 328 YFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYS--PDFSPPTITMHFEGGSVELS 385

Query: 426 ATNYLIPVDSSGTFCFAFAGTMSGL-SIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           ++N  I + +    C AFA + S   ++ G  QQ    + YDL A  + F    C
Sbjct: 386 SSNSFIRM-TEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDLDAGFLSFLKTDC 439


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 111/375 (29%), Positives = 174/375 (46%), Gaps = 42/375 (11%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV------FDPAKSRSFA 189
           G Y+T++ +GTPP    + +DTGSDV+W+ C  C  C  QT  +      FDP  S + +
Sbjct: 73  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGC-PQTSGLQIQLNFFDPGSSSTSS 131

Query: 190 TVPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETL----TFRGT--- 238
            + C    C    +  D++  ++ N C Y   YGDGS T G + ++ +     F G+   
Sbjct: 132 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTT 191

Query: 239 -RVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRS 291
              A V  GC +   G       A  G+ G G+  +S  +Q   +    R FS+CL  + 
Sbjct: 192 NSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL--KG 249

Query: 292 TSAKPSSMVFGDSAVSRTARFTPLL-ANPKLDTFYYVELVGISVGGAHVRGITASLFKLD 350
            S+    +V G+  V     +T L+ A P     Y + L  I+V G  ++ I +S+F   
Sbjct: 250 DSSGGGILVLGE-IVEPNIVYTSLVPAQPH----YNLNLQSIAVNGQTLQ-IDSSVFA-- 301

Query: 351 PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKV 410
            + + G I+DSGT++  L   AY     A  A           S  + C+ ++       
Sbjct: 302 TSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTV-VSRGNQCYLITSSVTEVF 360

Query: 411 PTVVLHFR-GADVSLPATNYLIPVDSSG---TFCFAFAGTM-SGLSIIGNIQQQGFRVVY 465
           P V L+F  GA + L   +YLI  +S G    +C  F      G++I+G++  +   VVY
Sbjct: 361 PQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVY 420

Query: 466 DLAASRIGFAPRGCA 480
           DLA  RIG+A   C+
Sbjct: 421 DLAGQRIGWANYDCS 435


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 114/421 (27%), Positives = 181/421 (42%), Gaps = 57/421 (13%)

Query: 95  RVKSLTAFAESAVRVPPRNRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMV 154
           R +SL+A     VR   R  S    N G +     GL   +G YFT+LG+G+PPR  Y+ 
Sbjct: 32  RKRSLSAVRAHDVRRRGRILSAVDLNLGGN-----GLPTETGLYFTKLGLGSPPRDYYVQ 86

Query: 155 LDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCRSPLCRKLDSS---GC 206
           +DTGSD++W+ C  C +C  ++D      ++DP  S +   V C    C         GC
Sbjct: 87  VDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGC 146

Query: 207 NRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVARVALGCGHDNEGLF--- 255
                C Y ++YGDGS T G +  + LT+           + + +  GCG    G     
Sbjct: 147 KSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSS 206

Query: 256 --VAAAGLLGLGRGRLSFPTQTGR--RFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTAR 311
              A  G++G G+   S  +Q     +  + FS+CL     + +   +      V     
Sbjct: 207 SEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----DNVRGGGIFAIGEVVEPKVS 262

Query: 312 FTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRP 371
            TPL+  P++   Y V L  I V    +  + + +F  D     G +IDSGT++  L   
Sbjct: 263 TTPLV--PRM-AHYNVVLKSIEV-DTDILQLPSDIF--DSVNGKGTVIDSGTTLAYLPDI 316

Query: 372 AYIALRDAFRAGASSLKRAPDFSLFDT-----CFDLSGKTEVKVPTVVLHFRGA-DVSLP 425
            Y  L          L R P   L+       CF  +G  +   P V LHF+ +  +++ 
Sbjct: 317 VYDELIQKV------LARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVY 370

Query: 426 ATNYLIPVDSSGTFCFAF----AGTMSG--LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
             +YL      G +C  +    A T +G  ++++G++      V+YDL    IG+    C
Sbjct: 371 PHDYLFQF-KDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNC 429

Query: 480 A 480
           +
Sbjct: 430 S 430


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 118/221 (53%), Gaps = 16/221 (7%)

Query: 262 LGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKL 321
           +GLG G  S  +QT     R FSYCL    +S+   ++     + +     TP+L + ++
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQV 60

Query: 322 DTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 381
            TFY V L  I VGG  +  I AS+F      + G ++DSGT +TRL   AY AL  AF+
Sbjct: 61  PTFYGVRLQAIRVGGRQLS-IPASVF------SAGTVMDSGTVITRLPPTAYSALSSAFK 113

Query: 382 AGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFC 440
           AG      A    + DTCFD SG++ V +P+V L F  GA VSL A+  ++      + C
Sbjct: 114 AGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNC 167

Query: 441 FAFAGTM--SGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            AFAG    S L IIGN+QQ+ F V+YD+    +GF    C
Sbjct: 168 LAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 177/375 (47%), Gaps = 41/375 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFTR+ +G P +  ++ +DTGSD++W+ C+PC  C + +        F+P  S + + 
Sbjct: 87  GLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146

Query: 191 VPCRSPLCRKLDSSG---CNRRNT----CLYQVSYGDGSITVGDFSTETLTFR------- 236
           +PC    C     +G   C   ++    C Y  +YGDGS T G + ++T+ F        
Sbjct: 147 IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQ 206

Query: 237 -GTRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVD 289
                A V  GC +   G  +    A  G+ G G+ +LS  +Q  +     + FS+CL  
Sbjct: 207 TANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL-- 264

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           + +      +V G+  V     FTPL+ +      Y + L  I+V G  +  I +SLF  
Sbjct: 265 KGSDNGGGILVLGE-IVEPGLVFTPLVPS---QPHYNLNLESIAVSGQKLP-IDSSLFAT 319

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
             +   G I+DSGT++  L   AY    +A  A  S   R+        CF  +   +  
Sbjct: 320 --SNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVDSS 376

Query: 410 VPTVVLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVY 465
            PT  L+F+G   +++   NYL+    VD++  +C  +  +  G++I+G++  +    VY
Sbjct: 377 FPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQ-GITILGDLVLKDKIFVY 435

Query: 466 DLAASRIGFAPRGCA 480
           DLA  R+G+A   C+
Sbjct: 436 DLANMRMGWADYDCS 450


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 168/374 (44%), Gaps = 48/374 (12%)

Query: 137 EYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--CYSQTDPVFDPAKSRSFATVPCR 194
            Y     +GTPP+ V  ++D   ++VW QCA C+   C+ Q  PVFDP+ S ++    C 
Sbjct: 61  HYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120

Query: 195 SPLCRKLDSSGCNRRNTCLYQVS--YGDGSITVGDFSTETLTFRGTRVARVALGCGHDNE 252
           SPLC+ + +  C+    C Y+    +GD   T G  ST+ +   G    R+A GC   ++
Sbjct: 121 SPLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAI-GNAEGRLAFGCVVASD 176

Query: 253 GLFVAA----AGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA--- 305
           G    A    +G +GLGR   S     G+     FSYCL       K S++  G SA   
Sbjct: 177 GSIDGAMDGPSGFVGLGRTPWSL---VGQSNVTAFSYCLALHGPGKK-SALFLGASAKLA 232

Query: 306 -VSRTARFTPLL-------ANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
              ++   TPLL       ++   D +Y V+L GI  G   V   +        +G G +
Sbjct: 233 GAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAAS--------SGGGAI 284

Query: 358 II---DSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
            +   ++   ++ L   AY AL     A   S   A     FD CF  +  +   VP +V
Sbjct: 285 TVLQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLV 342

Query: 415 LHFR-GADVSLPATNYLI-PVDSSGTFCFAFAGTM------SGLSIIGNIQQQGFRVVYD 466
             F+ GA ++   + YL+   + +GT C +   +        G+SI+G++ Q+    ++D
Sbjct: 343 FTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFD 402

Query: 467 LAASRIGFAPRGCA 480
           L    + F P  C+
Sbjct: 403 LEKETLSFEPADCS 416


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 164/388 (42%), Gaps = 58/388 (14%)

Query: 130 GLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAK 184
           GL   +G Y+T +G+GTP +  Y+ +DTGSD++W+ C  C +C  ++       ++DP  
Sbjct: 81  GLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKD 140

Query: 185 SRSFATVPCRSPLCRKLDSS---GCNRRNTCLYQVSYGDGSITVGDFSTETLTFR----- 236
           S + + V C    C         GC     C Y V+YGDGS T G F ++ L F      
Sbjct: 141 SSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200

Query: 237 -GTRVAR--VALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCL 287
             TR A   V  GCG    G       A  G++G G+   S  +Q     +  + F++CL
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 260

Query: 288 VDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLF 347
               T         G+  V    + TPL+ N      Y V L  I VGG  ++ + + +F
Sbjct: 261 ---DTINGGGIFAIGN-VVQPKVKTTPLVPNMP---HYNVNLKSIDVGGTALK-LPSHMF 312

Query: 348 KLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDT----CFDLS 403
             D     G IIDSGT++T L    Y  +  A  A      +  D +  +     CF   
Sbjct: 313 --DTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFA------KHKDITFHNVQEFLCFQYV 364

Query: 404 GKTEVKVPTVVLHFRGADVSLPATNYLIPVD-----SSGTFCFAF--AGTMS----GLSI 452
           G+ +   P +  HF      LP   Y  P D         +C  F   G  S    G+ +
Sbjct: 365 GRVDDDFPKITFHFEN---DLPLNVY--PHDYFFENGDNLYCVGFQNGGLQSKDGKGMVL 419

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +G++      VVYDL    IG+    C+
Sbjct: 420 LGDLVLSNKLVVYDLENQVIGWTEYNCS 447


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/262 (38%), Positives = 131/262 (50%), Gaps = 21/262 (8%)

Query: 230 TETLTFRGTRVA--RVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFSYCL 287
           TET TF     A   +A GC   +EG F   +GL+GLGRG+LS  TQ        F Y L
Sbjct: 2   TETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFGYRL 58

Query: 288 VDRSTSAKPSSMVFG---DSAVSRTARF--TPLLANPKLDT--FYYVELVGISVGGAHVR 340
              S  + PS + FG   D        F  TPLL NP +    FYYV L GISVGG  V+
Sbjct: 59  --SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 116

Query: 341 GITASLFKLD-PAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTC 399
            I +  F  D   G GGVI DSGT++T L  PAY  +RD   +     K  P  +  D  
Sbjct: 117 -IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI 175

Query: 400 FDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPV---DSSGTFCFAFAGTMSGLSIIGN 455
               G +    P++VLHF  GAD+ L   NYL  +   +     C++   +   L+IIGN
Sbjct: 176 CFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGN 235

Query: 456 IQQQGFRVVYDLAA-SRIGFAP 476
           I Q  F VV+DL+  +R+ F P
Sbjct: 236 IMQMDFHVVFDLSGNARMLFQP 257


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 158/399 (39%), Gaps = 74/399 (18%)

Query: 131 LAQGSGEYFTRLGVGTP--PRYVYMVLDTGSDVVWIQCAP--CKKCY-------SQTDPV 179
           LA GS +Y   L VG P     V + LDTGSD+VW  CAP  C  C        + + P+
Sbjct: 82  LAPGS-DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPL 140

Query: 180 FDPAKSRSFATVPCRSPLCRKLDSSG--------------------CNRRNTCLYQVSYG 219
             P  SR  +   C SPLC    SS                     C          +YG
Sbjct: 141 PPPIDSRRIS---CASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYG 197

Query: 220 DGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           DGS+                V      C H          G+ G GRG LS P Q     
Sbjct: 198 DGSLVANLRRGRVGLAASMAVENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQ----- 249

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHV 339
                   +  S S    +   G S       +TPLL NPK   FY V L  +SVGG  +
Sbjct: 250 --------LAPSLSGSTDAAAIGASETDFV--YTPLLHNPKHPYFYSVALEAVSVGGKRI 299

Query: 340 RGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRD-----AFRAGASSLKRAPDFS 394
           +     L  +D  GNGG+++DSGT+ T L    +  + D        A  +  + A   +
Sbjct: 300 QA-QPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT 358

Query: 395 LFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPATNYLIPVDSS-----GTFCFAFAGTMS 448
               C+  S  ++  VP V LHFRG A V+LP  NY +   S      G       G  +
Sbjct: 359 GLAPCYHYS-PSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNN 417

Query: 449 G--------LSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
                       +GN QQQGF VVYD+ A R+GFA R C
Sbjct: 418 DDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 162/379 (42%), Gaps = 51/379 (13%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G Y+ ++G+GTP +  Y+ +DTGSD++W+ C  CK+C  ++       +++  +S S   
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137

Query: 191 VPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRG--------TR 239
           V C    C ++     SGC    +C Y   YGDGS T G F  + + +          T 
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 240 VARVALGCGHDNEGLF-----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSYCLVDRST 292
              V  GCG    G        A  G+LG G+   S  +Q  +  R  + F++CL  R+ 
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNG 257

Query: 293 SAKPSSMVFG-DSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDP 351
                  +F     V      TPL+ N      Y V +  + VG   +  I A LF+  P
Sbjct: 258 GG-----IFAIGRVVQPKVNMTPLVPN---QPHYNVNMTAVQVGQEFLT-IPADLFQ--P 306

Query: 352 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFD---TCFDLSGKTEV 408
               G IIDSGT++  L    Y  L     +   +LK      + D    CF  SG+ + 
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDE 362

Query: 409 KVPTVVLHFRGAD-VSLPATNYLIPVDSSGTFCFAFAGTMS------GLSIIGNIQQQGF 461
             P V  HF  +  + +   +YL P    G +C  +  +         ++++G++     
Sbjct: 363 GFPNVTFHFENSVFLRVYPHDYLFP--HEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNK 420

Query: 462 RVVYDLAASRIGFAPRGCA 480
            V+YDL    IG+    C+
Sbjct: 421 LVLYDLENQLIGWTEYNCS 439


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 160/359 (44%), Gaps = 27/359 (7%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y TR+ +GTPP+   +++DTGS + ++ C+ C++C    DP F P  S ++  + C  
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-- 147

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNE 252
                ++ +  +    C+Y   Y + S + G    + ++F      +  R   GC +   
Sbjct: 148 ----SMECTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 253 GLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           G   +  A G++GLGRG LS   Q   +     S+ L          +MV G   +S  A
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLG--GISPPA 261

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
                 ++P    +Y ++L  I + G  +  I   +F     G  G I+DSGT+   L  
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLP-INPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 371 PAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADVS 423
           PA+ A +DA     +SLK  + PD +  D CF   G    ++    P V L F  G  +S
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 424 LPATNYLIP-VDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           L   NYL     + G +C   F       +++G I  +   V+YD    +IGF    C+
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/390 (28%), Positives = 173/390 (44%), Gaps = 36/390 (9%)

Query: 116 RGRANGGFSSSV---ISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP--CK 170
           R   +G  +SS+   IS ++     Y  +  +G+P    Y + D+GS +VW+QC    C+
Sbjct: 76  RSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCR 135

Query: 171 KCYSQTDPVFDPAKSRSFATVPCRSPLCRKL---DSSGCNRRN-TCLYQVSYGDGSITVG 226
            CY Q  P+F+P+KS ++    C +  CR     +   C + N  C Y   Y D S T G
Sbjct: 136 NCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEG 195

Query: 227 DFSTETLTFR------GTRVARVALGCGHDN-EGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
             ST+  TF       G    R+  GCG++N +       GL+GL   + S     G+  
Sbjct: 196 VISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASL---VGQMD 252

Query: 280 NRKFSYCL-VDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELV-GISVGGA 337
             +FSYC+ +D   + K S  +    A S +   T L+  P  D +Y  + V GI V   
Sbjct: 253 VDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLV--PNSDGWYIFKNVDGIYVNEF 310

Query: 338 HVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSL-KRAPDFSLF 396
            V G  A +FK    G GG+ +D+GT+ T L       L        + + ++    S F
Sbjct: 311 EVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSGF 370

Query: 397 DTCF---DLSGKTEVKVPTVVLHF---RGADVSLPATNYLIPVDSSGTFCFAFAGTMSGL 450
           + C+   D  G T   +P + L F   +    S    N   P +     C A   T +G+
Sbjct: 371 ELCYFSDDFLGAT---LPDIELRFTDNKDTYFSFNTRNAWTP-NGRSQMCLAMFRT-NGM 425

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPR-GC 479
           SIIG  Q +  ++ YDL  + + F    GC
Sbjct: 426 SIIGMHQLRDIKIGYDLHHNIVSFTDAFGC 455


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 160/359 (44%), Gaps = 27/359 (7%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRS 195
           G Y TR+ +GTPP+   +++DTGS + ++ C+ C++C    DP F P  S ++  + C  
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-- 147

Query: 196 PLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDNE 252
                ++ +  +    C+Y   Y + S + G    + ++F      +  R   GC +   
Sbjct: 148 ----SMECTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 253 GLFVA--AAGLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTA 310
           G   +  A G++GLGRG LS   Q   +     S+ L          +MV G   +S  A
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLG--GISPPA 261

Query: 311 RFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTR 370
                 ++P    +Y ++L  I + G  +  I   +F     G  G I+DSGT+   L  
Sbjct: 262 GMVFTHSDPARSAYYNIDLKEIHIAGKQLP-INPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 371 PAYIALRDAFRAGASSLK--RAPDFSLFDTCFDLSGKTEVKV----PTVVLHF-RGADVS 423
           PA+ A +DA     +SLK  + PD +  D CF   G    ++    P V L F  G  +S
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 424 LPATNYLIP-VDSSGTFCFA-FAGTMSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           L   NYL     + G +C   F       +++G I  +   V+YD    +IGF    C+
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/379 (30%), Positives = 170/379 (44%), Gaps = 59/379 (15%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCR 194
           G Y+  + +G PP+  ++ +D+GSD+ W+QC APC+ C     P++ P KS+    VPC 
Sbjct: 64  GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSK---LVPCV 120

Query: 195 SPLCRKLDS--SGCNR----RNTCLYQVSYGDGSITVGDFSTETLTFRGTR--VAR--VA 244
             LC  L +  +G +R       C Y + Y D   + G    ++   R T   VAR  VA
Sbjct: 121 HRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVA 180

Query: 245 LGCGHDNE----GLFVAAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSS 298
            GCG+D +     L     G+LGLG G +S  +Q  +R   K    +CL  R        
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGF---- 236

Query: 299 MVFGDSAVS-RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGV 357
           + FGD  V  + A +TP +A      +Y      +  G    R +   L K        V
Sbjct: 237 LFFGDDLVPYQRATWTP-MARSAFRNYYSPGSASLYFGD---RSLGVRLAK--------V 284

Query: 358 IIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSL---------FDTCFDLSGKTE 407
           + DSG+S T      Y AL  A + G S +L+  PD SL         F +  D+  + E
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDV--RKE 342

Query: 408 VKVPTVVLHFRGAD---VSLPATNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQG 460
            K  ++VL+F       + +P  NYLI V  +G  C          +  LSIIG+I  Q 
Sbjct: 343 FK--SLVLNFASGKKTLMEIPPENYLI-VTENGNACLGILNGSEIGLKDLSIIGDITMQD 399

Query: 461 FRVVYDLAASRIGFAPRGC 479
             V+YD    +IG+    C
Sbjct: 400 HMVIYDNEKGKIGWIRAPC 418


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/377 (28%), Positives = 169/377 (44%), Gaps = 53/377 (14%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFT++ +G+PP+  ++ +DTGSD++W+ C PC +C S+T+      +FD   S +   
Sbjct: 72  GLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKK 131

Query: 191 VPCRSPLCRKL-DSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GTRVA 241
           V C    C  +  S  C     C Y + Y D S + G+F  + LT          G    
Sbjct: 132 VGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQ 191

Query: 242 RVALGCGHDNEGLF----VAAAGLLGLGRGRLSFPTQ---TGRRFNRKFSYCLVDRSTSA 294
            V  GCG D  G       A  G++G G+   S  +Q   TG    R FS+CL D     
Sbjct: 192 EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDA-KRVFSHCL-DNVKGG 249

Query: 295 KPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
              ++   DS   +T   TP++ N      Y V L+G+ V G  +  +  S+ +     N
Sbjct: 250 GIFAVGVVDSPKVKT---TPMVPN---QMHYNVMLMGMDVDGTALD-LPPSIMR-----N 297

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPD--FSLFDT--CFDLSGKTEVKV 410
           GG I+DSGT++    +  Y +L +        L R P     + DT  CF  S   +V  
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETI------LARQPVKLHIVEDTFQCFSFSENVDVAF 351

Query: 411 PTVVLHFRGA-DVSLPATNYLIPVDSSGTFCFAF------AGTMSGLSIIGNIQQQGFRV 463
           P V   F  +  +++   +YL  ++    +CF +       G  + + ++G++      V
Sbjct: 352 PPVSFEFEDSVKLTVYPHDYLFTLEKE-LYCFGWQAGGLTTGERTEVILLGDLVLSNKLV 410

Query: 464 VYDLAASRIGFAPRGCA 480
           VYDL    IG+A   C+
Sbjct: 411 VYDLENEVIGWADHNCS 427


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 168/372 (45%), Gaps = 37/372 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFT++ +GTPP    + +DTGSD++W+ C  C  C   +        FD + S S + 
Sbjct: 77  GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136

Query: 191 VPCRSPLCR---KLDSSGC-NRRNTCLYQVSYGDGSITVGDFSTETLTFR--------GT 238
           V C  P+C    +  ++ C  + N C Y   YGDGS T G + +E++ F           
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196

Query: 239 RVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRST 292
             A V  GC     G       A  G+ G G G LS  +Q   R    + FS+CL  +  
Sbjct: 197 SSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL--KGE 254

Query: 293 SAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA 352
                 +V G+  +     ++PL+ +      Y + L  ISV G  +  I  S+F    +
Sbjct: 255 GNGGGILVLGE-VLEPGIVYSPLVPS---QPHYNLYLQSISVNGQTLP-IDPSVFA--TS 307

Query: 353 GNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPT 412
            N G IIDSGT++  L   AY     A  A A S    P  S  + C+ +S       P 
Sbjct: 308 INRGTIIDSGTTLAYLVEEAYTPFVSAITA-AVSQSVTPTISKGNQCYLVSTSVGEIFPL 366

Query: 413 VVLHFRG-ADVSLPATNYLIPV---DSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLA 468
           V L+F G A + L    YL+ +   D +  +C  F     G++I+G++  +    VYDLA
Sbjct: 367 VSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLA 426

Query: 469 ASRIGFAPRGCA 480
             RIG+A   C+
Sbjct: 427 RQRIGWASYDCS 438


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 170/380 (44%), Gaps = 60/380 (15%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPCR 194
           G Y+  + +G PP+  ++ +D+GSD+ W+QC APC+ C     P++ P KS+    VPC 
Sbjct: 62  GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSK---LVPCV 118

Query: 195 SPLCRKLDSS---GCNR----RNTCLYQVSYGDGSITVGDFSTETLTFRGTR--VAR--V 243
             LC  L ++   G +R       C Y + Y D   + G    ++   R T   VAR  V
Sbjct: 119 HRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTNGSVARPSV 178

Query: 244 ALGCGHDNE----GLFVAAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPS 297
           A GCG+D +     L     G+LGLG G +S  +Q  +R   K    +CL  R       
Sbjct: 179 AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGF--- 235

Query: 298 SMVFGDSAVS-RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGG 356
            + FGD  V  + A +TP +A      +Y      +  G    R +   L K        
Sbjct: 236 -LFFGDDLVPYQRATWTP-MARSAFRNYYSPGSASLYFGD---RSLGVRLAK-------- 282

Query: 357 VIIDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSL---------FDTCFDLSGKT 406
           V+ DSG+S T      Y AL  A + G S +L+  PD SL         F +  D+  + 
Sbjct: 283 VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDV--RK 340

Query: 407 EVKVPTVVLHFRGAD---VSLPATNYLIPVDSSGTFCFAFAG----TMSGLSIIGNIQQQ 459
           E K  ++VL+F       + +P  NYLI V  +G  C          +  LSIIG+I  Q
Sbjct: 341 EFK--SLVLNFASGKKTLMEIPPENYLI-VTENGNACLGILNGSEIGLKDLSIIGDITMQ 397

Query: 460 GFRVVYDLAASRIGFAPRGC 479
              V+YD    +IG+    C
Sbjct: 398 DHMVIYDNEKGKIGWIRAPC 417


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 125/432 (28%), Positives = 194/432 (44%), Gaps = 51/432 (11%)

Query: 87  LRIQRDVLRVKSLTAFAESAVRVPPRNR-SRGRANGGFSSSVISGLAQGS------GEYF 139
           LR+QR V          E   R   R+R SR R  GG +  V+    +GS      G YF
Sbjct: 36  LRLQRAVPHQG--VPLEELRRRDAARHRVSRRRLLGGVAG-VVDFPVEGSANPYMVGLYF 92

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCR 194
           TR+ +G P +  ++ +DTGSD++W+ C+PC  C + +        F+P  S + + + C 
Sbjct: 93  TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 152

Query: 195 SPLCRKLDSSG---CNRRNT----CLYQVSYGDGSITVGDFSTETLTFR--------GTR 239
              C     +G   C   N+    C Y  +YGDGS T G + ++T+ F            
Sbjct: 153 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 212

Query: 240 VARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS 293
            A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  + + 
Sbjct: 213 SASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSD 270

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                +V G+  V     +TPL+ +      Y + L  I+V G  +  I +SLF    + 
Sbjct: 271 NGGGILVLGE-IVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKLP-IDSSLFTT--SN 323

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
             G I+DSGT++  L   AY     A  A  S   R+   S    CF  S   +   PTV
Sbjct: 324 TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGSQCFITSSSVDSSFPTV 382

Query: 414 VLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLA 468
            L+F G   +S+   NYL+    VD+S  +C  +       ++I+G++  +    VYDLA
Sbjct: 383 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 442

Query: 469 ASRIGFAPRGCA 480
             R+G+A   C+
Sbjct: 443 NMRMGWADYDCS 454


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/388 (27%), Positives = 171/388 (44%), Gaps = 55/388 (14%)

Query: 129 SGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPA 183
           SG     G Y+ ++G+GTPP+  Y+ +DTGSD++W+ C  CK+C ++++      ++D  
Sbjct: 76  SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIK 135

Query: 184 KSRSFATVPCRSPLCRKLDS---SGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR---- 236
           +S S   VPC    C++++    +GC    +C Y   YGDGS T G F  + + +     
Sbjct: 136 ESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 195

Query: 237 ----GTRVARVALGCGHDNEGLF-----VAAAGLLGLGRGRLSFPTQ--TGRRFNRKFSY 285
                +    +  GCG    G        A  G+LG G+   S  +Q  +  +  + F++
Sbjct: 196 DLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAH 255

Query: 286 CLVDRSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITAS 345
           CL           +      V      TPLL +      Y V +  + VG A +   T +
Sbjct: 256 CL----NGVNGGGIFAIGHVVQPKVNMTPLLPD---QPHYSVNMTAVQVGHAFLSLSTDT 308

Query: 346 LFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDF---SLFD--TCF 400
             + D     G IIDSGT++  L    Y  L   ++     + + PD    +L D  TCF
Sbjct: 309 STQGDRK---GTIIDSGTTLAYLPEGIYEPL--VYKI----ISQHPDLKVRTLHDEYTCF 359

Query: 401 DLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTF-CFAF--AGTMS----GLSI 452
             S   +   P V  +F  G  + +   +YL P   SG F C  +  +GT S     +++
Sbjct: 360 QYSESVDDGFPAVTFYFENGLSLKVYPHDYLFP---SGDFWCIGWQNSGTQSRDSKNMTL 416

Query: 453 IGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +G++      V YDL    IG+    C+
Sbjct: 417 LGDLVLSNKLVFYDLENQVIGWTEYNCS 444


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 125/432 (28%), Positives = 194/432 (44%), Gaps = 51/432 (11%)

Query: 87  LRIQRDVLRVKSLTAFAESAVRVPPRNR-SRGRANGGFSSSVISGLAQGS------GEYF 139
           LR+QR V          E   R   R+R SR R  GG +  V+    +GS      G YF
Sbjct: 34  LRLQRAVPHKG--VPLEELRRRDAARHRVSRRRLLGGVAG-VVDFPVEGSANPYMVGLYF 90

Query: 140 TRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFATVPCR 194
           TR+ +G P +  ++ +DTGSD++W+ C+PC  C + +        F+P  S + + + C 
Sbjct: 91  TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 150

Query: 195 SPLCRKLDSSG---CNRRNT----CLYQVSYGDGSITVGDFSTETLTFR--------GTR 239
              C     +G   C   N+    C Y  +YGDGS T G + ++T+ F            
Sbjct: 151 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 210

Query: 240 VARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTS 293
            A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  + + 
Sbjct: 211 SASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSD 268

Query: 294 AKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAG 353
                +V G+  V     +TPL+ +      Y + L  I+V G  +  I +SLF    + 
Sbjct: 269 NGGGILVLGE-IVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKLP-IDSSLFTT--SN 321

Query: 354 NGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTV 413
             G I+DSGT++  L   AY     A  A  S   R+   S    CF  S   +   PTV
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGSQCFITSSSVDSSFPTV 380

Query: 414 VLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVVYDLA 468
            L+F G   +S+   NYL+    VD+S  +C  +       ++I+G++  +    VYDLA
Sbjct: 381 TLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 440

Query: 469 ASRIGFAPRGCA 480
             R+G+A   C+
Sbjct: 441 NMRMGWADYDCS 452


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 146/323 (45%), Gaps = 55/323 (17%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TR+ +GTPP+   +++DTGS V ++ C+ C++C    DP F+P  S ++  V C 
Sbjct: 87  NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC- 145

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRV---ARVALGCGHDN 251
                 +D +  N R  C+Y+  Y + S + G    + ++F         R   GC +  
Sbjct: 146 -----NIDCTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQE 200

Query: 252 EGLFVA--AAGLLGLGRGRLSFPTQTGRR--FNRKFSYCL--VDRSTSAK-------PSS 298
            G   +  A G++GLGRG LS   Q   +   +  FS C   +D    A        PS 
Sbjct: 201 TGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISPPSG 260

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPA---GNG 355
           MVF +S             +P    +Y ++L  I V G  +         LDP+   G  
Sbjct: 261 MVFAES-------------DPVRSQYYNIDLKAIHVAGKQLH--------LDPSIFDGKH 299

Query: 356 GVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKR--APDFSLFDTCF-----DLSGKTEV 408
           G ++DSGT+   L   A+ A +DA     +SLK+   PD +  D CF     D+S  +  
Sbjct: 300 GTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNT 359

Query: 409 KVPTVVLHF-RGADVSLPATNYL 430
             P V + F  G  +SL   NYL
Sbjct: 360 -FPAVEMVFSNGQKLSLSPENYL 381


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 153/326 (46%), Gaps = 36/326 (11%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCR 194
           +G Y TRL +GTPP+   +++D+GS V ++ CA C++C +  DP F P  S S++ V C 
Sbjct: 86  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC- 144

Query: 195 SPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTF---RGTRVARVALGCGHDN 251
                 +D +  + +  C Y+  Y + S + G    + ++F      +  R   GC +  
Sbjct: 145 -----NVDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAVFGCENSE 199

Query: 252 EG-LFVAAA-GLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSAKPSSMVFGDSAVS 307
            G LF   A G++GLGRG+LS   Q   +   N  FS C           +MV G     
Sbjct: 200 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGG--GAMVLGGVPTP 257

Query: 308 RTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTR 367
               F+   ++P    +Y +EL  I V G  +R + + +F        G ++DSGT+   
Sbjct: 258 SDMVFS--RSDPLRSPYYNIELKEIHVAGKALR-VDSRIFD----SKHGTVLDSGTTYAY 310

Query: 368 LTRPAYIALRDAFRAGASSLK--RAPDFSLFDTCF-----DLSGKTEVKVPTVVLHF-RG 419
           L   A++A +DA  +   SLK  R PD S  D CF     ++S   EV  P V + F  G
Sbjct: 311 LPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEV-FPDVDMVFGNG 369

Query: 420 ADVSLPATNYLI---PVDSSGTFCFA 442
             +SL   NYL     VD  G +C  
Sbjct: 370 QKLSLTPENYLFRHSKVD--GAYCLG 393


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 172/376 (45%), Gaps = 41/376 (10%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G YFTR+ +G P +  ++ +DTGSD++W+ C+PC  C + +        F+P  S + + 
Sbjct: 3   GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62

Query: 191 VPCRSPLCRKLDSSG---CNRRNT----CLYQVSYGDGSITVGDFSTETLTFR------- 236
           + C    C     +G   C   N+    C Y  +YGDGS T G + ++T+ F        
Sbjct: 63  ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122

Query: 237 -GTRVARVALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVD 289
                A +  GC +   G       A  G+ G G+ +LS  +Q        + FS+CL  
Sbjct: 123 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-- 180

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           + +      +V G+  V     +TPL+ +      Y + L  I+V G  +  I +SLF  
Sbjct: 181 KGSDNGGGILVLGE-IVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLP-IDSSLFT- 234

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
             +   G I+DSGT++  L   AY     A  A  S   R+   S    CF  S   +  
Sbjct: 235 -TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGSQCFITSSSVDSS 292

Query: 410 VPTVVLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVV 464
            PTV L+F G   +S+   NYL+    VD+S  +C  +       ++I+G++  +    V
Sbjct: 293 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352

Query: 465 YDLAASRIGFAPRGCA 480
           YDLA  R+G+A   C+
Sbjct: 353 YDLANMRMGWADYDCS 368


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 111/375 (29%), Positives = 166/375 (44%), Gaps = 48/375 (12%)

Query: 138 YFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPV----------FDPAKSRS 187
           Y+TRL +G+PPR  Y+ +DTGSDV+W+ C+ C  C     PV          FDP  S +
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGC-----PVSSGLHIPLNFFDPGSSPT 144

Query: 188 FATVPCRSPLC----RKLDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFR---GTRV 240
            + + C    C    +  DS    + N C Y   YGDGS T G + ++ L F    G  V
Sbjct: 145 ASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSV 204

Query: 241 AR-----VALGCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVD 289
            +     +  GC     G       A  G+ G G+  +S  +Q   +    R FS+CL  
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL-- 262

Query: 290 RSTSAKPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKL 349
           +   +    +V G+  V     +TPL+ +      Y + L  I V G     I  S+F  
Sbjct: 263 KGDDSGGGILVLGE-IVEPNIVYTPLVPS---QPHYNLNLQSIYVNG-QTLAIDPSVFAT 317

Query: 350 DPAGNGGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVK 409
             + N G IIDSGT++  LT  AY     A  +  S    +P  S  + C+  S      
Sbjct: 318 --SSNQGTIIDSGTTLAYLTEAAYDPFISAITSTVSP-SVSPYLSKGNQCYLTSSSINDV 374

Query: 410 VPTVVLHFRGA-DVSLPATNYLIP---VDSSGTFCFAFAGTM-SGLSIIGNIQQQGFRVV 464
            P V L+F G   + L   +YLI    ++ +  +C  F       ++I+G++  +    V
Sbjct: 375 FPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFV 434

Query: 465 YDLAASRIGFAPRGC 479
           YD+A  RIG+A   C
Sbjct: 435 YDIAGQRIGWANYDC 449


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 134/420 (31%), Positives = 179/420 (42%), Gaps = 73/420 (17%)

Query: 107 VRVPPR---NRSRGRANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVW 163
           V  PPR   NR R R N   + SV+               VGTPP+ V MVLDTGS++  
Sbjct: 46  VAPPPRALANRLRFRHNVSLTVSVV---------------VGTPPQNVTMVLDTGSELSG 90

Query: 164 IQCAPCKKCYSQTDPV-FDPAKSRSFATVPCRSPLC----RKLDSSG-CNR--RNTCLYQ 215
           + C       S + P  F+ + S +++ V C SP C    R L     C+     +C   
Sbjct: 91  LLC----NGSSLSPPAPFNASASLTYSAVDCSSPACVWRGRDLPVRPFCDAPPSTSCRVS 146

Query: 216 VSYGDGSITVGDFSTETLTFRGTRVARVALGC----------GHDNEGLFVAAAGLLGLG 265
           +SY D S   G    +T    GT+      GC                   AA GLLG+ 
Sbjct: 147 ISYADASSADGHLVADTFIL-GTQAVPALFGCITSYSSSTAINSSATDPSEAATGLLGMN 205

Query: 266 RGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSAVSRTARFTPLLAN----PKL 321
           RG LSF TQT      +F+YC+              G +A      +TPL+      P  
Sbjct: 206 RGSLSFVTQTA---TLRFAYCIAPGQGPGILLLGGDGGAA--PPLNYTPLIEISQPLPYF 260

Query: 322 DTFYY-VELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDAF 380
           D   Y V+L GI VG A ++ I  S+   D  G G  ++DSGT  T L   AY AL+  F
Sbjct: 261 DRVAYSVQLEGIRVGSALLQ-IPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEF 319

Query: 381 RAGASSLKR---APDFSL---FDTCF----DLSGKTEVKVPTVVLHFRGADVSLPATN-- 428
              A SL      P F     FD CF    +        +P V L  RGA+V++      
Sbjct: 320 LNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLPEVGLVLRGAEVAVAGEKLL 379

Query: 429 YLIPVDSSG------TFCFAFAGT-MSGLS--IIGNIQQQGFRVVYDLAASRIGFAPRGC 479
           Y +P +  G       +C  F  + M+G+S  +IG+  QQ   V YDL   R+GFAP  C
Sbjct: 380 YSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVIGHHHQQDVWVEYDLQNGRVGFAPARC 439


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 125/396 (31%), Positives = 173/396 (43%), Gaps = 58/396 (14%)

Query: 127 VISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKK--------------- 171
           V S L  G  EY   + VGTPP     V DTGSD+VW++C   +                
Sbjct: 71  VSSDLFYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNS 130

Query: 172 ----CYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG-CN-RRNTCLYQVSYGDGSITV 225
                  +    F+P  S S++ V C  P C  L ++  CN   + C ++ SY DG+   
Sbjct: 131 SPPPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASAT 190

Query: 226 GDFSTETLTFRG------TRVARVALGCGHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRF 279
           G  + +T TF G      T  A +  GC     G    A G++GLG G LS  +Q G   
Sbjct: 191 GLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLG--- 247

Query: 280 NRKFSYCLVDRSTSAKPSSMVFGDSAVSRT--ARFTPLLANPKLDTFYY-VELVGISVGG 336
            RKFS+CL         S + FG  AV     A  TPL+A+      YY + +  + V G
Sbjct: 248 -RKFSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAG 306

Query: 337 AHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIA-LRDAFR--AGASSLKRA--P 391
             V G T S+ K        VI+D+GT +T L R A +A L ++       + L RA  P
Sbjct: 307 QPVPG-TTSVSK--------VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPP 357

Query: 392 DFSLFDTCFDLSGKTEVK--VPTVVLHF---RGADVSLPATNYLIPVDSSGTFCFAFAGT 446
           D +L + C+D+S   +V   +P V L      G +V L      + V   G  C A   T
Sbjct: 358 DETL-ELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLV-KEGVLCLAVVTT 415

Query: 447 ---MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
              +  LS++GN+  Q   V  DL A    FA   C
Sbjct: 416 SPELQPLSVLGNVALQDLHVGIDLDARTATFATANC 451


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 163/361 (45%), Gaps = 40/361 (11%)

Query: 144 VGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDS 203
           +GTPP+    ++D   ++VW QC+ C +C+ Q  P+F P  S +F   PC +  C+   +
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 204 SGCNRRNTCLYQVSYG---DGSITVGDFSTETLTFRGTRVARVALGCGHDNE-GLFVAAA 259
           S C+  + C Y+ +     D   T+G   TET    GT  A +A GC   ++       +
Sbjct: 109 SNCS-GDVCTYESTTNIRLDRHTTLGIVGTETFAI-GTATASLAFGCVVASDIDTMDGTS 166

Query: 260 GLLGLGRGRLSFPTQTGRRFNRKFSYCLVDRSTSAKPSSMVFGDSA------VSRTARFT 313
           G +GLGR   S   Q       KFSYCL  R T  K S +  G SA       + TA F 
Sbjct: 167 GFIGLGRTPRSLVAQMKL---TKFSYCLSPRGT-GKSSRLFLGSSAKLAGGESTSTAPF- 221

Query: 314 PLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTS-VTRLTRPA 372
            +  +P  D+ +Y  L         +  I A    +  A +GG+++    S  + L   A
Sbjct: 222 -IKTSPDDDSHHYYLL--------SLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVDSA 272

Query: 373 YIALRDAFR---AGASSLKRAPDFSLFDTCF-DLSGKTEVKVPTVVLHFRG-ADVSLPAT 427
           Y A + A      GA+    A     FD CF   +G +    P +V  F+G A +++P  
Sbjct: 273 YRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPA 332

Query: 428 NYLIPV-DSSGTFCFAFAGT-------MSGLSIIGNIQQQGFRVVYDLAASRIGFAPRGC 479
            YLI V +   T C A           + G+S++G++QQ+    +YDL    + F P  C
Sbjct: 333 KYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADC 392

Query: 480 A 480
           +
Sbjct: 393 S 393


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 169/390 (43%), Gaps = 42/390 (10%)

Query: 111 PRNRSRGR--ANGGFSSSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP 168
           PR   RGR  A+GG   +V+               +GTPP+     +D   ++VW QC+ 
Sbjct: 27  PRRAMRGRLLADGG--GAVVPFHWSPELYNVANFTIGTPPQAASAFIDLTGELVWTQCSQ 84

Query: 169 CKKCYSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSGCNRRNTCLYQVSYGDGSITVGDF 228
           C  C+ Q  PVF P  S +F   PC + +C+ + +  C   + C Y    G G  TVG  
Sbjct: 85  CIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKC-ASDVCAYDGVTGLGGHTVGIV 143

Query: 229 STETLTFRGTRVARVALGC----GHDNEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRKFS 284
           +T+T        A +  GC      D  G     +G +GLGR   S   Q   +  R FS
Sbjct: 144 ATDTFAIGTAAPASLGFGCVVASDIDTMG---GPSGFIGLGRTPWSLVAQ--MKLTR-FS 197

Query: 285 YCLVDRSTSAKPSSMVFGDSA-VSRTARFTPLLA---NPKLDTFYYVELVGISVGGAHVR 340
           YCL    T  K S +  G SA ++    +TP +    N  +  +Y +EL  I  G A + 
Sbjct: 198 YCLAPHDT-GKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT 256

Query: 341 GITASLFKLDPAGNGGVIIDSG-TSVTRLTRPAYIALRDAFRAGASSLKRA-PDFSLFDT 398
                     P G   V++ +    V+ L    Y   + A  A   +   A P  + F+ 
Sbjct: 257 ---------MPRGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEV 307

Query: 399 CFDLSGKTEVKVPTVVLHFR-GADVSLPATNYLIPVDSSGTFCFAFAG-------TMSGL 450
           CF  +G +    P +V  F+ GA +++P  NYL  V +  T C +           + GL
Sbjct: 308 CFPKAGVS--GAPDLVFTFQAGAALTVPPANYLFDVGND-TVCLSVMSIALLNITALDGL 364

Query: 451 SIIGNIQQQGFRVVYDLAASRIGFAPRGCA 480
           +I+G+ QQ+   +++DL    + F P  C+
Sbjct: 365 NILGSFQQENVHLLFDLDKDMLSFEPADCS 394


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 134/429 (31%), Positives = 190/429 (44%), Gaps = 69/429 (16%)

Query: 112 RNRSRGRANGGFSS--SVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAP- 168
            +  +G ++GG  S  +  +      G Y     +GTPP+ + ++LDTGS + W+ C   
Sbjct: 75  HHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTSN 134

Query: 169 --CKKC---YSQTDPVFDPAKSRSFATVPCRSPLCRKLDSSG--------CNRRNTCL-- 213
             C+ C   ++   PVF P  S S   V CR+P C  + S+         C+R   C   
Sbjct: 135 YDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPA 194

Query: 214 ------YQVSYGDGSITVGDFSTETLTFRGTRVARVALGCGHDNEGLFVAAAGLLGLGRG 267
                 Y V YG GS T G    +TL   G  V+   LGC      +    +GL G GRG
Sbjct: 195 SNVCPPYAVVYGSGS-TAGLLIADTLRAPGRAVSGFVLGC--SLVSVHQPPSGLAGFGRG 251

Query: 268 RLSFPTQTGRRFNRKFSYCLVDR---STSAKPSSMVFGDSAVSRTARFTPLLANPKLD-- 322
             S P Q G     KFSYCL+ R     +A   S+V G    +   ++ PL+ +   D  
Sbjct: 252 APSVPAQLGL---SKFSYCLLSRRFDDNAAVSGSLVLGGD--NDGMQYVPLVKSAAGDKQ 306

Query: 323 ---TFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVIIDSGTSVTRLTRPAYIALRDA 379
               +YY+ L G++VGG  VR + A  F  + AG+GG I+DSGT+ T L    +  + DA
Sbjct: 307 PYAVYYYLALSGVTVGGKAVR-LPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADA 365

Query: 380 FRAGASS-LKRAPDFSL---FDTCFDL-SGKTEVKVPTVVLHFRGADV-SLPATNYLI-- 431
             A      KR+ D         CF L  G   + +P + LHF+G  V  LP  NY +  
Sbjct: 366 VVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPLENYFVVA 425

Query: 432 ---PVD-------SSGTFCFAFAGTMSGLS----------IIGNIQQQGFRVVYDLAASR 471
              PV        ++   C A      G            I+G+ QQQ + V YDL   R
Sbjct: 426 GRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKER 485

Query: 472 IGFAPRGCA 480
           +GF  + CA
Sbjct: 486 LGFRRQPCA 494


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 171/371 (46%), Gaps = 37/371 (9%)

Query: 136 GEYFTRLGVGTPPRYVYMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFDPAKSRSFAT 190
           G Y+T++ +GTPPR   + +DTGSDV+W+ C  C  C   ++       FDP  S S + 
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 191 VPCRSPLCRK--LDSSGCNRRNTCLYQVSYGDGSITVGDFSTETLTFRGTRVARVAL--- 245
           V C    C       SGC+  N C Y   YGDGS T G + ++ ++F     + +A+   
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSS 201

Query: 246 -----GCGHDNEGLFV----AAAGLLGLGRGRLSFPTQTGRR--FNRKFSYCLVDRSTSA 294
                GC +   G       A  G+ GLG+G LS  +Q   +    R FS+CL  +   +
Sbjct: 202 APFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL--KGDKS 259

Query: 295 KPSSMVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGN 354
               MV G      T  +TPL+ +      Y V L  I+V G  +  I  S+F +  A  
Sbjct: 260 GGGIMVLGQIKRPDTV-YTPLVPS---QPHYNVNLQSIAVNG-QILPIDPSVFTI--ATG 312

Query: 355 GGVIIDSGTSVTRLTRPAYIALRDAFRAGASSLKRAPDFSLFDTCFDLSGKTEVKVPTVV 414
            G IID+GT++  L   AY     A     S   R   +  +  CF+++       P V 
Sbjct: 313 DGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFEITAGDVDVFPEVS 371

Query: 415 LHFR-GADVSLPATNYLIPVDSSGT--FCFAFAGTMSG--LSIIGNIQQQGFRVVYDLAA 469
           L F  GA + L    YL    SSG+  +C  F   MS   ++I+G++  +   VVYDL  
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQ-RMSHRRITILGDLVLKDKVVVYDLVR 430

Query: 470 SRIGFAPRGCA 480
            RIG+A   C+
Sbjct: 431 QRIGWAEYDCS 441


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 111/377 (29%), Positives = 173/377 (45%), Gaps = 55/377 (14%)

Query: 135 SGEYFTRLGVGTPPRYVYMVLDTGSDVVWIQC-APCKKCYSQTDPVFDPAKSRSFATVPC 193
           +G Y+  + +G P +  ++ +DTGSD+ W+QC APC+ C     P++ P  +R    VPC
Sbjct: 50  TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANR---LVPC 106

Query: 194 RSPLCRKLDS-----SGCNRRNTCLYQVSYGDGSITVGDFSTE--TLTFRGTRV-ARVAL 245
            + LC  L S     + C     C YQ+ Y D + + G    +  +L  R + +   +  
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSNIRPGLTF 166

Query: 246 GCGHD-----NEGLFVAAAGLLGLGRGRLSFPTQTGRRFNRK--FSYCLVDRSTSAKPSS 298
           GCG+D     N  +  A  G+LGLGRG +S  +Q  ++   K    +CL    ++     
Sbjct: 167 GCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL----STNGGGF 222

Query: 299 MVFGDSAVSRTARFTPLLANPKLDTFYYVELVGISVGGAHVRGITASLFKLDPAGNGGVI 358
           + FGD  V  ++R T +    +    YY    G         G+              V+
Sbjct: 223 LFFGDDVVP-SSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPM----------EVV 271

Query: 359 IDSGTSVTRLTRPAYIALRDAFRAGAS-SLKRAPDFSL---------FDTCFDLSGKTEV 408
            DSG++ T  T   Y A+  A + G S SLK+  D +L         F + FD+  K E 
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQKAFKSVFDV--KNEF 329

Query: 409 KVPTVVLHF---RGADVSLPATNYLIPVDSSGTFCFA-FAGTMSGLS--IIGNIQQQGFR 462
           K  ++ L F   + A + +P  NYLI V  +G  C     GT + LS  +IG+I  Q   
Sbjct: 330 K--SMFLSFSSAKNAAMEIPPENYLI-VTKNGNVCLGILDGTAAKLSFNVIGDITMQDQM 386

Query: 463 VVYDLAASRIGFAPRGC 479
           V+YD   S++G+A   C
Sbjct: 387 VIYDNEKSQLGWARGAC 403


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.136    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,404,006,630
Number of Sequences: 23463169
Number of extensions: 317636853
Number of successful extensions: 755814
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2122
Number of HSP's successfully gapped in prelim test: 2450
Number of HSP's that attempted gapping in prelim test: 743344
Number of HSP's gapped (non-prelim): 6022
length of query: 480
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 334
effective length of database: 8,933,572,693
effective search space: 2983813279462
effective search space used: 2983813279462
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)